-
公开(公告)号:US20240355341A1
公开(公告)日:2024-10-24
申请号:US18574918
申请日:2022-06-16
IPC分类号: G10L19/008
CPC分类号: G10L19/008
摘要: An apparatus for spatial audio encoding including circuitry configured to: obtain a first spatial audio stream of a first spatial audio format configured to be encoded with a low bitrate, wherein the first spatial audio stream includes an audio signal and a first metadata; obtain a second and different spatial audio stream of a second spatial audio format, wherein the second spatial audio stream includes a second audio signal and a second metadata; convert the second spatial audio format into the first spatial audio format to encode a converted second spatial audio stream with the low bitrate, wherein the converted spatial audio stream represents spatial audio properties of the second spatial audio stream; combine the first spatial audio stream and the converted second spatial audio stream to generate a combined spatial audio stream; and encode the combined spatial audio stream.
-
公开(公告)号:US12126985B2
公开(公告)日:2024-10-22
申请号:US17896005
申请日:2022-08-25
发明人: Leon Terentiv , Christof Fersch , Daniel Fischer
IPC分类号: H04S7/00 , G10L19/008 , G10L19/16 , H04S3/00
CPC分类号: H04S7/303 , G10L19/008 , G10L19/167 , H04S3/008 , H04S2400/01 , H04S2400/11
摘要: The present disclosure relates to methods, apparatus and systems for encoding an audio signal into a bitstream, in particular at an encoder, comprising: encoding or including audio signal data associated with 3DoF audio rendering into one or more first bitstream parts of the bitstream, and encoding or including metadata associated with 6DoF audio rendering into one or more second bitstream parts of the bitstream. The present disclosure further relates to methods, apparatus and systems for decoding an audio signal and audio rendering based on the bitstream.
-
公开(公告)号:US12125492B2
公开(公告)日:2024-10-22
申请号:US17071299
申请日:2020-10-15
申请人: VOICEAGE CORPORATION
发明人: Tommy Vaillancourt , Milan Jelinek
IPC分类号: G10L19/008
CPC分类号: G10L19/008
摘要: A stereo sound decoding method and system decode left and right channels of a stereo sound signal, using received encoding parameters comprising encoding parameters of a primary channel, encoding parameters of a secondary channel, and a factor β. The primary channel encoding parameters comprise LP filter coefficients of the primary channel. The primary channel is decoded in response to the primary channel encoding parameters. The secondary channel is decoded using one of a plurality of coding models, wherein at least one of the coding models uses the primary channel LP filter coefficients to decode the secondary channel. The decoded primary and secondary channels are time domain up-mixed using the factor β to produce the decoded left and right channels of the stereo sound signal, wherein the factor β determines respective contributions of the primary and secondary channels upon production of the left and right channels.
-
公开(公告)号:US12120500B2
公开(公告)日:2024-10-15
申请号:US17939114
申请日:2022-09-07
发明人: Seigo Enomoto , Tomokazu Ishikawa
IPC分类号: H04R5/02 , G10L19/008 , H04S3/00 , H04S7/00
CPC分类号: H04S7/304 , G10L19/008 , H04S3/008 , H04S2400/01 , H04S2400/11 , H04S2400/15 , H04S2420/01
摘要: An acoustic reproduction method includes: localizing a first sound image at a first position in a target space in which a user is present; and localizing, at a second position in the target space, a second sound image that represents an anchor sound for indicating a reference position.
-
公开(公告)号:US12119009B2
公开(公告)日:2024-10-15
申请号:US17909698
申请日:2021-02-08
IPC分类号: G10L19/008 , G10L19/24 , H04S1/00 , H04S7/00
CPC分类号: G10L19/008 , G10L19/24 , H04S1/007 , H04S7/30 , H04S2400/03
摘要: A sound signal downmix method includes an inter-channel relationship information obtaining step of obtaining an inter-channel correlation value and an inter-channel time difference in an approximate manner, and a downmix step of obtaining a downmix signal based on the obtained information. In the inter-channel relationship information obtaining step, multiple channel signals are sorted such that signals of adjacent channels are similar to each other, the inter-channel correlation value and the inter-channel time difference are determined only between adjacent channels after the sorting, the inter-channel correlation value between non-adjacent channels is obtained by determining a value that has a monotonically non-decreasing relationship with the inter-channel correlation between the adjacent channels, and the inter-channel time difference between non-adjacent channels is obtained by adding up the inter-channel time differences of adjacent channels.
-
公开(公告)号:US12106763B2
公开(公告)日:2024-10-01
申请号:US17571970
申请日:2022-01-10
发明人: Guillaume Fuchs , Jürgen Herre , Fabian Küch , Stefan Döhla , Markus Multrus , Oliver Thiergart , Oliver Wübbolt , Florin Ghido , Stefan Bayer , Wolfgang Jaegers
IPC分类号: G10L19/008 , G10L19/02 , G10L19/032 , G10L19/038 , G10L19/16 , G10L19/26 , H03M7/30
CPC分类号: G10L19/008 , G10L19/0204 , G10L19/032 , G10L19/038 , G10L19/167 , G10L19/26 , H03M7/3082 , H03M7/6005 , H03M7/6011
摘要: An apparatus for encoding directional audio coding parameters comprising diffuseness parameters and direction parameters having a parameter calculator (100) for calculating the diffuseness parameters with a first time or frequency resolution and for calculating the direction parameters with a second time or frequency resolution; and a quantizer and encoder processor (200) for generating a quantized and encoded representation of the diffuseness parameters and the direction parameters.
-
公开(公告)号:US20240323630A1
公开(公告)日:2024-09-26
申请号:US18676347
申请日:2024-05-28
IPC分类号: H04S7/00 , G06T7/70 , G10L15/06 , G10L15/22 , G10L19/00 , G10L19/008 , G10L19/16 , G10L21/0208 , G10L21/0216 , H04R1/40 , H04R3/00 , H04R5/027 , H04S3/00
CPC分类号: H04S7/30 , G06T7/70 , G10L15/063 , G10L15/22 , G10L19/008 , G10L19/167 , G10L21/0208 , H04R1/406 , H04R3/005 , H04R5/027 , H04S3/008 , G10L2019/0001 , G10L2019/0002 , G10L2021/02166 , H04R2201/401 , H04S2400/01 , H04S2400/15
摘要: A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.
-
公开(公告)号:US20240321285A1
公开(公告)日:2024-09-26
申请号:US18259971
申请日:2022-01-05
申请人: VOICEAGE CORPORATION
IPC分类号: G10L19/12 , G10L19/008 , G10L19/22 , G10L21/0232
CPC分类号: G10L19/12 , G10L19/008 , G10L19/22 , G10L21/0232
摘要: A unified time-domain/frequency-domain coding method and device for coding an input sound signal comprise a classifier of the input sound signal into one of a plurality of sound signal categories comprising an unclear signal type category showing that the nature of the input sound signal is unclear. One of a plurality of coding sub-modes is selected for coding the input sound signal if the input sound signal is classified in the unclear signal type category. A mixed time-domain/frequency-domain encoder codes the input sound signal using the selected coding sub-mode. The mixed time-domain/frequency-domain encoder comprises a selector of frequency bands and allocator of bits for selecting frequency bands to quantize and for distributing a bit budget available to quantization between the selected frequency bands. Corresponding sound signal decoder and decoding method are also provided.
-
公开(公告)号:US12100403B2
公开(公告)日:2024-09-24
申请号:US17909666
申请日:2020-11-04
IPC分类号: G10L19/008 , G10L19/24 , H04S1/00 , H04S7/00
CPC分类号: G10L19/008 , G10L19/24 , H04S1/007 , H04S7/30 , H04S2400/03
摘要: A sound signal downmix device for obtaining a downmix signal that is a signal obtained by mixing a left channel input sound signal and a right channel input sound signal includes a left-right relationship information acquisition unit 185 that obtains preceding channel information that is information indicating which of the left channel input sound signal and the right channel input sound signal is preceding and a left-right correlation coefficient that is a correlation coefficient between the left channel input sound signal and the right channel input sound signal and a downmix unit 112 that obtains the downmix signal by weighted averaging the left channel input sound signal and the right channel input sound signal to include a larger amount of an input sound signal of a preceding channel among the left channel input sound signal and the right channel input sound signal as the left-right correlation coefficient is greater, based on the preceding channel information and the left-right correlation coefficient.
-
10.
公开(公告)号:US12100402B2
公开(公告)日:2024-09-24
申请号:US17824297
申请日:2022-05-25
发明人: Jan Buethe , Guillaume Fuchs , Wolfgang Jagers , Franz Reutelhuber , Juergen Herre , Eleni Fotopoulou , Markus Multrus , Srikanth Korse
IPC分类号: G10L19/008 , H04S1/00 , H04S3/00 , H04S3/02 , H04S7/00
CPC分类号: G10L19/008 , H04S1/007 , H04S3/008 , H04S7/30 , H04S2400/01 , H04S2420/03
摘要: An apparatus for downmixing a multi-channel signal having at least two channels, has: a downmixer for calculating a downmix signal from the multi-channel signal, wherein the downmixer is configured to calculate the downmix using an absolute phase compensation, so that a channel having a lower energy among the at least two channels is only rotated or is rotated stronger than a channel having a greater energy in calculating the downmix signal; and an output interface for generating an output signal, the output signal having information on the downmix signal.
-
-
-
-
-
-
-
-
-