-
公开(公告)号:US12080310B2
公开(公告)日:2024-09-03
申请号:US17336132
申请日:2021-06-01
发明人: Sascha Disch , Martin Dietz , Markus Multrus , Guillaume Fuchs , Emmanuel Ravelli , Matthias Neusinger , Markus Schnell , Benjamin Schubert , Bernhard Grill
IPC分类号: G10L19/18 , G10L19/02 , G10L19/028 , G10L19/032 , G10L19/04 , G10L19/06 , G10L19/20 , G10L19/24 , G10L19/26 , G10L21/038
CPC分类号: G10L19/18 , G10L19/028 , G10L19/032 , G10L19/06 , G10L19/265 , G10L19/02 , G10L19/04 , G10L19/20 , G10L19/24 , G10L21/038
摘要: An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal having a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second portion.
-
2.
公开(公告)号:US20240282322A1
公开(公告)日:2024-08-22
申请号:US18592965
申请日:2024-03-01
发明人: Sripal S. MEHTA , Thomas ZIEGLER , Stewart MURRIE
CPC分类号: G10L19/008 , G06F3/165 , G10L19/20 , H04S3/008 , H04S7/30 , G10L19/167 , H04S2400/01 , H04S2400/13 , H04S2400/15 , H04S2420/03
摘要: Methods and audio processing units for generating an object based audio program including conditional rendering metadata corresponding to at least one object channel of the program, where the conditional rendering metadata is indicative of at least one rendering constraint, based on playback speaker array configuration, which applies to each corresponding object channel, and methods for rendering audio content determined by such a program, including by rendering content of at least one audio channel of the program in a manner compliant with each applicable rendering constraint in response to at least some of the conditional rendering metadata. Rendering of a selected mix of content of the program may provide an immersive experience.
-
公开(公告)号:US20240265930A1
公开(公告)日:2024-08-08
申请号:US18416775
申请日:2024-01-18
发明人: Simone FUEG , Jan PLOGSTIES , Sascha DICK , Johannes HILPERT , Julien ROBILLIARD , Achim KUNTZ , Andreas HOELZER
IPC分类号: G10L19/20 , G10L19/008 , H04N21/233 , H04N21/431 , H04N21/439 , H04N21/45 , H04N21/81 , H04N21/84 , H04S3/00 , H04S7/00 , G10L19/16
CPC分类号: G10L19/20 , G10L19/008 , H04N21/233 , H04N21/4318 , H04N21/439 , H04N21/4516 , H04N21/8106 , H04N21/84 , H04S3/008 , H04S7/00 , H04S7/30 , H04S7/308 , G10L19/167 , H04S2400/11
摘要: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
-
4.
公开(公告)号:US12057130B2
公开(公告)日:2024-08-06
申请号:US17852479
申请日:2022-06-29
发明人: Dejun Zhang
IPC分类号: G10L19/02 , G10L19/008 , G10L19/09 , G10L19/20 , G10L19/26
CPC分类号: G10L19/02 , G10L19/008 , G10L19/09 , G10L19/20 , G10L19/265
摘要: An audio signal encoding method and apparatus, and an audio signal decoding method and apparatus are disclosed. The audio signal encoding method includes: obtaining a frequency-domain coefficient of a current frame and a frequency-domain coefficient of a reference signal of the current frame; performing filtering processing on the frequency-domain coefficient of the current frame to obtain a filtering parameter; determining a target frequency-domain coefficient of the current frame based on the filtering parameter; performing filtering processing on the frequency-domain coefficient of the reference signal and a reference frequency-domain coefficient based on the filtering parameter to obtain a target frequency-domain coefficient of the reference signal; and encoding the target frequency-domain coefficient of the current frame based on the target frequency-domain coefficient of the current frame, the target frequency-domain coefficient of the reference signal, and a reference target frequency-domain coefficient. The method can improve audio signal encoding/decoding efficiency.
-
公开(公告)号:US12033648B2
公开(公告)日:2024-07-09
申请号:US18339915
申请日:2023-06-22
发明人: Emmanuel Ravelli , Manuel Jander , Grzegorz Pietrzyk , Martin Dietz , Marc Gayer
IPC分类号: G10L19/26 , G10L19/005 , G10L19/022 , G10L19/03 , G10L19/12 , G10L19/20 , H04B1/10 , G10L21/0364 , G11B27/038
CPC分类号: G10L19/26 , G10L19/005 , G10L19/022 , G10L19/03 , G10L19/12 , G10L19/20 , G10L21/0364 , G11B27/038 , H04B1/1027
摘要: A method is described that processes an audio signal. A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filter. Removing the discontinuity further comprises processing a beginning portion of the filtered current frame, wherein the beginning portion of the current frame comprises a predefined number of samples being less or equal than a total number of samples in the current frame, and wherein processing the beginning portion of the current frame comprises subtracting a beginning portion of a zero-input-response (ZIR) from the beginning portion of the filtered current frame.
-
公开(公告)号:US20240153517A1
公开(公告)日:2024-05-09
申请号:US18504879
申请日:2023-11-08
CPC分类号: G10L19/20 , G10L19/008 , G10L19/0212 , G10L19/167 , G10L25/18 , H04S3/008 , H04S2400/03 , H04S2420/03
摘要: A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.
-
公开(公告)号:US20240153516A1
公开(公告)日:2024-05-09
申请号:US18416154
申请日:2024-01-18
发明人: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
IPC分类号: G10L19/20 , G10L19/008 , G10L25/51
CPC分类号: G10L19/20 , G10L19/008 , G10L25/51 , G10L19/02
摘要: The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.-
公开(公告)号:US11900956B2
公开(公告)日:2024-02-13
申请号:US18154187
申请日:2023-01-13
发明人: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
IPC分类号: G10L19/20 , G10L25/51 , G10L19/008 , G10L19/02 , G10L25/78
CPC分类号: G10L19/20 , G10L19/008 , G10L25/51 , G10L19/02 , G10L25/78
摘要: The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.-
公开(公告)号:US11792591B2
公开(公告)日:2023-10-17
申请号:US17548485
申请日:2021-12-10
IPC分类号: G10L19/008 , H04S3/00 , H04H20/89 , G10L19/20 , H04S3/02
CPC分类号: H04S3/008 , G10L19/008 , G10L19/20 , H04H20/89 , H04S3/02 , H04S2420/11
摘要: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
-
公开(公告)号:US11785414B2
公开(公告)日:2023-10-10
申请号:US17815860
申请日:2022-07-28
发明人: Christian Borss , Christian Ertel , Johannes Hilpert , Achim Kuntz , Michael Fischer , Florian Schuh , Bernhard Grill
IPC分类号: H04S3/02 , H04S7/00 , G10L19/008 , G10L19/20
CPC分类号: H04S7/308 , G10L19/008 , G10L19/20 , H04S3/02 , H04S7/30 , H04S2400/01 , H04S2400/03 , H04S2400/11
摘要: An apparatus for generating a plurality of audio channels for a speaker setup, comprises a processor repeating an energy distribution from a speaker not contained in the speaker setup to the speakers in the speaker setup to acquire a downmix information for a downmix to the speaker setup; and a renderer for generating the plurality of audio channels using the downmix information.
-
-
-
-
-
-
-
-
-