专利检索 cpc:"G10L19/20" 第 1 页

1.

发明授权
Audio encoder and decoder using a frequency domain processor with full-band gap filling and a time domain processor 有权

公开(公告)号：US12080310B2

公开(公告)日：2024-09-03

申请号：US17336132

申请日：2021-06-01

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Sascha Disch , Martin Dietz , Markus Multrus , Guillaume Fuchs , Emmanuel Ravelli , Matthias Neusinger , Markus Schnell , Benjamin Schubert , Bernhard Grill

IPC分类号： G10L19/18 , G10L19/02 , G10L19/028 , G10L19/032 , G10L19/04 , G10L19/06 , G10L19/20 , G10L19/24 , G10L19/26 , G10L21/038

CPC分类号： G10L19/18 , G10L19/028 , G10L19/032 , G10L19/06 , G10L19/265 , G10L19/02 , G10L19/04 , G10L19/20 , G10L19/24 , G10L21/038

摘要： An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal having a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second portion.

2.

发明公开
METHODS AND SYSTEMS FOR GENERATING AND RENDERING OBJECT BASED AUDIO WITH CONDITIONAL RENDERING METADATA 审中-公开

公开(公告)号：US20240282322A1

公开(公告)日：2024-08-22

申请号：US18592965

申请日：2024-03-01

申请人： Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

发明人： Sripal S. MEHTA , Thomas ZIEGLER , Stewart MURRIE

IPC分类号： G10L19/008 , G06F3/16 , G10L19/16 , G10L19/20 , H04S3/00 , H04S7/00

CPC分类号： G10L19/008 , G06F3/165 , G10L19/20 , H04S3/008 , H04S7/30 , G10L19/167 , H04S2400/01 , H04S2400/13 , H04S2400/15 , H04S2420/03

摘要： Methods and audio processing units for generating an object based audio program including conditional rendering metadata corresponding to at least one object channel of the program, where the conditional rendering metadata is indicative of at least one rendering constraint, based on playback speaker array configuration, which applies to each corresponding object channel, and methods for rendering audio content determined by such a program, including by rendering content of at least one audio channel of the program in a manner compliant with each applicable rendering constraint in response to at least some of the conditional rendering metadata. Rendering of a selected mix of content of the program may provide an immersive experience.

3.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开

公开(公告)号：US20240265930A1

公开(公告)日：2024-08-08

申请号：US18416775

申请日：2024-01-18

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Simone FUEG , Jan PLOGSTIES , Sascha DICK , Johannes HILPERT , Julien ROBILLIARD , Achim KUNTZ , Andreas HOELZER

IPC分类号： G10L19/20 , G10L19/008 , H04N21/233 , H04N21/431 , H04N21/439 , H04N21/45 , H04N21/81 , H04N21/84 , H04S3/00 , H04S7/00 , G10L19/16

CPC分类号： G10L19/20 , G10L19/008 , H04N21/233 , H04N21/4318 , H04N21/439 , H04N21/4516 , H04N21/8106 , H04N21/84 , H04S3/008 , H04S7/00 , H04S7/30 , H04S7/308 , G10L19/167 , H04S2400/11

摘要： An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

4.

发明授权
Audio signal encoding method and apparatus, and audio signal decoding method and apparatus 有权

公开(公告)号：US12057130B2

公开(公告)日：2024-08-06

申请号：US17852479

申请日：2022-06-29

申请人： HUAWEI TECHNOLOGIES CO., LTD.

发明人： Dejun Zhang

IPC分类号： G10L19/02 , G10L19/008 , G10L19/09 , G10L19/20 , G10L19/26

CPC分类号： G10L19/02 , G10L19/008 , G10L19/09 , G10L19/20 , G10L19/265

摘要： An audio signal encoding method and apparatus, and an audio signal decoding method and apparatus are disclosed. The audio signal encoding method includes: obtaining a frequency-domain coefficient of a current frame and a frequency-domain coefficient of a reference signal of the current frame; performing filtering processing on the frequency-domain coefficient of the current frame to obtain a filtering parameter; determining a target frequency-domain coefficient of the current frame based on the filtering parameter; performing filtering processing on the frequency-domain coefficient of the reference signal and a reference frequency-domain coefficient based on the filtering parameter to obtain a target frequency-domain coefficient of the reference signal; and encoding the target frequency-domain coefficient of the current frame based on the target frequency-domain coefficient of the current frame, the target frequency-domain coefficient of the reference signal, and a reference target frequency-domain coefficient. The method can improve audio signal encoding/decoding efficiency.

5.

发明授权
Method and apparatus for processing an audio signal, audio decoder, and audio encoder for removing a discontinuity between frames by subtracting a portion of a zero-input-reponse 有权

公开(公告)号：US12033648B2

公开(公告)日：2024-07-09

申请号：US18339915

申请日：2023-06-22

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Emmanuel Ravelli , Manuel Jander , Grzegorz Pietrzyk , Martin Dietz , Marc Gayer

IPC分类号： G10L19/26 , G10L19/005 , G10L19/022 , G10L19/03 , G10L19/12 , G10L19/20 , H04B1/10 , G10L21/0364 , G11B27/038

CPC分类号： G10L19/26 , G10L19/005 , G10L19/022 , G10L19/03 , G10L19/12 , G10L19/20 , G10L21/0364 , G11B27/038 , H04B1/1027

摘要： A method is described that processes an audio signal. A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filter. Removing the discontinuity further comprises processing a beginning portion of the filtered current frame, wherein the beginning portion of the current frame comprises a predefined number of samples being less or equal than a total number of samples in the current frame, and wherein processing the beginning portion of the current frame comprises subtracting a beginning portion of a zero-input-response (ZIR) from the beginning portion of the filtered current frame.

6.

发明公开
AUDIO DECODER FOR INTERLEAVING SIGNALS 审中-公开

公开(公告)号：US20240153517A1

公开(公告)日：2024-05-09

申请号：US18504879

申请日：2023-11-08

申请人： Dolby International AB

发明人： Kristofer Kjörling , Heiko Purnhagen , Harald Mundt , Karl Jonas Roeden , Leif Sehlström

IPC分类号： G10L19/20 , G10L19/008 , G10L19/02 , G10L19/16 , G10L25/18 , H04S3/00

CPC分类号： G10L19/20 , G10L19/008 , G10L19/0212 , G10L19/167 , G10L25/18 , H04S3/008 , H04S2400/03 , H04S2420/03

摘要： A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.

7.

发明公开
SIGNAL PROCESSING DEVICE AND METHOD, AND PROGRAM 审中-公开

公开(公告)号：US20240153516A1

公开(公告)日：2024-05-09

申请号：US18416154

申请日：2024-01-18

申请人： Sony Group Corporation

发明人： Yuki Yamamoto , Toru Chinen , Minoru Tsuji

IPC分类号： G10L19/20 , G10L19/008 , G10L25/51

CPC分类号： G10L19/20 , G10L19/008 , G10L25/51 , G10L19/02

摘要： The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.

8.

发明授权
Signal processing device and method, and program 有权

公开(公告)号：US11900956B2

公开(公告)日：2024-02-13

申请号：US18154187

申请日：2023-01-13

申请人： Sony Group Corporation

发明人： Yuki Yamamoto , Toru Chinen , Minoru Tsuji

IPC分类号： G10L19/20 , G10L25/51 , G10L19/008 , G10L19/02 , G10L25/78

CPC分类号： G10L19/20 , G10L19/008 , G10L25/51 , G10L19/02 , G10L25/78

摘要： The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.

9.

发明授权
Method and apparatus for compressing and decompressing a higher order Ambisonics signal representation 有权

公开(公告)号：US11792591B2

公开(公告)日：2023-10-17

申请号：US17548485

申请日：2021-12-10

申请人： DOLBY LABORATORIES LICENSING CORPORATION

发明人： Alexander Krueger , Sven Kordon , Johannes Boehm , Johann-Markus Batke

IPC分类号： G10L19/008 , H04S3/00 , H04H20/89 , G10L19/20 , H04S3/02

CPC分类号： H04S3/008 , G10L19/008 , G10L19/20 , H04H20/89 , H04S3/02 , H04S2420/11

摘要： A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.

10.

发明授权
Apparatus and method for generating a plurality of audio channels 有权

公开(公告)号：US11785414B2

公开(公告)日：2023-10-10

申请号：US17815860

申请日：2022-07-28

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Christian Borss , Christian Ertel , Johannes Hilpert , Achim Kuntz , Michael Fischer , Florian Schuh , Bernhard Grill

IPC分类号： H04S3/02 , H04S7/00 , G10L19/008 , G10L19/20

CPC分类号： H04S7/308 , G10L19/008 , G10L19/20 , H04S3/02 , H04S7/30 , H04S2400/01 , H04S2400/03 , H04S2400/11

摘要： An apparatus for generating a plurality of audio channels for a speaker setup, comprises a processor repeating an energy distribution from a speaker not contained in the speaker setup to the speakers in the speaker setup to acquire a downmix information for a downmix to the speaker setup; and a renderer for generating the plurality of audio channels using the downmix information.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类