专利检索 ipc:"G10L19/13" 第 1 页

1.

发明授权
Audio signal compression method and apparatus using deep neural network-based multilayer structure and training method thereof 有权

公开(公告)号：US11881227B2

公开(公告)日：2024-01-23

申请号：US18097054

申请日：2023-01-13

申请人： Electronics and Telecommunications Research Institute , Industry-Academic Cooperation Foundation, Yonsei University

发明人： In Seon Jang , Seung Kwon Beack , Jong Mo Sung , Tae Jin Lee , Woo Taek Lim , Byeong Ho Cho , Hong Goo Kang , Ji Hyun Lee , Chan Woo Lee , Hyung Seob Lim

IPC分类号： G10L19/04 , G10L19/038 , G10L19/002 , G10L19/12 , G10L19/08 , G10L19/09 , G10L19/06 , G10L19/13 , G10L19/02

CPC分类号： G10L19/038 , G10L19/002 , G10L19/0204 , G10L19/04 , G10L19/06 , G10L19/08 , G10L19/09 , G10L19/12 , G10L19/13

摘要： A method, executed by a processor for compressing an audio signal in multiple layers, may comprise: (a) restoring, in a highest layer, an input audio signal as a first signal; (b) restoring, in at least one intermediate layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in the highest layer or an immediately previous intermediate layer, from the input audio signal as a second signal; and (c) restoring, in a lowest layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in an intermediate layer immediately before the lowest layer, from the input audio signal as a third signal, wherein the first signal, the second signal, and the third signal are combined to output a final restoration audio signal.

2.

发明申请
AUDIO ENCODER FOR ENCODING A MULTICHANNEL SIGNAL AND AUDIO DECODER FOR DECODING AN ENCODED AUDIO SIGNAL 有权

公开(公告)号：US20220093112A1

公开(公告)日：2022-03-24

申请号：US17410033

申请日：2021-08-24

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Sascha DISCH , Guillaume FUCHS , Emmanuel RAVELLI , Christian NEUKAM , Konstantin SCHMIDT , Conrad BENNDORF , Andreas NIEDERMEIER , Benjamin SCHUBERT , Ralf GEIGER

IPC分类号： G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13

摘要： A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.

3.

发明授权
Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal 有权

公开(公告)号：US11107483B2

公开(公告)日：2021-08-31

申请号：US17008428

申请日：2020-08-31

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger

IPC分类号： G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13

摘要： A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.

4.

发明申请
AUDIO SIGNAL ENCODING AND DECODING METHOD USING LEARNING MODEL, TRAINING METHOD OF LEARNING MODEL, AND ENCODER AND DECODER THAT PERFORM THE METHODS 有权

公开(公告)号：US20220262378A1

公开(公告)日：2022-08-18

申请号：US17672041

申请日：2022-02-15

申请人： ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

发明人： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG

IPC分类号： G10L19/13 , G10L25/12 , G06N3/08

摘要： An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.

5.

发明申请
METHOD OF GENERATING RESIDUAL SIGNAL, AND ENCODER AND DECODER PERFORMING THE METHOD 有权

公开(公告)号：US20220157326A1

公开(公告)日：2022-05-19

申请号：US17507746

申请日：2021-10-21

申请人： ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

发明人： Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG

IPC分类号： G10L19/13 , G10L19/032 , G10L19/06

摘要： A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.

6.

发明授权
Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal 有权

公开(公告)号：US10388287B2

公开(公告)日：2019-08-20

申请号：US15695668

申请日：2017-09-05

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger

IPC分类号： G10L19/02 , G10L19/04 , G10L19/13 , G10L19/18 , G10L19/008 , G10L19/032 , G10L21/038

摘要： Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.

7.

发明申请
AUDIO ENCODER FOR ENCODING A MULTICHANNEL SIGNAL AND AUDIO DECODER FOR DECODING AN ENCODED AUDIO SIGNAL 审中-公开

公开(公告)号：US20170365264A1

公开(公告)日：2017-12-21

申请号：US15695668

申请日：2017-09-05

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Sascha DISCH , Guillaume FUCHS , Emmanuel RAVELLI , Christian NEUKAM , Konstantin SCHMIDT , Conrad BENNDORF , Andreas NIEDERMEIER , Benjamin SCHUBERT , Ralf GEIGER

IPC分类号： G10L19/008 , G10L19/032 , G10L19/13 , G10L19/18

CPC分类号： G10L19/008 , G10L19/02 , G10L19/032 , G10L19/04 , G10L19/13 , G10L19/18 , G10L21/038

摘要： Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.

8.

发明授权
Method of generating residual signal, and encoder and decoder performing the method 有权

公开(公告)号：US11978465B2

公开(公告)日：2024-05-07

申请号：US17507746

申请日：2021-10-21

申请人： ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

发明人： Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Inseon Jang

IPC分类号： G10L19/13 , G10L19/032 , G10L19/06

CPC分类号： G10L19/13 , G10L19/032 , G10L19/06

摘要： A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.

9.

发明授权
Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal 有权

公开(公告)号：US11741973B2

公开(公告)日：2023-08-29

申请号：US17410033

申请日：2021-08-24

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

发明人： Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger

IPC分类号： G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13

CPC分类号： G10L19/008 , G10L19/02 , G10L19/032 , G10L19/04 , G10L19/13 , G10L19/18 , G10L21/038

摘要： A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.

10.

发明授权
Artificial intelligence based audio coding 有权

公开(公告)号：US11437050B2

公开(公告)日：2022-09-06

申请号：US16709873

申请日：2019-12-10

申请人： QUALCOMM Incorporated

发明人： Zisis Iason Skordilis , Vivek Rajendran , Guillaume Konrad Sautière , Daniel Jared Sinder

IPC分类号： G10L19/13 , G06N3/08 , G10L25/30 , G10L19/09 , G10L19/12 , G10L19/08 , G06N3/04

摘要： Techniques are described for coding audio signals. For example, using a neural network, a residual signal is generated for a sample of an audio signal based on inputs to the neural network. The residual signal is configured to excite a long-term prediction filter and/or a short-term prediction filter. Using the long-term prediction filter and/or the short-term prediction filter, a sample of a reconstructed audio signal is determined. The sample of the reconstructed audio signal is determined based on the residual signal generated using the neural network for the sample of the audio signal.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类