-
公开(公告)号:US11881227B2
公开(公告)日:2024-01-23
申请号:US18097054
申请日:2023-01-13
申请人: Electronics and Telecommunications Research Institute , Industry-Academic Cooperation Foundation, Yonsei University
发明人: In Seon Jang , Seung Kwon Beack , Jong Mo Sung , Tae Jin Lee , Woo Taek Lim , Byeong Ho Cho , Hong Goo Kang , Ji Hyun Lee , Chan Woo Lee , Hyung Seob Lim
IPC分类号: G10L19/04 , G10L19/038 , G10L19/002 , G10L19/12 , G10L19/08 , G10L19/09 , G10L19/06 , G10L19/13 , G10L19/02
CPC分类号: G10L19/038 , G10L19/002 , G10L19/0204 , G10L19/04 , G10L19/06 , G10L19/08 , G10L19/09 , G10L19/12 , G10L19/13
摘要: A method, executed by a processor for compressing an audio signal in multiple layers, may comprise: (a) restoring, in a highest layer, an input audio signal as a first signal; (b) restoring, in at least one intermediate layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in the highest layer or an immediately previous intermediate layer, from the input audio signal as a second signal; and (c) restoring, in a lowest layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in an intermediate layer immediately before the lowest layer, from the input audio signal as a third signal, wherein the first signal, the second signal, and the third signal are combined to output a final restoration audio signal.
-
公开(公告)号:US20220093112A1
公开(公告)日:2022-03-24
申请号:US17410033
申请日:2021-08-24
发明人: Sascha DISCH , Guillaume FUCHS , Emmanuel RAVELLI , Christian NEUKAM , Konstantin SCHMIDT , Conrad BENNDORF , Andreas NIEDERMEIER , Benjamin SCHUBERT , Ralf GEIGER
IPC分类号: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13
摘要: A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.
-
公开(公告)号:US11107483B2
公开(公告)日:2021-08-31
申请号:US17008428
申请日:2020-08-31
发明人: Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger
IPC分类号: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13
摘要: A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.
-
公开(公告)号:US20220262378A1
公开(公告)日:2022-08-18
申请号:US17672041
申请日:2022-02-15
发明人: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG
摘要: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
-
公开(公告)号:US20220157326A1
公开(公告)日:2022-05-19
申请号:US17507746
申请日:2021-10-21
发明人: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC分类号: G10L19/13 , G10L19/032 , G10L19/06
摘要: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
-
公开(公告)号:US10388287B2
公开(公告)日:2019-08-20
申请号:US15695668
申请日:2017-09-05
发明人: Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger
IPC分类号: G10L19/02 , G10L19/04 , G10L19/13 , G10L19/18 , G10L19/008 , G10L19/032 , G10L21/038
摘要: Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.
-
7.
公开(公告)号:US20170365264A1
公开(公告)日:2017-12-21
申请号:US15695668
申请日:2017-09-05
发明人: Sascha DISCH , Guillaume FUCHS , Emmanuel RAVELLI , Christian NEUKAM , Konstantin SCHMIDT , Conrad BENNDORF , Andreas NIEDERMEIER , Benjamin SCHUBERT , Ralf GEIGER
IPC分类号: G10L19/008 , G10L19/032 , G10L19/13 , G10L19/18
CPC分类号: G10L19/008 , G10L19/02 , G10L19/032 , G10L19/04 , G10L19/13 , G10L19/18 , G10L21/038
摘要: Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.
-
公开(公告)号:US11978465B2
公开(公告)日:2024-05-07
申请号:US17507746
申请日:2021-10-21
发明人: Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC分类号: G10L19/13 , G10L19/032 , G10L19/06
CPC分类号: G10L19/13 , G10L19/032 , G10L19/06
摘要: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
-
公开(公告)号:US11741973B2
公开(公告)日:2023-08-29
申请号:US17410033
申请日:2021-08-24
发明人: Sascha Disch , Guillaume Fuchs , Emmanuel Ravelli , Christian Neukam , Konstantin Schmidt , Conrad Benndorf , Andreas Niedermeier , Benjamin Schubert , Ralf Geiger
IPC分类号: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/18 , G10L21/038 , G10L19/032 , G10L19/13
CPC分类号: G10L19/008 , G10L19/02 , G10L19/032 , G10L19/04 , G10L19/13 , G10L19/18 , G10L21/038
摘要: A schematic block diagram of an audio encoder for encoding a multichannel audio signal is shown. The audio encoder includes a linear prediction domain encoder, a frequency domain encoder, and a controller for switching between the linear prediction domain encoder and the frequency domain encoder. The controller is configured such that a portion of the multichannel signal is represented either by an encoded frame of the linear prediction domain encoder or by an encoded frame of the frequency domain encoder. The linear prediction domain encoder includes a downmixer for downmixing the multichannel signal to obtain a downmixed signal. The linear prediction domain encoder further includes a linear prediction domain core encoder for encoding the downmix signal and furthermore, the linear prediction domain encoder includes a first joint multichannel encoder for generating first multichannel information from the multichannel signal.
-
公开(公告)号:US11437050B2
公开(公告)日:2022-09-06
申请号:US16709873
申请日:2019-12-10
摘要: Techniques are described for coding audio signals. For example, using a neural network, a residual signal is generated for a sample of an audio signal based on inputs to the neural network. The residual signal is configured to excite a long-term prediction filter and/or a short-term prediction filter. Using the long-term prediction filter and/or the short-term prediction filter, a sample of a reconstructed audio signal is determined. The sample of the reconstructed audio signal is determined based on the residual signal generated using the neural network for the sample of the audio signal.
-
-
-
-
-
-
-
-
-