-
公开(公告)号:US20110119067A1
公开(公告)日:2011-05-19
申请号:US13054343
申请日:2009-07-14
申请人: Seung Kwon Beack , Tae Jin Lee , Minje Kim , Dae Young Jang , Kyeongok Kang , Jeongil Seo , Jin Woo Hong , Hochong Park , Young-Cheol Park
发明人: Seung Kwon Beack , Tae Jin Lee , Minje Kim , Dae Young Jang , Kyeongok Kang , Jeongil Seo , Jin Woo Hong , Hochong Park , Young-Cheol Park
IPC分类号: G10L19/00
CPC分类号: G10L19/22 , G10L19/0212 , G10L19/04 , G10L19/20
摘要: A module capable of appropriately selecting a linear predictive coding (LPC)-based or a code excitation linear prediction (CELP)-based speech or audio encoder and a transform-based audio encoder according to a feature of an input signal is a module that performs as a bridge for overcoming a performance barrier between a conventional LPC-based encoder and an audio encoder. Also, an integral audio encoder that provides consistent audio quality regardless of a type of the input audio signal can be designed based on the module.
摘要翻译: 能够根据输入信号的特征适当地选择基于线性预测编码(LPC)的或基于代码激励线性预测(CELP)的语音或音频编码器和基于变换的音频编码器的模块是执行 作为克服传统的基于LPC的编码器和音频编码器之间的性能障碍的桥梁。 此外,可以基于该模块来设计不考虑输入音频信号的类型而提供一致的音频质量的整体音频编码器。
-
2.
公开(公告)号:US08959015B2
公开(公告)日:2015-02-17
申请号:US13054377
申请日:2009-07-14
申请人: Tae Jin Lee , Seung Kwon Beack , Minje Kim , Dae Young Jang , Kyeongok Kang , Jin Woo Hong , Hochong Park , Young-Cheol Park
发明人: Tae Jin Lee , Seung Kwon Beack , Minje Kim , Dae Young Jang , Kyeongok Kang , Jin Woo Hong , Hochong Park , Young-Cheol Park
CPC分类号: G10L19/20 , G10L19/0212 , G10L19/12
摘要: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.
摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的装置。 用于对语音信号和音频信号进行整体编码的编码装置可以包括:模块选择单元,用于分析输入信号的特性,并选择用于编码输入信号的第一帧的第一编码模块; 语音编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成语音比特流; 音频编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成音频位流; 以及比特流生成单元,用于根据模块选择单元的选择从语音编码单元或音频编码单元生成输出比特流。
-
公开(公告)号:US20110119054A1
公开(公告)日:2011-05-19
申请号:US13054377
申请日:2009-07-14
申请人: Tae Jin Lee , Seung Kwon Beack , Minje Kim , Dae Young Jang , Kyeongok Kang , Jin Woo Hong , Hochong Park , Young-Cheol Park
发明人: Tae Jin Lee , Seung Kwon Beack , Minje Kim , Dae Young Jang , Kyeongok Kang , Jin Woo Hong , Hochong Park , Young-Cheol Park
CPC分类号: G10L19/20 , G10L19/0212 , G10L19/12
摘要: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.
摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的装置。 用于对语音信号和音频信号进行整体编码的编码装置可以包括:模块选择单元,用于分析输入信号的特性,并选择用于编码输入信号的第一帧的第一编码模块; 语音编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成语音比特流; 音频编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成音频位流; 以及比特流生成单元,用于根据模块选择单元的选择从语音编码单元或音频编码单元生成输出比特流。
-
4.
公开(公告)号:US20200349965A1
公开(公告)日:2020-11-05
申请号:US16865111
申请日:2020-05-01
申请人: Francesco Nesta , Minje Kim , Sanna Wager
发明人: Francesco Nesta , Minje Kim , Sanna Wager
IPC分类号: G10L21/0264 , G06N3/08 , G10L21/0216
摘要: Systems and methods for generating an enhanced audio signal comprise a trained neural network configured to receive an input audio signal and generate an enhanced target signal, the trained neural network comprising a pre-processing neural network configured to receive a segment of the input audio signal and output an audio classification, the pre-processing neural network including at least one hidden layer comprising an embedding vector, and a noise reduction neural network configured to receive the segment of the input audio signal, and the embedding vector and generate the enhanced target signal. The pre-processing neural network may comprise a target signal pre-processing neural network configured to output a target signal classification and comprising at least one hidden layer comprising a target embedding vector. The pre-processing neural network may comprise a noise pre-processing neural network configured output a noise classification and comprising at least one hidden layer comprising a noise embedding vector.
-
-
-