-
公开(公告)号:US20180144757A1
公开(公告)日:2018-05-24
申请号:US15820852
申请日:2017-11-22
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Young Ho Jeong , Tae Jin Lee , Sang Won Suh
CPC分类号: G10L19/167 , G06F17/2252
摘要: Disclosed is a bitstream generation method performed by an acoustic data transmission (ADT) encoder, the method including receiving a first audio signal, receiving additional information converted into a bitstream, and transmitting a second audio signal obtained by inserting the bitstream into the first audio signal, to an ADT decoder.
-
公开(公告)号:US11508386B2
公开(公告)日:2022-11-22
申请号:US16843649
申请日:2020-04-08
申请人: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
发明人: Hochong Park , Seung Kwon Beack , Jongmo Sung , Seong-Hyeon Shin , Mi Suk Lee , Tae Jin Lee , Jin Soo Choi
摘要: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
-
公开(公告)号:US11508385B2
公开(公告)日:2022-11-22
申请号:US16686859
申请日:2019-11-18
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC分类号: G06N3/04 , G06N3/08 , G10L19/032 , G10L19/02
摘要: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
公开(公告)号:US11488613B2
公开(公告)日:2022-11-01
申请号:US17098090
申请日:2020-11-13
发明人: Minje Kim , Kai Zhen , Mi Suk Lee , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Jin Soo Choi
IPC分类号: G10L19/08 , G10L19/032 , G10L19/26 , G06N3/08 , G10L25/30 , G10L13/02 , G10L21/0208
摘要: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
-
公开(公告)号:US11416742B2
公开(公告)日:2022-08-16
申请号:US16122708
申请日:2018-09-05
发明人: Jongmo Sung , Minje Kim , Aswin Sivaraman , Kai Zhen
IPC分类号: G06N3/08 , G10L19/008 , G10L19/032 , G10L25/30 , G10L25/69
摘要: Provided is a training method of a neural network that is applied to an audio signal encoding method using an audio signal encoding apparatus, the training method including generating a masking threshold of a first audio signal before training is performed, calculating a weight matrix to be applied to a frequency component of the first audio signal based on the masking threshold, generating a weighted error function obtained by correcting a preset error function using the weight matrix, and generating a second audio signal by applying a parameter learned using the weighted error function to the first audio signal.
-
6.
公开(公告)号:US20220020385A1
公开(公告)日:2022-01-20
申请号:US17377157
申请日:2021-07-15
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi
IPC分类号: G10L19/06 , G10L19/032
摘要: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
-
公开(公告)号:US20210005208A1
公开(公告)日:2021-01-07
申请号:US16686859
申请日:2019-11-18
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC分类号: G10L19/02 , G10L19/032 , G06N3/08 , G06N3/04
摘要: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
8.
公开(公告)号:US11581000B2
公开(公告)日:2023-02-14
申请号:US17105835
申请日:2020-11-27
发明人: Woo-Taek Lim , Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee
IPC分类号: G10L19/00 , G10L25/30 , G10L19/16 , G06N3/08 , G10L19/038
摘要: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
-
公开(公告)号:US11580999B2
公开(公告)日:2023-02-14
申请号:US17331416
申请日:2021-05-26
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC分类号: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035
摘要: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
10.
公开(公告)号:US11276413B2
公开(公告)日:2022-03-15
申请号:US16543095
申请日:2019-08-16
发明人: Mi Suk Lee , Jongmo Sung , Minje Kim , Kai Zhen
摘要: Disclosed are an audio signal encoding method and audio signal decoding method, and an encoder and decoder performing the same. The audio signal encoding method includes applying an audio signal to a training model including N autoencoders provided in a cascade structure, encoding an output result derived through the training model, and generating a bitstream with respect to the audio signal based on the encoded output result.
-
-
-
-
-
-
-
-
-