-
公开(公告)号:US11456001B2
公开(公告)日:2022-09-27
申请号:US16814103
申请日:2020-03-10
申请人: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hochong Park
IPC分类号: G10L19/02 , G06N3/04 , G10L21/038 , G10L19/032
摘要: Disclosed are a method of encoding a high band of an audio, a method of decoding a high band of an audio, and an encoder and a decoder for performing the methods. The method of decoding a high band of an audio, the method performed by a decoder, includes identifying a parameter extracted through a first neural network, identifying side information extracted through a second neural network, and restoring a high band of an audio by applying the parameter and the side information to a third neural network.
-
公开(公告)号:US11508386B2
公开(公告)日:2022-11-22
申请号:US16843649
申请日:2020-04-08
申请人: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
发明人: Hochong Park , Seung Kwon Beack , Jongmo Sung , Seong-Hyeon Shin , Mi Suk Lee , Tae Jin Lee , Jin Soo Choi
摘要: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
-
公开(公告)号:US20180144757A1
公开(公告)日:2018-05-24
申请号:US15820852
申请日:2017-11-22
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Young Ho Jeong , Tae Jin Lee , Sang Won Suh
CPC分类号: G10L19/167 , G06F17/2252
摘要: Disclosed is a bitstream generation method performed by an acoustic data transmission (ADT) encoder, the method including receiving a first audio signal, receiving additional information converted into a bitstream, and transmitting a second audio signal obtained by inserting the bitstream into the first audio signal, to an ADT decoder.
-
公开(公告)号:US09407871B2
公开(公告)日:2016-08-02
申请号:US14625962
申请日:2015-02-19
发明人: Mi Suk Lee , In Ki Hwang
CPC分类号: H04N7/15 , G06F3/013 , G06F3/017 , G06F3/0304 , G06K9/00335 , G06K9/00597 , G06T3/0093 , H04N7/144 , H04N7/147
摘要: Disclosed are an apparatus and a method of controlling an eye-to-eye contact function, which provide a natural eye-to-eye contact by controlling an eye-to-eye contact function based on gaze information about a local participant and position information about a remote participant on a screen when providing the eye-to-eye contact function by using an image combination method and the like in a teleconference system, thereby improving absorption to a teleconference.
摘要翻译: 公开了一种控制眼睛接触功能的装置和方法,其通过基于关于本地参与者的凝视信息和关于本地参与者的位置信息来控制眼睛接触功能来提供自然的眼睛接触接触 通过在电话会议系统中使用图像组合方法等来提供眼睛接触功能,从而提高对电话会议的吸收的屏幕上的远程参与者。
-
公开(公告)号:US11508385B2
公开(公告)日:2022-11-22
申请号:US16686859
申请日:2019-11-18
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC分类号: G06N3/04 , G06N3/08 , G10L19/032 , G10L19/02
摘要: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
公开(公告)号:US11488613B2
公开(公告)日:2022-11-01
申请号:US17098090
申请日:2020-11-13
发明人: Minje Kim , Kai Zhen , Mi Suk Lee , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Jin Soo Choi
IPC分类号: G10L19/08 , G10L19/032 , G10L19/26 , G06N3/08 , G10L25/30 , G10L13/02 , G10L21/0208
摘要: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
-
7.
公开(公告)号:US20220020385A1
公开(公告)日:2022-01-20
申请号:US17377157
申请日:2021-07-15
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi
IPC分类号: G10L19/06 , G10L19/032
摘要: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
-
公开(公告)号:US20210005208A1
公开(公告)日:2021-01-07
申请号:US16686859
申请日:2019-11-18
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim
IPC分类号: G10L19/02 , G10L19/032 , G06N3/08 , G06N3/04
摘要: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
-
9.
公开(公告)号:US11581000B2
公开(公告)日:2023-02-14
申请号:US17105835
申请日:2020-11-27
发明人: Woo-Taek Lim , Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee
IPC分类号: G10L19/00 , G10L25/30 , G10L19/16 , G06N3/08 , G10L19/038
摘要: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
-
10.
公开(公告)号:US11580999B2
公开(公告)日:2023-02-14
申请号:US17331416
申请日:2021-05-26
发明人: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC分类号: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035
摘要: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
-
-
-
-
-
-
-
-