-
公开(公告)号:US20160232902A1
公开(公告)日:2016-08-11
申请号:US15131623
申请日:2016-04-18
Inventor: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC: G10L19/008 , H04S7/00
CPC classification number: G10L19/008 , H04S7/00 , H04S7/30 , H04S2400/01 , H04S2400/03
Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
公开(公告)号:US20240420712A1
公开(公告)日:2024-12-19
申请号:US18732758
申请日:2024-06-04
Inventor: Byeongho CHO , Seung Kwon BEACK , Jung Won KANG , Soo Young PARK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/028 , G10L19/02 , G10L19/035 , G10L19/06
Abstract: A method of encoding/decoding an audio signal and a device for performing the same are provided. The method of encoding an audio signal includes generating, based on the audio signal, a linear prediction coding (LPC) bitstream and a frequency-domain signal of the audio signal, generating, based on the LPC bitstream and the frequency-domain signal, a first residual signal including information on a frequency envelope of the frequency-domain signal, and outputting a second residual signal by processing a first residual signal through one of a plurality of signal processing paths.
-
13.
公开(公告)号:US20240153513A1
公开(公告)日:2024-05-09
申请号:US18502648
申请日:2023-11-06
Inventor: Byeong Ho CHO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Woo Taek LIM , In Seon JANG
IPC: G10L19/035
CPC classification number: G10L19/035
Abstract: A complex number quantization-based audio signal encoding method may comprise: estimating a scale factor for each subband of an input audio signal; performing complex magnitude scaling for each subband based on the scale factor; and performing polar quantization on a complex frequency coefficient for each subband, wherein the performing the polar quantization for each subband comprises applying two or more different magnitude quantization techniques based on the magnitude of the complex frequency coefficient scaled for each subband.
-
公开(公告)号:US20240119949A1
公开(公告)日:2024-04-11
申请号:US18525181
申请日:2023-11-30
Inventor: Jeong Il SEO , Seung Kwon BEACK , Dae Young JANG , Kyeong Ok KANG , Tae Jin PARK , Yong Ju LEE , Keun Woo CHOI , Jin Woong KIM
IPC: G10L19/008
CPC classification number: G10L19/008 , H04S3/00
Abstract: An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
-
15.
公开(公告)号:US20230274141A1
公开(公告)日:2023-08-31
申请号:US18166407
申请日:2023-02-08
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , YONSEI UNIVERSITY WONJU INDUSTRY-ACADEMIC COOPERATION FOUNDATION
Inventor: Jongmo SUNG , Seung Kwon BEACK , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO , Young Cheol PARK , Joon BYUN , Seungmin SHIN
IPC: G06N3/08 , G10L19/038 , G10L25/30 , G10L19/028 , G10L25/69 , G10L25/60
CPC classification number: G06N3/08 , G10L19/038 , G10L25/30 , G10L19/028 , G10L25/69 , G10L25/60
Abstract: Provided is a method and apparatus for designing and testing an audio codec using quantization based on white noise modeling. A neural network-based audio encoder design method includes generating a quantized latent vector and a reconstructed signal corresponding to an input signal by using a white noise modeling-based quantization process, computing a total loss for training a neural network-based audio codec, based on the input signal, the reconstruction signal, and the quantized latent vector, training the neural network-based audio codec by using the total loss, and validating the trained neural network-based audio codec to select the best neural network-based audio codec.
-
公开(公告)号:US20230048402A1
公开(公告)日:2023-02-16
申请号:US17884364
申请日:2022-08-09
Inventor: Jongmo SUNG , Seung Kwon BEACK , Tae Jin LEE , Woo-taek KIM , Inseon JANG
Abstract: Provided is an encoding method according to various example embodiments and an encoder performing the method. The encoding method includes outputting a linear prediction(LP) coefficients bitstream and a residual signal by performing a linear prediction analysis on an input signal, outputting a first latent signal obtained by encoding a periodic component of the residual signal, using a first neural network module, outputting a first bitstream obtained by quantizing the first latent signal, using a quantization module, outputting a second latent signal obtained by encoding an aperiodic component of the residual signal, using the first neural network module, and outputting a second bitstream obtained by quantizing the second latent signal, using the quantization module, wherein the aperiodic component of the residual signal is calculated based on a periodic component of the residual signal decoded from the quantized first latent signal output by de-quantizing the first bitstream.
-
公开(公告)号:US20220262378A1
公开(公告)日:2022-08-18
申请号:US17672041
申请日:2022-02-15
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG
Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
-
公开(公告)号:US20220238126A1
公开(公告)日:2022-07-28
申请号:US17570489
申请日:2022-01-07
Inventor: Jongmo SUNG , Seung Kwon BEACK , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032 , G10L19/008 , G10L25/90 , G10L25/30
Abstract: Methods of encoding and decoding an audio signal using a learning model and an encoder and a decoder for performing the methods are disclosed. A method of encoding an audio signal using a learning model may include extracting pitch information of the audio signal, determining a dilation factor of a receptive field of a first expandable neural network block to extract a feature map from the audio signal based on the pitch information, generating a first feature map of the audio signal using the first expandable neural network block in which the dilation factor is determined, determining a second feature map by inputting the first feature map into a second expandable neural network block to process the first feature map, and converting the second feature map and the pitch information into a bitstream.
-
公开(公告)号:US20220157326A1
公开(公告)日:2022-05-19
申请号:US17507746
申请日:2021-10-21
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/13 , G10L19/032 , G10L19/06
Abstract: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
-
公开(公告)号:US20210166701A1
公开(公告)日:2021-06-03
申请号:US17104400
申请日:2020-11-25
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE
IPC: G10L19/002
Abstract: An audio signal encoding/decoding device and method using a filter bank is disclosed. The audio signal encoding method includes generating a plurality of first audio signals by performing filtering on an input audio signal using an analysis filter bank, generating a plurality of second audio signals by performing downsampling on the first audio signals, and outputting a bitstream by encoding and quantizing the second audio signals.
-
-
-
-
-
-
-
-
-