-
61.
公开(公告)号:US12223426B2
公开(公告)日:2025-02-11
申请号:US18166407
申请日:2023-02-08
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , YONSEI UNIVERSITY WONJU INDUSTRY-ACADEMIC COOPERATION FOUNDATION
Inventor: Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Byeongho Cho , Young Cheol Park , Joon Byun , Seungmin Shin
IPC: G10L19/00 , G06N3/08 , G10L19/028 , G10L19/038 , G10L25/30 , G10L25/60 , G10L25/69 , G06N3/084 , G10L15/00 , G10L19/22
Abstract: Provided is a method and apparatus for designing and testing an audio codec using quantization based on white noise modeling. A neural network-based audio encoder design method includes generating a quantized latent vector and a reconstructed signal corresponding to an input signal by using a white noise modeling-based quantization process, computing a total loss for training a neural network-based audio codec, based on the input signal, the reconstruction signal, and the quantized latent vector, training the neural network-based audio codec by using the total loss, and validating the trained neural network-based audio codec to select the best neural network-based audio codec.
-
公开(公告)号:US11887612B2
公开(公告)日:2024-01-30
申请号:US17895233
申请日:2022-08-25
Inventor: Seung Kwon Beack , Tae Jin Lee , Min Je Kim , Kyeongok Kang , Dae Young Jang , Jin Woo Hong , Jeongil Seo , Chieteuk Ahn , Hochong Park , Young-Cheol Park
IPC: G10L19/087 , G10L19/125 , G10L19/22 , G10L19/26
CPC classification number: G10L19/087 , G10L19/125 , G10L19/22 , G10L19/26
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
公开(公告)号:US11871204B2
公开(公告)日:2024-01-09
申请号:US17877696
申请日:2022-07-29
Inventor: Yong Ju Lee , Jeong Il Seo , Seung Kwon Beack , Kyeong Ok Kang , Jin Woong Kim , Jae Hyoun Yoo
IPC: H04S3/00 , G10L19/008
CPC classification number: H04S3/008 , G10L19/008 , H04S2400/01
Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
-
公开(公告)号:US11694703B2
公开(公告)日:2023-07-04
申请号:US17672041
申请日:2022-02-15
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang
Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
-
65.
公开(公告)号:US11651778B2
公开(公告)日:2023-05-16
申请号:US17520895
申请日:2021-11-08
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang , Jong-Won Seok , Yunsu Kim
IPC: G10L19/16 , G10L19/02 , G10L25/30 , G10L19/038
CPC classification number: G10L19/038 , G10L19/02 , G10L19/167 , G10L25/30
Abstract: Disclosed are methods of encoding and decoding an audio signal, and an encoder and a decoder for performing the methods. The method of encoding an audio signal includes identifying an input signal corresponding to a low frequency band of the audio signal, windowing the input signal, generating a first latent vector by inputting the windowed input signal to a first encoding model, transforming the windowed input signal into a frequency domain, generating a second latent vector by inputting the transformed input signal to a second encoding model, generating a final latent vector by combining the first latent vector and the second latent vector, and generating a bitstream corresponding to the final latent vector.
-
公开(公告)号:US11289105B2
公开(公告)日:2022-03-29
申请号:US16447573
申请日:2019-06-20
Inventor: Jeong Il Seo , Seung Kwon Beack , Dae Young Jang , Kyeong Ok Kang , Tae Jin Park , Yong Ju Lee , Keun Woo Choi , Jin Woong Kim
IPC: G10L19/008 , H04S3/00
Abstract: An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
-
公开(公告)号:US10950248B2
公开(公告)日:2021-03-16
申请号:US16841428
申请日:2020-04-06
Inventor: Yong Ju Lee , Jeong Il Seo , Jae Hyoun Yoo , Seung Kwon Beack , Jong Mo Sung , Tae Jin Lee , Kyeong Ok Kang , Jin Woong Kim , Tae Jin Park , Dae Young Jang , Keun Woo Choi
IPC: G10L19/008 , H04S7/00
Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
公开(公告)号:US10552711B2
公开(公告)日:2020-02-04
申请号:US16203668
申请日:2018-11-29
Inventor: Woo-taek Lim , Seung Kwon Beack
IPC: G06K9/62 , G06N3/02 , G06F16/683 , G10L19/008
Abstract: Disclosed is an apparatus and method for extracting a sound source from a multi-channel audio signal. A sound source extracting method includes transforming a multi-channel audio signal into two-dimensional (2D) data, extracting a plurality of feature maps by inputting the 2D data into a convolutional neural network (CNN) including at least one layer, and extracting a sound source from the multi-channel audio signal using the feature maps.
-
公开(公告)号:US10332526B2
公开(公告)日:2019-06-25
申请号:US15652055
申请日:2017-07-17
Inventor: Seung Kwon Beack , Tae Jin Lee , Jong Mo Sung , Kyeong Ok Kang , Keun Woo Choi
IPC: G10L19/00 , G10L19/02 , G10L19/22 , G10L19/002 , G10L19/008
Abstract: An audio encoding apparatus to encode an audio signal using lossless coding or lossy coding and an audio decoding apparatus to decode an encoded audio signal are disclosed. An audio encoding apparatus according to an exemplary embodiment may include an input signal type determination unit to determine a type of an input signal based on characteristics of the input signal, a residual signal generation unit to generate a residual signal based on an output signal from the input signal type determination unit, and a coding unit to perform lossless coding or lossy coding using the residual signal.
-
70.
公开(公告)号:US10237673B2
公开(公告)日:2019-03-19
申请号:US15871669
申请日:2018-01-15
Inventor: Seung Kwon Beack , Tae Jin Lee , Jong Mo Sung , Kyeong Ok Kang , Jeong Il Seo , Dae Young Jang , Yong Ju Lee , Jin Woong Kim
IPC: H04S3/00 , G10L19/008
Abstract: An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
-
-
-
-
-
-
-
-
-