Patent search ap:("ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE") AND inv:"Woo-taek Lim" Page 1

1.

发明授权
Methods of encoding and decoding audio signal using neural network model, and devices for performing the methods 有权

公开(公告)号：US11862183B2

公开(公告)日：2024-01-02

申请号：US17368390

申请日：2021-07-06

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jongmo Sung , Seung Kwon Beack , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang

IPC: G10L19/032

CPC classification number: G10L19/032

Abstract: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.

2.

发明授权
Methods of encoding and decoding speech signal using neural network model recognizing sound sources, and encoding and decoding apparatuses for performing the same 有权

公开(公告)号：US11664037B2

公开(公告)日：2023-05-30

申请号：US17326035

申请日：2021-05-20

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Inseon Jang , Minje Kim , Haici Yang

IPC: G10L19/032 , G10L21/0272

CPC classification number: G10L19/032 , G10L21/0272

Abstract: Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.

3.

发明授权
Method and apparatus for recognition of sound events based on convolutional neural network 有权

公开(公告)号：US11205442B2

公开(公告)日：2021-12-21

申请号：US16562110

申请日：2019-09-05

Applicant: Electronics and Telecommunications Research Institute

Inventor： Young Ho Jeong , Sang Won Suh , Tae Jin Lee , Woo-taek Lim , Hui Yong Kim

IPC: G10L15/14 , G06N3/04 , G10L25/30 , G10L25/84 , G10L17/04

Abstract: Provided is a sound event recognition method that may improve a sound event recognition performance using a correlation between difference sound signal feature parameters based on a neural network, in detail, that may extract a sound signal feature parameter from a sound signal including a sound event, and recognize the sound event included in the sound signal by applying a convolutional neural network (CNN) trained using the sound signal feature parameter.

4.

发明授权
Method of encoding and decoding audio signal using linear predictive coding and encoder and decoder performing the method 有权

公开(公告)号：US11562757B2

公开(公告)日：2023-01-24

申请号：US17377157

申请日：2021-07-15

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi

IPC: G10L19/06 , G10L19/032

Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.

5.

发明授权
Method and device for predicting channel parameter of audio signal 有权

公开(公告)号：US11133015B2

公开(公告)日：2021-09-28

申请号：US16180298

申请日：2018-11-05

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon Beack , Woo-taek Lim , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Hui Yong Kim

IPC: G10L19/04 , G10L25/30 , G10L19/008

Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.

6.

发明授权
Method and apparatus for sound event detection robust to frequency change 有权

公开(公告)号：US10540988B2

公开(公告)日：2020-01-21

申请号：US16196356

申请日：2018-11-20

Applicant: Electronics and Telecommunications Research Institute

Inventor： Woo-taek Lim

IPC: G10L21/14 , G10L25/30 , G10L21/12

Abstract: Disclosed is a sound event detecting method including receiving an audio signal, transforming the audio signal into a two-dimensional (2D) signal, extracting a feature map by training a convolutional neural network (CNN) using the 2D signal, pooling the feature map based on a frequency, and determining whether a sound event occurs with respect to each of at least one time interval based on a result of the pooling.

7.

发明授权
Methods of encoding and decoding, encoder and decoder performing the methods 有权

公开(公告)号：US12159640B2

公开(公告)日：2024-12-03

申请号：US17884364

申请日：2022-08-09

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Jongmo Sung , Seung Kwon Beack , Tae Jin Lee , Woo-taek Lim , Inseon Jang

IPC: G10L19/06 , G06N3/045

Abstract: Provided is an encoding method according to various example embodiments and an encoder performing the method. The encoding method includes outputting a linear prediction (LP) coefficients bitstream and a residual signal by performing a linear prediction analysis on an input signal, outputting a first latent signal obtained by encoding a periodic component of the residual signal, using a first neural network module, outputting a first bitstream obtained by quantizing the first latent signal, using a quantization module, outputting a second latent signal obtained by encoding an aperiodic component of the residual signal, using the first neural network module, and outputting a second bitstream obtained by quantizing the second latent signal, using the quantization module, wherein the aperiodic component of the residual signal is calculated based on a periodic component of the residual signal decoded from the quantized first latent signal output by de-quantizing the first bitstream.

8.

发明授权
Method of generating residual signal, and encoder and decoder performing the method 有权

公开(公告)号：US11978465B2

公开(公告)日：2024-05-07

申请号：US17507746

申请日：2021-10-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Inseon Jang

IPC: G10L19/13 , G10L19/032 , G10L19/06

CPC classification number: G10L19/13 , G10L19/032 , G10L19/06

Abstract: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.

9.

发明授权
Audio encoding/decoding apparatus and method using vector quantized residual error feature 有权

公开(公告)号：US11804230B2

公开(公告)日：2023-10-31

申请号：US17711908

申请日：2022-04-01

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology

Inventor： Inseon Jang , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-taek Lim , Jongwon Shin , Youngju Cheon , Sangwook Han , Soojoong Hwang

IPC: G10L19/02 , G10L19/038 , G06N3/04

CPC classification number: G10L19/038 , G06N3/04 , G10L19/02

Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.

10.

发明申请
METHOD OF ENCODING AND DECODING AUDIO SIGNAL AND ENCODER AND DECODER PERFORMING THE METHOD 有权

公开(公告)号：US20220020385A1

公开(公告)日：2022-01-20

申请号：US17377157

申请日：2021-07-15

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang , Jin Soo Choi

IPC: G10L19/06 , G10L19/032

Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification