Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Inseon JANG" Page 4

31.

发明公开
ENCODING METHOD AND ENCODING DEVICE USING COMPLEX SIGNAL AND DECODING METHOD AND DECODING DEVICE USING COMPLEX SIGNAL 审中-公开

公开(公告)号：US20230298599A1

公开(公告)日：2023-09-21

申请号：US18108431

申请日：2023-06-12

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO

IPC: G10L19/008

CPC classification number: G10L19/008

Abstract: An encoding method and an encoding device using a complex signal and a decoding method and a decoding device using a complex signal are provided. The encoding method includes converting a first channel signal and a second channel signal constituting an audio signal corresponding to a stereo signal from a real domain to a complex domain, determining one of a sum operation, a difference operation, and a bypass operation to be performed on the second channel signal converted to the complex domain, determining a complex spatial cue according to the determined operation, converting a residual signal for the second channel signal to a real domain using the complex spatial cue, converting the first channel signal to a real domain, encoding the first channel signal converted to the real domain, and encoding the residual signal for the second channel signal converted to the real domain.

32.

发明申请
AUDIO SIGNAL ENCODING AND DECODING METHOD, AND ENCODER AND DECODER PERFORMING THE METHODS 有权

公开(公告)号：US20230038394A1

公开(公告)日：2023-02-09

申请号：US17390753

申请日：2021-07-30

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , The Trustees of Indiana University

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Minje KIM

IPC: G10L19/008 , G10L19/032 , G06N3/04

Abstract: Disclosed are a method of encoding and decoding an audio signal and an encoder and a decoder performing the method. The method of encoding an audio signal includes identifying an input signal, and generating a bitstring of each encoding layer by applying, to the input signal, an encoding model including a plurality of successive encoding layers that encodes the input signal, in which a current encoding layer among the encoding layers is trained to generate a bitstring of the current encoding layer by encoding an encoded signal which is a signal encoded in a previous encoding layer and quantizing an encoded signal which is a signal encoded in the current encoding layer.

33.

发明申请
METHODS OF ENCODING AND DECODING AUDIO SIGNAL, AND ENCODER AND DECODER FOR PERFORMING THE METHODS 有权

公开(公告)号：US20220375483A1

公开(公告)日：2022-11-24

申请号：US17520895

申请日：2021-11-08

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Jong-won SEOK , YUNSU KIM

IPC: G10L19/038 , G10L19/16 , G10L25/30

Abstract: Disclosed are methods of encoding and decoding an audio signal, and an encoder and a decoder for performing the methods. The method of encoding an audio signal includes identifying an input signal corresponding to a low frequency band of the audio signal, windowing the input signal, generating a first latent vector by inputting the windowed input signal to a first encoding model, transforming the windowed input signal into a frequency domain, generating a second latent vector by inputting the transformed input signal to a second encoding model, generating a final latent vector by combining the first latent vector and the second latent vector, and generating a bitstream corresponding to the final latent vector.

34.

发明申请
METHODS OF ENCODING AND DECODING SPEECH SIGNAL USING NEURAL NETWORK MODEL RECOGNIZING SOUND SOURCES, AND ENCODING AND DECODING APPARATUSES FOR PERFORMING THE SAME 有权

公开(公告)号：US20210366497A1

公开(公告)日：2021-11-25

申请号：US17326035

申请日：2021-05-20

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Inseon JANG , Minje KIM , Haici YANG

IPC: G10L19/032

Abstract: Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.

Patent Agency Ranking