-
31.
公开(公告)号:US20230298599A1
公开(公告)日:2023-09-21
申请号:US18108431
申请日:2023-06-12
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: An encoding method and an encoding device using a complex signal and a decoding method and a decoding device using a complex signal are provided. The encoding method includes converting a first channel signal and a second channel signal constituting an audio signal corresponding to a stereo signal from a real domain to a complex domain, determining one of a sum operation, a difference operation, and a bypass operation to be performed on the second channel signal converted to the complex domain, determining a complex spatial cue according to the determined operation, converting a residual signal for the second channel signal to a real domain using the complex spatial cue, converting the first channel signal to a real domain, encoding the first channel signal converted to the real domain, and encoding the residual signal for the second channel signal converted to the real domain.
-
32.
公开(公告)号:US20230038394A1
公开(公告)日:2023-02-09
申请号:US17390753
申请日:2021-07-30
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , The Trustees of Indiana University
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Minje KIM
IPC: G10L19/008 , G10L19/032 , G06N3/04
Abstract: Disclosed are a method of encoding and decoding an audio signal and an encoder and a decoder performing the method. The method of encoding an audio signal includes identifying an input signal, and generating a bitstring of each encoding layer by applying, to the input signal, an encoding model including a plurality of successive encoding layers that encodes the input signal, in which a current encoding layer among the encoding layers is trained to generate a bitstring of the current encoding layer by encoding an encoded signal which is a signal encoded in a previous encoding layer and quantizing an encoded signal which is a signal encoded in the current encoding layer.
-
33.
公开(公告)号:US20220375483A1
公开(公告)日:2022-11-24
申请号:US17520895
申请日:2021-11-08
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Jong-won SEOK , YUNSU KIM
IPC: G10L19/038 , G10L19/16 , G10L25/30
Abstract: Disclosed are methods of encoding and decoding an audio signal, and an encoder and a decoder for performing the methods. The method of encoding an audio signal includes identifying an input signal corresponding to a low frequency band of the audio signal, windowing the input signal, generating a first latent vector by inputting the windowed input signal to a first encoding model, transforming the windowed input signal into a frequency domain, generating a second latent vector by inputting the transformed input signal to a second encoding model, generating a final latent vector by combining the first latent vector and the second latent vector, and generating a bitstream corresponding to the final latent vector.
-
公开(公告)号:US20210366497A1
公开(公告)日:2021-11-25
申请号:US17326035
申请日:2021-05-20
Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Inseon JANG , Minje KIM , Haici YANG
IPC: G10L19/032
Abstract: Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.
-
-
-