-
公开(公告)号:US20240420712A1
公开(公告)日:2024-12-19
申请号:US18732758
申请日:2024-06-04
Inventor: Byeongho CHO , Seung Kwon BEACK , Jung Won KANG , Soo Young PARK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/028 , G10L19/02 , G10L19/035 , G10L19/06
Abstract: A method of encoding/decoding an audio signal and a device for performing the same are provided. The method of encoding an audio signal includes generating, based on the audio signal, a linear prediction coding (LPC) bitstream and a frequency-domain signal of the audio signal, generating, based on the LPC bitstream and the frequency-domain signal, a first residual signal including information on a frequency envelope of the frequency-domain signal, and outputting a second residual signal by processing a first residual signal through one of a plurality of signal processing paths.
-
2.
公开(公告)号:US20230274141A1
公开(公告)日:2023-08-31
申请号:US18166407
申请日:2023-02-08
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , YONSEI UNIVERSITY WONJU INDUSTRY-ACADEMIC COOPERATION FOUNDATION
Inventor: Jongmo SUNG , Seung Kwon BEACK , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO , Young Cheol PARK , Joon BYUN , Seungmin SHIN
IPC: G06N3/08 , G10L19/038 , G10L25/30 , G10L19/028 , G10L25/69 , G10L25/60
CPC classification number: G06N3/08 , G10L19/038 , G10L25/30 , G10L19/028 , G10L25/69 , G10L25/60
Abstract: Provided is a method and apparatus for designing and testing an audio codec using quantization based on white noise modeling. A neural network-based audio encoder design method includes generating a quantized latent vector and a reconstructed signal corresponding to an input signal by using a white noise modeling-based quantization process, computing a total loss for training a neural network-based audio codec, based on the input signal, the reconstruction signal, and the quantized latent vector, training the neural network-based audio codec by using the total loss, and validating the trained neural network-based audio codec to select the best neural network-based audio codec.
-
公开(公告)号:US20250104721A1
公开(公告)日:2025-03-27
申请号:US18686568
申请日:2022-12-15
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/032 , G10L19/03
Abstract: Disclosed are a device and method for audio signal processing. The audio signal processing device according to an embodiment includes a receiver configured to receive a bitstream corresponding to a compressed audio signal and a processor. The processor may be configured to generate a real restoration signal or a complex restoration signal by performing inverse quantization on real data of the bitstream or complex data of the bitstream, generate a result of real Frequency Domain Noise Shaping (FDNS) synthesis or a result of complex FDNS synthesis by performing FDNS synthesis on the real restoration signal or the complex restoration signal, and generate a restored audio signal by performing frequency-to-time transform on the result of the real FDNS synthesis or the result of the complex FDNS synthesis.
-
公开(公告)号:US20240135941A1
公开(公告)日:2024-04-25
申请号:US18358646
申请日:2023-07-24
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC: G10L19/02
CPC classification number: G10L19/02
Abstract: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
公开(公告)号:US20240290335A1
公开(公告)日:2024-08-29
申请号:US18435334
申请日:2024-02-07
Inventor: Seung Kwon BEACK , Byeongho CHO
IPC: G10L19/032 , G10L19/06
CPC classification number: G10L19/032 , G10L19/06
Abstract: Disclosed are an audio signal encoding/decoding method and an apparatus for performing the same. An audio signal encoding method includes receiving a current frame signal and a reconstructed previous frame signal, generating a predicted current frame signal, based on the current frame signal and the reconstructed previous frame signal, and outputting a reconstructed residual signal, based on the current frame signal and the predicted current frame signal.
-
6.
公开(公告)号:US20230245666A1
公开(公告)日:2023-08-03
申请号:US18102472
申请日:2023-01-27
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/035 , G10L19/038 , G10L19/00
CPC classification number: G10L19/035 , G10L19/038 , G10L19/0017
Abstract: Provided are an encoding method, an encoding device, a decoding method, and a decoding device using a scalar quantization and a vector quantization. The encoding method includes converting an input signal of a time domain into a frequency domain, generating a first residual signal from an input signal of a frequency domain by using a scale factor, performing a scalar quantization of the first residual signal, generating a second residual signal from the scalar-quantized first residual signal, performing a lossless encoding of the scalar-quantized first residual signal, performing a vector quantization of the second residual signal, and transmitting a bitstream including the lossless-encoded first residual signal and the vector-quantized second residual signal.
-
公开(公告)号:US20240233738A9
公开(公告)日:2024-07-11
申请号:US18358646
申请日:2023-07-25
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC: G10L19/02
CPC classification number: G10L19/02
Abstract: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
公开(公告)号:US20240055009A1
公开(公告)日:2024-02-15
申请号:US18349680
申请日:2023-07-10
Inventor: Byeongho CHO , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032
CPC classification number: G10L19/032
Abstract: Provided are an apparatus for encoding an audio signal and a method of an operation thereof. An audio signal encoding method includes obtaining quantized linear prediction (LP) coefficients by performing a linear predictive coding (LPC) analysis and quantization on an input audio signal, generating a reference signal by applying discrete Fourier transform (DFT) to the input audio signal, obtaining LP residual coefficients from the reference signal, scaling magnitudes of the LP residual coefficients using the quantized LP coefficients and the reference signal, and quantizing phases of the LP residual coefficients and the scaled magnitudes of the LP residual coefficients.
-
9.
公开(公告)号:US20230230604A1
公开(公告)日:2023-07-20
申请号:US18099119
申请日:2023-01-19
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Tae Jin LEE , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN , Soojoong HWANG , Eunkyun LEE , Youngwon CHOI , Sangwook HAN
CPC classification number: G10L19/0204 , G10L25/30
Abstract: A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.
-
公开(公告)号:US20250104724A1
公开(公告)日:2025-03-27
申请号:US18886296
申请日:2024-09-16
Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University
Inventor: Inseon JANG , Soo Young PARK , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jung Won KANG , Tae Jin LEE , Minje KIM , Haici YANG
IPC: G10L19/16
Abstract: A method and apparatus for encoding/decoding a neural network-based personalized speech are provided. The method includes outputting a first bit stream in which an input speech signal is encrypted, based on the input speech signal, and outputting a second bit stream in which speaker information of the input speech signal is encrypted, based on the input speech signal.
-
-
-
-
-
-
-
-
-