-
公开(公告)号:US20250104721A1
公开(公告)日:2025-03-27
申请号:US18686568
申请日:2022-12-15
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/032 , G10L19/03
Abstract: Disclosed are a device and method for audio signal processing. The audio signal processing device according to an embodiment includes a receiver configured to receive a bitstream corresponding to a compressed audio signal and a processor. The processor may be configured to generate a real restoration signal or a complex restoration signal by performing inverse quantization on real data of the bitstream or complex data of the bitstream, generate a result of real Frequency Domain Noise Shaping (FDNS) synthesis or a result of complex FDNS synthesis by performing FDNS synthesis on the real restoration signal or the complex restoration signal, and generate a restored audio signal by performing frequency-to-time transform on the result of the real FDNS synthesis or the result of the complex FDNS synthesis.
-
公开(公告)号:US20240135941A1
公开(公告)日:2024-04-25
申请号:US18358646
申请日:2023-07-24
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC: G10L19/02
CPC classification number: G10L19/02
Abstract: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
公开(公告)号:US20220005487A1
公开(公告)日:2022-01-06
申请号:US17368390
申请日:2021-07-06
Inventor: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032
Abstract: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
-
公开(公告)号:US20190180763A1
公开(公告)日:2019-06-13
申请号:US16180298
申请日:2018-11-05
Inventor: Seung Kwon BEACK , Woo-taek LIM , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Hui Yong KIM
Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
-
5.
公开(公告)号:US20180144755A1
公开(公告)日:2018-05-24
申请号:US15710353
申请日:2017-09-20
Inventor: Mi Suk LEE , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE
IPC: G10L19/018 , H04H20/31 , H04H60/37 , H04H60/58 , H04N21/2389 , H04N21/8358 , H04N5/067
CPC classification number: G10L19/018 , G06F16/683 , G06F16/955 , H04H20/31 , H04H60/37 , H04H60/58 , H04H2201/50 , H04N5/0675 , H04N21/23892 , H04N21/4394 , H04N21/8358
Abstract: Disclosed is an audio watermark insertion method. The audio watermark insertion method includes performing a modulated complex lapped transform (MCLT) on a first audio signal, inserting a bit string of a watermark in the first audio signal obtained by performing the MCLT, performing an inverse modified discrete cosine transform (IMDCT) on the first audio signal in which the bit string is inserted, and obtaining a second audio signal, which is the first audio signal in which the watermark is inserted, by performing an overlap-add on a signal obtained by performing the IMDCT and a neighbor frame signal.
-
6.
公开(公告)号:US20230245666A1
公开(公告)日:2023-08-03
申请号:US18102472
申请日:2023-01-27
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/035 , G10L19/038 , G10L19/00
CPC classification number: G10L19/035 , G10L19/038 , G10L19/0017
Abstract: Provided are an encoding method, an encoding device, a decoding method, and a decoding device using a scalar quantization and a vector quantization. The encoding method includes converting an input signal of a time domain into a frequency domain, generating a first residual signal from an input signal of a frequency domain by using a scale factor, performing a scalar quantization of the first residual signal, generating a second residual signal from the scalar-quantized first residual signal, performing a lossless encoding of the scalar-quantized first residual signal, performing a vector quantization of the second residual signal, and transmitting a bitstream including the lossless-encoded first residual signal and the vector-quantized second residual signal.
-
7.
公开(公告)号:US20230039546A1
公开(公告)日:2023-02-09
申请号:US17711908
申请日:2022-04-01
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Jongwon SHIN , Youngju CHEON , Sangwook HAN , Soojoong HWANG
IPC: G10L19/038 , G06N3/04
Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
-
公开(公告)号:US20220335963A1
公开(公告)日:2022-10-20
申请号:US17670172
申请日:2022-02-11
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Hong-Goo KANG , Jihyun LEE , Chanwoo LEE , Hyungseob LIM
IPC: G10L19/038 , G10L25/30
Abstract: An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
-
公开(公告)号:US20220005488A1
公开(公告)日:2022-01-06
申请号:US17368484
申请日:2021-07-06
Inventor: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032 , G10L19/16 , H04B17/309 , G06N3/08
Abstract: The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, quantizing the first feature information and producing the first feature bitstream, computing the first output signal from the quantized first feature information using a recurrent decoding model, computing the second feature information of the input signal using a nonrecurrent encoding model, quantizing the second feature information and producing the second feature bitstream, computing the second output signal from the quantized second feature information using a nonrecurrent decoding model, determining an encoding mode based on the input signal, the first and second output signals, and the first and second feature bitstreams, and outputting an overall bitstream by multiplexing an encoding mode bit and one of the first feature bitstream and the second feature bitstream depending on the encoding mode.
-
公开(公告)号:US20200349959A1
公开(公告)日:2020-11-05
申请号:US16843649
申请日:2020-04-08
Applicant: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Hochong PARK , Seung Kwon BEACK , Jongmo SUNG , Seong-Hyeon SHIN , Mi Suk LEE , Tae Jin LEE , Jin Soo CHOI
IPC: G10L19/032 , G10L25/18 , G10L25/21 , G06N3/08 , G06N20/00
Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
-
-
-
-
-
-
-
-
-