Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Jongmo SUNG" Page 3

21.

发明公开
METHOD OF ENCODING AUDIO SIGNAL AND ENCODER, METHOD OF DECODING AUDIO SIGNAL AND DECODER 审中-公开

公开(公告)号：US20230230604A1

公开(公告)日：2023-07-20

申请号：US18099119

申请日：2023-01-19

Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology

Inventor： Inseon JANG , Tae Jin LEE , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN , Soojoong HWANG , Eunkyun LEE , Youngwon CHOI , Sangwook HAN

IPC: G10L19/02 , G10L25/30

CPC classification number: G10L19/0204 , G10L25/30

Abstract: A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.

22.

发明申请
METHODS OF ENCODING AND DECODING AUDIO SIGNAL USING SIDE INFORMATION, AND ENCODER AND DECODER FOR PERFORMING THE METHODS 有权

公开(公告)号：US20220358940A1

公开(公告)日：2022-11-10

申请号：US17527351

申请日：2021-11-16

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Jong Won SHIN , Soojoong HWANG , Youngju CHEON , Sangwook HAN

IPC: G10L19/02 , G10L25/21 , G10L25/30 , G06N3/04

Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.

23.

发明申请
METHOD AND APPARATUS FOR ENCODING AND DECODING AUDIO SIGNAL TO REDUCE QUANTIZATION NOISE 有权

公开(公告)号：US20210398547A1

公开(公告)日：2021-12-23

申请号：US17331416

申请日：2021-05-26

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG

IPC: G10L19/035 , G10L19/022 , G10L19/06 , G10L19/16

Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.

24.

发明申请
METHOD AND APPARATUS FOR ENCODING AND DECODING AUDIO SIGNAL USING LINEAR PREDICTIVE CODING 有权

公开(公告)号：US20210390967A1

公开(公告)日：2021-12-16

申请号：US17242828

申请日：2021-04-28

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon Beack , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Jin Soo CHOI

IPC: G10L19/032 , G10L19/08

Abstract: Disclosed is a method of encoding and decoding an audio signal using linear predictive coding (LPC) and an encoder and a decoder that perform the method. The method of encoding an audio signal to be performed by the encoder includes identifying a time-domain audio signal block-wise, quantizing a linear prediction coefficient obtained from a block of the audio signal through the LPC, generating an envelope based on the quantized linear prediction coefficient, extracting a residual signal based on the envelope and a result of converting the block into a frequency domain, grouping the residual signal by each sub-band and determining a scale factor for quantizing the grouped residual signal, quantizing the residual signal using the scale factor, and converting the quantized residual signal and the quantized linear prediction coefficient into a bitstream and transmitting the bitstream to a decoder.

25.

发明申请
APPARATUS AND METHOD FOR SPEECH PROCESSING USING A DENSELY CONNECTED HYBRID NEURAL NETWORK 有权

公开(公告)号：US20210350796A1

公开(公告)日：2021-11-11

申请号：US17308800

申请日：2021-05-05

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Minje KIM , Mi Suk LEE , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Jin Soo CHOI , Kai ZHEN

IPC: G10L15/16 , G06N3/04 , G06F17/15

Abstract: Disclosed is a speech processing apparatus and method using a densely connected hybrid neural network. The speech processing method includes inputting a time domain sample of N*1 dimension for an input speech into a densely connected hybrid network; passing the time domain sample through a plurality of dense blocks in a densely connected hybrid network; reshaping the time domain samples into M subframes by passing the time domain samples through the plurality of dense blocks, inputting the M subframes into gated recurrent unit (GRU) components of N/M-dimension; outputting clean speech from which noise is removed from the input speech by passing the M subframes through GRU components.

26.

发明申请
QUANTIZATION METHOD OF LATENT VECTOR FOR AUDIO ENCODING AND COMPUTING DEVICE FOR PERFORMING THE METHOD 有权

公开(公告)号：US20210174815A1

公开(公告)日：2021-06-10

申请号：US17112480

申请日：2020-12-04

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon BEACK , Jooyoung LEE , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Seunghyun CHO , Jin Soo CHOI

IPC: G10L19/038 , G10L25/30 , G10L19/028 , G10L19/24 , G06N3/02

Abstract: Disclosed are a quantizing method for a latent vector and a computing device for performing the quantization method. A quantizing method of a latent vector includes performing information shaping on the latent vector resulting from reduction in a dimension of an input signal using a target neural network; clamping a residual signal of the latent vector derived based on the information shaping; performing resealing on the clamped residual signal; and performing quantization on the resealed residual signal.

27.

发明申请
APPARATUS AND METHOD FOR ENCODING/DECODING AUDIO SIGNAL USING INFORMATION OF PREVIOUS FRAME 有权

公开(公告)号：US20210166706A1

公开(公告)日：2021-06-03

申请号：US17105835

申请日：2020-11-27

Applicant: Electronics and Telecommunications Research Institute

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE

IPC: G10L19/16 , G10L19/038 , G10L25/30 , G06N3/08

Abstract: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.

28.

发明申请
ENCODING METHOD AND DECODING METHOD FOR AUDIO SIGNAL USING DYNAMIC MODEL PARAMETER, AUDIO ENCODING APPARATUS AND AUDIO DECODING APPARATUS 有权

公开(公告)号：US20210074306A1

公开(公告)日：2021-03-11

申请号：US17017413

申请日：2020-09-10

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Jin Soo CHOI

IPC: G10L19/032 , H03M7/30 , G06N3/08 , G06N5/04

Abstract: Provided are an audio encoding method, an audio decoding method, an audio encoding apparatus, and an audio decoding apparatus using dynamic model parameters. The audio encoding method using dynamic model parameters may use dynamic model parameters corresponding to each of the levels of the encoding network when reducing the dimension of an audio signal in the encoding network. In addition, the audio decoding method using the dynamic model parameter may use a dynamic model parameter corresponding to each of the levels of the decoding network when extending the dimension of an audio signal in an encoding network.

29.

发明申请
METHOD AND APPARATUS FOR ENCODING/DECODING NEURAL NETWORK-BASED PERSONALIZED SPEECH 有权

公开(公告)号：US20250104724A1

公开(公告)日：2025-03-27

申请号：US18886296

申请日：2024-09-16

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Inseon JANG , Soo Young PARK , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jung Won KANG , Tae Jin LEE , Minje KIM , Haici YANG

IPC: G10L19/16

Abstract: A method and apparatus for encoding/decoding a neural network-based personalized speech are provided. The method includes outputting a first bit stream in which an input speech signal is encrypted, based on the input speech signal, and outputting a second bit stream in which speaker information of the input speech signal is encrypted, based on the input speech signal.

30.

发明申请
METHOD AND DEVICE FOR ENCODING/DECODING AUDIO SIGNAL BASED ON DEQUANTIZATION THROUGH POTENTIAL DIFFUSION 有权

公开(公告)号：US20250104722A1

公开(公告)日：2025-03-27

申请号：US18886765

申请日：2024-09-16

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Inseon JANG , Woo-taek LIM , Soo Young PARK , Seung Kwon BEACK , Jongmo SUNG , Byeongho CHO , Jung Won KANG , Tae Jin LEE , Minje KIM , Haici YANG

IPC: G10L19/038 , G10L21/0208 , G10L25/30

Abstract: A method and device for encoding/decoding an audio signal based on dequantization through potential diffusion are provided. The method of decoding an audio signal includes obtaining a discrete latent vector in which a speech signal is quantized and based on the discrete latent vector, outputting a continuous latent vector in which the discrete latent vector is dequantized.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification