Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Tae Jin LEE" Page 5

41.

发明申请
UNIFIED SPEECH/AUDIO CODEC (USAC) PROCESSING WINDOWS SEQUENCE BASED MODE SWITCHING 审中-公开

公开(公告)号：US20160314798A1

公开(公告)日：2016-10-27

申请号：US15200404

申请日：2016-07-01

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventor： Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK

IPC: G10L19/22 , G10L19/022 , G10L19/06

CPC classification number: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

42.

发明申请
BINAURAL RENDERING METHOD AND APPARATUS FOR DECODING MULTI CHANNEL AUDIO 有权

公开(公告)号：US20160232902A1

公开(公告)日：2016-08-11

申请号：US15131623

申请日：2016-04-18

Applicant: Electronics and Telecommunications Research Institute

Inventor： Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI

IPC: G10L19/008 , H04S7/00

CPC classification number: G10L19/008 , H04S7/00 , H04S7/30 , H04S2400/01 , H04S2400/03

Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.

43.

发明申请
APPARATUS AND METHOD FOR TRANSMITTING AUDIO OBJECT 有权
Title translation: 用于传输音频对象的装置和方法

公开(公告)号：US20130170646A1

公开(公告)日：2013-07-04

申请号：US13729303

申请日：2012-12-28

Applicant: Electronics and Telecommunications Research Institute

Inventor： Jae Hyoun YOO , Jeong Il SEO , Tae Jin LEE , Keun Woo CHOI , Kyeong Ok KANG

IPC: H04H20/88

CPC classification number: H04H20/88 , G10L19/00 , G10L19/008 , G10L19/20 , H04R5/00 , H04S2400/00 , H04S2400/01 , H04S2420/13

Abstract: An apparatus and method for transmitting a plurality of audio objects using a multichannel encoder and a multichannel decoder are provided. The audio object encoder includes a multichannel encoder determination unit to determine a multichannel encoder to be used for encoding of a plurality of audio objects according to the number of the audio objects, an encoding unit to generate an encoded signal by encoding the plurality of audio objects using the determined multichannel encoder, and a multichannel audio object signal generation unit to generating a multichannel audio object signal, by multiplexing sound image localization information of the plurality of audio objects along with the encoded signal.

Abstract translation: 提供了一种使用多声道编码器和多声道解码器发送多个音频对象的装置和方法。音频对象编码器包括多声道编码器确定单元，用于根据音频对象的数量确定要用于多个音频对象的编码的多声道编码器;编码单元，用于通过对多个音频对象进行编码来生成编码信号使用所确定的多声道编码器和多声道音频对象信号生成单元，通过将多个音频对象的声音图像定位信息与编码信号一起多路复用来生成多声道音频对象信号。

44.

发明申请
METHOD AND APPARATUS FOR ENCODING/DECODING NEURAL NETWORK-BASED PERSONALIZED SPEECH 有权

公开(公告)号：US20250104724A1

公开(公告)日：2025-03-27

申请号：US18886296

申请日：2024-09-16

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Inseon JANG , Soo Young PARK , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jung Won KANG , Tae Jin LEE , Minje KIM , Haici YANG

IPC: G10L19/16

Abstract: A method and apparatus for encoding/decoding a neural network-based personalized speech are provided. The method includes outputting a first bit stream in which an input speech signal is encrypted, based on the input speech signal, and outputting a second bit stream in which speaker information of the input speech signal is encrypted, based on the input speech signal.

45.

发明申请
METHOD AND DEVICE FOR ENCODING/DECODING AUDIO SIGNAL BASED ON DEQUANTIZATION THROUGH POTENTIAL DIFFUSION 有权

公开(公告)号：US20250104722A1

公开(公告)日：2025-03-27

申请号：US18886765

申请日：2024-09-16

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Inseon JANG , Woo-taek LIM , Soo Young PARK , Seung Kwon BEACK , Jongmo SUNG , Byeongho CHO , Jung Won KANG , Tae Jin LEE , Minje KIM , Haici YANG

IPC: G10L19/038 , G10L21/0208 , G10L25/30

Abstract: A method and device for encoding/decoding an audio signal based on dequantization through potential diffusion are provided. The method of decoding an audio signal includes obtaining a discrete latent vector in which a speech signal is quantized and based on the discrete latent vector, outputting a continuous latent vector in which the discrete latent vector is dequantized.

46.

发明申请
METHOD AND APPARATUS FOR ENCODING/DECODING AUDIO SIGNAL 有权

公开(公告)号：US20240371383A1

公开(公告)日：2024-11-07

申请号：US18653233

申请日：2024-05-02

Applicant: Electronics and Telecommunications Research Institute , UIF (University Industry Foundation), Yonsei University

Inventor： Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Byeongho CHO , Hong-Goo KANG , Byeong Hyeon KIM , Jihyun LEE , Hyungseob LIM

IPC: G10L19/038 , G10L19/02

Abstract: A method and apparatus for encoding/decoding audio signal are provided. The encoding method includes transforming an input audio signal in a time domain into an audio signal in a frequency domain, quantizing energy of a frequency band of the audio signal in the frequency domain, generating a normal signal by normalizing the audio signal in the frequency domain according to quantized energy, obtaining a feature vector including information on the energy of the frequency band based on the normal signal and the input audio signal, quantizing the feature vector, obtaining a scale factor used to scale the normal signal based on the quantized feature vector, quantizing an adjustment signal into which the normal signal has been scaled based on the scale factor, and outputting bitstreams based on the quantized energy, the quantized feature vector, and the quantized adjustment signal.

47.

发明公开
AUDIO SIGNAL ENCODING/DECODING METHOD AND APPARATUS FOR PERFORMING THE SAME 审中-公开

公开(公告)号：US20240144943A1

公开(公告)日：2024-05-02

申请号：US18473791

申请日：2023-09-25

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Woo-taek LIM , Seung Kwon BEACK , Inseon JANG , Jongmo SUNG , Tae Jin LEE , Byeongho CHO , Minje KIM , Darius Petermann

IPC: G10L19/038 , G10L25/18

CPC classification number: G10L19/038 , G10L25/18

Abstract: An audio signal encoding/decoding method and an apparatus for performing the same are disclosed. The audio signal encoding method includes obtaining a full-band input signal, extracting a first feature vector corresponding to a first sub-band signal and a second feature vector corresponding to a second sub-band signal using an encoder neural network including a plurality of encoding layers, generating a first code vector corresponding to the first feature vector and a second code vector corresponding to the second feature vector by compressing the first feature vector and the second feature vector, and generating a bitstream by quantizing the first code vector and the second code vector.

48.

发明公开
APPARATUS AND METHOD FOR AUDIO ENCODING/DECODING ROBUST TO TRANSITION SEGMENT ENCODING DISTORTION 审中-公开

公开(公告)号：US20240087577A1

公开(公告)日：2024-03-14

申请号：US18014924

申请日：2021-07-02

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG

IPC: G10L19/005 , G10L19/16

CPC classification number: G10L19/005 , G10L19/167

Abstract: Disclosed is an apparatus and method for audio encoding/decoding that is robust against coding distortion in a transition section. An audio encoding method includes outputting a frequency domain signal by time-to-frequency (T/F) transform of an input signal, outputting a frequency domain residual signal in which a frequency axis envelope is removed from the frequency domain signal by applying frequency domain noise shaping (FDNS) encoding to the frequency domain signal, outputting a time domain residual signal in which a time axis envelope is removed by performing linear prediction coefficient (LPC) analysis based on the frequency domain residual signal, and quantizing and transmitting the time domain residual signal.

49.

发明公开
SPEECH CODING METHOD AND APPARATUS FOR PERFORMING THE SAME 审中-公开

公开(公告)号：US20240013796A1

公开(公告)日：2024-01-11

申请号：US18474997

申请日：2023-09-26

Applicant: Electronics and Telecommunications Research Institute , The Trustees of Indiana University

Inventor： Woo-taek LIM , Seung Kwon BEACK , Inseon JANG , Jongmo SUNG , Tae Jin LEE , Byeongho CHO , Minje KIM , Haici YANG

IPC: G10L19/038

CPC classification number: G10L19/038

Abstract: A method of encoding a speech signal includes predicting a feature vector of each of a plurality of frames included in the speech signal based on a ground-truth feature vector of a previous frame of each of the plurality of frames, calculating a residual signal corresponding to each of the plurality of frames based on a ground-truth feature vector of each of the plurality of frames and a predicted feature vector of each of the plurality of frames, and generating a bitstring corresponding to each of the plurality of frames by quantizing the residual signal.

50.

发明公开
SIGNAL COMPRESSION METHOD AND APPARATUS, AND SIGNAL RESTORATION METHOD AND APPARATUS 审中-公开

公开(公告)号：US20230335145A1

公开(公告)日：2023-10-19

申请号：US18118604

申请日：2023-03-07

Applicant: Electronics and Telecommunications Research Institute , Kyungpook National University Industry-Academic Cooperation Foundation

Inventor： Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Min Han KIM , Seung Hyeon SHIN , Dae Ho LEE , Seok Jin LEE

IPC: G10L19/06 , G10L19/032

CPC classification number: G10L19/06 , G10L19/032

Abstract: A signal compression method and apparatus and a signal restoration method and apparatus are provided. The signal compression method includes outputting an input signal, obtained by processing an audio signal, which is input, based on a human auditory perception characteristic, using an auditory perception model, extracting a feature vector from the input signal using a feature extraction module, and outputting a code obtained by compressing the feature vector using a trained signal compression model.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification