Patent search ap:("Electronics AND Telecommunications Research Institute") AND inv:"Tae Jin LEE" Page 3

21.

发明申请
UNIFIED SPEECH/AUDIO CODEC (USAC) PROCESSING WINDOWS SEQUENCE BASED MODE SWITCHING 审中-公开
Title translation: 统一语音/音频编解码器（USAC）处理窗口序列模式切换

公开(公告)号：US20150134328A1

公开(公告)日：2015-05-14

申请号：US14588638

申请日：2015-01-02

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventor： Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK

IPC: G10L19/04

CPC classification number: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

Abstract translation: 提供了可以基于模式切换处理窗口序列的统一语音和音频编解码器（USAC）。 USAC可以通过在发生模式切换时基于折叠点在帧之间重叠来执行编码或解码。 USAC可以针对每种情况处理不同的窗口序列以执行编码或解码，从而可以提高编码效率。

22.

发明申请
BINAURAL RENDERING METHOD AND APPARATUS FOR DECODING MULTI CHANNEL AUDIO 有权
Title translation: 用于解码多通道音频的双向渲染方法和装置

公开(公告)号：US20150030160A1

公开(公告)日：2015-01-29

申请号：US14341554

申请日：2014-07-25

Applicant: Electronics and Telecommunications Research Institute

Inventor： Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI

IPC: H04S5/00 , G10K15/08 , H04R5/04

CPC classification number: G10L19/008 , H04S7/00 , H04S7/30 , H04S2400/01 , H04S2400/03

Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.

Abstract translation: 公开了一种用于对多声道音频信号进行解码的双耳渲染方法和装置。双耳渲染方法可以包括：从双耳滤波器提取早期反射分量和后期混响分量; 通过基于早期反射分量执行多声道音频信号的双耳渲染来产生立体声音频信号; 以及将后期混响分量应用于所生成的立体声音频信号。

23.

发明申请
AUDIO PROCESSING METHOD USING COMPLEX NUMBER DATA, AND APPARATUS FOR PERFORMING SAME 有权

公开(公告)号：US20250104721A1

公开(公告)日：2025-03-27

申请号：US18686568

申请日：2022-12-15

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor： Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO

IPC: G10L19/032 , G10L19/03

Abstract: Disclosed are a device and method for audio signal processing. The audio signal processing device according to an embodiment includes a receiver configured to receive a bitstream corresponding to a compressed audio signal and a processor. The processor may be configured to generate a real restoration signal or a complex restoration signal by performing inverse quantization on real data of the bitstream or complex data of the bitstream, generate a result of real Frequency Domain Noise Shaping (FDNS) synthesis or a result of complex FDNS synthesis by performing FDNS synthesis on the real restoration signal or the complex restoration signal, and generate a restored audio signal by performing frequency-to-time transform on the result of the real FDNS synthesis or the result of the complex FDNS synthesis.

24.

发明公开
UNIFIED SPEECH/AUDIO CODEC (USAC) PROCESSING WINDOWS SEQUENCE BASED MODE SWITCHING 审中-公开

公开(公告)号：US20240212698A1

公开(公告)日：2024-06-27

申请号：US18426726

申请日：2024-01-30

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventor： Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK

IPC: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

CPC classification number: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18

Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.

25.

发明公开
APPARATUS FOR ENCODING AND DECODING AUDIO SIGNALS AND METHOD OF OPERATION THEREOF 审中-公开

公开(公告)号：US20240135941A1

公开(公告)日：2024-04-25

申请号：US18358646

申请日：2023-07-24

Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology

Inventor： Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN

IPC: G10L19/02

CPC classification number: G10L19/02

Abstract: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.

26.

发明公开
APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO 审中-公开

公开(公告)号：US20240119948A1

公开(公告)日：2024-04-11

申请号：US18212364

申请日：2023-06-21

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation

Inventor： Tae Jin LEE , Seung-Kwon BAEK , Min Je KIM , Dae Young JANG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Hochong PARK , Young-Cheol PARK

IPC: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/12 , G10L19/20

CPC classification number: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/12 , G10L19/20 , G10L19/00

Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.

27.

发明公开
METHOD OF RENDERING OBJECT-BASED AUDIO AND ELECTRONIC DEVICE FOR PERFORMING THE METHOD 审中-公开

公开(公告)号：US20230345197A1

公开(公告)日：2023-10-26

申请号：US18304257

申请日：2023-04-20

Applicant: Electronics and Telecommunications Research Institute

Inventor： Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Soo Young PARK , Tae Jin LEE , Young Ho JEONG

IPC: H04S7/00

CPC classification number: H04S7/303 , H04S7/307 , H04S2400/11 , H04S2400/13

Abstract: A method of rendering object-based audio and an electronic device for performing the method are disclosed. The method includes identifying metadata of the object-based audio, determining whether the metadata includes a parameter set for an atmospheric absorption effect for each distance, and rendering the object-based audio, using a distance between the object-based audio and a listener obtained using the metadata and the atmospheric absorption effect according to an effect of a medium attenuation based on the parameter, when the metadata includes the parameter.

28.

发明公开
AUDIO SIGNAL GENERATION MODEL AND TRAINING METHOD USING GENERATIVE ADVERSARIAL NETWORK 审中-公开

公开(公告)号：US20230267950A1

公开(公告)日：2023-08-24

申请号：US18097062

申请日：2023-01-13

Applicant: Electronics and Telecommunications Research Institute , Industry-Academic Cooperation Foundation, Yonsei University

Inventor： In Seon JANG , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Woo Taek LIM , Byeong Ho CHO , Hong Goo KANG , Ji Hyun LEE , Chan Woo LEE , Hyung Seob LIM

IPC: G10L25/51 , G10L25/30

CPC classification number: G10L25/51 , G10L25/30

Abstract: A generative adversarial network-based audio signal generation model for generating a high quality audio signal may comprise: a generator generating an audio signal with an external input; a harmonic-percussive separation model separating the generated audio signal into a harmonic component signal and a percussive component signal; and at least one discriminator evaluating whether each of the harmonic component signal and the percussive component signal is real or fake.

29.

发明公开
METHOD OF TRAINING NEURAL NETWORK MODEL, METHOD OF RECOGNIZING ACOUSTIC EVENT AND ACOUSTIC DIRECTION, AND ELECTRONIC DEVICE FOR PERFORMING THE METHODS 审中-公开

公开(公告)号：US20230224656A1

公开(公告)日：2023-07-13

申请号：US17869171

申请日：2022-07-20

Applicant: Electronics and Telecommunications Research Institute

Inventor： Soo Young PARK , Tae Jin LEE , Young Ho JEONG

IPC: H04R29/00 , G06N3/08

CPC classification number: H04R29/008 , G06N3/08

Abstract: Provided are a method of training a neural network model, a method of recognizing an acoustic event and an acoustic direction, and an electronic device for performing the methods. A method of training a neural network model according to an example embodiment includes generating a heatmap indicating an acoustic event and an acoustic direction in which the acoustic event occurs by using training data, outputting a result of recognizing the acoustic event and the acoustic direction by inputting a feature extracted using the training data into a neural network model for recognizing the acoustic event and the acoustic direction of the training data, and training the neural network model by using the result and the heatmap.

30.

发明公开
METHOD OF TRAINING SOUND RECOGNITION MODEL, METHOD OF RECOGNIZING SOUND, AND ELECTRONIC DEVICE FOR PERFORMING THE METHODS 审中-公开

公开(公告)号：US20230214647A1

公开(公告)日：2023-07-06

申请号：US17869242

申请日：2022-07-20

Applicant: Electronics and Telecommunications Research Institute

Inventor： Soo Young PARK , Tae Jin LEE , Young Ho JEONG

IPC: G06N3/08 , G10L25/51

CPC classification number: G06N3/08 , G10L25/51

Abstract: Provided are a method of recognizing sound, a method of training a sound recognition model, and an electronic device performing the same methods. A method of training a sound recognition model according to an example embodiment may include converting training data labeled with a sound class into a feature vector, storing the feature vector in a feature queue, transferring the feature vector stored in the feature queue to a block queue according to an operation of a feature vector transfer timer, inputting the feature vector of the block queue into a sound recognition model trained to predict the sound class and storing an output result in a result queue, transferring the feature vector stored in the feature queue corresponding to timing at which the result is output to the block queue by the feature vector transfer timer when the result is output.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification