AUDIO SIGNAL ENCODING AND DECODING METHOD USING NEURAL NETWORK MODEL, AND ENCODER AND DECODER FOR PERFORMING THE SAME

Invention Application

US20220335963A1 AUDIO SIGNAL ENCODING AND DECODING METHOD USING NEURAL NETWORK MODEL, AND ENCODER AND DECODER FOR PERFORMING THE SAME 有权

Please log in to see more content

Patent Title: AUDIO SIGNAL ENCODING AND DECODING METHOD USING NEURAL NETWORK MODEL, AND ENCODER AND DECODER FOR PERFORMING THE SAME
Application No.: US17670172

Application Date: 2022-02-11
Publication No.: US20220335963A1

Publication Date: 2022-10-20
Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Hong-Goo KANG , Jihyun LEE , Chanwoo LEE , Hyungseob LIM
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Applicant Address: KR Daejeon; KR Seoul
Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Current Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Current Assignee Address: KR Daejeon; KR Seoul
Priority: KR10-2021-0049104 20210415
Main IPC: G10L19/038
IPC: G10L19/038 ; G10L25/30

AUDIO SIGNAL ENCODING AND DECODING METHOD USING NEURAL NETWORK MODEL, AND ENCODER AND DECODER FOR PERFORMING THE SAME

Abstract:

An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.

Public/Granted literature

US12205605B2 Audio signal encoding and decoding method using a neural network model to generate a quantized latent vector, and encoder and decoder for performing the same Public/Granted day:2025-01-21

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）
G10L19/02	.利用频谱分析，例如变换声码器或子频带声码器
G10L19/032	..频谱分量的量化或非量化
G10L19/038	...矢量量化，例如TwinVQ音频