Invention Application
- Patent Title: AUDIO SIGNAL ENCODING AND DECODING METHOD USING NEURAL NETWORK MODEL, AND ENCODER AND DECODER FOR PERFORMING THE SAME
-
Application No.: US17670172Application Date: 2022-02-11
-
Publication No.: US20220335963A1Publication Date: 2022-10-20
- Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Hong-Goo KANG , Jihyun LEE , Chanwoo LEE , Hyungseob LIM
- Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Applicant Address: KR Daejeon; KR Seoul
- Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee Address: KR Daejeon; KR Seoul
- Priority: KR10-2021-0049104 20210415
- Main IPC: G10L19/038
- IPC: G10L19/038 ; G10L25/30

Abstract:
An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
Public/Granted literature
Information query