Invention Grant
- Patent Title: Audio signal encoding and decoding method using a neural network model to generate a quantized latent vector, and encoder and decoder for performing the same
-
Application No.: US17670172Application Date: 2022-02-11
-
Publication No.: US12205605B2Publication Date: 2025-01-21
- Inventor: Inseon Jang , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Woo-Taek Lim , Hong-Goo Kang , Jihyun Lee , Chanwoo Lee , Hyungseob Lim
- Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Applicant Address: KR Daejeon; KR Seoul
- Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee Address: KR Daejeon; KR Seoul
- Priority: KR10-2021-0049104 20210415
- Main IPC: G10L19/038
- IPC: G10L19/038 ; G10L19/00 ; G10L25/30

Abstract:
An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
Public/Granted literature
Information query