Invention Publication
- Patent Title: AUDIO SIGNAL PROCESSING APPARATUS AND METHOD FOR DEEP NEURAL NETWORK-BASED AUDIO ENCODER AND DECODER
-
Application No.: US18505970Application Date: 2023-11-09
-
Publication No.: US20240169997A1Publication Date: 2024-05-23
- Inventor: Jong Mo SUNG , Seung Kwon BEACK , Young Cheol Park , Joon BYUN , Seung Min SHIN
- Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Applicant Address: KR Daejeon
- Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE,INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
- Current Assignee Address: KR Daejeon
- Priority: KR 20220149392 2022.11.10
- Main IPC: G10L19/005
- IPC: G10L19/005

Abstract:
An audio signal processing method, which is executed by a processor electronically communicating with a deep neural network within a computing system, may comprise: acquiring, by the processor, an input signal before encoding and an output signal after quantization and decoding; calculating, by the processor, a perceptual global loss for a frame corresponding to the input and the output signals; acquiring, by the processor, a plurality of subframes corresponding to the input and output signals by applying a windowing function to the frame of the input and output signals; calculating, by the processor, perceptual local losses for the plurality of subframes corresponding to the input and output signals; and acquiring, by the processor, multi-time scale perceptual loss based on the perceptual global and local losses.
Information query