-
公开(公告)号:US11783844B2
公开(公告)日:2023-10-10
申请号:US17527351
申请日:2021-11-16
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
Inventor: Woo-taek Lim , Seung Kwon Beack , Jongmo Sung , Tae Jin Lee , Inseon Jang , Jong Won Shin , Soojoong Hwang , Youngju Cheon , Sangwook Han
Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
-
公开(公告)号:US11778376B2
公开(公告)日:2023-10-03
申请号:US17582209
申请日:2022-01-24
Inventor: Yong Ju Lee , Jae-hyoun Yoo , Dae Young Jang , Kyeongok Kang , Tae Jin Lee
Abstract: An apparatus and method for pitch-shifting an audio signal with low complexity are disclosed. The method includes identifying a distance between an audio object included in the audio signal and a listener, checking whether the distance between the audio object and the listener decreases, and performing stepwise stretching pitch-shifting of repeatedly using at least one of frequency components of the audio signal when the distance between the audio object and the listener decreases.
-
63.
公开(公告)号:US11581000B2
公开(公告)日:2023-02-14
申请号:US17105835
申请日:2020-11-27
Inventor: Woo-Taek Lim , Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee
IPC: G10L19/00 , G10L25/30 , G10L19/16 , G06N3/08 , G10L19/038
Abstract: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
-
64.
公开(公告)号:US11580999B2
公开(公告)日:2023-02-14
申请号:US17331416
申请日:2021-05-26
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035
Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
公开(公告)号:US11545163B2
公开(公告)日:2023-01-03
申请号:US16729112
申请日:2019-12-27
Inventor: Seung Kwon Beack , Woo-taek Lim , Tae Jin Lee
IPC: G10L19/032 , G10L25/30
Abstract: A loss function of a signal including an audio signal is determined. A loss function determining system for an audio signal is provided. A loss function is determined by: determining a reference quantization index by quantizing an original input signal; inputting the original input signal to a neural network classifier and applying an activation function to an output layer of the neural network classifier; and determining a total loss function for the neural network classifier using an output of the activation function and the reference quantization index.
-
公开(公告)号:US11430457B2
公开(公告)日:2022-08-30
申请号:US16846272
申请日:2020-04-10
Inventor: Seung Kwon Beack , Tae Jin Lee , Min Je Kim , Kyeongok Kang , Dae Young Jang , Jin Woo Hong , Jeongil Seo , Chieteuk Ahn , Hochong Park , Young-Cheol Park
IPC: G10L19/087 , G10L19/22 , G10L19/125 , G10L19/26
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
67.
公开(公告)号:US11330387B2
公开(公告)日:2022-05-10
申请号:US16647458
申请日:2019-10-01
Applicant: Electronics and Telecommunications Research Institute , CHUNG ANG UNIVERSITY INDUSTRY ACADEMIC COOPERATION FOUNDATION
Inventor: Dae Young Jang , Jae-hyoun Yoo , Yong Ju Lee , Tae Jin Lee , Sang Wook Kim
Abstract: An audio signal controlling method includes identifying whether an audio zooming effect is used for at least one audio object present in a virtual reality (VR) through an audio zooming effect field included in metadata, and controlling an audio signal corresponding to the audio object based on a preset method when the audio zooming effect is identified as being used.
-
公开(公告)号:US11328734B2
公开(公告)日:2022-05-10
申请号:US16735522
申请日:2020-01-06
Inventor: Seung Kwon Beack , Jeong Il Seo , Jong Mo Sung , Tae Jin Lee , Jin Soo Choi
IPC: G10L19/008 , G10L19/24 , H04S3/00
Abstract: An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.
-
公开(公告)号:US10904689B2
公开(公告)日:2021-01-26
申请号:US16797523
申请日:2020-02-21
Inventor: Jae Hyoun Yoo , Tae Jin Lee , Seok Jin Lee
IPC: H04S3/02 , H04S3/00 , G10L19/16 , G10L19/008
Abstract: An audio metadata providing apparatus and method and a multichannel audio data playback apparatus and method to support a dynamic format conversion are provided. Dynamic format conversion information may include information about a plurality of format conversion schemes that are used to convert a first format set by a writer of multichannel audio data into a second format that is based on a playback environment of the multichannel audio data and that are set for each of playback periods of the multichannel audio data. The audio metadata providing apparatus may provide audio metadata including the dynamic format conversion information. The multichannel audio data playback apparatus may identify the dynamic format conversion information from the audio metadata, may convert the first format of the multichannel audio data into the second format based on the identified dynamic format conversion information, and may play back the multichannel audio data with the second format.
-
公开(公告)号:US10783892B2
公开(公告)日:2020-09-22
申请号:US16404334
申请日:2019-05-06
Inventor: Seung Kwon Beack , Tae Jin Lee , Jong Mo Sung , Kyeong Ok Kang , Keun Woo Choi
IPC: G10L19/00 , G10L19/008 , G10L19/02 , G10L19/002 , G10L19/22
Abstract: An audio encoding apparatus to encode an audio signal using lossless coding or lossy coding and an audio decoding apparatus to decode an encoded audio signal are disclosed. An audio encoding apparatus according to an exemplary embodiment may include an input signal type determination unit to determine a type of an input signal based on characteristics of the input signal, a residual signal generation unit to generate a residual signal based on an output signal from the input signal type determination unit, and a coding unit to perform lossless coding or lossy coding using the residual signal.
-
-
-
-
-
-
-
-
-