-
公开(公告)号:US20240233738A9
公开(公告)日:2024-07-11
申请号:US18358646
申请日:2023-07-25
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC: G10L19/02
CPC classification number: G10L19/02
Abstract: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
公开(公告)号:US20240055009A1
公开(公告)日:2024-02-15
申请号:US18349680
申请日:2023-07-10
Inventor: Byeongho CHO , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032
CPC classification number: G10L19/032
Abstract: Provided are an apparatus for encoding an audio signal and a method of an operation thereof. An audio signal encoding method includes obtaining quantized linear prediction (LP) coefficients by performing a linear predictive coding (LPC) analysis and quantization on an input audio signal, generating a reference signal by applying discrete Fourier transform (DFT) to the input audio signal, obtaining LP residual coefficients from the reference signal, scaling magnitudes of the LP residual coefficients using the quantized LP coefficients and the reference signal, and quantizing phases of the LP residual coefficients and the scaled magnitudes of the LP residual coefficients.
-
公开(公告)号:US20230308828A1
公开(公告)日:2023-09-28
申请号:US18191695
申请日:2023-03-28
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
IPC: H04S7/00
CPC classification number: H04S7/305 , H04S7/303 , H04S2400/11
Abstract: An audio signal processing apparatus and an audio signal processing method are disclosed. The audio signal processing method performed by the audio signal processing apparatus includes determining whether a line of sight between a render item (RI) corresponding to an audio element and a listener is visible, based on a bitstream, in response to a case where the line of sight is invisible, generating an audio signal by rendering a diffraction-type RI corresponding to the RI, and outputting the audio signal.
-
74.
公开(公告)号:US20230230604A1
公开(公告)日:2023-07-20
申请号:US18099119
申请日:2023-01-19
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Tae Jin LEE , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN , Soojoong HWANG , Eunkyun LEE , Youngwon CHOI , Sangwook HAN
CPC classification number: G10L19/0204 , G10L25/30
Abstract: A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.
-
公开(公告)号:US20230224662A1
公开(公告)日:2023-07-13
申请号:US18146685
申请日:2022-12-27
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
IPC: H04S7/00
CPC classification number: H04S7/302
Abstract: Provided is a method and apparatus for generating an impulse response using ray tracing. The method of generating an impulse response may include calculating a number of rays reaching a receiver from a transmitter based on acoustic geometry information including a position of the transmitter and a position of the receiver disposed in a sound space, a maximum ray length or a sound space volume, and a radius of the receiver, tracing the rays using a path of the calculated rays, and generating an impulse response based on the traced rays.
-
公开(公告)号:US20230224661A1
公开(公告)日:2023-07-13
申请号:US17713059
申请日:2022-04-04
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
CPC classification number: H04S7/302 , H04S5/005 , H04S2400/11 , H04S2420/01
Abstract: A method and apparatus for rendering an object-based audio signal considering an obstacle are disclosed. A method for rendering an object-based audio signal according to an example embodiment, the method includes identifying an object-based input signal and metadata for the input signal, generating a binaural filter based on the metadata using a binaural room impulse response (BRIR), determining, based on the metadata, whether an obstacle is present between a listener and an object, modifying the generated binaural filter when it is determined that the obstacle is present, and generating a rendered output signal by convolving the modified binaural filter and the input signal.
-
公开(公告)号:US20230112342A1
公开(公告)日:2023-04-13
申请号:US17582209
申请日:2022-01-24
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
Abstract: An apparatus and method for pitch-shifting an audio signal with low complexity are disclosed. The method includes identifying a distance between an audio object included in the audio signal and a listener, checking whether the distance between the audio object and the listener decreases, and performing stepwise stretching pitch-shifting of repeatedly using at least one of frequency components of the audio signal when the distance between the audio object and the listener decreases.
-
公开(公告)号:US20220358940A1
公开(公告)日:2022-11-10
申请号:US17527351
申请日:2021-11-16
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
Inventor: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Jong Won SHIN , Soojoong HWANG , Youngju CHEON , Sangwook HAN
Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
-
公开(公告)号:US20220216881A1
公开(公告)日:2022-07-07
申请号:US17484284
申请日:2021-09-24
Inventor: Young Ho JEONG , Soo Young PARK , Tae Jin LEE
Abstract: Disclosed are a training method for a learning model for recognizing an acoustic signal, a method of recognizing an acoustic signal using the learning model, and devices for performing the methods. The method of recognizing an acoustic signal using a learning model includes identifying an acoustic signal including an acoustic event or acoustic scene, determining an acoustic feature of the acoustic signal, dividing the determined acoustic feature for each of a plurality of frequency band intervals, and determining the acoustic event or acoustic scene included in the acoustic signal by inputting the divided acoustic features to a trained learning model.
-
80.
公开(公告)号:US20210398547A1
公开(公告)日:2021-12-23
申请号:US17331416
申请日:2021-05-26
Inventor: Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/035 , G10L19/022 , G10L19/06 , G10L19/16
Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
-
-
-
-
-
-
-
-