-
111.
公开(公告)号:US20230245666A1
公开(公告)日:2023-08-03
申请号:US18102472
申请日:2023-01-27
Inventor: Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Inseon JANG , Byeongho CHO
IPC: G10L19/035 , G10L19/038 , G10L19/00
CPC classification number: G10L19/035 , G10L19/038 , G10L19/0017
Abstract: Provided are an encoding method, an encoding device, a decoding method, and a decoding device using a scalar quantization and a vector quantization. The encoding method includes converting an input signal of a time domain into a frequency domain, generating a first residual signal from an input signal of a frequency domain by using a scale factor, performing a scalar quantization of the first residual signal, generating a second residual signal from the scalar-quantized first residual signal, performing a lossless encoding of the scalar-quantized first residual signal, performing a vector quantization of the second residual signal, and transmitting a bitstream including the lossless-encoded first residual signal and the vector-quantized second residual signal.
-
公开(公告)号:US20230224665A1
公开(公告)日:2023-07-13
申请号:US18091966
申请日:2022-12-30
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
IPC: H04S7/00
CPC classification number: H04S7/303 , H04S2400/11 , H04S7/305
Abstract: A method and apparatus for processing acoustic spatial information are provided. The method of processing acoustic spatial information includes identifying at least one mesh disposed in an acoustic space, setting a minimum cuboid surrounding the mesh as a bounding box, and generating acoustic spatial information including information about the bounding box.
-
113.
公开(公告)号:US20230039546A1
公开(公告)日:2023-02-09
申请号:US17711908
申请日:2022-04-01
Applicant: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Jongwon SHIN , Youngju CHEON , Sangwook HAN , Soojoong HWANG
IPC: G10L19/038 , G06N3/04
Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
-
公开(公告)号:US20220406320A1
公开(公告)日:2022-12-22
申请号:US17895233
申请日:2022-08-25
Inventor: Seung Kwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jin Woo HONG , Jeongil SEO , Chieteuk AHN , Hochong PARK , Young-Cheol PARK
IPC: G10L19/087 , G10L19/22 , G10L19/125 , G10L19/26
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
公开(公告)号:US20220360932A1
公开(公告)日:2022-11-10
申请号:US17681429
申请日:2022-02-25
Inventor: Dae Young JANG , Kyeongok KANG , Jae-hyoun YOO , Yong Ju LEE , Tae Jin LEE
Abstract: A method and apparatus for rendering a volume sound source are disclosed. The method of rendering a volume sound source may include identifying information about a listener and information about the volume sound source, determining a corresponding area in which a source element is disposed in the volume sound source in consideration of the information about the listener, determining an angle between the listener and the corresponding area based on the information about the listener and the information about the volume sound source, determining a number of source elements disposed in the corresponding area according to the angle, determining a position and a gain of the source element using i) the number of source elements and ii) a distance between the listener and the volume sound source, and rendering the volume sound source according to the position and the gain of the source element.
-
116.
公开(公告)号:US20220335963A1
公开(公告)日:2022-10-20
申请号:US17670172
申请日:2022-02-11
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
Inventor: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Hong-Goo KANG , Jihyun LEE , Chanwoo LEE , Hyungseob LIM
IPC: G10L19/038 , G10L25/30
Abstract: An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
-
117.
公开(公告)号:US20220005488A1
公开(公告)日:2022-01-06
申请号:US17368484
申请日:2021-07-06
Inventor: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032 , G10L19/16 , H04B17/309 , G06N3/08
Abstract: The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, quantizing the first feature information and producing the first feature bitstream, computing the first output signal from the quantized first feature information using a recurrent decoding model, computing the second feature information of the input signal using a nonrecurrent encoding model, quantizing the second feature information and producing the second feature bitstream, computing the second output signal from the quantized second feature information using a nonrecurrent decoding model, determining an encoding mode based on the input signal, the first and second output signals, and the first and second feature bitstreams, and outputting an overall bitstream by multiplexing an encoding mode bit and one of the first feature bitstream and the second feature bitstream depending on the encoding mode.
-
118.
公开(公告)号:US20210258709A1
公开(公告)日:2021-08-19
申请号:US16647458
申请日:2019-10-01
Applicant: Electronics and Telecommunications Research Institute , CHUNG ANG UNIVERSITY INDUSTRY ACADEMIC COOPERATION FOUNDATION
Inventor: Dae Young JANG , Jae-hyoun YOO , Yong Ju LEE , Tae Jin LEE , Sang Wook KIM
Abstract: An audio signal controlling method includes identifying whether an audio zooming effect is used for at least one audio object present in a virtual reality (VR) through an audio zooming effect field included in metadata, and controlling an audio signal corresponding to the audio object based on a preset method when the audio zooming effect is identified as being used.
-
公开(公告)号:US20210201923A1
公开(公告)日:2021-07-01
申请号:US17201943
申请日:2021-03-15
Inventor: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC: G10L19/008 , H04S7/00
Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
公开(公告)号:US20200349959A1
公开(公告)日:2020-11-05
申请号:US16843649
申请日:2020-04-08
Applicant: Electronics and Telecommunications Research Institute , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Hochong PARK , Seung Kwon BEACK , Jongmo SUNG , Seong-Hyeon SHIN , Mi Suk LEE , Tae Jin LEE , Jin Soo CHOI
IPC: G10L19/032 , G10L25/18 , G10L25/21 , G06N3/08 , G06N20/00
Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
-
-
-
-
-
-
-
-
-