-
公开(公告)号:US20240135941A1
公开(公告)日:2024-04-25
申请号:US18358646
申请日:2023-07-24
申请人: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
发明人: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC分类号: G10L19/02
CPC分类号: G10L19/02
摘要: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
公开(公告)号:US20240233738A9
公开(公告)日:2024-07-11
申请号:US18358646
申请日:2023-07-25
申请人: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
发明人: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC分类号: G10L19/02
CPC分类号: G10L19/02
摘要: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
3.
公开(公告)号:US20230230604A1
公开(公告)日:2023-07-20
申请号:US18099119
申请日:2023-01-19
申请人: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
发明人: Inseon JANG , Tae Jin LEE , Seung Kwon BEACK , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN , Soojoong HWANG , Eunkyun LEE , Youngwon CHOI , Sangwook HAN
CPC分类号: G10L19/0204 , G10L25/30
摘要: A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.
-
公开(公告)号:US20220358940A1
公开(公告)日:2022-11-10
申请号:US17527351
申请日:2021-11-16
申请人: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Gwangju Institute of Science and Technology
发明人: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Inseon JANG , Jong Won SHIN , Soojoong HWANG , Youngju CHEON , Sangwook HAN
摘要: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
-
5.
公开(公告)号:US20230039546A1
公开(公告)日:2023-02-09
申请号:US17711908
申请日:2022-04-01
申请人: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
发明人: Inseon JANG , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE , Woo-taek LIM , Jongwon SHIN , Youngju CHEON , Sangwook HAN , Soojoong HWANG
IPC分类号: G10L19/038 , G06N3/04
摘要: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
-
6.
公开(公告)号:US20230267950A1
公开(公告)日:2023-08-24
申请号:US18097062
申请日:2023-01-13
申请人: Electronics and Telecommunications Research Institute , Industry-Academic Cooperation Foundation, Yonsei University
发明人: In Seon JANG , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Woo Taek LIM , Byeong Ho CHO , Hong Goo KANG , Ji Hyun LEE , Chan Woo LEE , Hyung Seob LIM
摘要: A generative adversarial network-based audio signal generation model for generating a high quality audio signal may comprise: a generator generating an audio signal with an external input; a harmonic-percussive separation model separating the generated audio signal into a harmonic component signal and a percussive component signal; and at least one discriminator evaluating whether each of the harmonic component signal and the percussive component signal is real or fake.
-
公开(公告)号:US20220005487A1
公开(公告)日:2022-01-06
申请号:US17368390
申请日:2021-07-06
发明人: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC分类号: G10L19/032
摘要: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
-
公开(公告)号:US20190306577A1
公开(公告)日:2019-10-03
申请号:US16025217
申请日:2018-07-02
发明人: Young Ho JEONG , Seung Kwon BEACK , Tae Jin LEE , Hui Yong KIM
IPC分类号: H04N21/466 , H04N21/442 , H04N21/435 , G10L19/018
摘要: Provided are a signal processing method for determining an audience rating of media, and an additional information inserting apparatus, a media reproducing apparatus and an audience rating determining apparatus for performing the same method. In detail, the signal processing method for determining an audience rating of media is a method that may determine an audience rating of media with respect to a whole section of an audio signal by inserting additional information into a silence section through a noise signal.
-
公开(公告)号:US20190180763A1
公开(公告)日:2019-06-13
申请号:US16180298
申请日:2018-11-05
发明人: Seung Kwon BEACK , Woo-taek LIM , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Hui Yong KIM
摘要: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
-
公开(公告)号:US20190147894A1
公开(公告)日:2019-05-16
申请号:US16245024
申请日:2019-01-10
发明人: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC分类号: G10L19/008 , H04S7/00
摘要: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
-
-
-
-
-
-
-
-