-
公开(公告)号:US20240135941A1
公开(公告)日:2024-04-25
申请号:US18358646
申请日:2023-07-24
申请人: Electronics and Telecommunications Research Institute , Gwangju Institute of Science and Technology
发明人: Inseon JANG , Seung Kwon BEACK , Tae Jin LEE , Jongmo SUNG , Woo-taek LIM , Byeongho CHO , Jongwon SHIN
IPC分类号: G10L19/02
CPC分类号: G10L19/02
摘要: Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
-
2.
公开(公告)号:US20230267950A1
公开(公告)日:2023-08-24
申请号:US18097062
申请日:2023-01-13
申请人: Electronics and Telecommunications Research Institute , Industry-Academic Cooperation Foundation, Yonsei University
发明人: In Seon JANG , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Woo Taek LIM , Byeong Ho CHO , Hong Goo KANG , Ji Hyun LEE , Chan Woo LEE , Hyung Seob LIM
摘要: A generative adversarial network-based audio signal generation model for generating a high quality audio signal may comprise: a generator generating an audio signal with an external input; a harmonic-percussive separation model separating the generated audio signal into a harmonic component signal and a percussive component signal; and at least one discriminator evaluating whether each of the harmonic component signal and the percussive component signal is real or fake.
-
公开(公告)号:US20220005487A1
公开(公告)日:2022-01-06
申请号:US17368390
申请日:2021-07-06
发明人: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC分类号: G10L19/032
摘要: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
-
公开(公告)号:US20190306577A1
公开(公告)日:2019-10-03
申请号:US16025217
申请日:2018-07-02
发明人: Young Ho JEONG , Seung Kwon BEACK , Tae Jin LEE , Hui Yong KIM
IPC分类号: H04N21/466 , H04N21/442 , H04N21/435 , G10L19/018
摘要: Provided are a signal processing method for determining an audience rating of media, and an additional information inserting apparatus, a media reproducing apparatus and an audience rating determining apparatus for performing the same method. In detail, the signal processing method for determining an audience rating of media is a method that may determine an audience rating of media with respect to a whole section of an audio signal by inserting additional information into a silence section through a noise signal.
-
公开(公告)号:US20190180763A1
公开(公告)日:2019-06-13
申请号:US16180298
申请日:2018-11-05
发明人: Seung Kwon BEACK , Woo-taek LIM , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Hui Yong KIM
摘要: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
-
公开(公告)号:US20190147894A1
公开(公告)日:2019-05-16
申请号:US16245024
申请日:2019-01-10
发明人: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC分类号: G10L19/008 , H04S7/00
摘要: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
7.
公开(公告)号:US20180144755A1
公开(公告)日:2018-05-24
申请号:US15710353
申请日:2017-09-20
发明人: Mi Suk LEE , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE
IPC分类号: G10L19/018 , H04H20/31 , H04H60/37 , H04H60/58 , H04N21/2389 , H04N21/8358 , H04N5/067
CPC分类号: G10L19/018 , G06F16/683 , G06F16/955 , H04H20/31 , H04H60/37 , H04H60/58 , H04H2201/50 , H04N5/0675 , H04N21/23892 , H04N21/4394 , H04N21/8358
摘要: Disclosed is an audio watermark insertion method. The audio watermark insertion method includes performing a modulated complex lapped transform (MCLT) on a first audio signal, inserting a bit string of a watermark in the first audio signal obtained by performing the MCLT, performing an inverse modified discrete cosine transform (IMDCT) on the first audio signal in which the bit string is inserted, and obtaining a second audio signal, which is the first audio signal in which the watermark is inserted, by performing an overlap-add on a signal obtained by performing the IMDCT and a neighbor frame signal.
-
公开(公告)号:US20170337929A1
公开(公告)日:2017-11-23
申请号:US15669262
申请日:2017-08-04
发明人: Seung Kwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jin Woo HONG , Jeongil SEO , Chieteuk AHN , Hochong PARK , Young-cheol PARK
IPC分类号: G10L19/087 , G10L19/26
CPC分类号: G10L19/087 , G10L19/125 , G10L19/22 , G10L19/26
摘要: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
9.
公开(公告)号:US20170194010A1
公开(公告)日:2017-07-06
申请号:US15388408
申请日:2016-12-22
发明人: Jong Mo SUNG , Tae Jin PARK , Seung Kwon BEACK , Tae Jin LEE , Jin Soo CHOI
IPC分类号: G10L19/018 , G06F17/30 , G10L19/02
CPC分类号: G10L19/018 , G06F16/683 , G10L19/0204 , G10L25/54
摘要: Disclosed are a content identifying method and apparatus, and an audio signal processing apparatus and method for identifying content. The audio signal processing method for registration includes splitting an original audio signal into a lower band signal and a higher band signal; modifying the higher band signal using an metadata associated to the original audio signal; storing a reference lower band fingerprint extracted from the lower band signal, a reference higher band fingerprint extracted from the modified higher band signal, and the associated metadata in database; and generating a reference audio signal synthesized using the lower band signal and the modified higher band signal.
-
公开(公告)号:US20160232902A1
公开(公告)日:2016-08-11
申请号:US15131623
申请日:2016-04-18
发明人: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC分类号: G10L19/008 , H04S7/00
CPC分类号: G10L19/008 , H04S7/00 , H04S7/30 , H04S2400/01 , H04S2400/03
摘要: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
-
-
-
-
-
-
-
-