-
公开(公告)号:US20220406321A1
公开(公告)日:2022-12-22
申请号:US17895256
申请日:2022-08-25
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK
IPC: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18
Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
-
公开(公告)号:US20220301571A1
公开(公告)日:2022-09-22
申请号:US17672403
申请日:2022-02-15
Inventor: Young Ho JEONG , Soo Young PARK , Tae Jin LEE
IPC: G10L19/018 , G06N3/08
Abstract: Disclosed is a method and apparatus for label encoding in a multi-sound event interval. The method includes identifying an event interval in which a plurality of sound events occurs in a sound signal, separating a sound source into sound event signals corresponding to each sound event by performing sound source separation on the event interval, determining energy information for each of the sound event signals, and performing label encoding based on the energy information.
-
公开(公告)号:US20220005487A1
公开(公告)日:2022-01-06
申请号:US17368390
申请日:2021-07-06
Inventor: Jongmo SUNG , Seung Kwon BEACK , Mi Suk LEE , Tae Jin LEE , Woo-taek LIM , Inseon JANG
IPC: G10L19/032
Abstract: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
-
公开(公告)号:US20190306577A1
公开(公告)日:2019-10-03
申请号:US16025217
申请日:2018-07-02
Inventor: Young Ho JEONG , Seung Kwon BEACK , Tae Jin LEE , Hui Yong KIM
IPC: H04N21/466 , H04N21/442 , H04N21/435 , G10L19/018
Abstract: Provided are a signal processing method for determining an audience rating of media, and an additional information inserting apparatus, a media reproducing apparatus and an audience rating determining apparatus for performing the same method. In detail, the signal processing method for determining an audience rating of media is a method that may determine an audience rating of media with respect to a whole section of an audio signal by inserting additional information into a silence section through a noise signal.
-
公开(公告)号:US20190180763A1
公开(公告)日:2019-06-13
申请号:US16180298
申请日:2018-11-05
Inventor: Seung Kwon BEACK , Woo-taek LIM , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE , Hui Yong KIM
Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
-
公开(公告)号:US20190147894A1
公开(公告)日:2019-05-16
申请号:US16245024
申请日:2019-01-10
Inventor: Yong Ju LEE , Jeong Il SEO , Jae Hyoun YOO , Seung Kwon BEACK , Jong Mo SUNG , Tae Jin LEE , Kyeong Ok KANG , Jin Woong KIM , Tae Jin PARK , Dae Young JANG , Keun Woo CHOI
IPC: G10L19/008 , H04S7/00
Abstract: Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
-
公开(公告)号:US20190074022A1
公开(公告)日:2019-03-07
申请号:US16179120
申请日:2018-11-02
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Tae Jin LEE , Seung-Kwon BAEK , Min Je KIM , Dae Young JANG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Hochong PARK , Young-cheol PARK
IPC: G10L19/12
CPC classification number: G10L19/12 , G10L19/008 , G10L19/02 , G10L19/20 , G10L19/22 , G11C2207/16
Abstract: Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; and a bitstream generator to generate a bitstream using an output
-
38.
公开(公告)号:US20180144755A1
公开(公告)日:2018-05-24
申请号:US15710353
申请日:2017-09-20
Inventor: Mi Suk LEE , Seung Kwon BEACK , Jongmo SUNG , Tae Jin LEE
IPC: G10L19/018 , H04H20/31 , H04H60/37 , H04H60/58 , H04N21/2389 , H04N21/8358 , H04N5/067
CPC classification number: G10L19/018 , G06F16/683 , G06F16/955 , H04H20/31 , H04H60/37 , H04H60/58 , H04H2201/50 , H04N5/0675 , H04N21/23892 , H04N21/4394 , H04N21/8358
Abstract: Disclosed is an audio watermark insertion method. The audio watermark insertion method includes performing a modulated complex lapped transform (MCLT) on a first audio signal, inserting a bit string of a watermark in the first audio signal obtained by performing the MCLT, performing an inverse modified discrete cosine transform (IMDCT) on the first audio signal in which the bit string is inserted, and obtaining a second audio signal, which is the first audio signal in which the watermark is inserted, by performing an overlap-add on a signal obtained by performing the IMDCT and a neighbor frame signal.
-
39.
公开(公告)号:US20170337929A1
公开(公告)日:2017-11-23
申请号:US15669262
申请日:2017-08-04
Inventor: Seung Kwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jin Woo HONG , Jeongil SEO , Chieteuk AHN , Hochong PARK , Young-cheol PARK
IPC: G10L19/087 , G10L19/26
CPC classification number: G10L19/087 , G10L19/125 , G10L19/22 , G10L19/26
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
40.
公开(公告)号:US20170194010A1
公开(公告)日:2017-07-06
申请号:US15388408
申请日:2016-12-22
Inventor: Jong Mo SUNG , Tae Jin PARK , Seung Kwon BEACK , Tae Jin LEE , Jin Soo CHOI
IPC: G10L19/018 , G06F17/30 , G10L19/02
CPC classification number: G10L19/018 , G06F16/683 , G10L19/0204 , G10L25/54
Abstract: Disclosed are a content identifying method and apparatus, and an audio signal processing apparatus and method for identifying content. The audio signal processing method for registration includes splitting an original audio signal into a lower band signal and a higher band signal; modifying the higher band signal using an metadata associated to the original audio signal; storing a reference lower band fingerprint extracted from the lower band signal, a reference higher band fingerprint extracted from the modified higher band signal, and the associated metadata in database; and generating a reference audio signal synthesized using the lower band signal and the modified higher band signal.
-
-
-
-
-
-
-
-
-