-
公开(公告)号:US20230224668A1
公开(公告)日:2023-07-13
申请号:US18096439
申请日:2023-01-12
Inventor: Dae Young JANG , Kyeongok KANG , Jae-hyoun YOO , Yong Ju LEE
IPC: H04S7/00 , G10L19/008
CPC classification number: H04S7/304 , H04S7/305 , G10L19/008 , H04S2420/01 , H04S2400/13
Abstract: Disclosed is an apparatus for immersive spatial audio modeling and rendering for effectively transmitting and playing immersive spatial audio content. The apparatus for immersive spatial audio modeling and rendering disclosed herein may model a spatial audio scene, generate and transmit parameters necessary for spatial audio rendering, and generate various spatial audio effects using the spatial audio parameters, to provide an immersive three-dimensional (3D) audio source coinciding with visual experience in a virtual reality space in response to free changes in the position and direction of a remote user in the space.
-
公开(公告)号:US20230224662A1
公开(公告)日:2023-07-13
申请号:US18146685
申请日:2022-12-27
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
IPC: H04S7/00
CPC classification number: H04S7/302
Abstract: Provided is a method and apparatus for generating an impulse response using ray tracing. The method of generating an impulse response may include calculating a number of rays reaching a receiver from a transmitter based on acoustic geometry information including a position of the transmitter and a position of the receiver disposed in a sound space, a maximum ray length or a sound space volume, and a radius of the receiver, tracing the rays using a path of the calculated rays, and generating an impulse response based on the traced rays.
-
公开(公告)号:US20230224661A1
公开(公告)日:2023-07-13
申请号:US17713059
申请日:2022-04-04
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
CPC classification number: H04S7/302 , H04S5/005 , H04S2400/11 , H04S2420/01
Abstract: A method and apparatus for rendering an object-based audio signal considering an obstacle are disclosed. A method for rendering an object-based audio signal according to an example embodiment, the method includes identifying an object-based input signal and metadata for the input signal, generating a binaural filter based on the metadata using a binaural room impulse response (BRIR), determining, based on the metadata, whether an obstacle is present between a listener and an object, modifying the generated binaural filter when it is determined that the obstacle is present, and generating a rendered output signal by convolving the modified binaural filter and the input signal.
-
公开(公告)号:US20230112342A1
公开(公告)日:2023-04-13
申请号:US17582209
申请日:2022-01-24
Inventor: Yong Ju LEE , Jae-hyoun YOO , Dae Young JANG , Kyeongok KANG , Tae Jin LEE
Abstract: An apparatus and method for pitch-shifting an audio signal with low complexity are disclosed. The method includes identifying a distance between an audio object included in the audio signal and a listener, checking whether the distance between the audio object and the listener decreases, and performing stepwise stretching pitch-shifting of repeatedly using at least one of frequency components of the audio signal when the distance between the audio object and the listener decreases.
-
公开(公告)号:US20200349958A1
公开(公告)日:2020-11-05
申请号:US16925946
申请日:2020-07-10
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , Kwangwoon University Industry-Academic Collaboration Foundation
Inventor: Tae Jin LEE , Seung-Kwon BAEK , Min Je KIM , Dae Young JANG , Jeongil SEO , Kyeongok KANG , Jin-Woo HONG , Hochong PARK , Young-Cheol PARK
IPC: G10L19/008 , G10L19/02 , G10L19/04 , G10L19/20 , G10L19/12
Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate ; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
-
46.
公开(公告)号:US20200243099A1
公开(公告)日:2020-07-30
申请号:US16846272
申请日:2020-04-10
Inventor: Seung Kwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jin Woo HONG , Jeongil SEO , Chieteuk AHN , Hochong PARK , Young-Cheol PARK
IPC: G10L19/087 , G10L19/26 , G10L19/125 , G10L19/22
Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
-
公开(公告)号:US20200227060A1
公开(公告)日:2020-07-16
申请号:US16835728
申请日:2020-03-31
Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
Inventor: Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK
IPC: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18
Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
-
48.
公开(公告)号:US20200176002A1
公开(公告)日:2020-06-04
申请号:US16786817
申请日:2020-02-10
Inventor: Seung Kwon BEACK , Tae Jin LEE , Jong Mo SUNG , Jeong Il SEO , Kyeong Ok KANG , Dae Young JANG , Jin Woong KIM
IPC: G10L19/008 , H04S3/00
Abstract: An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
-
49.
公开(公告)号:US20190215631A1
公开(公告)日:2019-07-11
申请号:US16354890
申请日:2019-03-15
Inventor: Seung Kwon BEACK , Tae Jin LEE , Jong Mo SUNG , Kyeong Ok KANG , Jeong Il SEO , Dae Young JANG , Yong Ju LEE , Jin Woong KIM
IPC: H04S3/00
CPC classification number: H04S3/008 , G10L19/008 , H04S2420/03
Abstract: An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
-
50.
公开(公告)号:US20190200150A1
公开(公告)日:2019-06-27
申请号:US16290469
申请日:2019-03-01
Inventor: Seung Kwon BEACK , Jeong Il SEO , Jong Mo SUNG , Tae Jin LEE , Dae Young JANG , Jin Soo CHOI
IPC: H04S1/00 , H04S3/00 , G10L19/008 , H04S5/00
CPC classification number: H04S1/007 , G10L19/008 , H04S3/008 , H04S3/02 , H04S5/00 , H04S2400/01 , H04S2400/03 , H04S2400/07 , H04S2420/03
Abstract: Provided are an encoding method of a multichannel signal, an encoding apparatus to perform the encoding method, a multichannel signal processing method, and a decoding apparatus to perform the decoding method. The decoding method may include identifying an N/2-channel downmix signal derived from an N-channel input signal; and generating an N-channel output signal from the identified N/2-channel downmix signal using a plurality of one-to-two (OTT) boxes. If a low frequency effect (LFE) channel is absent in the output signal, the number of OTT boxes may be equal to N/2 where N/2 denotes the number of channels of the downmix signal.
-
-
-
-
-
-
-
-
-