-
公开(公告)号:US20240274141A1
公开(公告)日:2024-08-15
申请号:US18640790
申请日:2024-04-19
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen , Siddhartha Goutham Swaminathan , S M Akramus Salehin , Jason Filos
CPC classification number: G10L19/02 , H04S7/303 , H04S2420/01
Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
-
公开(公告)号:US11967329B2
公开(公告)日:2024-04-23
申请号:US17180255
申请日:2021-02-19
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen , Siddhartha Goutham Swaminathan , S M Akramus Salehin , Jason Filos
CPC classification number: G10L19/02 , H04S7/303 , H04S2420/01
Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
-
公开(公告)号:US11240623B2
公开(公告)日:2022-02-01
申请号:US16058760
申请日:2018-08-08
Applicant: QUALCOMM Incorporated
Inventor: Nils Gunther Peters , S M Akramus Salehin , Shankar Thagadur Shivappa , Moo Young Kim , Dipanjan Sen
Abstract: One or more processors may obtain a first distance between a first audio zone of the two or more audio zones associated with the one or more interest points within the first audio zone, and a first device position of a device, obtain a second distance between a second audio zone of the two or more audio zones associated with the one or more interest points within the second audio zone, and the first device position of the device, and obtain an updated first distance and updated second distance after movement of the device has changed from the first device position to a second device position. The one or more processor(s) may independently control the first audio zone and the second audio zone, such that the audio data within the first audio zone and the second audio zone are adjusted based on the updated first distance and updated second distance.
-
公开(公告)号:US20200260210A1
公开(公告)日:2020-08-13
申请号:US16863626
申请日:2020-04-30
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
IPC: H04S7/00 , G10L19/008
Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
-
公开(公告)号:US10659906B2
公开(公告)日:2020-05-19
申请号:US15868656
申请日:2018-01-11
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim , Nils Günther Peters , Dipanjan Sen
Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
-
公开(公告)号:US20190116440A1
公开(公告)日:2019-04-18
申请号:US15782252
申请日:2017-10-12
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , S M Akramus Salehin , Shankar Thagadur Shivappa , Moo Young Kim , Dipanjan Sen
IPC: H04S3/00 , G02B27/00 , H04S3/02 , G10L19/008
Abstract: In general, techniques are described for adapting higher order ambisonic audio data to include three degrees of freedom plus effects. An example device configured to perform the techniques includes a memory, and a processor coupled to the memory. The memory may be configured to store higher order ambisonic audio data representative of a soundfield. The processor may be configured to obtain a translational distance representative of a translational head movement of a user interfacing with the device. The processor may further be configured to adapt, based on the translational distance, higher order ambisonic audio data to provide three degrees of freedom plus effects that adapt the soundfield to account for the translational head movement, and generate speaker feeds based on the adapted higher order ambient audio data.
-
公开(公告)号:US20180082694A1
公开(公告)日:2018-03-22
申请号:US15823284
申请日:2017-11-27
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim
IPC: G10L19/008 , H04S3/00 , G10L19/002
CPC classification number: G10L19/008 , G10L19/002 , H04S3/008 , H04S2420/11
Abstract: Systems and techniques for compression and decoding of audio data are generally disclosed. An example device for compressing higher order ambisonic (HOA) coefficients representative of a soundfield includes a memory configured to store audio data and one or more processors configured to: determine when to use ambient HOA coefficients of the HOA coefficients to augment one or more foreground audio objects obtained through decomposition of the HOA coefficients based on one or more singular values also obtained through the decomposition of the HOA coefficients, the ambient HOA coefficients representative of an ambient component of the soundfield.
-
公开(公告)号:US09847087B2
公开(公告)日:2017-12-19
申请号:US14712661
申请日:2015-05-14
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim
IPC: G10L19/008 , G10L19/002 , H04S3/00
CPC classification number: G10L19/008 , G10L19/002 , H04S3/008 , H04S2420/11
Abstract: Systems and techniques for compression and decoding of audio data are generally disclosed. An example device for compressing higher order ambisonic (HOA) coefficients representative of a soundfield includes a memory configured to store audio data and one or more processors configured to: determine when to use ambient HOA coefficients of the HOA coefficients to augment one or more foreground audio objects obtained through decomposition of the HOA coefficients based on one or more singular values also obtained through the decomposition of the HOA coefficients, the ambient HOA coefficients representative of an ambient component of the soundfield.
-
公开(公告)号:US20170194014A1
公开(公告)日:2017-07-06
申请号:US15266929
申请日:2016-09-15
Applicant: QUALCOMM Incorporated
Inventor: Moo Young Kim
CPC classification number: G10L19/20 , G10L19/008 , G10L19/167 , H04S7/308 , H04S2400/01 , H04S2400/15 , H04S2420/11
Abstract: In one example, a method includes obtaining an audio signal comprising a plurality of elements; generating a first Higher-Order Ambisonics (HOA) soundfield that represents the audio signal; selecting a set of elements of the audio signal for encoding in a non-Higher-Order Ambisonics (HOA) domain; generating, based on the selected set of elements and a set of spatial positioning vectors, a second HOA soundfield that represents the selected set of elements; generating a third HOA soundfield that represents a difference between the first HOA soundfield and the second HOA soundfield; and generate a coded audio bitstream that includes a representation of the selected set of elements in the non-HOA domain, an indication of the set of spatial positioning vectors, and a representation of the third HOA soundfield.
-
公开(公告)号:US20170110140A1
公开(公告)日:2017-04-20
申请号:US15290229
申请日:2016-10-11
Applicant: QUALCOMM Incorporated
Inventor: Nils Günther Peters , Dipanjan Sen , Moo Young Kim
CPC classification number: H04S5/00 , G10L19/008 , H04R2499/15 , H04S3/02 , H04S2400/01 , H04S2420/11
Abstract: In general, techniques are described for coding higher-order ambisonic coefficients during multiple transitions. A device comprising a processor and a memory coupled to the processor may be configured to perform the techniques. The processor may be configured to obtain a multi-transition indication of whether an ambient HOA coefficient is in transition during a same frame of the bitstream as a foreground audio signal is in transition. The processor may also be configured to obtain a vector that describes a spatial characteristic of a corresponding foreground audio signal based on the multi-transition indication, both the vector and the corresponding HOA audio signal decomposed from the HOA audio data. The memory may be configured to store the vector.
-
-
-
-
-
-
-
-
-