-
公开(公告)号:US20240305952A1
公开(公告)日:2024-09-12
申请号:US18606301
申请日:2024-03-15
Inventor: Michael William Mason , Juan Felix Torres , Antonio Mateos Sole , Daniel Arteaga , Adam J. Mills , Mark David de Burgh , Andrew Robert Owen
CPC classification number: H04S7/307 , H04R3/04 , H04R3/12 , H04R5/04 , H04S7/303 , H04S2400/07 , H04S2420/11
Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.
-
公开(公告)号:US20240265927A1
公开(公告)日:2024-08-08
申请号:US18605733
申请日:2024-03-14
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. McGrath , Stefanie Brown , Juan Felix Torres
IPC: G10L19/008 , G10L19/26 , G10L21/0208 , G10L25/18
CPC classification number: G10L19/008 , G10L19/26 , G10L21/0208 , G10L25/18 , G10L2021/02082
Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.
-
公开(公告)号:US12014745B2
公开(公告)日:2024-06-18
申请号:US17882900
申请日:2022-08-08
Inventor: Stefan Bruhn , Michael Eckert , Juan Felix Torres , Stefanie Brown , David S. McGrath
IPC: G10L19/008 , H04S3/00
CPC classification number: G10L19/008 , H04S3/008 , H04S2400/01
Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
-
公开(公告)号:US10805723B2
公开(公告)日:2020-10-13
申请号:US16433933
申请日:2019-06-06
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Timothy Alan Port , Sebastian P. B. Holzapfel , Juan Felix Torres
IPC: H04R3/04 , G06F3/16 , H04R29/00 , H04B17/12 , G10L21/0232
Abstract: Systems and methods for automatic characterization of perceived transducer distortion are described. The system includes a controller configured to receive a distortion level; a signal generator configured to generate a test signal for a frequency band in response to the distortion level, the test signal including at least two simultaneous tones, the at least two simultaneous tones having different frequencies within the frequency band; an audio transducer configured to generate an audio signal based on the test signal; and a distortion tuner configured to receive the audio signal and to determine the distortion level of the system based on a detected amount of distortion in the audio signal.
-
公开(公告)号:US20240179485A1
公开(公告)日:2024-05-30
申请号:US18535192
申请日:2023-12-11
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S7/302 , H04R5/02 , H04S7/308 , H04S2400/11 , H04S2400/13
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US11972767B2
公开(公告)日:2024-04-30
申请号:US17632225
申请日:2020-07-31
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. McGrath , Stefanie Brown , Juan Felix Torres
IPC: G10L19/008 , G10L19/26 , G10L21/0208 , G10L25/18
CPC classification number: G10L19/008 , G10L19/26 , G10L21/0208 , G10L25/18 , G10L2021/02082
Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.
-
公开(公告)号:US20230343346A1
公开(公告)日:2023-10-26
申请号:US18008445
申请日:2021-06-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH , Rishabh TYAGI , Stefanie BROWN , Juan Felix Torres
IPC: G10L19/032 , G10L19/008
CPC classification number: G10L19/032 , G10L19/008
Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.
-
公开(公告)号:US10861467B2
公开(公告)日:2020-12-08
申请号:US15902608
申请日:2018-02-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Juan Felix Torres , David S. Mcgrath , Michael William Mason
IPC: G10L19/008 , H04S5/00
Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
-
公开(公告)号:US10405120B2
公开(公告)日:2019-09-03
申请号:US15647121
申请日:2017-07-11
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US09949052B2
公开(公告)日:2018-04-17
申请号:US15451241
申请日:2017-03-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S3/002 , H04R5/02 , H04R5/04 , H04S7/30 , H04S7/308 , H04S2400/11 , H04S2400/13 , H04S2420/03
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
-
-
-
-
-
-
-
-