-
公开(公告)号:US20250142282A1
公开(公告)日:2025-05-01
申请号:US18690787
申请日:2022-09-15
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH , Jeremy Grant STODDARD
Abstract: The present disclosure relates to a method and system for predicting a future orientation of an orientation tracker (100). The method comprising obtaining a sequence of angular velocity samples, each angular velocity sample indicating an angular velocity at a point in time and obtaining a sequence of angular acceleration samples, each angular acceleration sample indicating an acceleration or deceleration of the angular velocity at each point in time. Wherein said method further comprises determining (S5a), for each point in time where the angular velocity is accelerating, a predicted orientation of the orientation tracker (100) based on a first order prediction of an accumulated rotation of the orientation tracker (100) and determining (S5c), for each point in time where the angular velocity is decelerating, a predicted orientation of the orientation tracker (100) based on a second order prediction of the accumulated rotation of the orientation tracker (100).
-
公开(公告)号:US20250071479A1
公开(公告)日:2025-02-27
申请号:US18826115
申请日:2024-09-05
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: David S. MCGRATH
Abstract: Disclosed are methods and systems which convert a multi-microphone input signal to a multichannel output signal making use of a time-and frequency-varying matrix. For each time and frequency tile, the matrix is derived as a function of a dominant direction of arrival and a steering strength parameter. Likewise, the dominant direction and steering strength parameter are derived from characteristics of the multi-microphone signals, where those characteristics include values representative of the inter-channel amplitude and group-delay differences.
-
公开(公告)号:US20240114306A1
公开(公告)日:2024-04-04
申请号:US17683762
申请日:2020-09-02
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH
Abstract: An multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output audio signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.
-
公开(公告)号:US20230326469A1
公开(公告)日:2023-10-12
申请号:US18042518
申请日:2021-08-26
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH , Hao LUO
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: Embodiments are disclosed for a matrix coded stereo signal with periphonic elements. A mixing matrix, suitable for processing a multi-channel audio input signal, is constructed so that the resulting multi-channel output signal contains the same audio elements from the input signal, wherein the spatial relationships between audio elements, as defined by panning rules associated with the input signal format, are preserved in the output signal, as defined by matrix encoding rules associated with the output signal format. The choice of the coefficients of the mixing matrix is governed by a phase-preference rule that is used to determine the appropriate phase to apply to each input signal channel.
-
35.
公开(公告)号:US20230215444A1
公开(公告)日:2023-07-06
申请号:US18000841
申请日:2021-06-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH
IPC: G10L19/008 , G10L19/02 , G10L19/002
CPC classification number: G10L19/008 , G10L19/002 , G10L19/0204
Abstract: Systems, methods, and computer program products are disclosed for adaptive downmixing of audio signals with improved continuity. An audio encoding system receives an input multi-channel audio signal including a primary input audio channel and L non-primary input audio channels. The system determines a set of L input gains. For each of the channels and gains, the system forms a respective scaled non-primary input audio channel. The system forms a primary output audio channel from the sum of the primary input audio channel and the scaled non-primary input audio channels. The system determines a set of L prediction gains. The system forms a prediction channel from the primary output audio channel. The system forms L non-primary output audio channels. The system forms an output multi-channel audio signal from the primary output audio channel and the L non-primary output audio channels.
-
公开(公告)号:US20220375482A1
公开(公告)日:2022-11-24
申请号:US17882900
申请日:2022-08-08
Inventor: Stefan BRUHN , Michael ECKERT , Juan Felix TORRES , Stefanie BROWN , David S. MCGRATH
IPC: G10L19/008
Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
-
公开(公告)号:US20210272574A1
公开(公告)日:2021-09-02
申请号:US16973030
申请日:2019-10-07
Inventor: Stefan BRUHN , Michael ECKERT , Juan Felix TORRES , Stefanie BROWN , David S. MCGRATH
IPC: G10L19/008 , H04S3/00
Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
-
38.
公开(公告)号:US20200322751A1
公开(公告)日:2020-10-08
申请号:US16851656
申请日:2020-04-17
Inventor: Nicolas R. TSINGOS , David S. MCGRATH , Freddie SANCHEZ , Antonio MATEOS SOLE
IPC: H04S7/00
Abstract: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.
-
公开(公告)号:US20180308507A1
公开(公告)日:2018-10-25
申请号:US15776718
申请日:2017-01-13
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Zhiwei SHUANG , David S. MCGRATH , Michael William MASON
CPC classification number: G10L25/18 , G10L21/00 , G10L25/21 , H03G5/005 , H03G5/165 , H04R3/04 , H04R2430/03
Abstract: Example embodiments disclosed herein relate to audio signal processing with low latency. A method of processing an audio signal is disclosed. The method includes obtaining frequency parameters of a current frame of the audio signal. The method also includes generating intermediate frequency domain outputs for a set of predefined frequency bands based on the frequency parameters using predefined frequency band filter banks, a frequency band filter bank being specific to a respective frequency band in the set. The method further includes determining frequency band energies for the set of predefined frequency bands based on the intermediate frequency domain outputs, and processing the current frame based on the determined frequency band energies. Corresponding system, computer program product, and device for processing an audio signal are also disclosed.
-
公开(公告)号:US20180018977A1
公开(公告)日:2018-01-18
申请号:US15546258
申请日:2016-03-02
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David S. MCGRATH
IPC: G10L19/008
CPC classification number: G10L19/008 , H04S3/008
Abstract: Some methods involve receiving an input audio signal that includes N input audio channels, the input audio signal representing a first soundfield format having a first soundfield format resolution, N being an integer ≧2. A first decorrelation process may be applied to two or more of the input audio channels to produce a first set of decorrelated channels, the first decorrelation process maintaining an inter-channel correlation of the set of input audio channels. A first modulation process may be applied to the first set of decorrelated channels to produce a first set of decorrelated and modulated output channels. The first set of decorrelated and modulated output channels may be combined with two or more undecorrelated output channels to produce an output audio signal that includes O output audio channels representing a second and relatively higher-resolution soundfield format than the first soundfield format, O being an integer ≧3.
-
-
-
-
-
-
-
-
-