-
公开(公告)号:US12143797B2
公开(公告)日:2024-11-12
申请号:US18309145
申请日:2023-04-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Louis D. Fielder , Zhiwei Shuang , Grant A. Davidson , Xiguang Zheng , Mark S. Vinton
Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
-
2.
公开(公告)号:US11576004B2
公开(公告)日:2023-02-07
申请号:US17688744
申请日:2022-03-07
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Grant A. Davidson , Kuan-Chieh Yen , Dirk Jeroen Breebaart
IPC: H04S7/00
Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
-
3.
公开(公告)号:US09489956B2
公开(公告)日:2016-11-08
申请号:US14767565
申请日:2014-01-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Matthew Fellers , Vinay Melkote , Kuan-Chieh Yen , Grant A. Davidson , Mark F. Davis
IPC: G10L19/008 , G10L19/02 , G10L19/012 , G10L25/06 , G10L25/18
CPC classification number: G10L19/008 , G10L19/012 , G10L19/0204 , G10L25/06 , G10L25/18
Abstract: Received audio data may include a first set of frequency coefficients and a second set of frequency coefficients. Spatial parameters for at least part of the second set of frequency coefficients may be estimated, based at least in part on the first set of frequency coefficients. The estimated spatial parameters may be applied to the second set of frequency coefficients to generate a modified second set of frequency coefficients. The first set of frequency coefficients may correspond to a first frequency range (for example, an individual channel frequency range) and the second set of frequency coefficients may correspond to a second frequency range (for example, a coupled channel frequency range). Combined frequency coefficients of a composite coupling channel may be based on frequency coefficients of two or more channels. Cross-correlation coefficients, between frequency coefficients of a first channel and the combined frequency coefficients, may be computed.
Abstract translation: 接收的音频数据可以包括第一组频率系数和第二组频率系数。 可以至少部分地基于第一组频率系数来估计第二组频率系数的至少一部分的空间参数。 估计的空间参数可以应用于第二组频率系数,以产生经修改的第二组频率系数。 第一组频率系数可以对应于第一频率范围(例如,单个信道频率范围),并且第二组频率系数可以对应于第二频率范围(例如,耦合的信道频率范围)。 复合耦合信道的组合频率系数可以基于两个或更多个信道的频率系数。 可以计算第一通道的频率系数与组合频率系数之间的互相关系数。
-
4.
公开(公告)号:US20230394287A1
公开(公告)日:2023-12-07
申请号:US18248805
申请日:2021-10-12
Inventor: Cong Zhou , Mark S. Vinton , Grant A. Davidson , Lars Villemoes
IPC: G06N3/0475 , G06N3/044 , G06N3/045
CPC classification number: G06N3/0475 , G06N3/044 , G06N3/045
Abstract: A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables representing a specific frequency band given coefficients of one or several frequency bands adjacent to the specific frequency band in said current time frame. Such a neural network system forms a predictor capable of capturing both temporal and frequency dependencies occurring in time-frequency tiles of a media signal.
-
5.
公开(公告)号:US11817111B2
公开(公告)日:2023-11-14
申请号:US17046284
申请日:2019-04-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Roy M. Fejgin , Grant A. Davidson , Chih-Wei Wu , Vivek Kumar
IPC: G10L19/022 , G06F3/16 , G06N3/084 , G06N3/048
CPC classification number: G10L19/022 , G06F3/16 , G06N3/048 , G06N3/084
Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.
-
公开(公告)号:US20230328469A1
公开(公告)日:2023-10-12
申请号:US18309145
申请日:2023-04-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Louis D. Fielder , Zhiwei Shuang , Grant A. Davidson , Xiguang Zheng , Mark S. Vinton
CPC classification number: H04S3/004 , H04S7/302 , G10K15/08 , H04S2420/01 , H04S7/304 , H04S2400/01 , H04S5/005
Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
-
公开(公告)号:US20220415334A1
公开(公告)日:2022-12-29
申请号:US17779258
申请日:2020-12-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Grant A. Davidson , Louis D. Fielder , Mark S. Vinton
IPC: G10L19/032 , G01L19/04
Abstract: The present disclosure relates to the field of audio coding, in particular, it relates to a method for encoding audio signals through a masking model based on a hearing threshold of frequency intervals of the audio signal and a measured energy of the audio signal for the corresponding frequency intervals. The disclosure further relates to an encoder that is capable of carrying out the audio encoding method.
-
公开(公告)号:US10425763B2
公开(公告)日:2019-09-24
申请号:US15109541
申请日:2014-12-18
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Kuan-Chieh Yen , Dirk Jeroen Breebaart , Grant A. Davidson , Rhonda Wilson , David M. Cooper , Zhiwei Shuang
IPC: H04S7/00 , H04S3/00 , G10L19/008
Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feed-back delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
-
公开(公告)号:US10149082B2
公开(公告)日:2018-12-04
申请号:US15550424
申请日:2016-02-11
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Louis D. Fielder , Zhiwei Shuang , Grant A. Davidson , Xiguang Zheng , Mark S. Vinton
Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
-
公开(公告)号:US09830916B2
公开(公告)日:2017-11-28
申请号:US14766371
申请日:2014-01-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Vinay Melkote , Kuan-Chieh Yen , Grant A. Davidson , Matthew Fellers , Mark S. Vinton , Vivek Kumar
Abstract: Audio processing methods may involve receiving audio data corresponding to a plurality of audio channels. The audio data may include a frequency domain representation corresponding to filterbank coefficients of an audio encoding or processing system. A decorrelation process may be performed with the same filterbank coefficients used by the audio encoding or processing system. The decorrelation process may be performed without converting coefficients of the frequency domain representation to another frequency domain or time domain representation. The decorrelation process may involve selective or signal-adaptive decorrelation of specific channels and/or specific frequency bands. The decorrelation process may involve applying a decorrelation filter to a portion of the received audio data to produce filtered audio data. The decorrelation process may involve using a non-hierarchal mixer to combine a direct portion of the received audio data with the filtered audio data according to spatial parameters.
-
-
-
-
-
-
-
-
-