-
公开(公告)号:US20220386053A1
公开(公告)日:2022-12-01
申请号:US17833761
申请日:2022-06-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US10904688B2
公开(公告)日:2021-01-26
申请号:US16878616
申请日:2020-05-20
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jun Wang
Abstract: Embodiments of source separation for reverberant environment are disclosed. According to a method, first microphone signals for each individual one of at least one source are captured respectively by at least two microphones for a period during which only the individual one produces sounds. Mixing parameters for modeling acoustic paths between the at least one source and the at least two microphones are learned by a processor based on the first microphone signals. Second microphone signals are captured respectively by the at least two microphones for a period during which all of the at least one source produce sounds. The reconstruction model is estimated by the processor based on the mixing parameters and second microphone signals. The processor performs the source separation by applying the reconstruction model.
-
公开(公告)号:US10897682B2
公开(公告)日:2021-01-19
申请号:US16555126
申请日:2019-08-29
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US10885923B2
公开(公告)日:2021-01-05
申请号:US16869477
申请日:2020-05-07
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
IPC: G10L19/02 , G10L19/008 , G10L21/0308 , H04S3/00 , G10L25/21
Abstract: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US10818302B2
公开(公告)日:2020-10-27
申请号:US16561836
申请日:2019-09-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jun Wang , Lie Lu , Qingyuan Bin
IPC: G10L19/008 , G10L21/0232 , G10L25/21 , H04S7/00 , G10L21/0272 , G10L25/18
Abstract: The present document describes a method for extracting J audio sources from I audio channels. The method includes updating a Wiener filter matrix based on a mixing matrix from a source matrix and based on a power matrix of the J audio sources. Furthermore, the method includes updating a cross-covariance matrix of the I audio channels and of the J audio sources and an auto-covariance matrix of the J audio sources, based on the updated Wiener filter matrix and based on an auto-covariance matrix of the I audio channels. In addition, the method includes updating the mixing matrix and the power matrix based on the updated cross-covariance matrix of the I audio channels and of the J audio sources, and/or based on the updated auto-covariance matrix of the J audio sources.
-
公开(公告)号:US10362426B2
公开(公告)日:2019-07-23
申请号:US15538892
申请日:2016-02-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun Wang , Lie Lu , Lianwu Chen , Mingqing Hu
Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
公开(公告)号:US09842605B2
公开(公告)日:2017-12-12
申请号:US14779322
申请日:2014-03-25
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie Lu , Alan J. Seefeldt , Jun Wang
Abstract: Apparatus and methods for audio classifying and processing are disclosed. In one embodiment, an audio processing apparatus includes an audio classifier for classifying an audio signal into at least one audio type in real time; an audio improving device for improving experience of audience; and an adjusting unit for adjusting at least one parameter of the audio improving device in a continuous manner based on the confidence value of the at least one audio type.
-
公开(公告)号:US09830896B2
公开(公告)日:2017-11-28
申请号:US14282654
申请日:2014-05-20
Applicant: Dolby Laboratories Licensing Corporation
CPC classification number: G10H1/40 , G10H2210/041 , G10H2210/051 , G10H2210/076 , G10H2240/075 , G10H2250/015
Abstract: Audio processing method and audio processing apparatus, and training method are described. According to embodiments of the application, an accent identifier is used to identify accent frames from a plurality of audio frames, resulting in an accent sequence comprised of probability scores of accent and/or non-accent decisions with respect to the plurality of audio frames. Then a tempo estimator is used to estimate a tempo sequence of the plurality of audio frames based on the accent sequence. The embodiments can be well adaptive to the change of tempo, and can be further used to tracking beats properly.
-
公开(公告)号:US09756445B2
公开(公告)日:2017-09-05
申请号:US14900117
申请日:2014-06-17
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jun Wang , Lie Lu , Mingqing Hu , Dirk Jeroen Breebaart , Nicolas R. Tsingos
IPC: H04R5/00 , H04S7/00 , G10L19/008 , H04S3/00 , G10L21/0272 , G10L19/20 , G10L19/02 , H04S5/00
CPC classification number: H04S7/30 , G10L19/008 , G10L19/0204 , G10L19/20 , G10L21/0272 , H04S3/002 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/07
Abstract: Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US09621124B2
公开(公告)日:2017-04-11
申请号:US14780485
申请日:2014-03-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie Lu , Jun Wang , Alan Seefeldt , Mingqing Hu
Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
-
-
-
-
-
-
-
-
-