-
公开(公告)号:US12166460B2
公开(公告)日:2024-12-10
申请号:US18356044
申请日:2023-07-20
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun Wang , Lie Lu , Alan J. Seefeldt
IPC: H03G3/00 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/30 , H03G3/32 , H03G5/16 , H03G7/00 , H04M7/00
Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
-
公开(公告)号:US11356787B2
公开(公告)日:2022-06-07
申请号:US17149683
申请日:2021-01-14
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US11218126B2
公开(公告)日:2022-01-04
申请号:US16920254
申请日:2020-07-02
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun Wang , Lie Lu , Alan J. Seefeldt
IPC: H03G3/00 , H03G7/00 , H03G3/30 , H03G3/32 , H03G5/16 , G10L25/30 , G10L25/51 , G10L21/0364 , H04M7/00
Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
-
公开(公告)号:US20190387342A1
公开(公告)日:2019-12-19
申请号:US16555126
申请日:2019-08-29
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US10453464B2
公开(公告)日:2019-10-22
申请号:US15326378
申请日:2015-07-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
IPC: G10L19/02 , G10L19/008 , H04S3/00 , G10L21/0308 , G10L25/21
Abstract: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method obtains a set of components that are weakly correlated wherein the set of components generated based on the plurality of audio signals. The method extract a feature from the set of components and determining determines a set of gains associated with the set of components at least in part based on the extracted feature. Each of the gains indicate a proportion of a diffuse part in the associated component and decompose the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US10410641B2
公开(公告)日:2019-09-10
申请号:US16091069
申请日:2017-04-06
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jun Wang , Lie Lu , Qingyuan Bin
IPC: G10L19/008 , G10L21/0232 , G10L25/21 , H04S7/00
Abstract: The present document describes a method (100) for extracting audio sources (301) from audio channels (302). The method (100) includes updating (102) a Wiener filter matrix based on a mixing matrix from a source matrix and based on a power matrix of the audio sources (301). Furthermore, the method (100) includes updating (103) a cross-covariance matrix of the audio channels (302) and of the audio sources (301) and an auto-covariance matrix of the audio sources (301), based on the updated Wiener filter matrix and based on an auto-covariance matrix of the audio channels (302). In addition, the method (100) includes updating (104) the mixing matrix and the power matrix based on the updated cross-covariance matrix of the audio channels (302) and of the audio sources (301), and/or based on the updated auto-covariance matrix of the audio sources (301).
-
公开(公告)号:US10339959B2
公开(公告)日:2019-07-02
申请号:US15321741
申请日:2015-06-24
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Claus Bauer , Lie LU , Mingqing Hu , Jun Wang , Poppy Crum , Rhonda Wilson , Regunathan Radhakrishnan
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
公开(公告)号:US20170353810A1
公开(公告)日:2017-12-07
申请号:US15647121
申请日:2017-07-11
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S3/002 , H04R5/02 , H04R5/04 , H04S7/30 , H04S7/308 , H04S2400/11 , H04S2400/13 , H04S2420/03
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US20170280264A1
公开(公告)日:2017-09-28
申请号:US15451241
申请日:2017-03-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S3/002 , H04R5/02 , H04R5/04 , H04S7/30 , H04S7/308 , H04S2400/11 , H04S2400/13 , H04S2420/03
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US20240179485A1
公开(公告)日:2024-05-30
申请号:US18535192
申请日:2023-12-11
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S7/302 , H04R5/02 , H04S7/308 , H04S2400/11 , H04S2400/13
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
-
-
-
-
-
-
-
-