-
公开(公告)号:US20180152803A1
公开(公告)日:2018-05-31
申请号:US15577510
申请日:2016-05-26
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alan J. SEEFELDT , Lie LU , Chen ZHANG
CPC classification number: H04S7/302 , G10L19/008 , H04S3/008 , H04S7/30 , H04S2400/11
Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.
-
32.
公开(公告)号:US20180144759A1
公开(公告)日:2018-05-24
申请号:US15572067
申请日:2016-05-12
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie LU , Mingqing HU
IPC: G10L21/0308 , G10L25/18 , G10L19/008
CPC classification number: G10L21/0308 , G10L19/008 , G10L21/0264 , G10L21/0272 , G10L25/18
Abstract: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.
-
公开(公告)号:US20170230024A1
公开(公告)日:2017-08-10
申请号:US15433486
申请日:2017-02-15
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lie LU , Jun WANG , Alan J. SEEFELDT , Mingqing HU
Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
-
公开(公告)号:US20160150343A1
公开(公告)日:2016-05-26
申请号:US14900117
申请日:2014-06-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Mingqing HU , Dirk Jeroen BREEBAART , Nicolas R. TSINGOS
IPC: H04S7/00 , G10L19/008 , G10L19/02
CPC classification number: H04S7/30 , G10L19/008 , G10L19/0204 , G10L19/20 , G10L21/0272 , H04S3/002 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/07
Abstract: Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.
Abstract translation: 本发明的实施例涉及自适应音频内容生成。 具体地,提供了一种用于产生自适应音频内容的方法。 所述方法包括从基于频道的源音频内容中提取至少一个音频对象,以及至少部分地基于所述至少一个音频对象生成所述自适应音频内容。 还公开了相应的系统和计算机程序产品。
-
公开(公告)号:US20240205629A1
公开(公告)日:2024-06-20
申请号:US18391426
申请日:2023-12-20
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alan J. SEEFELDT , Lie LU , Chen ZHANG
IPC: H04S7/00 , G10L19/008 , H04S3/00
CPC classification number: H04S7/302 , H04S3/008 , H04S7/30 , G10L19/008 , H04S2400/11
Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.
-
公开(公告)号:US20220272474A1
公开(公告)日:2022-08-25
申请号:US17737184
申请日:2022-05-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu CHEN , Lie LU , Nicolas R. TSINGOS
Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20220199074A1
公开(公告)日:2022-06-23
申请号:US17604379
申请日:2020-04-13
Applicant: Dolby Laboratories Licensing Corporation
Abstract: The present application relates to a method of extracting audio features in a dialog detector in response to an input audio signal, the method comprising dividing the input audio signal into a plurality of frames, extracting frame audio features from each frame, determining a set of context windows, each context window including a number of frames surrounding a current frame, deriving, for each context window, a relevant context audio feature for the current frame based on the frame audio features of the frames in each respective context, and concatenating each context audio feature to form a combined feature vector to represent the current frame. The context windows with the different length can improve the response speed and improve robustness.
-
公开(公告)号:US20190392848A1
公开(公告)日:2019-12-26
申请号:US16561836
申请日:2019-09-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jun WANG , Lie LU , Qingyuan BIN
IPC: G10L19/008 , H04S7/00 , G10L25/21 , G10L21/0232
Abstract: The present document describes a method for extracting J audio sources from I audio channels. The method includes updating a Wiener filter matrix based on a mixing matrix from a source matrix and based on a power matrix of the J audio sources. Furthermore, the method includes updating a cross-covariance matrix of the I audio channels and of the J audio sources and an auto-covariance matrix of the J audio sources, based on the updated Wiener filter matrix and based on an auto-covariance matrix of the I audio channels. In addition, the method includes updating the mixing matrix and the power matrix based on the updated cross-covariance matrix of the I audio channels and of the J audio sources, and/or based on the updated auto-covariance matrix of the J audio sources.
-
公开(公告)号:US20190334497A1
公开(公告)日:2019-10-31
申请号:US16509791
申请日:2019-07-12
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Alan J. SEEFELDT
Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
-
公开(公告)号:US20190325894A1
公开(公告)日:2019-10-24
申请号:US16455178
申请日:2019-06-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Claus BAUER , Lie LU , Mingqing HU , Jun WANG , Poppy CRUM , Rhonda WILSON , Regunathan RADHAKRISHNAN
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
-
-
-
-
-
-
-
-