-
公开(公告)号:US20250045585A1
公开(公告)日:2025-02-06
申请号:US18716895
申请日:2022-12-08
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jundai SUN , Lie LU , Zhiwei SHUANG , Yuanxing MA
IPC: G06N3/082
Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n−1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.
-
公开(公告)号:US20230215423A1
公开(公告)日:2023-07-06
申请号:US17921564
申请日:2021-05-03
Inventor: Aaron Steven MASTER , Lie LU , Heidi-Maria LEHTONEN
IPC: G10L15/08 , G10L21/0272
CPC classification number: G10L15/08 , G10L21/0272
Abstract: Computer-implemented methods and devices for combined audio separation and classification are provided. An estimated separated signal is time gated based on a determination of an audio classifier of, at least in part, the original mix of signals before separation. Combined separation, classification, and time gating of both the estimated signal and a residual signal are also provided.
-
公开(公告)号:US20200288260A1
公开(公告)日:2020-09-10
申请号:US16825776
申请日:2020-03-20
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Alan J. SEEFELDT , Lie LU , Chen ZHANG
Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.
-
公开(公告)号:US10339959B2
公开(公告)日:2019-07-02
申请号:US15321741
申请日:2015-06-24
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Claus Bauer , Lie LU , Mingqing Hu , Jun Wang , Poppy Crum , Rhonda Wilson , Regunathan Radhakrishnan
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
公开(公告)号:US20180324540A1
公开(公告)日:2018-11-08
申请号:US15772383
申请日:2016-11-02
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Xin Liu , Lie LU , Alan J. Seefeldt
CPC classification number: H04S5/005 , G06F17/18 , G06F17/30743 , H04R1/26 , H04S5/00 , H04S7/30 , H04S7/307 , H04S2400/03 , H04S2420/01
Abstract: Example embodiments disclosed herein relate to content-adaptive surround sound virtualization. A method of virtualizing surround sound is disclosed. The method includes receiving a set of input audio signals, each of the input audio signals being indicative of sound from one of different sound sources, and determining a probability of the set of input audio signals belonging to a predefined audio content category. The method also includes determining a virtualization amount based on the determined probability, the virtualization amount indicating to which extent the set of input audio signals is virtualized as surround sound. The method further includes performing surround sound virtualization on two or more input audio signals in the set based on the determined virtualization amount and generating output audio signals based on the virtualized input audio signals and other input audio signals in the set. Corresponding system and computer program product for virtualizing surround sound are also disclosed.
-
公开(公告)号:US20170206907A1
公开(公告)日:2017-07-20
申请号:US15326378
申请日:2015-07-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
IPC: G10L19/02 , G10L25/21 , G10L19/008
CPC classification number: G10L19/0204 , G10L19/008 , G10L21/0308 , G10L25/21 , H04S3/008
Abstract: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20170155369A1
公开(公告)日:2017-06-01
申请号:US15432679
申请日:2017-02-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Alan J. SEEFELDT
IPC: H03G7/00 , G10L25/30 , G10L25/51 , H03G5/16 , G10L21/0364
CPC classification number: H03G7/002 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/3089 , H03G3/32 , H03G5/165 , H03G7/007 , H04M7/006 , H04M2203/305
Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
-
公开(公告)号:US20170133039A1
公开(公告)日:2017-05-11
申请号:US15321741
申请日:2015-06-24
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Claus BAUER , Lie LU , Mingqing HU , Jun WANG , Poppy CRUM , Rhonda WILSON , Regunathan RADHAKRISHNAN
IPC: G10L25/54
CPC classification number: G10L25/54 , G06K9/6259 , G06K9/6261 , G10L25/03
Abstract: Example embodiments disclosed herein relate to perception based multimedia processing. There is provided a method for processing multimedia data, the method includes automatically determining user perception on a segment of the multimedia data based on a plurality of clusters, the plurality of clusters obtained in association with predefined user perceptions and processing the segment of the multimedia data at least in part based on determined user perception on the segment. Corresponding system and computer program products are disclosed as well.
-
公开(公告)号:US20160337776A1
公开(公告)日:2016-11-17
申请号:US15110371
申请日:2015-01-05
Inventor: Dirk Jeroen BREEBAART , Lianwu CHEN , Lie LU , Antonio Mateos SOLE , Nicolas R. TSINGOS
Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.
Abstract translation: 确定存在于一个或多个帧中的输入音频内容中的音频对象。 还确定存在于一个或多个帧中的输出音频内容中的输出簇。 这里,输入音频内容中的音频对象被转换为输出音频内容中的输出群集。 至少部分地基于音频对象的位置元数据和输出簇的位置元数据来计算一个或多个空间误差度量。
-
公开(公告)号:US20160056787A1
公开(公告)日:2016-02-25
申请号:US14780485
申请日:2014-03-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lie LU , Jun WANG , Alan SEEFELDT , Mingqing HU
Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
Abstract translation: 公开了均衡器控制器和控制方法。 在一个实施例中,均衡器控制器包括用于实时地识别音频信号的音频类型的音频分类器; 以及调整单元,用于基于识别的音频类型的置信度值以连续的方式调整均衡器。
-
-
-
-
-
-
-
-
-