-
公开(公告)号:US20160150343A1
公开(公告)日:2016-05-26
申请号:US14900117
申请日:2014-06-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Mingqing HU , Dirk Jeroen BREEBAART , Nicolas R. TSINGOS
IPC: H04S7/00 , G10L19/008 , G10L19/02
CPC classification number: H04S7/30 , G10L19/008 , G10L19/0204 , G10L19/20 , G10L21/0272 , H04S3/002 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/07
Abstract: Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.
Abstract translation: 本发明的实施例涉及自适应音频内容生成。 具体地,提供了一种用于产生自适应音频内容的方法。 所述方法包括从基于频道的源音频内容中提取至少一个音频对象,以及至少部分地基于所述至少一个音频对象生成所述自适应音频内容。 还公开了相应的系统和计算机程序产品。
-
22.
公开(公告)号:US20230232176A1
公开(公告)日:2023-07-20
申请号:US18008431
申请日:2021-06-10
Inventor: Aaron Steven MASTER , Lie LU , Heiko PURNHAGEN
IPC: H04S7/00 , G10L21/0308 , H04S1/00 , G10L25/18
CPC classification number: H04S7/30 , G10L21/0308 , G10L25/18 , H04S1/007 , H04S2400/11
Abstract: A method comprises: obtaining softmask values for frequency bins of time-frequency tiles representing an audio signal; reducing, or expanding and limiting, the softmask values; and applying the reduced, or expanded and limited, softmask values to the frequency bins to create a time-frequency representation of an estimated target source. An alternative method comprises, for each time-frequency tile: obtaining softmask values; applying the softmask values to the frequency bins to create a time-frequency domain representation of an estimated target source; obtaining a panning parameter and a source concentration estimates for the target source; determining, using the panning parameter estimate and the softmask values, a magnitude for the time-frequency representation of the estimated target source; determining, using the panning parameter estimate and the source phase concentration estimate, a phase for the time-frequency representation of the estimated target source; and combining the magnitude and the phase.
-
公开(公告)号:US20220366933A1
公开(公告)日:2022-11-17
申请号:US17683662
申请日:2022-03-01
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Chunmao ZHANG , Lianwu CHEN , Ziyu YANG , Joshua Brandon LANDO , David Matthew FISCHER , Lie LU
Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
-
公开(公告)号:US20200265849A1
公开(公告)日:2020-08-20
申请号:US16869477
申请日:2020-05-07
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
IPC: G10L19/02 , G10L19/008 , G10L25/21
Abstract: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20190182612A1
公开(公告)日:2019-06-13
申请号:US16310569
申请日:2017-07-13
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lianwu CHEN , Lie LU , Dirk Jeroen BREEBAART
CPC classification number: H04S7/303 , H04R5/02 , H04S3/008 , H04S7/30 , H04S7/308 , H04S2400/01 , H04S2400/11 , H04S2400/13 , H04S2420/01
Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.
-
公开(公告)号:US20190052991A9
公开(公告)日:2019-02-14
申请号:US15538892
申请日:2016-02-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Lianwu CHEN , Mingqing HU
Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
公开(公告)号:US20190037333A1
公开(公告)日:2019-01-31
申请号:US16143351
申请日:2018-09-26
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alan J. SEEFELDT , Lie LU , Chen ZHANG
IPC: H04S7/00 , H04S3/00 , G10L19/008
Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.
-
公开(公告)号:US20170215019A1
公开(公告)日:2017-07-27
申请号:US15328631
申请日:2015-07-23
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lianwu CHEN , Lie LU
IPC: H04S7/00 , G10L21/038 , H04S3/00
CPC classification number: H04S7/302 , G10L19/008 , G10L21/0308 , G10L21/038 , H04S3/008 , H04S2400/01 , H04S2400/11 , H04S2400/13 , H04S2420/07
Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20170026017A1
公开(公告)日:2017-01-26
申请号:US15284953
申请日:2016-10-04
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jun WANG , Lie LU , Alan SEEFELDT
CPC classification number: H03G7/002 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/3089 , H03G3/32 , H03G5/165 , H03G7/007 , H04M7/006 , H04M2203/305
Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
Abstract translation: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。
-
公开(公告)号:US20160267914A1
公开(公告)日:2016-09-15
申请号:US15031887
申请日:2014-11-25
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Mingqing HU , Lie LU , Jun WANG
IPC: G10L19/02 , G10L19/038 , H04S3/00 , G10L19/008
CPC classification number: G10L19/02 , G10L19/008 , G10L19/038 , H04S3/008 , H04S2400/11
Abstract: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.
Abstract translation: 本发明的实施例涉及音频对象提取。 公开了一种基于多个频道的格式的音频内容提取音频对象的方法。 该方法包括至少部分地基于多个频道之间的频谱相似度来对音频内容的各个帧应用音频对象提取。 该方法还包括基于在各个帧上的音频对象提取来执行跨音频内容的帧的音频对象组合,以产生至少一个音频对象的轨道。 还公开了相应的系统和计算机程序产品。
-
-
-
-
-
-
-
-
-