-
公开(公告)号:US10200804B2
公开(公告)日:2019-02-05
申请号:US15553536
申请日:2016-02-24
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu Chen , Xuejing Sun , Lie Lu
Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20170171687A1
公开(公告)日:2017-06-15
申请号:US15375488
申请日:2016-12-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dirk Jeroen Breebaart , Lianwu Chen , Lie Lu
CPC classification number: H04S7/30 , H04S2400/11 , H04S2400/13
Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.
-
公开(公告)号:US11929091B2
公开(公告)日:2024-03-12
申请号:US17683662
申请日:2022-03-01
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Chunmao Zhang , Lianwu Chen , Ziyu Yang , Joshua Brandon Lando , David Matthew Fischer , Lie Lu
CPC classification number: G10L25/78 , G06N3/08 , G10L25/30 , H04R5/033 , H04R5/04 , H04S1/002 , H04R2420/07
Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
-
公开(公告)号:US11264050B2
公开(公告)日:2022-03-01
申请号:US17050786
申请日:2019-04-24
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Chunmao Zhang , Lianwu Chen , Ziyu Yang , Joshua Brandon Lando , David Matthew Fischer , Lie Lu
Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
-
公开(公告)号:US09830922B2
公开(公告)日:2017-11-28
申请号:US15117647
申请日:2015-02-23
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lianwu Chen , Lie Lu , Dirk Jeroen Breebaart
Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.
-
公开(公告)号:US11363398B2
公开(公告)日:2022-06-14
申请号:US15535398
申请日:2015-12-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu Chen , Lie Lu , Nicolas R. Tsingos
Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US10779106B2
公开(公告)日:2020-09-15
申请号:US16310569
申请日:2017-07-13
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Lianwu Chen , Lie Lu , Dirk Jeroen Breebaart
Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.
-
公开(公告)号:US20220159395A1
公开(公告)日:2022-05-19
申请号:US17427665
申请日:2020-02-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu Chen , Lie Lu
IPC: H04S7/00
Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.
-
公开(公告)号:US10278000B2
公开(公告)日:2019-04-30
申请号:US15375488
申请日:2016-12-12
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Dirk Jeroen Breebaart , Lianwu Chen , Lie Lu
IPC: H04S7/00
Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.
-
公开(公告)号:US11937064B2
公开(公告)日:2024-03-19
申请号:US17737184
申请日:2022-05-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu Chen , Lie Lu , Nicolas R. Tsingos
IPC: H04S3/00 , G06F18/2321 , H04S7/00
CPC classification number: H04S3/008 , H04S7/30 , G06F18/2321 , H04S2400/01 , H04S2400/09 , H04S2400/11 , H04S2420/03
Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
-
-
-
-
-
-
-
-
-