Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Lie LU" Page 5

41.

发明申请
Video Content Assisted Audio Object Extraction 审中-公开

公开(公告)号：US20180054689A1

公开(公告)日：2018-02-22

申请号：US15553536

申请日：2016-02-24

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Lianwu CHEN , Xuejing SUN , Lie LU

IPC: H04S3/00 , H04S3/02 , G06K9/00

CPC classification number: H04S3/008 , G06K9/00718 , G10L19/20 , H04S3/02 , H04S2400/01 , H04S2400/11 , H04S2420/11

Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.

42.

发明申请
Projection-Based Audio Object Extraction from Audio Content 审中-公开

公开(公告)号：US20170344852A1

公开(公告)日：2017-11-30

申请号：US15538306

申请日：2015-12-18

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Mingqing HU , Lie LU , Lianwu CHEN

IPC: G06K9/62 , H03H21/00 , G06F17/15 , H04S5/00

CPC classification number: G06K9/624 , G06F17/15 , H03H21/00 , H03H2021/0034 , H04S5/00 , H04S2400/03 , H04S2400/11

Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.

43.

发明申请
METADATA-PRESERVED AUDIO OBJECT CLUSTERING 审中-公开

公开(公告)号：US20170339506A1

公开(公告)日：2017-11-23

申请号：US15535398

申请日：2015-12-10

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Lianwu CHEN , Lie LU , Nicolas R. TSINGOS

IPC: H04S3/00

CPC classification number: H04S3/008 , G06K9/6226 , H04S7/30 , H04S2400/01 , H04S2400/09 , H04S2400/11 , H04S2420/03

Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.

44.

发明申请
Generating Metadata for Audio Object 审中-公开

公开(公告)号：US20170238117A1

公开(公告)日：2017-08-17

申请号：US15508065

申请日：2015-08-31

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Mingqing HU , Lie LU

IPC: H04S7/00 , G11B27/10

CPC classification number: H04S7/302 , G01S5/18 , G10L19/008 , G11B27/10 , H04S7/30 , H04S7/301 , H04S2400/01 , H04S2400/11 , H04S2420/01

Abstract: Example embodiments disclosed herein relate to audio object processing. A method for processing audio content, which includes at least one audio object of a multi-channel format, is disclosed. The method includes generating metadata associated with the audio object, the metadata including at least one of an estimated trajectory of the audio object and an estimated perceptual size of the audio object, the perceptual size being a perceived area of a phantom of the audio object produced by at least two transducers. Corresponding system and computer program product are also disclosed.

45.

发明申请
AUDIO OBJECT CLUSTERING BY UTILIZING TEMPORAL VARIATIONS OF AUDIO OBJECTS 有权
Title translation: 使用音频对象的时间变化的音频对象聚类

公开(公告)号：US20160358618A1

公开(公告)日：2016-12-08

申请号：US15117647

申请日：2015-02-23

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Lianwu CHEN , Lie LU , Dirk Jeroen BREEBAART

IPC: G10L19/20 , G10L25/21 , G10L19/022

CPC classification number: G10L19/20 , G10L19/022 , G10L25/03 , G10L25/21 , G10L25/48 , H04S7/30

Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

Abstract translation: 本发明的实施例涉及通过利用音频对象的时间变化的音频对象聚类。提供了一种估计用于音频对象聚类的音频对象的时间变化的方法。所述方法包括获得与所述音频对象相关联的音轨的至少一个段，所述至少一个段包含所述音频对象; 基于所述音频对象的至少一个属性来估计所述音频对象在所述至少一个段的持续时间上的变化，并且至少部分地基于所估计的所述音频对象的变化来调整所述音频对象对所述音频对象的贡献确定音频对象聚类中的质心。披露了相应的系统和计算机程序产品。

46.

发明公开
DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL 审中-公开

公开(公告)号：US20240071411A1

公开(公告)日：2024-02-29

申请号：US18259848

申请日：2022-01-04

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Jundai SUN , Lie LU , Shaofan YANG , Rhonda J. WILSON , Dirk Jeroen BREEBAART

IPC: G10L25/60 , G10L21/0272

CPC classification number: G10L25/60 , G10L21/0272

Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

47.

发明公开
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD 审中-公开

公开(公告)号：US20240039499A1

公开(公告)日：2024-02-01

申请号：US18356044

申请日：2023-07-20

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Jun WANG , Lie LU , Alan J. SEEFELDT

IPC: H03G7/00 , H03G3/30 , H03G3/32 , H03G5/16 , G10L25/30 , G10L25/51 , G10L21/0364

CPC classification number: H03G7/002 , H03G3/3089 , H03G7/007 , H03G3/32 , H03G5/165 , G10L25/30 , G10L25/51 , G10L21/0364 , H04M7/006

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

48.

发明公开
METHOD, APPARATUS OR SYSTEMS FOR PROCESSING AUDIO OBJECTS 审中-公开

公开(公告)号：US20230353970A1

公开(公告)日：2023-11-02

申请号：US18349704

申请日：2023-07-10

Applicant: Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

Inventor： Dirk Jeroen BREEBAART , Lie LU , Nicolas R. TSINGOS , Antonio MATEOS SOLE

IPC: G10L19/008 , G10L19/20 , H04S3/00 , H04S7/00 , G10L19/00 , G10L19/018

CPC classification number: H04S7/308 , G10L19/00 , G10L19/008 , G10L19/018 , G10L19/20 , H04S3/002 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/03 , H04S2420/07

Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

49.

发明公开
METHODS, APPARATUS, AND SYSTEMS FOR DETECTION AND EXTRACTION OF SPATIALLY-IDENTIFIABLE SUBBAND AUDIO SOURCES 审中-公开

公开(公告)号：US20230245671A1

公开(公告)日：2023-08-03

申请号：US18009501

申请日：2021-06-11

Applicant: Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

Inventor： Aaron Steven MASTER , Lie LU , Harald MUNDT

IPC: G10L21/0272

CPC classification number: G10L21/0272

Abstract: In an embodiment, a method comprises: transforming one or more frames of a two-channel time domain audio signal into a time-frequency domain representation including a plurality of time-frequency tiles, wherein the frequency domain of the time-frequency domain representation includes a plurality of frequency bins grouped into subbands. For each time-frequency tile, the method comprises: calculating spatial parameters and a level for the time-frequency tile; modifying the spatial parameters using shift and squeeze parameters; obtaining a softmask value for each frequency bin using the modified spatial parameters, the level and subband information; and applying the softmask values to the time-frequency tile to generate a modified time-frequency tile of an estimated audio source. In an embodiment, a plurality of frames of the time-frequency tiles are assembled into a plurality of chunks, wherein each chunk includes a plurality of subbands, and the method described above is performed on each subband of each chunk.

50.

发明申请
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD 有权

公开(公告)号：US20220116006A1

公开(公告)日：2022-04-14

申请号：US17556722

申请日：2021-12-20

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Jun WANG , Lie LU , Alan J. SEEFELDT

IPC: H03G7/00 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/32 , H03G5/16 , H03G3/30

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification