Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Zhiwei SHUANG" Page 1

1.

发明申请
SOURCE SEPARATION AND REMIXING IN SIGNAL PROCESSING 有权

公开(公告)号：US20250046328A1

公开(公告)日：2025-02-06

申请号：US18709129

申请日：2022-10-26

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Jundai SUN , Zhiwei SHUANG , Yuanxing MA

IPC: G10L21/028 , G10L25/84 , G10L25/93

Abstract: The present disclosure relates to a method and audio processing system (1) for performing source separation. The method comprises obtaining (S1) an audio signal (Sin) including a mixture of speech content and noise content, determining (S2a, S2b, S2c), from the audio signal, speech content (formula A), stationary noise content (formula C) and non-speech content (formula B). The stationary noise content (formula C) is a true subset of the non-speech content (formula B) and the method further comprises determining (S3), based on a difference between the stationary noise content (formula C) and the non-speech content (formula B) a non-stationary noise content formula D), obtaining (S5) a set of weighting factors and forming (S6) a processed audio signal based on a combination of the speech content (formula A), the stationary noise content (formula C), and the non-stationary noise content (formula D) weighted with their respective weighting factor. (Ŝ1) formula A ({circumflex over (N)}1) formula B ({circumflex over (N)}2) formula C ({circumflex over (N)}NS) formula D

2.

发明公开
CONTEXT AWARE SOUNDSCAPE CONTROL 审中-公开

公开(公告)号：US20240155289A1

公开(公告)日：2024-05-09

申请号：US18548791

申请日：2022-04-28

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Zhiwei SHUANG , Yuanxing MA , Yang LIU

IPC: H04R3/00 , G10L21/0216 , H04R1/10 , H04S7/00

CPC classification number: H04R3/005 , G10L21/0216 , H04R1/1091 , H04S7/304 , H04R1/1016 , H04R2420/01 , H04R2499/11

Abstract: Embodiments are disclosed for context aware soundscape control. In an embodiment, an audio processing method comprises: capturing, using a first set of microphones on a mobile device, a first audio signal from an audio scene; capturing, using a second set of microphones on a pair of earbuds, a second audio signal from the audio scene; capturing, using a camera on the mobile device, a video signal from a video scene; generating, with at least one processor, a processed audio signal from the first audio signal and the second audio signal, the processed audio signal generated with adaptive soundscape control based on context information; and combining, with the at least one processor, the processed audio signal and the captured video signal as multimedia output.

3.

发明申请
CONTROL OF A VOLUME LEVELING UNIT USING TWO-STAGE NOISE CLASSIFIER 有权

公开(公告)号：US20250166652A1

公开(公告)日：2025-05-22

申请号：US18835248

申请日：2023-02-06

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Ziyu YANG , Lie LU , Zhiwei SHUANG

IPC: G10L21/0224 , G06F3/16 , G10L21/034

Abstract: Volume leveling of an audio signal using a volume leveling control signal. The method comprises determining a noise reliability ratio w(n) as a ratio of noise-like frames over all frames in a current time segment, determining a PGC noise confidence score XPGN(n) indicating a likelihood that professionally generated content, PGC, noise is present in the time segment, and determining, for the time segment, whether the noise reliability ratio is above a predetermined threshold. When the noise reliability ratio is above the predetermined threshold, the volume leveling control signal is updated based on the PGC noise confidence score, and when the noise reliability ratio is below the predetermined threshold, the volume leveling control signal is left unchanged. Volume leveling is improved by preventing boosting of e.g. phone-recorded environmental noise in UGC, while keeping original behavior for other types of content.

4.

发明申请
METHOD AND APPARATUS FOR SPEECH SOURCE SEPARATION BASED ON A CONVOLUTIONAL NEURAL NETWORK 有权

公开(公告)号：US20220223144A1

公开(公告)日：2022-07-14

申请号：US17611121

申请日：2020-05-13

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Jundai SUN , Zhiwei SHUANG , Lie LU , Shaofan YANG , Jia DAI

IPC: G10L15/20 , G10L15/16 , G10L15/22 , G10L21/0308 , G10L25/18 , G06N3/08

Abstract: Described herein is a method for Convolutional Neural Network (CNN) based speech source separation, wherein the method includes the steps of: (a) providing multiple frames of a time-frequency transform of an original noisy speech signal; (b) inputting the time-frequency transform of said multiple frames into an aggregated multi-scale CNN having a plurality of parallel convolution paths; (c) extracting and outputting, by each parallel convolution path, features from the input time-frequency transform of said multiple frames; (d) obtaining an aggregated output of the outputs of the parallel convolution paths; and (e) generating an output mask for extracting speech from the original noisy speech signal based on the aggregated output. Described herein are further an apparatus for CNN based speech source separation as well as a respective computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.

5.

发明申请
Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network 有权

公开(公告)号：US20210051435A1

公开(公告)日：2021-02-18

申请号：US17012076

申请日：2020-09-04

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Kuan-Chieh YEN , Dirk Jeroen BREEBAART , Grant A. DAVIDSON , Rhonda WILSON , David M. COOPER , Zhiwei SHUANG

IPC: H04S7/00 , G10L19/008 , H04S3/00

Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

6.

发明申请
Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network 审中-公开

公开(公告)号：US20190373397A1

公开(公告)日：2019-12-05

申请号：US16541079

申请日：2019-08-14

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Kuan-Chieh YEN , Dirk Jeroen BREEBAART , Grant A. DAVIDSON , Rhonda WILSON , David M. Cooper , Zhiwei SHUANG

IPC: H04S7/00 , G10L19/008

Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

7.

发明申请
Reverberation Generation for Headphone Virtualization 审中-公开

公开(公告)号：US20190052989A1

公开(公告)日：2019-02-14

申请号：US16163863

申请日：2018-10-18

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Louis D. FIELDER , Zhiwei SHUANG , Grant A. DAVIDSON , Xiguang ZHENG , Mark S. VINTON

IPC: H04S3/00 , H04S7/00 , H04S5/00 , G10K15/08

CPC classification number: H04S3/004 , G10K15/08 , H04S5/005 , H04S7/302 , H04S7/304 , H04S2400/01 , H04S2420/01

Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.

8.

发明申请
Method for Generating a Surround Sound Field, Apparatus and Computer Program Product Thereof 有权
Title translation: 用于生成环绕声场的方法，装置和计算机程序产品

公开(公告)号：US20160142851A1

公开(公告)日：2016-05-19

申请号：US14899505

申请日：2014-06-17

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Xuejing SUN , Bin CHENG , Sen XU , Zhiwei SHUANG , Jun WANG

IPC: H04S7/00 , H04S3/02 , H04R29/00

CPC classification number: H04S7/301 , H04R29/002 , H04R29/005 , H04R2430/20 , H04S3/02 , H04S7/308 , H04S2400/03 , H04S2400/15 , H04S2420/01 , H04S2420/11

Abstract: Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.

Abstract translation: 本发明的实施例涉及自适应音频内容生成。具体地，提供了一种用于产生自适应音频内容的方法。所述方法包括从基于频道的源音频内容中提取至少一个音频对象，以及至少部分地基于所述至少一个音频对象生成所述自适应音频内容。还公开了相应的系统和计算机程序产品。

9.

发明申请
IMPROVING NOISE COMPENSATION IN MASK-BASED SPEECH ENHANCEMENT 有权

公开(公告)号：US20250054508A1

公开(公告)日：2025-02-13

申请号：US18705446

申请日：2022-11-07

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Jundai SUN , Zhiwei SHUANG

IPC: G10L21/0208 , G10L15/04 , G10L25/51 , G10L25/78

Abstract: Methods and apparatus for improving noise compensation in mask-based speech enhancement are described. A method of processing an audio signal, which includes one or more speech segments, includes obtaining a mask for mask-based speech enhancement of the audio signal and obtaining a magnitude of the audio signal. An estimate of residual noise is determined in the audio signal after mask-based speech enhancement, based on the mask and the magnitude of the audio signal. A modified mask is determined based on the estimate of the residual noise. Further described are corresponding programs and computer-readable storage media.

10.

发明申请
METHOD FOR NEURAL NETWORK TRAINING WITH MULTIPLE SUPERVISORS 有权

公开(公告)号：US20250045585A1

公开(公告)日：2025-02-06

申请号：US18716895

申请日：2022-12-08

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Jundai SUN , Lie LU , Zhiwei SHUANG , Yuanxing MA

IPC: G06N3/082

Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n−1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification