-
公开(公告)号:US20210343308A1
公开(公告)日:2021-11-04
申请号:US17281006
申请日:2019-09-26
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Yuanxing MA
IPC: G10L21/034 , H03G9/18 , G10L25/18 , G10L25/21 , G10L25/51 , G10L21/0364
Abstract: The present application relates to a method, system, and computer program product of dynamically adjusting thresholds of a compressor responsive to an input audio signal. A scene switch analyzer receives an input audio signal having a plurality of frequency band components. The scene switch analyzer determines whether a scene switch has occurred in the input audio signal. The frequency band components of the input audio signal are processed. In response to determine that scene switch has not occurred, a distortion audibility system applies slow smoothing to compressor thresholds of the frequency band components. In response to determine that scene switch has occurred, the distortion audibility system applies fast smoothing or no smoothing to the compressor thresholds of the frequency band components.
-
公开(公告)号:US20250046328A1
公开(公告)日:2025-02-06
申请号:US18709129
申请日:2022-10-26
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Jundai SUN , Zhiwei SHUANG , Yuanxing MA
IPC: G10L21/028 , G10L25/84 , G10L25/93
Abstract: The present disclosure relates to a method and audio processing system (1) for performing source separation. The method comprises obtaining (S1) an audio signal (Sin) including a mixture of speech content and noise content, determining (S2a, S2b, S2c), from the audio signal, speech content (formula A), stationary noise content (formula C) and non-speech content (formula B). The stationary noise content (formula C) is a true subset of the non-speech content (formula B) and the method further comprises determining (S3), based on a difference between the stationary noise content (formula C) and the non-speech content (formula B) a non-stationary noise content formula D), obtaining (S5) a set of weighting factors and forming (S6) a processed audio signal based on a combination of the speech content (formula A), the stationary noise content (formula C), and the non-stationary noise content (formula D) weighted with their respective weighting factor. (Ŝ1) formula A ({circumflex over (N)}1) formula B ({circumflex over (N)}2) formula C ({circumflex over (N)}NS) formula D
-
公开(公告)号:US20250045585A1
公开(公告)日:2025-02-06
申请号:US18716895
申请日:2022-12-08
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Jundai SUN , Lie LU , Zhiwei SHUANG , Yuanxing MA
IPC: G06N3/082
Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n−1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.
-
公开(公告)号:US20240170001A1
公开(公告)日:2024-05-23
申请号:US18548854
申请日:2022-03-09
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Yuanxing MA , Kai LI
IPC: G10L21/0232 , G10L21/0208 , H04S7/00
CPC classification number: G10L21/0232 , H04S7/305 , G10L2021/02082
Abstract: A method for reverberation suppression may involve receiving an input audio signal. The method may involve calculating an initial reverberation suppression gain for the input audio signal for at least one frame of the input audio signal. The method may involve calculating at least one adjusted reverberation suppression gain, where the at least one adjusted reverberation suppression gain adjusts at least one of: 1) a reverberation suppression decay based on a reverberation intensity detected in the input audio signal; 2) gains applied to different frequency bands of the input audio signal based on an amount of room resonance detected in the input audio signal; or 3) a loudness of the input audio signal based on a direct part of the input audio signal. The method may involve generating an output audio signal by applying the at least one adjusted reverberation suppression gain to the input audio signal.
-
公开(公告)号:US20240155289A1
公开(公告)日:2024-05-09
申请号:US18548791
申请日:2022-04-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Zhiwei SHUANG , Yuanxing MA , Yang LIU
IPC: H04R3/00 , G10L21/0216 , H04R1/10 , H04S7/00
CPC classification number: H04R3/005 , G10L21/0216 , H04R1/1091 , H04S7/304 , H04R1/1016 , H04R2420/01 , H04R2499/11
Abstract: Embodiments are disclosed for context aware soundscape control. In an embodiment, an audio processing method comprises: capturing, using a first set of microphones on a mobile device, a first audio signal from an audio scene; capturing, using a second set of microphones on a pair of earbuds, a second audio signal from the audio scene; capturing, using a camera on the mobile device, a video signal from a video scene; generating, with at least one processor, a processed audio signal from the first audio signal and the second audio signal, the processed audio signal generated with adaptive soundscape control based on context information; and combining, with the at least one processor, the processed audio signal and the captured video signal as multimedia output.
-
公开(公告)号:US20240170004A1
公开(公告)日:2024-05-23
申请号:US18548750
申请日:2022-04-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Zhiwei SHUANG , Yuanxing MA , Yang LIU
IPC: G10L21/0364 , G10L21/034 , G10L25/18 , G10L25/30 , G10L25/78
CPC classification number: G10L21/0364 , G10L21/034 , G10L25/18 , G10L25/30 , G10L25/78
Abstract: Embodiments are disclosed for context aware audio processing. In an embodiment, an audio processing method comprises: receiving, with one or more sensors of a device, environment information about an audio recording captured by the device; detecting, with at least one processor of the device, a context of the audio recording based on the audio recording and the environment information; determining, with the at least one processor, a model based on the context; processing, with the at least one processor, the audio recording based on the model to produce a processed audio recording with suppressed noise; determining, with the at least one processor, an audio processing profile based on the context; and combining, with the at least one processor, the audio recording and the processed audio recording based on the audio processing profile.
-
公开(公告)号:US20240170002A1
公开(公告)日:2024-05-23
申请号:US18549575
申请日:2022-03-10
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Kai LI , Shaofan YANG , Yuanxing MA
IPC: G10L21/0232 , G10L21/0208 , G10L21/028 , G10L25/18 , G10L25/21 , G10L25/51
CPC classification number: G10L21/0232 , G10L21/028 , G10L25/18 , G10L25/21 , G10L25/51 , G10L2021/02082
Abstract: A method for reverberation suppression may involve receiving an input audio signal. The method may involve classifying a media type of the input audio signal as one of a group comprising at least: 1) speech; 2) music; or 3) speech over music. The method may involve determining whether to perform dereverberation on the input audio signal based at least on a determination that the media type of the input audio signal has been classified as speech. The method may involve generating an output audio signal by performing dereverberation on the input audio signal in response to determining that dereverberation is to be performed on the input audio signal.
-
公开(公告)号:US20240080608A1
公开(公告)日:2024-03-07
申请号:US18257862
申请日:2021-12-14
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Yuanxing MA , Zhiwei SHUANG , Yang LIU
CPC classification number: H04R1/1083 , H04R5/04 , H04S7/301 , H04R2430/03 , H04R2499/11 , H04S2400/01 , H04S2400/15 , H04S2420/07
Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.
-
-
-
-
-
-
-