-
公开(公告)号:US20230410829A1
公开(公告)日:2023-12-21
申请号:US18251876
申请日:2021-11-04
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Ning WANG
IPC: G10L21/0232 , G10L25/84 , G10L25/30
CPC classification number: G10L21/0232 , G10L25/84 , G10L25/30 , G10L2021/02166
Abstract: In an embodiment, a method comprises: receiving bands of power spectra of an input audio signal and a microphone covariance, and for each band: estimating, using a classifier, respective probabilities of speech and noise; estimating, using a directionality model, a set of means for speech and noise, or a set of means and covariances for speech and noise, based on the microphone covariance for the band and the probabilities; estimating, using a level model, a mean and covariance of noise power based on the probabilities and the power spectra; determining a first noise suppression gain based on the directionality model; determining a second noise suppression gain based on the level model; selecting the first or second noise suppression gain or their sum based on a signal-to-noise ratio of the input audio signal; and scaling a time-frequency representation of the input signal by the selected noise suppression gain.
-
公开(公告)号:US20180336902A1
公开(公告)日:2018-11-22
申请号:US15546109
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Kai LI , Xuejing SUN
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve analyzing conversational dynamics of the conference recording. Some examples may involve searching the conference recording to determine instances of segment classifications. The segment classifications may be based, at least in part, on conversational dynamics data. Some implementations may involve segmenting the conference recording into a plurality of segments, each of the segments corresponding with a time interval and at least one of the segment classifications. Some implementations allow a listener to scan through a conference recording quickly according to segments, words, topics and/or talkers of interest.
-
公开(公告)号:US20180191912A1
公开(公告)日:2018-07-05
申请号:US15548265
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Xuejing SUN
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. In some examples, only a portion of the received audio data will be selected as playback audio data. The selection process may involve a topic selection process, a talkspurt filtering process and/or an acoustic feature selection process. Some examples involve receiving an indication of a target playback time duration. Selecting the portion of audio data may involve making a time duration of the playback audio data within a threshold time difference of the target playback time duration.
-
公开(公告)号:US20180014139A1
公开(公告)日:2018-01-11
申请号:US15547043
申请日:2016-02-02
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Glenn N. DICKINS , Richard J. CARTWRIGHT
CPC classification number: H04S7/303 , H04R3/005 , H04R5/027 , H04S7/00 , H04S2400/11 , H04S2400/15
Abstract: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.
-
5.
公开(公告)号:US20180006837A1
公开(公告)日:2018-01-04
申请号:US15546925
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Glenn N. DICKINS
Abstract: Some aspects of the present disclosure involve the recording, processing and playback of audio data corresponding to conferences, such as teleconferences. In some teleconference implementations, the audio experience heard when a recording of the conference is played back may be substantially different from the audio experience of an individual conference participant during the original teleconference. In some implementations, the recorded audio data may include at least some audio data that was not available during the teleconference. In some examples, the spatial characteristics of the played-back audio data may be different from that of the audio heard by participants of the teleconference.
-
公开(公告)号:US20240098435A1
公开(公告)日:2024-03-21
申请号:US18469498
申请日:2023-09-18
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , David S. MCGRATH , Glenn N. DICKINS
CPC classification number: H04S1/005 , H04M3/568 , H04R3/12 , H04R5/033 , H04R5/04 , H04S7/30 , H04S7/304 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2420/01 , H04S2420/11
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US20220030370A1
公开(公告)日:2022-01-27
申请号:US17397887
申请日:2021-08-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , David S. MCGRATH , Glenn N. DICKINS
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US20200021935A1
公开(公告)日:2020-01-16
申请号:US16518666
申请日:2019-07-22
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , David S. MCGRATH , Glenn N. DICKINS
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US20180295459A1
公开(公告)日:2018-10-11
申请号:US16009154
申请日:2018-06-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , David S. MCGRATH , Glenn N. DICKINS
CPC classification number: H04S1/005 , H04M3/568 , H04S7/30 , H04S7/304 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2420/01 , H04S2420/11
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US20180054688A1
公开(公告)日:2018-02-22
申请号:US15677995
申请日:2017-08-15
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Richard J. CARTWRIGHT , Peter MARTIN , Christopher Stanley MCGRATH , Glenn N. DICKINS
CPC classification number: H04S3/002 , A61B5/1118 , A61B5/14546 , A61B5/165 , G10L25/48 , G10L25/51 , H04M2203/252 , H04R1/028 , H04R1/1083 , H04R1/1091 , H04R3/005 , H04R29/008 , H04R2420/01 , H04S7/30 , H04S2400/15
Abstract: Some disclosed implementations include an interface system and a control system. The control system may be capable of receiving, via the interface system, microphone data. The control system may be capable of determining, based at least in part on the microphone data, instances of one or more acoustic events. The instances of one or more acoustic events may, in some examples, include conversational dynamics data. The control system may be capable of providing behavior modification feedback, via the interface system, corresponding with the instances of the one or more acoustic events.
-
-
-
-
-
-
-
-
-