-
公开(公告)号:US20220124128A1
公开(公告)日:2022-04-21
申请号:US17423061
申请日:2020-01-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Timothy Alan PORT , Richard J. CARTWRIGHT
IPC: H04L65/403 , G06V20/40 , H04M3/42 , H04N7/01 , H04N19/593 , G10L15/26
Abstract: An apparatus and method relating to use of a physical writing surface (132) during a videoconference or presentation. Snapshots of a whiteboard (132) are identified by applying a difference measure to the video data (e.g., as a way of comparing frames at different times). Audio captured by a microphone may be processed to generate textual data, wherein a portion of the textual data is associated with each snapshot. The writing surface may be identified (enrolled) using gestures. Image processing techniques may be used to transform views of a writing surface.
-
公开(公告)号:US20180295241A1
公开(公告)日:2018-10-11
申请号:US15956470
申请日:2018-04-18
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT
Abstract: Embodiments are described for a soundfield system that receives a transmitting soundfield, wherein the transmitting soundfield includes a sound source at a location in the transmitting soundfield. The system determines a rotation angle for rotating the transmitting soundfield based on a desired location for the sound source. The transmitting soundfield is rotated by the determined angle and the system obtains a listener's soundfield based on the rotated transmitting soundfield. The listener's soundfield is transmitted for rendering to a listener.
-
公开(公告)号:US20180027351A1
公开(公告)日:2018-01-25
申请号:US15546576
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Hannes MUESCH
CPC classification number: H04S7/303 , G10L25/78 , H04M3/42221 , H04M3/56 , H04M3/568 , H04R1/1016 , H04R2420/07 , H04S3/008 , H04S7/302 , H04S2400/01 , H04S2400/11
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations involve receiving or determining conversational dynamics data. One or more variables of a cost function may be based, at least in part, on the conversational dynamics data. The cost function may be a spatial optimization cost function of a vector describing a virtual conference participant position for each of the conference participants in a virtual acoustic space. The virtual acoustic space may be determined relative to a listener's head. The virtual conference participant positions may be assigned according to a solution of the cost function.
-
公开(公告)号:US20180027123A1
公开(公告)日:2018-01-25
申请号:US15548245
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Shen HUANG
CPC classification number: H04M3/568 , G10L15/26 , G10L25/48 , H04M3/42221 , H04M2203/305 , H04S2400/11
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. The audio data may include conference participant speech data from multiple endpoints, recorded separately and/or conference participant speech data from a single endpoint corresponding to multiple conference participants and including spatial information for each conference participant of the multiple conference participants. A search of the audio data may be based on one or more search parameters. The search may be a concurrent search for multiple features of the audio data. Instances of conference participant speech may be rendered to at least two different virtual conference participant positions of a virtual acoustic space.
-
公开(公告)号:US20170208409A1
公开(公告)日:2017-07-20
申请号:US15480163
申请日:2017-04-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Richard J. CARTWRIGHT , David S. MCGRATH , Glenn N. DICKINS
CPC classification number: H04S1/005 , H04M3/568 , H04S7/30 , H04S7/304 , H04S2400/01 , H04S2400/03 , H04S2400/11 , H04S2420/01 , H04S2420/11
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
26.
公开(公告)号:US20170078488A1
公开(公告)日:2017-03-16
申请号:US15121859
申请日:2015-02-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. CARTWRIGHT , Glenn N. DICKINS
CPC classification number: H04M3/568 , G10L15/08 , G10L21/02 , G10L25/78 , G10L25/87 , H04M3/563 , H04M2201/14 , H04R3/005 , H04R2420/01 , H04W52/0229 , Y02D70/23 , Y02D70/25
Abstract: In an audio conferencing mixing system of the type taking a plurality of audio input streams of input audio information of conference participants, including mixing transition events and outputting a plurality of audio output streams including output audio information, a method of mixing the audio output streams so as to reduce the detectability of the mixing transition events, the method including the steps of (a) determining that a transition event is to occur; (b) determining that a masking trigger is to occur; (c) scheduling the transition event to substantially occur when the masking event occurs. Change blindness mechanism to mask changes in audio conference mix and maintain perceptual continuity.
Abstract translation: 在采用会议参与者的输入音频信息的多个音频输入流的类型的音频会议混合系统中,包括混合转换事件并输出包括输出音频信息的多个音频输出流,一种将音频输出流混合的方法 为了降低混合转移事件的可检测性,该方法包括以下步骤:(a)确定发生转移事件; (b)确定将发生掩蔽触发; (c)当所述屏蔽事件发生时,调度所述转换事件基本上发生。 改变失明机制来掩盖音频会议组合的变化,并维持感知连续性。
-
公开(公告)号:US20250022465A1
公开(公告)日:2025-01-16
申请号:US18901697
申请日:2024-09-30
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark R. P. THOMAS , Richard J. CARTWRIGHT
Abstract: A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and applying a classifier to the multiple current acoustic features. Applying the classifier may involve applying a model trained on previously-determined acoustic features derived from a plurality of previous utterances made by the user in a plurality of user zones in the environment. The method may involve determining, based at least in part on output from the classifier, an estimate of the user zone in which the user is currently located.
-
公开(公告)号:US20230319190A1
公开(公告)日:2023-10-05
申请号:US17628732
申请日:2020-07-29
Inventor: Glenn N. DICKINS , Christopher Graham HINES , David GUNAWAN , Richard J. CARTWRIGHT , Alan J. SEEFELDT , Daniel Arteaga , Mark R.P. THOMAS , Joshua B. LANDO
CPC classification number: H04M9/082 , G10L2015/223 , G10L15/22
Abstract: An audio processing method may involve receiving output signals from each microphone of a plurality of microphones in an audio environment, the output signals corresponding to a current utterance of a person and determining, based on the output signals, one or more aspects of context information relating to the person, including an estimated current proximity of the person to one or more microphone locations. The method may involve selecting two or more loudspeaker-equipped audio devices based, at least in part, on the one or more aspects of the context information, determining one or more types of audio processing changes to apply to audio data being rendered to loudspeaker feed signals for the audio devices and causing one or more types of audio processing changes to be applied. In some examples, the audio processing changes have the effect of increasing a speech to echo ratio at one or more microphones.
-
公开(公告)号:US20220351724A1
公开(公告)日:2022-11-03
申请号:US17626619
申请日:2020-07-29
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Mark R. P. THOMAS , Richard J. CARTWRIGHT
Abstract: A method for selecting a device for audio processing may involve receiving a first wakeword confidence metric from a first device that includes at least a first microphone and receiving a second wakeword confidence metric from a second device that includes at least a second microphone. The first and second wakeword confidence metrics may correspond to a first local maximum of a first plurality of wakeword confidence values determined by the first device and a second local maximum of a second plurality of wakeword confidence values determined by the second device. The method may involve comparing the first wakeword confidence metric and the second wakeword confidence metric and selecting a device for subsequent audio processing based, at least in part, on a comparison of the first wakeword confidence metric and the second wakeword confidence metric.
-
公开(公告)号:US20220345820A1
公开(公告)日:2022-10-27
申请号:US17631024
申请日:2020-07-27
Inventor: Glenn N. DICKINS , Richard J. CARTWRIGHT , David GUNAWAN , Christopher Graham HINES , Mark R. P. THOMAS , Alan J. SEEFELDT , Joshua B. LANDO , Carlos Eduardo Medaglia DYONISIO , Daniel ARTEAGA
Abstract: An audio session management method for an audio environment having multiple audio devices may involve receiving, from a first device implementing a first application and by a device implementing an audio session manager, a first route initiation request to initiate a first route for a first audio session. The first route initiation request may indicate a first audio source and a first audio environment destination. The first audio environment destination may correspond with at least a first person in the audio environment, but in some instances will not indicate an audio device. The method may involve establishing a first route corresponding to the first route initiation request. Establishing the first route may involve determining a first location of at least the first person in the audio environment, determining at least one audio device for a first stage of the first audio session and initiating or scheduling the first audio session.
-
-
-
-
-
-
-
-
-