-
公开(公告)号:US11076052B2
公开(公告)日:2021-07-27
申请号:US15548265
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. Cartwright , Xuejing Sun
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. In some examples, only a portion of the received audio data will be selected as playback audio data. The selection process may involve a topic selection process, a talkspurt filtering process and/or an acoustic feature selection process. Some examples involve receiving an indication of a target playback time duration. Selecting the portion of audio data may involve making a time duration of the playback audio data within a threshold time difference of the target playback time duration.
-
公开(公告)号:US10812401B2
公开(公告)日:2020-10-20
申请号:US16084932
申请日:2017-03-16
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Richard J. Cartwright , Hannes Muesch
IPC: H04L12/841 , H04L29/06 , H04L1/20 , H04L1/00 , H04L12/26 , H04L12/835 , H04L12/939
Abstract: Disclosed is an apparatus and method operative to receive packets of media from a network including a receiver unit operative to receive the packets from the network, a jitter buffer data structure for receiving the packets in an ordered queue, the jitter buffer data structure having a tail into which the packets are input; a plurality of heads defining points in the jitter buffer data structure from which the ordered queue of packets are to be played back, the heads comprise an adjustable actual playback head coupled to an actual playback unit and at least one prototype head, each prototype head having associated therewith a target latency a processor having decision logic operable to determine a cost of achieving the associated target latency for each prototype head, wherein the decision logic compares the costs determined for each prototype head to identify a particular target latency and head location for the actual playback head of the buffer and a playback unit coupled to the processor for actual playback of the playback head of the buffer, such that the particular target latency of the jitter buffer data structure is determined at playback of the buffer rather than upon input of the packets into the jitter buffer data structure.
-
公开(公告)号:US10728688B2
公开(公告)日:2020-07-28
申请号:US16424409
申请日:2019-05-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Glenn N. Dickins , Richard J. Cartwright
Abstract: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.
-
公开(公告)号:US10522151B2
公开(公告)日:2019-12-31
申请号:US15546109
申请日:2016-02-03
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. Cartwright , Kai Li , Xuejing Sun
IPC: G10L17/00 , G06N20/00 , G06F16/61 , G06F16/68 , H04M3/42 , H04M3/56 , G10L25/48 , G06F17/27 , G10L17/02 , G10L25/78 , G10L15/26
Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve analyzing conversational dynamics of the conference recording. Some examples may involve searching the conference recording to determine instances of segment classifications. The segment classifications may be based, at least in part, on conversational dynamics data. Some implementations may involve segmenting the conference recording into a plurality of segments, each of the segments corresponding with a time interval and at least one of the segment classifications. Some implementations allow a listener to scan through a conference recording quickly according to segments, words, topics and/or talkers of interest.
-
公开(公告)号:US10362420B2
公开(公告)日:2019-07-23
申请号:US16009154
申请日:2018-06-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. Cartwright , David S. McGrath , Glenn N. Dickins
Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
-
公开(公告)号:US10009475B2
公开(公告)日:2018-06-26
申请号:US15121744
申请日:2015-02-17
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. Cartwright
Abstract: In an audio teleconference mixing system, of the type mixing a first plurality of audio uplink input streams containing audio information including sensed audio and associated control information, to produce at least one audio downlink output stream for downlinking to at least one conference participants, wherein the audio uplink input streams potentially can include continuous transmission (CTX) and discontinuous transmission streams (DTX), a method of mixing multiple current audio uplink streams together to produce the at least one audio output stream, the method including the steps of: (a) determining a verbosity measure indicative of the likely importance of each current audio uplink streams; (b) where at least one current audio uplink stream can comprise a CTX stream, utilizing at least one CTX stream in the mix to produce at least one current downlink output stream.
-
公开(公告)号:US09979829B2
公开(公告)日:2018-05-22
申请号:US14776322
申请日:2014-03-13
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Richard J. Cartwright
CPC classification number: H04M3/568 , H04S7/304 , H04S2400/11 , H04S2420/01
Abstract: Embodiments are described for a soundfield system that receives a transmitting soundfield, wherein the transmitting soundfield includes a sound source at a location in the transmitting soundfield. The system determines a rotation angle for rotating the transmitting soundfield based on a desired location for the sound source. The transmitting soundfield is rotated by the determined angle and the system obtains a listener's soundfield based on the rotated transmitting soundfield. The listener's soundfield is transmitted for rendering to a listener.
-
-
-
-
-
-