-
公开(公告)号:US11803351B2
公开(公告)日:2023-10-31
申请号:US17601199
申请日:2020-04-03
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Glenn N. Dickins , Feng Deng , Michael Eckert , Craig Johnston , Paul Holmberg
IPC: G06F3/16 , H04L65/403 , H04M3/56
CPC classification number: G06F3/165 , H04L65/403 , H04M3/568
Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.
-
2.
公开(公告)号:US20150049868A1
公开(公告)日:2015-02-19
申请号:US14382847
申请日:2013-03-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Paul Boustead , Craig Johnston , Vinh Nguyen , Gary Spittle
Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.
Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。
-
公开(公告)号:US09552827B2
公开(公告)日:2017-01-24
申请号:US14409948
申请日:2013-06-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Craig Johnston , Dong Shi , Xuejing Sun , Glenn N. Dickins
IPC: H04B3/20 , G10L21/0208 , H04M9/08 , G10L21/0232
CPC classification number: G10L21/0208 , G10L21/0232 , G10L2021/02082 , H04M9/082
Abstract: A method (800) for determining an estimate (215, 261) of an echo path property of an electronic device (200, 250, 300, 600). The electronic device is configured to render a total audio signal using a loudspeaker (102), and the electronic device is configured to record an echo of the rendered audio signal using a microphone (103), thereby yielding a recorded audio signal (112). The method comprises inserting (801), in an inaudible manner, an auxiliary audio signal (212) into the total audio signal to be rendered; wherein the auxiliary audio signal (212) comprises a tonal audio signal at a first frequency; isolating (803) the echo of the auxiliary audio signal (212) from the recorded audio signal (112); and determining (804) the estimate (215, 261) of the echo path property based on the inserted auxiliary audio signal (212) and based on the isolated echo of the auxiliary audio signal (212).
Abstract translation: 一种用于确定电子设备(200,250,300,600)的回波路径属性的估计(215,261)的方法(800)。 电子设备被配置为使用扬声器(102)呈现总音频信号,并且电子设备被配置为使用麦克风(103)记录所呈现的音频信号的回声,从而产生记录的音频信号(112)。 该方法包括以无法听见的方式将辅助音频信号(212)插入(801)到要渲染的总音频信号中; 其中所述辅助音频信号(212)包括第一频率的音调音频信号; 从所记录的音频信号(112)隔离(803)辅助音频信号(212)的回波; 以及基于所插入的辅助音频信号(212)并且基于所述辅助音频信号(212)的隔离回波来确定(804)所述回波路径属性的估计(215,261)。
-
4.
公开(公告)号:US09420109B2
公开(公告)日:2016-08-16
申请号:US14382847
申请日:2013-03-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Paul Boustead , Craig Johnston , Vinh Nguyen , Gary Spittle
Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.
Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。
-
公开(公告)号:US09445053B2
公开(公告)日:2016-09-13
申请号:US14166065
申请日:2014-01-28
Inventor: Richard J Cartwright , Craig Johnston , Glenn N Dickins , Heiko Purnhagen
IPC: H04N7/14 , H04N7/15 , G10L19/008 , H04M3/56 , G10L25/78
CPC classification number: H04N7/152 , G10L19/008 , G10L25/78 , H04M3/567 , H04M3/569
Abstract: A conferencing server (100) receives incoming bitstreams (I1, I2, I3, I4, I5) carrying media data from respective conferencing endpoints (110, 120, 130, 140, 150); receives a mixing strategy (M) specifying properties of at least one outgoing bitstream (O1, O2, O3, O4, O5) and requiring at least one additive media mixing step; and supplies at least one outgoing bitstream by executing, in a processor (103) and a memory (102) with a plurality of memory spaces, a run list of operations selected from a predefined collection of primitives and realizing the received mixing strategy. A pre-processor (104) in the server derives said run list repeatedly and dynamically while taking into consideration determined momentary activity in each incoming bitstream. In embodiments, the run list may be derived by (a) pruning of an initial run list, (b) constrained or non-constrained minimization of a cost function, or (c) automatic code generation.
Abstract translation: 会议服务器(100)从相应的会议端点(110,120,130,140,150)接收携带媒体数据的传入比特流(I1,I2,I3,I4,I5); 接收指定至少一个传出比特流(O1,O2,O3,O4,O5)的属性并需要至少一个添加介质混合步骤的混合策略(M) 并且通过在具有多个存储器空间的处理器(103)和存储器(102)中执行从预定义的图元集合中选择的操作的运行列表并实现所接收的混合策略来提供至少一个输出比特流。 服务器中的预处理器(104)在考虑每个输入比特流中确定的瞬时活动的同时,重复地和动态地导出所述运行列表。 在实施例中,可以通过(a)修剪初始运行列表,(b)成本函数的约束或非约束最小化,或(c)自动代码生成来导出运行列表。
-
公开(公告)号:US20140240447A1
公开(公告)日:2014-08-28
申请号:US14166065
申请日:2014-01-28
Inventor: Richard J. Cartwright , Craig Johnston , Glenn N. Dickins , Heiko Purnhagen
IPC: H04N7/15
CPC classification number: H04N7/152 , G10L19/008 , G10L25/78 , H04M3/567 , H04M3/569
Abstract: A conferencing server (100) receives incoming bitstreams (I1, I2, I3, I4, I5) carrying media data from respective conferencing endpoints (110, 120, 130, 140, 150); receives a mixing strategy (M) specifying properties of at least one outgoing bitstream (O1, O2, O3, O4, O5) and requiring at least one additive media mixing step; and supplies at least one outgoing bitstream by executing, in a processor (103) and a memory (102) with a plurality of memory spaces, a run list of operations selected from a predefined collection of primitives and realizing the received mixing strategy. A pre-processor (104) in the server derives said run list repeatedly and dynamically while taking into consideration determined momentary activity in each incoming bitstream. In embodiments, the run list may be derived by (a) pruning of an initial run list, (b) constrained or non-constrained minimization of a cost function, or (c) automatic code generation.
Abstract translation: 会议服务器(100)从相应的会议端点(110,120,130,140,150)接收携带媒体数据的传入比特流(I1,I2,I3,I4,I5); 接收指定至少一个传出比特流(O1,O2,O3,O4,O5)的属性并需要至少一个添加介质混合步骤的混合策略(M) 并且通过在具有多个存储器空间的处理器(103)和存储器(102)中执行从预定义的图元集合中选择的操作的运行列表并实现所接收的混合策略来提供至少一个输出比特流。 服务器中的预处理器(104)在考虑每个输入比特流中确定的瞬时活动的同时,重复地和动态地导出所述运行列表。 在实施例中,可以通过(a)修剪初始运行列表,(b)成本函数的约束或非约束最小化,或(c)自动代码生成来导出运行列表。
-
-
-
-
-