Schemes for emphasizing talkers in a 2D or 3D conference scene

    公开(公告)号:US09961208B2

    公开(公告)日:2018-05-01

    申请号:US14387301

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/0484

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place a plurality of upstream audio signals (123, 173) associated with a plurality of conference participants within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene with X different spatial talker locations (212) within the conference scene; assign the plurality of upstream audio signals (123, 173) to respective ones of the talker locations (212); determine a degree of activity of the plurality of upstream audio signals (123, 173); determine a dominant one of the plurality of upstream audio signals (123, 173); and emphasize the dominant upstream audio signal (123, 173).

    Clustering of audio streams in a 2D / 3D conference scene
    2.
    发明授权
    Clustering of audio streams in a 2D / 3D conference scene 有权
    2D / 3D会议场景中音频流的聚类

    公开(公告)号:US09420109B2

    公开(公告)日:2016-08-16

    申请号:US14382847

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。

    Schemes for Emphasizing Talkers in a 2D or 3D Conference Scene
    3.
    发明申请
    Schemes for Emphasizing Talkers in a 2D or 3D Conference Scene 有权
    在2D或3D会议场景中强调演讲者的方案

    公开(公告)号:US20150052455A1

    公开(公告)日:2015-02-19

    申请号:US14387301

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/0484

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place a plurality of upstream audio signals (123, 173) associated with a plurality of conference participants within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene with X different spatial talker locations (212) within the conference scene; assign the plurality of upstream audio signals (123, 173) to respective ones of the talker locations (212); determine a degree of activity of the plurality of upstream audio signals (123, 173); determine a dominant one of the plurality of upstream audio signals (123, 173); and emphasize the dominant upstream audio signal (123, 173).

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将与多个会议参与者相关联的多个上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为在会议场景内设置具有X个不同空间讲话者位置(212)的X点会议场景; 将多个上游音频信号(123,173)分配给各个讲话者位置(212); 确定所述多个上游音频信号(123,173)的活动程度; 确定多个上游音频信号(123,173)中的主要一个; 并强调主要的上游音频信号(123,173)。

    Clustering of Audio Streams in a 2D / 3D Conference Scene
    4.
    发明申请
    Clustering of Audio Streams in a 2D / 3D Conference Scene 有权
    音频流在2D / 3D会议场景中的聚类

    公开(公告)号:US20150049868A1

    公开(公告)日:2015-02-19

    申请号:US14382847

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。

Patent Agency Ranking