Talker collisions in an auditory scene
    21.
    发明授权
    Talker collisions in an auditory scene 有权
    讲话者在听觉场合的碰撞

    公开(公告)号:US09502047B2

    公开(公告)日:2016-11-22

    申请号:US14373336

    申请日:2013-03-21

    Abstract: From a plurality of received voice signals, a signal interval in which there is a talker collision between at least a first and a second voice signal is detected. A processor receives a positive detection result and processes, in response to this, at least one of the voice signals with the aim of making it perceptually distinguishable. A mixer mixes the voice signals to supply an output signal, wherein the processed signal(s) replaces the corresponding received signals. In example embodiments, signal content is shifted away from the talker collision in frequency or in time. The invention may be useful in a conferencing system.

    Abstract translation: 从多个接收到的语音信号中检测到在至少第一和第二语音信号之间存在讲话者冲突的信号间隔。 处理器接收到正的检测结果,并且响应于此,处理至少一个语音信号,目的在于使其感知上可区分。 混合器混合语音信号以提供输出信号,其中所处理的信号替换相应的接收信号。 在示例实施例中,信号内容在频率或时间上偏离说话人的冲突。 本发明在会议系统中可能是有用的。

    Matching Reverberation in Teleconferencing Environments
    22.
    发明申请
    Matching Reverberation in Teleconferencing Environments 有权
    电话会议环境中的匹配混响

    公开(公告)号:US20150092950A1

    公开(公告)日:2015-04-02

    申请号:US14489907

    申请日:2014-09-18

    Abstract: A system and method of matching reverberation in teleconferencing environments. When the two ends of a conversation are in environments with differing reverberations, the method filters the reverberation so that when both signals are output at the near end (e.g., the audio signal from the far end and the sidetone from the near end), the reverberations match. In this manner, the user does not perceive an annoying difference in reverberations, and the user experience is improved.

    Abstract translation: 在电话会议环境中匹配混响的系统和方法。 当对话的两端处于不同混响的环境中时,该方法对混响进行滤波,以便当两端信号在近端输出(例如,远端的音频信号和近端的侧音)时, 混响匹配。 以这种方式,用户不会感觉到混响中的烦人的差异,并且提高了用户体验。

    Schemes for Emphasizing Talkers in a 2D or 3D Conference Scene
    23.
    发明申请
    Schemes for Emphasizing Talkers in a 2D or 3D Conference Scene 有权
    在2D或3D会议场景中强调演讲者的方案

    公开(公告)号:US20150052455A1

    公开(公告)日:2015-02-19

    申请号:US14387301

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/0484

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place a plurality of upstream audio signals (123, 173) associated with a plurality of conference participants within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene with X different spatial talker locations (212) within the conference scene; assign the plurality of upstream audio signals (123, 173) to respective ones of the talker locations (212); determine a degree of activity of the plurality of upstream audio signals (123, 173); determine a dominant one of the plurality of upstream audio signals (123, 173); and emphasize the dominant upstream audio signal (123, 173).

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将与多个会议参与者相关联的多个上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为在会议场景内设置具有X个不同空间讲话者位置(212)的X点会议场景; 将多个上游音频信号(123,173)分配给各个讲话者位置(212); 确定所述多个上游音频信号(123,173)的活动程度; 确定多个上游音频信号(123,173)中的主要一个; 并强调主要的上游音频信号(123,173)。

    Clustering of Audio Streams in a 2D / 3D Conference Scene
    24.
    发明申请
    Clustering of Audio Streams in a 2D / 3D Conference Scene 有权
    音频流在2D / 3D会议场景中的聚类

    公开(公告)号:US20150049868A1

    公开(公告)日:2015-02-19

    申请号:US14382847

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。

Patent Agency Ranking