Audio Capture and Render Device Having a Visual Display and User Interface for Audio Conferencing
    11.
    发明申请
    Audio Capture and Render Device Having a Visual Display and User Interface for Audio Conferencing 审中-公开
    具有可视显示和音频会议用户界面的音频捕获和渲染设备

    公开(公告)号:US20160006879A1

    公开(公告)日:2016-01-07

    申请号:US14788963

    申请日:2015-07-01

    CPC classification number: H04M9/085 H04M3/567 H04M3/568

    Abstract: A method in a soundfield-capturing endpoint and the capturing endpoint that comprises a microphone array capturing soundfield, and an input processor pre-processing and performing auditory scene analysis to detect local sound objects and positions, de-clutter the sound objects, and integrate with auxiliary audio signals to form a de-cluttered local auditory scene that has a measure of plausibility and perceptual continuity. The input processor also codes the resulting de-cluttered auditory scene to form coded scene data comprising mono audio and additional scene data to send to others. The endpoint includes an output processor generating signals for a display unit that displays a summary of the de-cluttered local auditory scene and/or a summary of activity in the communication system from received data, the display including a shaped ribbon display element that has an extent with locations on the extent representing locations and other properties of different sound objects.

    Abstract translation: 声场捕获端点中的方法和包括麦克风阵列捕获声场的捕获端点,以及输入处理器预处理和执行听觉场景分析以检测局部声音对象和位置,使声音对象变得杂乱,并与 辅助音频信号,形成一个整洁的本地听觉场景,具有可信度和感知连续性的度量。 输入处理器还对所产生的去混乱的听觉场景进行编码,以形成包括单声道音频和附加场景数据的编码场景数据,以发送给他人。 端点包括输出处理器,用于产生用于显示单元的信号,该显示单元从接收到的数据显示去杂乱的本地听觉场景和/或通信系统中的活动概要,该显示器包括成形的带状显示元件,其具有 范围与表示不同声音对象的位置和其他属性的区域的位置。

    Placement of Sound Signals in a 2D or 3D Audio Conference
    12.
    发明申请
    Placement of Sound Signals in a 2D or 3D Audio Conference 有权
    声音信号在2D或3D音频会议中的放置

    公开(公告)号:US20150055770A1

    公开(公告)日:2015-02-26

    申请号:US14382825

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302 H04S2400/11

    Abstract: A conference controller (111, 175) configured to place an upstream audio signal (123, 173) associated with a conference participant and a sound signal within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene with X different spatial talker locations (212) within the conference scene, X being an integer, X>0; assign the upstream audio signal (123, 173) to one of the talker locations (212); place a sound signal at a spatial sound location (503) within the X-point conference scene; and generate metadata identifying the assigned talker location (212) and the spatial sound location and enabling an audio processing unit (121, 171) to generate a spatialized audio signal based on a set of downstream audio signals (124, 174) comprising the upstream audio signal (123, 173) and the sound signal.

    Abstract translation: 被配置为将与会议参与者相关联的上游音频信号(123,173)和声音信号放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为在会议场景内设置具有X个不同空间讲话者位置(212)的X点会议场景,X是整数X> 0; 将所述上游音频信号(123,173)分配给所述讲话者位置(212)中的一个; 在X点会议场景内的空间声音位置(503)放置声音信号; 并且产生识别所分配的讲话者位置(212)和所述空间声音位置的元数据,并且使得音频处理单元(121,171)能够基于包括所述上游音频的一组下游音频信号(124,174)生成空间化音频信号 信号(123,173)和声音信号。

    Audio capture and render device having a visual display and user interface for use for audio conferencing

    公开(公告)号:US10079941B2

    公开(公告)日:2018-09-18

    申请号:US14788963

    申请日:2015-07-01

    CPC classification number: H04M9/085 G10L21/0272 H04M3/567 H04M3/568

    Abstract: A method in a soundfield-capturing endpoint and the capturing endpoint that comprises a microphone array capturing soundfield, and an input processor pre-processing and performing auditory scene analysis to detect local sound objects and positions, de-clutter the sound objects, and integrate with auxiliary audio signals to form a de-cluttered local auditory scene that has a measure of plausibility and perceptual continuity. The input processor also codes the resulting de-cluttered auditory scene to form coded scene data comprising mono audio and additional scene data to send to others. The endpoint includes an output processor generating signals for a display unit that displays a summary of the de-cluttered local auditory scene and/or a summary of activity in the communication system from received data, the display including a shaped ribbon display element that has an extent with locations on the extent representing locations and other properties of different sound objects.

    Placement of sound signals in a 2D or 3D audio conference

    公开(公告)号:US09654644B2

    公开(公告)日:2017-05-16

    申请号:US14382825

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302 H04S2400/11

    Abstract: A conference controller (111, 175) configured to place an upstream audio signal (123, 173) associated with a conference participant and a sound signal within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene with X different spatial talker locations (212) within the conference scene, X being an integer, X>0; assign the upstream audio signal (123, 173) to one of the talker locations (212); place a sound signal at a spatial sound location (503) within the X-point conference scene; and generate metadata identifying the assigned talker location (212) and the spatial sound location and enabling an audio processing unit (121, 171) to generate a spatialized audio signal based on a set of downstream audio signals (124, 174) comprising the upstream audio signal (123, 173) and the sound signal.

    Clustering of audio streams in a 2D / 3D conference scene
    17.
    发明授权
    Clustering of audio streams in a 2D / 3D conference scene 有权
    2D / 3D会议场景中音频流的聚类

    公开(公告)号:US09420109B2

    公开(公告)日:2016-08-16

    申请号:US14382847

    申请日:2013-03-21

    CPC classification number: H04M3/568 G06F3/048 H04S7/302

    Abstract: The present document relates to methods and systems for setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place L upstream audio signals (123, 173) within a 2D or 3D conference scene to be rendered to a listener (211) is described. The conference controller (111, 175) is configured to set up a X-point conference scene; assign L upstream audio signals (123, 173) to X talker locations (212); determine a maximum number N of downstream audio signals (124, 174) to be transmitted to the listener (211); determine N downstream audio signals (124, 174) from the L assigned upstream audio signals (123, 173); determine N updated talker locations for the N downstream audio signals (124, 174); and generate metadata identifying the updated talker locations and enabling an audio processing unit (121, 171) to generate a spatialized audio signal.

    Abstract translation: 本文件涉及用于设置和管理用于音频会议的二维或三维场景的方法和系统。 被配置为将L上游音频信号(123,173)放置在要呈现给收听者(211)的2D或3D会议场景内的会议控制器(111,175)。 会议控制器(111,175)被配置为建立X点会议场景; 将L个上行音频信号(123,173)分配给X个讲话者位置(212); 确定要发送到收听者(211)的下游音频信号(124,174)的最大数量N; 从所述L个分配的上游音频信号(123,173)确定N个下游音频信号(124,174); 确定N个下游音频信号(124,174)的N个更新的讲话者位置; 并生成识别更新的讲话者位置的元数据,并且使音频处理单元(121,171)能够产生空间化音频信号。

    Nearby talker obscuring, duplicate dialogue amelioration and automatic muting of acoustically proximate participants

    公开(公告)号:US10142484B2

    公开(公告)日:2018-11-27

    申请号:US15549581

    申请日:2016-02-08

    Abstract: In an audio conferencing environment, including multiple users participating by means of a series of associated audio input devices for the provision of audio input, and a series of audio output devices for the output of audio output streams to the multiple users, with the audio input and output devices being interconnected to a mixing control server for the control and mixing of the audio inputs from each audio input devices to present a series of audio streams to the audio output devices, a method of reducing the effects of cross talk pickup of at least a first audio conversation by multiple audio input devices, the method including the steps of: (a) monitoring the series of audio input devices for the presence of a duplicate audio conversation input from at least two input audio sources in an audio output stream; and (b) where a duplicate audio conversation input is detected, suppressing the presence of the duplicate audio conversation input in the audio output stream.

    Placement of talkers in 2D or 3D conference scene

    公开(公告)号:US09749473B2

    公开(公告)日:2017-08-29

    申请号:US14384780

    申请日:2013-03-21

    CPC classification number: H04M3/568 H04S5/00 H04S2400/11

    Abstract: The present document relates to setting up and managing two-dimensional or three-dimensional scenes for audio conferences. A conference controller (111, 175) configured to place an upstream audio signal (123, 173) associated with a conference participant within a 2D or 3D conference scene to be rendered to a listener (211) is described. An X-point conference scene with X different spatial talker locations (212) is set up within the conference scene, wherein the X talker locations (212) are positioned within a cone around a midline (215) in front of a head of the listener (211). A generatrix (216) of the cone and the midline (215) form an angle which is smaller than or equal to a pre-determined maximum cone angle. The upstream audio signal (123, 173) is assigned to one of the talker locations (212) and metadata identifying the assigned talker location (212) are generated, thus enabling a spatialized audio signal.

Patent Agency Ranking