NETWORK ENTITY, METHOD AND COMPUTER PROGRAM PRODUCT FOR MIXING SIGNALS DURING A CONFERENCE SESSION
    1.
    发明申请
    NETWORK ENTITY, METHOD AND COMPUTER PROGRAM PRODUCT FOR MIXING SIGNALS DURING A CONFERENCE SESSION 有权
    在会议期间混合信号的网络实体,方法和计算机程序产品

    公开(公告)号:US20080162127A1

    公开(公告)日:2008-07-03

    申请号:US11616351

    申请日:2006-12-27

    IPC分类号: G10L15/08

    摘要: A network entity, method and computer program product are provided for effectuating a conference session. The method may include receiving a plurality of signals representative of voice communication of the participants. In this regard, the signals may be received from a plurality of terminals of a respective plurality of participants at one of the locations, each of at least some of the terminals otherwise being configured for voice communication independent of at least some of the other terminals. The method of this aspect also includes classifying speech activity of the conference session according to a speech pause, or one or more actively-speaking participants, during the conference session. The signals of the respective participants may then be mixed into a at least one mixed signal for output to one or more other participants at one or more other locations, the signals being mixed based upon classification of the speech activity.

    摘要翻译: 提供网络实体,方法和计算机程序产品用于实现会议会话。 该方法可以包括接收代表参与者的语音通信的多个信号。 在这点上,信号可以从一个位置处的相应多个参与者的多个终端接收,至少一些终端中的每一个终端被配置为独立于至少一些其他终端的语音通信。 该方面的方法还包括在会议会话期间根据语音暂停或一个或多个主动参与的参与者对会议的语音活动进行分类。 然后可以将各参与者的信号混合成至少一个混合信号,以输出到一个或多个其他位置处的一个或多个其他参与者,该信号基于语音活动的分类进行混合。

    Network entity, method and computer program product for mixing signals during a conference session
    2.
    发明授权
    Network entity, method and computer program product for mixing signals during a conference session 有权
    在会议期间混合信号的网络实体,方法和计算机程序产品

    公开(公告)号:US08218460B2

    公开(公告)日:2012-07-10

    申请号:US11616351

    申请日:2006-12-27

    IPC分类号: H04L12/16 H04Q11/00

    摘要: A network entity, method and computer program product are provided for effectuating a conference session. The method may include receiving a plurality of signals representative of voice communication of the participants. In this regard, the signals may be received from a plurality of terminals of a respective plurality of participants at one of the locations, each of at least some of the terminals otherwise being configured for voice communication independent of at least some of the other terminals. The method of this aspect also includes classifying speech activity of the conference session according to a speech pause, or one or more actively-speaking participants, during the conference session. The signals of the respective participants may then be mixed into a at least one mixed signal for output to one or more other participants at one or more other locations, the signals being mixed based upon classification of the speech activity.

    摘要翻译: 提供网络实体,方法和计算机程序产品用于实现会议会话。 该方法可以包括接收代表参与者的语音通信的多个信号。 在这点上,信号可以从一个位置处的相应多个参与者的多个终端接收,至少一些终端中的每一个终端被配置为独立于至少一些其他终端的语音通信。 该方面的方法还包括在会议会话期间根据语音暂停或一个或多个主动参与的参与者对会议的语音活动进行分类。 然后可以将各参与者的信号混合成至少一个混合信号,以输出到一个或多个其他位置处的一个或多个其他参与者,该信号基于语音活动的分类进行混合。

    DISTRIBUTED TELECONFERENCE MULTICHANNEL ARCHITECTURE, SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT
    3.
    发明申请
    DISTRIBUTED TELECONFERENCE MULTICHANNEL ARCHITECTURE, SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT 审中-公开
    分布式电信多媒体架构,系统,方法和计算机程序产品

    公开(公告)号:US20080159507A1

    公开(公告)日:2008-07-03

    申请号:US11616638

    申请日:2006-12-27

    IPC分类号: H04M3/42

    摘要: Provided are multichannel architectures, systems, methods, and computer program products for distributed teleconferencing using one or more master devices and/or a centralized conferencing switch. Multichannels enhance functionality of a master device in distributed teleconferencing and allow for compatibility with 3D capable teleconferencing. Multichannel distributed teleconferencing involves multichannel, monophonic, and/or a fixed number of uplink and downlink channels. A multichannel distributed teleconferencing system may perform active talker detection of near-end participants and communicate an ID signal on an uplink channel identifying the active near-end participants. A multichannel distributed teleconferencing system may also receive an ID signal on a downlink channel identifying the active far-end participants. A multichannel distributed teleconferencing system may perform various uplink and downlink processing. Uplink processing may involve multimixing and spatialization. Multimixing may be used to separate speech signals of near-end participants. Spatialization, also used in downlink processing, introduces spatial separation of active participants.

    摘要翻译: 提供了使用一个或多个主设备和/或集中式会议交换机的分布式电话会议的多通道架构,系统,方法和计算机程序产品。 多通道增强分布式电话会议中主设备的功能,并允许与3D功能的电话会议兼容。 多通道分布式电话会议涉及多通道,单声道和/或固定数量的上行链路和下行链路信道。 多通道分布式电话会议系统可以执行近端参与者的主动讲话者检测,并且在标识活跃的近端参与者的上行链路信道上传送ID信号。 多通道分布式电话会议系统还可以在识别活动的远端参与者的下行链路信道上接收ID信号。 多通道分布式电话会议系统可以执行各种上行链路和下行链路处理。 上行链路处理可能涉及多重混合和空间化。 多重混合可用于分离近端参与者的语音信号。 也用于下行处理的空间化引入了主动参与者的空间分离。

    Artificial Bandwidth Expansion Method For A Multichannel Signal
    4.
    发明申请
    Artificial Bandwidth Expansion Method For A Multichannel Signal 审中-公开
    多通道信号的人造带宽扩展方法

    公开(公告)号:US20080004866A1

    公开(公告)日:2008-01-03

    申请号:US11427856

    申请日:2006-06-30

    IPC分类号: G10L19/14

    CPC分类号: G10L21/038 G10L19/008

    摘要: Techniques for applying artificial bandwidth expansion to a multichannel signal are described. Aspects of a system for applying artificial bandwidth expansion to a multichannel signal include an estimation component for receiving a multichannel signal and estimating delay and energy level differences for each channel of the multichannel signal. An artificial bandwidth expansion component artificially expands the bandwidth of each of the channels of the multichannel signal separately. Each one of a plurality of adjustment components are configured to modify a different one of the artificial bandwidth expanded channels of the multichannel signal based upon the estimated delay and energy level differences. The multichannel signal may be a binaural speech signal.

    摘要翻译: 描述了将人造带宽扩展应用于多信道信号的技术。 用于将人造带宽扩展应用于多信道信号的系统的方面包括用于接收多信道信号并估计多信道信号的每个信道的延迟和能级差的估计组件。 人造带宽扩展组件分别人工地扩展多信道信号的每个信道的带宽。 多个调整组件中的每一个被配置为基于所估计的延迟和能量水平差来修改多通道信号中的不同的一个人造带宽扩展通道。 多声道信号可以是双耳语音信号。

    Positional disambiguation in spatial audio
    5.
    发明授权
    Positional disambiguation in spatial audio 有权
    空间音频中的位置消歧

    公开(公告)号:US09351070B2

    公开(公告)日:2016-05-24

    申请号:US13380514

    申请日:2009-06-30

    摘要: A method including: obtaining phase information dependent upon a time-varying phase difference between captured audio channels; obtaining sampling information relating to time-varying spatial sampling of the captured audio channels; and processing the phase information and the sampling information to determine audio control information for controlling spatial rendering of the captured audio channels.

    摘要翻译: 一种方法,包括:取决于捕获的音频通道之间的时变相位差的相位信息; 获取与所捕获的音频频道的时变空间采样相关的采样信息; 以及处理相位信息和采样信息以确定用于控制所捕获的音频频道的空间渲染的音频控制信息。

    Time scaling of multi-channel audio signals
    6.
    发明授权
    Time scaling of multi-channel audio signals 失效
    多声道音频信号的时间缩放

    公开(公告)号:US07647229B2

    公开(公告)日:2010-01-12

    申请号:US11584011

    申请日:2006-10-18

    IPC分类号: G10L21/00

    摘要: A method and related apparatus comprising: buffering an encoded audio input signal comprising at least one combined signal of a plurality of audio channels and one or more corresponding sets of side information parameters describing a multi-channel sound image; changing the length of at least one audio frame of said combined signal by adding or removing a segment of said combined signal; modifying said one or more sets of side information parameters with a change corresponding to the change in the length of said at least one audio frame of said combined signal; and transferring said at least one audio frame of said combined signal with a changed length and said modified one or more sets of side information parameters to a further processing unit.

    摘要翻译: 一种方法和相关装置,包括:缓冲编码的音频输入信号,其包括多个音频通道的至少一个组合信号和描述多声道声音图像的一个或多个对应的侧面信息参数组; 通过添加或去除所述组合信号的段来改变所述组合信号的至少一个音频帧的长度; 用对应于所述组合信号的所述至少一个音频帧的长度变化的改变来修改所述一组或多组侧信息参数; 以及将具有改变长度的所述组合信号的所述至少一个音频帧和所述修改的一组或多组侧信息参数传送到另一处理单元。

    Automatic participant placement in conferencing
    7.
    发明申请
    Automatic participant placement in conferencing 审中-公开
    自动参与者放置在会议中

    公开(公告)号:US20070263823A1

    公开(公告)日:2007-11-15

    申请号:US11393685

    申请日:2006-03-31

    IPC分类号: H04M3/42

    摘要: Techniques for positioning participants of a conference call in a three dimensional (3D) audio space are described. Aspects of a system for positioning include a client component that extracts speech frames of a currently speaking participant of a conference call from a transmission signal. A speech analysis component determines a voice fingerprint of the currently speaking participant based upon any of a number of factors, such as a pitch value of the participant. A control component determines a category position of the currently speaking participant in a three dimensional audio space based upon the voice fingerprint. An audio engine outputs audio signals of the speech frame based upon the determined category position of the currently speaking participant. The category position of one or more participants may be changed as new participants are added to the conference call.

    摘要翻译: 描述用于在三维(3D)音频空间中定位电话会议的参与者的技术。 用于定位的系统的方面包括从传输信号提取会议呼叫的当前会话的参与者的语音帧的客户端组件。 语音分析组件基于诸如参与者的音调值的多个因素中的任何一个来确定当前说话的参与者的语音指纹。 控制组件基于语音指纹确定三维音频空间中当前讲话的参与者的类别位置。 音频引擎基于当前说话的参与者的确定的类别位置来输出语音帧的音频信号。 一个或多个参与者的类别位置可以随着新的参与者被添加到电话会议而改变。

    Spatialization arrangement for conference call
    8.
    发明申请
    Spatialization arrangement for conference call 失效
    电话会议空间化安排

    公开(公告)号:US20070025538A1

    公开(公告)日:2007-02-01

    申请号:US11179347

    申请日:2005-07-11

    IPC分类号: H04M3/42

    摘要: A method for distinguishing speakers in a conference call of a plurality of participants, in which method speech frames of the conference call are received in a receiving unit, which speech frames include encoded speech parameters. At least one parameter of the received speech frames is examined in an audio codec of the receiving unit, and the speech frames are classified to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. These functions may be carried out in a speaker identification block, which is applicable in various positions of a teleconferencing processing chain. Finally, a spatialization effect is created in a terminal reproducing the audio signal according to notified differences by placing the participants at distinct positions in an acoustical space of the audio signal.

    摘要翻译: 一种用于区分多个参与者的电话会议中的扬声器的方法,其中在接收单元中接收会议呼叫的语音帧,哪些语音帧包括编码语音参数。 在接收单元的音频编解码器中检查接收到的语音帧的至少一个参数,并且将语音帧分类为属于其中一个参与者,根据所检查的至少一个语音参数的差异进行分类 。 这些功能可以在可应用于电话会议处理链的各种位置的扬声器识别块中进行。 最后,通过将参与者放置在音频信号的声音空间中的不同位置,根据所通知的差异在终端再生音频信号的终端中产生空间化效果。

    Method, apparatus and computer program product for utilizing spatial information for audio signal enhancement in a distributed network environment

    公开(公告)号:US08457328B2

    公开(公告)日:2013-06-04

    申请号:US12107491

    申请日:2008-04-22

    IPC分类号: H04B1/00

    CPC分类号: H04M3/56

    摘要: An apparatus for utilizing spatial information for audio signal enhancement in a multiple distributed network may include a processor. The processor may be configured to receive representations of a plurality of audio signals including at least one audio signal received at a first device and at least a second audio signal received at a second device. The first and second devices may be part of a common acoustic space network and may be arbitrarily positioned with respect to each other. The processor may be further configured to combine the first and second audio signals to form a composite audio signal, and provide for communication of the composite audio signal along with spatial information relating to a sound source of at least one of the plurality of audio signals to another device.