Hierarchical Active Voice Detection
    1.
    发明申请
    Hierarchical Active Voice Detection 有权
    分层活动语音检测

    公开(公告)号:US20150051906A1

    公开(公告)日:2015-02-19

    申请号:US14386304

    申请日:2013-03-21

    CPC classification number: G10L25/78 G10L19/173 G10L2025/786

    Abstract: One or more audio signals are processed using a multi-stage (hierarchical) voice and/or signal activity detector (VAD/SAD). A first stage is capable of reducing the workload bandwidth by employing an inexpensive VAD/SAD processor. One or more subsequent stages may further process the audio signals from the first stage. Other implementations may include a first stage that also performs continuity preservation between last blocks of audio signal and the first blocks of audio after it is detected that relevant audio signals are resumed. In yet other implementations, the first stage may extract features from audio signals when they are presented in their coded domain, and possibly with little or no decoding of the audio signal.

    Abstract translation: 使用多级(分级)语音和/或信号活动检测器(VAD / SAD)处理一个或多个音频信号。 第一级能够通过采用便宜的VAD / SAD处理器来减少工作负载带宽。 一个或多个后续阶段可以进一步处理来自第一阶段的音频信号。 其他实现可以包括在检测到相关音频信号被恢复之后还执行音频信号的最后块和第一音频块之间的连续性保持的第一阶段。 在其他实施方式中,当第一级在其编码域中呈现时,可能从音频信号中提取特征,并且可能对音频信号的解码很少或不进行解码。

    Spatial comfort noise
    2.
    发明授权

    公开(公告)号:US10224046B2

    公开(公告)日:2019-03-05

    申请号:US14774966

    申请日:2014-03-04

    Abstract: A method, an apparatus, logic (e.g., executable instructions encoded in a non-transitory computer-readable medium to carry out a method), and a non-transitory computer-readable medium configured with such instructions. The method is to generate and spatially render spatial comfort noise at a receiving endpoint of a conference system, such that the comfort noise has target spectral characteristics typical of comfort noise, and at least one spatial property that at least substantially matches at least one target spatial property. On version includes receiving one or more or more audio signals from other endpoints, combining the received audio signals with the spatial comfort noise signals, and rendering the combination of the received audio signals and the spatial comfort noise signals to a set of output signals for loudspeakers, such that the spatial comfort noise signals are continually in the output signal sin addition to output from the received audio signals.

    Hierarchical active voice detection
    3.
    发明授权
    Hierarchical active voice detection 有权
    分层主动语音检测

    公开(公告)号:US09064503B2

    公开(公告)日:2015-06-23

    申请号:US14386304

    申请日:2013-03-21

    CPC classification number: G10L25/78 G10L19/173 G10L2025/786

    Abstract: One or more audio signals are processed using a multi-stage (hierarchical) voice and/or signal activity detector (VAD/SAD). A first stage is capable of reducing the workload bandwidth by employing an inexpensive VAD/SAD processor. One or more subsequent stages may further process the audio signals from the first stage. Other implementations may include a first stage that also performs continuity preservation between last blocks of audio signal and the first blocks of audio after it is detected that relevant audio signals are resumed. In yet other implementations, the first stage may extract features from audio signals when they are presented in their coded domain, and possibly with little or no decoding of the audio signal.

    Abstract translation: 使用多级(分级)语音和/或信号活动检测器(VAD / SAD)处理一个或多个音频信号。 第一级能够通过采用便宜的VAD / SAD处理器来减少工作负载带宽。 一个或多个后续阶段可以进一步处理来自第一阶段的音频信号。 其他实现可以包括在检测到相关音频信号被恢复之后还执行音频信号的最后块和第一音频块之间的连续性保持的第一阶段。 在其他实施方式中,当第一级在其编码域中呈现时,可能从音频信号中提取特征,并且可能对音频信号的解码很少或不进行解码。

    Long term monitoring of transmission and voice activity patterns for regulating gain control
    4.
    发明授权
    Long term monitoring of transmission and voice activity patterns for regulating gain control 有权
    长期监测传输和语音活动模式,用于调节增益控制

    公开(公告)号:US09521263B2

    公开(公告)日:2016-12-13

    申请号:US14419924

    申请日:2013-09-09

    Abstract: The present document relates to audio communication systems. In particular, the present document relates to the control of the level of audio signals within audio communication systems. A method for leveling a near-end audio signal (211) using a leveling gain (214) is described. The near-end audio signal (211) comprises a sequence of segments, wherein the sequence of segments comprises a current segment and one or more preceding segments. The method comprises determining a nuisance measure (416) which is indicative of an amount of aberrant voice activity within the sequence of segments of the near-end audio signal (211); and determining the leveling gain (214) for the current segment of the near-end audio signal (211), at least based on the leveling gain (214) for the one or more preceding segments of the near-end audio signal (211), and by taking into account—according to a variable degree—an estimate of the level of the current segment of the near-end audio signal (211); wherein the variable degree is dependent on the nuisance measure (416).

    Abstract translation: 本文件涉及音频通信系统。 特别地,本文件涉及音频通信系统中的音频信号的级别的控制。 描述了使用调平增益(214)来调平近端音频信号(211)的方法。 近端音频信号(211)包括段序列,其中片段序列包括当前片段和一个或多个先前片段。 该方法包括确定指示近端音频信号(211)的段的序列内的异常语音活动量的扰动度量(416); 以及至少基于近端音频信号(211)的一个或多个先前段的调平增益(214)确定近端音频信号(211)的当前段的调平增益(214) ,并且根据可变程度考虑近端音频信号(211)的当前段的电平的估计; 其中所述可变度取决于所述妨扰措施(416)。

Patent Agency Ranking