THRESHOLD ADAPTATION IN TWO-CHANNEL NOISE ESTIMATION AND VOICE ACTIVITY DETECTION
    1.
    发明申请
    THRESHOLD ADAPTATION IN TWO-CHANNEL NOISE ESTIMATION AND VOICE ACTIVITY DETECTION 有权
    两通道噪声估计和语音活动检测中的阈值适应

    公开(公告)号:US20150221322A1

    公开(公告)日:2015-08-06

    申请号:US14170136

    申请日:2014-01-31

    Applicant: Apple Inc.

    CPC classification number: G10L25/84 G10L2021/02165 G10L2025/786

    Abstract: A method for adapting a threshold used in multi-channel audio voice activity detection. Strengths of primary and secondary sound pick up channels are computed. A separation, being a measure of difference between the strengths of the primary and secondary channels, is also computed. An analysis of the peaks in separation is performed, e.g. using a leaky peak capture function that captures a peak in the separation and then decays over time, or using a sliding window min-max detector. A threshold that is to be used in a voice activity detection (VAD) process is adjusted, in accordance with the analysis of the peaks. Other embodiments are also described and claimed.

    Abstract translation: 一种用于调整在多声道音频语音活动检测中使用的阈值的方法。 计算一次和二次声音拾取通道的强度。 还计算出分离,作为主要和次要信道强度差异的量度。 进行分离峰的分析,例如 使用泄漏峰值捕获功能,捕获分离中的峰值,然后随时间衰减,或使用滑动窗口最小 - 最大检测器。 根据峰值的分析,调整在语音活动检测(VAD)过程中使用的阈值。 还描述和要求保护其他实施例。

Patent Agency Ranking