Speech/music discrimination
    68.
    发明授权

    公开(公告)号:US09613640B1

    公开(公告)日:2017-04-04

    申请号:US14995509

    申请日:2016-01-14

    CPC分类号: G10L25/81 G10L25/21

    摘要: A speech/music discrimination method evaluates the standard deviation between envelope peaks, loudness ratio, and smoothed energy difference. The envelope is searched for peaks above a threshold. The standard deviations of the separations between peaks are calculated. Decreased standard deviation is indicative of speech, higher standard deviation is indicative of non-speech. The ratio between minimum and maximum loudness in recent input signal data frames is calculated. If this ratio corresponds to the dynamic range characteristic of speech, it is another indication that the input signal is speech content. Smoothed energies of the frames from the left and right input channels are computed and compared. Similar (e.g., highly correlated) left and right channel smoothed energies is indicative of speech. Dissimilar (e.g., un-correlated content) left and right channel smoothed energies is indicative of non-speech material. The results of the three tests are compared to make a speech/music decision.

    Harmonic bandwidth extension of audio signals
    70.
    发明授权
    Harmonic bandwidth extension of audio signals 有权
    音频信号的谐波带宽扩展

    公开(公告)号:US09564141B2

    公开(公告)日:2017-02-07

    申请号:US14617524

    申请日:2015-02-09

    摘要: A method includes separating, at a device, an input audio signal into at least a low-band signal and a high-band signal. The low-band signal corresponds to a low-band frequency range and the high-band signal corresponds to a high-band frequency range. The method also includes selecting a non-linear processing function of a plurality of non-linear processing functions. The method further includes generating a first extended signal based on the low-band signal and the non-linear processing function. The method also includes generating at least one adjustment parameter based on the first extended signal, the high-band signal, or both.

    摘要翻译: 一种方法包括在设备处将输入音频信号分离成至少低频带信号和高频带信号。 低频带信号对应于低频带频率范围,高频带信号对应于高频带频率范围。 该方法还包括选择多个非线性处理功能的非线性处理功能。 该方法还包括基于低频带信号和非线性处理功能产生第一扩展信号。 该方法还包括基于第一扩展信号,高频带信号或两者产生至少一个调整参数。