CLASSIFICATION OF FAST AND SLOW SIGNALS
    41.
    发明申请
    CLASSIFICATION OF FAST AND SLOW SIGNALS 有权
    快速和慢速信号的分类

    公开(公告)号:US20150221318A1

    公开(公告)日:2015-08-06

    申请号:US14687689

    申请日:2015-04-15

    Inventor: Yang Gao

    CPC classification number: G10L19/025 G10L19/022 G10L19/22

    Abstract: Low bit rate audio coding such as BWE algorithm often encounters conflict goal of achieving high time resolution and high frequency resolution at the same time. In order to achieve best possible quality, input signal can be first classified into fast signal and slow signal. This invention focuses on classifying signal into fast signal and slow signal, based on at least one of the following parameters or a combination of the following parameters: spectral sharpness, temporal sharpness, pitch correlation (pitch gain), and/or spectral envelope variation. This classification information can help to choose different BWE algorithms, different coding algorithms, and different post-processing algorithms respectively for fast signal and slow signal.

    Abstract translation: 诸如BWE算法之类的低比特率音频编码经常遇到同时实现高时间分辨率和高频分辨率的冲突目标。 为了达到最佳质量,输入信号可以先分为快速信号和慢信号。 本发明集中于基于以下参数中的至少一个或以下参数的组合将信号分类为快速信号和慢信号:频谱清晰度,时间清晰度,音调相关(音调增益)和/或频谱包络变化。 该分类信息有助于分别为快速信号和慢信号选择不同的BWE算法,不同的编码算法和不同的后处理算法。

    Very short pitch detection and coding
    42.
    发明授权
    Very short pitch detection and coding 有权
    非常短的音高检测和编码

    公开(公告)号:US09099099B2

    公开(公告)日:2015-08-04

    申请号:US13724769

    申请日:2012-12-21

    Inventor: Yang Gao Fengyan Qi

    Abstract: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.

    Abstract translation: 提供了用于语音或音频信号的非常短的音调检测和编码的系统和方法实施例。 该系统和方法包括使用时域和频域音调检测技术的组合来检测语音或音频信号中是否存在比常规最小音调限制短的音调滞后。 音调检测技术包括在时域中使用音调相关性,并检测频域中的语音或音频信号中缺少低频能量。 检测到的非常短的音调滞后使用小于传统的最小间距限制的预定的最小非常短的间距限制的音调范围进行编码。

    System and method for post excitation enhancement for low bit rate speech coding
    43.
    发明授权
    System and method for post excitation enhancement for low bit rate speech coding 有权
    用于低比特率语音编码的后激励增强的系统和方法

    公开(公告)号:US09082398B2

    公开(公告)日:2015-07-14

    申请号:US13779589

    申请日:2013-02-27

    Inventor: Yang Gao

    CPC classification number: G10L19/04 G10L19/12 G10L19/26

    Abstract: In accordance with an embodiment, a method of decoding an audio/speech signal includes decoding an excitation signal based on an incoming audio/speech information, determining a stability of a high frequency portion of the excitation signal, smoothing an energy of the high frequency portion of the excitation signal based on the stability of the high frequency portion of the excitation signal, and producing an audio signal based on smoothing the high frequency portion of the excitation signal.

    Abstract translation: 根据实施例,对音频/语音信号进行解码的方法包括基于输入音频/语音信息来解码激励信号,确定激励信号的高频部分的稳定性,平滑高频部分的能量 基于激励信号的高频部分的稳定性的激励信号,并且基于使激励信号的高频部分平滑来产生音频信号。

    Adaptive encoding pitch lag for voiced speech
    44.
    发明授权
    Adaptive encoding pitch lag for voiced speech 有权
    自适应编码浊音的音调滞后

    公开(公告)号:US09015039B2

    公开(公告)日:2015-04-21

    申请号:US13724700

    申请日:2012-12-21

    Inventor: Yang Gao

    CPC classification number: G10L25/90 G10L19/09 G10L19/18

    Abstract: System and method embodiments for dual modes pitch coding are provided. The system and method embodiments are configured to adaptively code pitch lags of a voiced speech signal using one of two pitch coding modes according to a pitch length, stability, or both. The two pitch coding modes include a first pitch coding mode with relatively high precision and reduced dynamic range, and a second pitch coding mode with relatively large dynamic range and reduced precision. The first pitch coding mode is used upon determining that the voiced speech signal has a relatively short or substantially stable pitch. The second pitch coding mode is used upon determining that the voiced speech signal has a relatively long or less stable pitch or is a substantially noisy signal.

    Abstract translation: 提供了用于双模音调编码的系统和方法实施例。 系统和方法实施例被配置为根据间距长度,稳定性或两者来使用两种音调编码模式之一自适应地编码有声语音信号的音调滞后。 两个音调编码模式包括具有相对较高精度和降低的动态范围的第一音调编码模式,以及具有相对大的动态范围和精度降低的第二音调编码模式。 在确定有声语音信号具有相对较短或基本上稳定的音调时,使用第一音调编码模式。 第二音调编码模式在确定有声语音信号具有相对较长或较小的稳定音调或者是基本上噪声的信号时被使用。

    Packet Loss Concealment for Speech Coding
    45.
    发明申请
    Packet Loss Concealment for Speech Coding 有权
    语音编码的丢包隐藏

    公开(公告)号:US20140156267A1

    公开(公告)日:2014-06-05

    申请号:US14175195

    申请日:2014-02-07

    Inventor: Yang Gao

    CPC classification number: G10L19/005 G10L19/083 G10L19/09 G10L19/22

    Abstract: A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame. The method is used for a voiced speech class. A pitch cycle length is compared to a subframe size to decide to reduce the pitch gain for the first subframe or the first two subframes within the frame. A strongly voiced class is decided by checking if the pitch lags are stable and the pitch gains are high enough with the frame; for the strongly voiced frame, the pitch lags and the pitch gains can be encoded more efficiently than other speech classes.

    Abstract translation: 通过仅对语音帧内的第一子帧或前两个子帧限制或减小音调增益来实现减少由于语音分组丢失引起的误差传播的语音编码方法。 该方法用于有声语音类。 将音调周期长度与子帧尺寸进行比较,以决定减小帧内的第一子帧或前两个子帧的音调增益。 通过检查音调滞后是否稳定并且音高增益足够高的帧来决定强音阶。 对于强有声的帧,音调滞后,音调增益可以比其他语音类别更有效地编码。

    Method for encoding signal, and method for decoding signal
    46.
    发明授权
    Method for encoding signal, and method for decoding signal 有权
    信号编码方法,信号解码方法

    公开(公告)号:US08712763B2

    公开(公告)日:2014-04-29

    申请号:US13943812

    申请日:2013-07-17

    CPC classification number: G10L19/09 G10L19/0017 H04N19/50

    Abstract: The present disclosure relates to a method, apparatus, and system for encoding and decoding signals. The encoding method includes: converting a first-domain signal into a second-domain signal; performing Linear Prediction (LP) processing and Long-Term Prediction (LTP) processing for the second-domain signal; obtaining a long-term flag value according to a decision criterion; obtaining a second-domain predictive signal according to the LP processing result and the LTP processing result when the long-term flag value is a first value; obtaining a second-domain predictive signal according to the LP processing result when the long-term flag value is a second value; converting the second-domain predictive signal into a first-domain predictive signal, and calculating a first-domain predictive residual signal; and outputting a bit stream that includes the first-domain predictive residual signal.

    Abstract translation: 本公开涉及用于对信号进行编码和解码的方法,装置和系统。 编码方法包括:将第一域信号转换为第二域信号; 对第二域信号执行线性预测(LP)处理和长期预测(LTP)处理; 根据决策标准获得长期标志值; 当长期标志值为第一值时,根据LP处理结果和LTP处理结果获得第二域预测信号; 当长期标志值是第二值时,根据LP处理结果获得第二域预测信号; 将所述第二域预测信号转换为第一域预测信号,以及计算第一域预测残差信号; 并输出包括第一域预测残差信号的比特流。

    Very Short Pitch Detection and Coding

    公开(公告)号:US20130166288A1

    公开(公告)日:2013-06-27

    申请号:US13724769

    申请日:2012-12-21

    Inventor: Yang Gao Fengyan Qi

    Abstract: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.

    EFFICIENT TEMPORAL ENVELOPE CODING APPROACH BY PREDICTION BETWEEN LOW BAND SIGNAL AND HIGH BAND SIGNAL
    48.
    发明申请
    EFFICIENT TEMPORAL ENVELOPE CODING APPROACH BY PREDICTION BETWEEN LOW BAND SIGNAL AND HIGH BAND SIGNAL 审中-公开
    低频信号与高频信号之间预测的有效的时间包络编码方法

    公开(公告)号:US20130030797A1

    公开(公告)日:2013-01-31

    申请号:US13625874

    申请日:2012-09-25

    Inventor: Yang Gao

    CPC classification number: G10L19/0204 G10L19/002 G10L19/025 G10L19/04

    Abstract: This invention provides a more efficient way to quantize temporal envelope shaping of high band signal by benefiting from energy relationship between low band signal and high band signal; if low band signal is well coded or it is coded with time domain codec such as CELP, temporal envelope shaping information of low band signal can be used to predict temporal envelope shaping of high band signal; the temporal envelope shaping prediction can bring significant saving of bits to precisely quantize temporal envelope shaping of high band signal. This prediction approach can be combined with other specific approach to further increase the efficiency and save mores bits.

    Abstract translation: 本发明通过受益于低频带信号与高频带信号之间的能量关系,提供了一种更高效的量化高频带信号时间包络整形的方法; 如果低频带信号被良好编码,或者用诸如CELP的时域编解码器编码,则可以使用低频带信号的时间包络整形信息来预测高频带信号的时间包络整形; 时间包络整形预测可以显着节省位以精确量化高频带信号的时间包络整形。 这种预测方法可以与其他具体方法结合起来,以进一步提高效率并节省毛利位。

Patent Agency Ranking