Adaptive Bandwidth Extension and Apparatus for the Same

    公开(公告)号:US20170221498A1

    公开(公告)日:2017-08-03

    申请号:US15491181

    申请日:2017-04-19

    Inventor: Yang Gao

    Abstract: A method of decoding an encoded audio bitstream and generating frequency bandwidth extension is disclosed. The method includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is identified within the low frequency band using a parameter which indicates energy information of a low band spectral envelope. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the identified sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.

    Audio classification based on perceptual quality for low or medium bit rates
    23.
    发明授权
    Audio classification based on perceptual quality for low or medium bit rates 有权
    基于低或中比特率的感知质量的音频分类

    公开(公告)号:US09589570B2

    公开(公告)日:2017-03-07

    申请号:US14027052

    申请日:2013-09-13

    Inventor: Yang Gao

    Abstract: The quality of encoded signals can be improved by reclassifying AUDIO signals carrying non-speech data as VOICE signals when periodicity parameters of the signal satisfy one or more criteria. In some embodiments, only low or medium bit rate signals are considered for re-classification. The periodicity parameters can include any characteristic or set of characteristics indicative of periodicity. For example, the periodicity parameter may include pitch differences between subframes in the audio signal, a normalized pitch correlation for one or more subframes, an average normalized pitch correlation for the audio signal, or combinations thereof. Audio signals which are re-classified as VOICED signals may be encoded in the time-domain, while audio signals that remain classified as AUDIO signals may be encoded in the frequency-domain.

    Abstract translation: 当信号的周期参数满足一个或多个标准时,通过将携带非语音数据的AUDIO信号重新分类为VOICE信号可以改善编码信号的质量。 在一些实施例中,仅考虑低或中比特率信号用于重新分类。 周期性参数可以包括指示周期性的任何特征或特征集合。 例如,周期性参数可以包括音频信号中的子帧之间的音调差,一个或多个子帧的归一化音调相关性,音频信号的平均归一化音调相关性,或其组合。 重新分类为VOICED信号的音频信号可以在时域中被编码,而保持分类为AUDIO信号的音频信号可以在频域中进行编码。

    Unvoiced/voiced decision for speech processing
    24.
    发明授权
    Unvoiced/voiced decision for speech processing 有权
    用于语音处理的清音/浊音决定

    公开(公告)号:US09570093B2

    公开(公告)日:2017-02-14

    申请号:US14476547

    申请日:2014-09-03

    Inventor: Yang Gao

    CPC classification number: G10L25/78 G10L19/22 G10L25/93

    Abstract: In accordance with an embodiment of the present invention, a method for speech processing includes determining an unvoicing/voicing parameter reflecting a characteristic of unvoiced/voicing speech in a current frame of a speech signal comprising a plurality of frames. A smoothed unvoicing/voicing parameter is determined to include information of the unvoicing/voicing parameter in a frame prior to the current frame of the speech signal. A difference between the unvoicing/voicing parameter and the smoothed unvoicing/voicing parameter is computed. The method further includes generating an unvoiced/voiced decision point for determining whether the current frame comprises unvoiced speech or voiced speech using the computed difference as a decision parameter.

    Abstract translation: 根据本发明的实施例,一种用于语音处理的方法包括:确定反映在包括多个帧的语音信号的当前帧中的清音/发声语音的特征的清音/发声参数。 平滑的清音/发声参数被确定为包括语音信号的当前帧之前的帧中的清音/发声参数的信息。 计算出浊音/浊音参数与平滑的浊音/浊音参数之间的差异。 该方法还包括生成清音/有声决定点,用于使用所计算的差分作为判定参数来确定当前帧是否包括无声语音或浊音。

    Spectral envelope coding of energy attack signal
    25.
    发明授权
    Spectral envelope coding of energy attack signal 有权
    能量攻击信号的光谱信封编码

    公开(公告)号:US09020815B2

    公开(公告)日:2015-04-28

    申请号:US13888550

    申请日:2013-05-07

    Inventor: Yang Gao

    Abstract: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved by performing spectrum smoothing, replacing the segment of having spectral pre-echoes or filtering the segment with a combined filter obtained by doing LPC analysis.

    Abstract translation: 当编码能量攻击信号时,基于MDCT或基于FFT的音频编码算法通常具有这里称为频谱预回波的问题。 本发明提出了几种可能性,以避免在能量攻击点之前存在于解码信号段中的频谱预回波。 攻击点之前的频谱包络可以通过进行频谱平滑处理来改善,用具有频谱预回波的段来替换,或者用通过LPC分析获得的组合滤波器对该段进行滤波。

    SPECTRAL ENVELOPE CODING OF ENERGY ATTACK SIGNAL
    26.
    发明申请
    SPECTRAL ENVELOPE CODING OF ENERGY ATTACK SIGNAL 审中-公开
    能量攻击信号的光谱包络编码

    公开(公告)号:US20130308792A1

    公开(公告)日:2013-11-21

    申请号:US13868806

    申请日:2013-04-23

    Inventor: Yang Gao

    Abstract: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved by performing spectrum smoothing, replacing the segment of having spectral pre-echoes or filtering the segment with a combined filter obtained by doing LPC analysis.

    Abstract translation: 当编码能量攻击信号时,基于MDCT或基于FFT的音频编码算法通常具有这里称为频谱预回波的问题。 本发明提出了几种可能性,以避免在能量攻击点之前存在于解码信号段中的频谱预回波。 攻击点之前的频谱包络可以通过进行频谱平滑处理来改善,用具有频谱预回波的段来替换,或者用通过LPC分析获得的组合滤波器对该段进行滤波。

    Method for Encoding Signal, and Method for Decoding Signal
    27.
    发明申请
    Method for Encoding Signal, and Method for Decoding Signal 有权
    信号编码方法以及解码信号的方法

    公开(公告)号:US20130304460A1

    公开(公告)日:2013-11-14

    申请号:US13943812

    申请日:2013-07-17

    CPC classification number: G10L19/09 G10L19/0017 H04N19/50

    Abstract: The present disclosure relates to a method, apparatus, and system for encoding and decoding signals. The encoding method includes: converting a first-domain signal into a second-domain signal; performing Linear Prediction (LP) processing and Long-Term Prediction (LTP) processing for the second-domain signal; obtaining a long-term flag value according to a decision criterion; obtaining a second-domain predictive signal according to the LP processing result and the LTP processing result when the long-term flag value is a first value; obtaining a second-domain predictive signal according to the LP processing result when the long-term flag value is a second value; converting the second-domain predictive signal into a first-domain predictive signal, and calculating a first-domain predictive residual signal; and outputting a bit stream that includes the first-domain predictive residual signal.

    Abstract translation: 本公开涉及用于对信号进行编码和解码的方法,装置和系统。 编码方法包括:将第一域信号转换为第二域信号; 对第二域信号执行线性预测(LP)处理和长期预测(LTP)处理; 根据决策标准获得长期标志值; 当长期标志值为第一值时,根据LP处理结果和LTP处理结果获得第二域预测信号; 当长期标志值是第二值时,根据LP处理结果获得第二域预测信号; 将所述第二域预测信号转换为第一域预测信号,以及计算第一域预测残差信号; 并输出包括第一域预测残差信号的比特流。

    System and Method for Post Excitation Enhancement for Low Bit Rate Speech Coding
    28.
    发明申请
    System and Method for Post Excitation Enhancement for Low Bit Rate Speech Coding 有权
    用于低比特率语音编码的后激励增强的系统和方法

    公开(公告)号:US20130246055A1

    公开(公告)日:2013-09-19

    申请号:US13779589

    申请日:2013-02-27

    Inventor: Yang Gao

    CPC classification number: G10L19/04 G10L19/12 G10L19/26

    Abstract: In accordance with an embodiment, a method of decoding an audio/speech signal includes decoding an excitation signal based on an incoming audio/speech information, determining a stability of a high frequency portion of the excitation signal, smoothing an energy of the high frequency portion of the excitation signal based on the stability of the high frequency portion of the excitation signal, and producing an audio signal based on smoothing the high frequency portion of the excitation signal.

    Abstract translation: 根据实施例,对音频/语音信号进行解码的方法包括基于输入音频/语音信息来解码激励信号,确定激励信号的高频部分的稳定性,平滑高频部分的能量 基于激励信号的高频部分的稳定性的激励信号,并且基于使激励信号的高频部分平滑来产生音频信号。

    System and Method for Mixed Codebook Excitation for Speech Coding
    29.
    发明申请
    System and Method for Mixed Codebook Excitation for Speech Coding 有权
    用于语音编码的混合码本激励的系统和方法

    公开(公告)号:US20130218578A1

    公开(公告)日:2013-08-22

    申请号:US13768814

    申请日:2013-02-15

    Inventor: Yang Gao

    CPC classification number: G10L19/00 G10L19/12

    Abstract: In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.

    Abstract translation: 根据实施例,对音频/语音信号进行编码的方法包括基于输入音频/语音信号来确定混合码本矢量,其中混合码本向量包括来自第一码本的第一码本条目和第二码本矢量的和 第二码本的码本条目。 该方法还包括基于所确定的混合码本矢量生成编码音频信号,以及发送所确定的混合码本矢量的编码的激励索引。

    File Generation Method and Electronic Device
    30.
    发明公开

    公开(公告)号:US20230376459A1

    公开(公告)日:2023-11-23

    申请号:US18027035

    申请日:2021-09-06

    Inventor: Shen Wang Yang Gao

    CPC classification number: G06F16/16 G06F16/957 G06F16/13

    Abstract: A file generation method includes obtaining first information based on a resource index file, where the resource index file comprises indexes, file names, and first path information of a plurality of resource files, where the resource files include a first resource file and a second resource file, where the first information includes second path information and a file name of a first file; loading the first file based on the first information; obtaining the first resource file and the second resource file from the first file based on second information; and displaying the first page based on the first resource file and the second resource file.

Patent Agency Ranking