Patent search ap:("Huawei Technologies Co. Page Ltd.") AND inv:"Yang Gao"

21.

发明申请
Classification Between Time-Domain Coding and Frequency Domain Coding 审中-公开

公开(公告)号：US20170249949A1

公开(公告)日：2017-08-31

申请号：US15592573

申请日：2017-05-11

Applicant: HUAWEI TECHNOLOGIES CO.,LTD.

Inventor： Yang Gao

IPC: G10L19/125 , G10L19/002 , G10L19/22

CPC classification number: G10L19/125 , G10L19/002 , G10L19/22 , G10L2019/0002 , G10L2019/0011 , G10L2019/0016

Abstract: A method for processing speech signals prior to encoding a digital signal comprising audio data includes selecting frequency domain coding or time domain coding based on a coding bit rate to be used for coding the digital signal and a short pitch lag detection of the digital signal.

22.

发明申请
Adaptive Bandwidth Extension and Apparatus for the Same 审中-公开

公开(公告)号：US20170221498A1

公开(公告)日：2017-08-03

申请号：US15491181

申请日：2017-04-19

Applicant: HUAWEI TECHNOLOGIES CO.,LTD.

Inventor： Yang Gao

IPC: G10L19/22 , G10L19/12 , G10L19/02 , G10L19/16 , G10L19/26

CPC classification number: G10L19/22 , G10L19/0204 , G10L19/08 , G10L19/12 , G10L19/167 , G10L19/265 , G10L21/038

Abstract: A method of decoding an encoded audio bitstream and generating frequency bandwidth extension is disclosed. The method includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is identified within the low frequency band using a parameter which indicates energy information of a low band spectral envelope. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the identified sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.

23.

发明授权
Audio classification based on perceptual quality for low or medium bit rates 有权
Title translation: 基于低或中比特率的感知质量的音频分类

公开(公告)号：US09589570B2

公开(公告)日：2017-03-07

申请号：US14027052

申请日：2013-09-13

Applicant: Huawei Technologies Co., LTD

Inventor： Yang Gao

IPC: G10L19/20 , G10L25/93 , G10L25/90 , G10L19/002

CPC classification number: G10L19/24 , G10L19/002 , G10L19/20 , G10L25/06 , G10L25/90 , G10L25/93 , G10L2025/937

Abstract: The quality of encoded signals can be improved by reclassifying AUDIO signals carrying non-speech data as VOICE signals when periodicity parameters of the signal satisfy one or more criteria. In some embodiments, only low or medium bit rate signals are considered for re-classification. The periodicity parameters can include any characteristic or set of characteristics indicative of periodicity. For example, the periodicity parameter may include pitch differences between subframes in the audio signal, a normalized pitch correlation for one or more subframes, an average normalized pitch correlation for the audio signal, or combinations thereof. Audio signals which are re-classified as VOICED signals may be encoded in the time-domain, while audio signals that remain classified as AUDIO signals may be encoded in the frequency-domain.

Abstract translation: 当信号的周期参数满足一个或多个标准时，通过将携带非语音数据的AUDIO信号重新分类为VOICE信号可以改善编码信号的质量。在一些实施例中，仅考虑低或中比特率信号用于重新分类。周期性参数可以包括指示周期性的任何特征或特征集合。例如，周期性参数可以包括音频信号中的子帧之间的音调差，一个或多个子帧的归一化音调相关性，音频信号的平均归一化音调相关性，或其组合。重新分类为VOICED信号的音频信号可以在时域中被编码，而保持分类为AUDIO信号的音频信号可以在频域中进行编码。

24.

发明授权
Unvoiced/voiced decision for speech processing 有权
Title translation: 用于语音处理的清音/浊音决定

公开(公告)号：US09570093B2

公开(公告)日：2017-02-14

申请号：US14476547

申请日：2014-09-03

Applicant: Huawei Technologies Co., Ltd.

Inventor： Yang Gao

IPC: G10L25/93 , G10L25/78

CPC classification number: G10L25/78 , G10L19/22 , G10L25/93

Abstract: In accordance with an embodiment of the present invention, a method for speech processing includes determining an unvoicing/voicing parameter reflecting a characteristic of unvoiced/voicing speech in a current frame of a speech signal comprising a plurality of frames. A smoothed unvoicing/voicing parameter is determined to include information of the unvoicing/voicing parameter in a frame prior to the current frame of the speech signal. A difference between the unvoicing/voicing parameter and the smoothed unvoicing/voicing parameter is computed. The method further includes generating an unvoiced/voiced decision point for determining whether the current frame comprises unvoiced speech or voiced speech using the computed difference as a decision parameter.

Abstract translation: 根据本发明的实施例，一种用于语音处理的方法包括：确定反映在包括多个帧的语音信号的当前帧中的清音/发声语音的特征的清音/发声参数。平滑的清音/发声参数被确定为包括语音信号的当前帧之前的帧中的清音/发声参数的信息。计算出浊音/浊音参数与平滑的浊音/浊音参数之间的差异。该方法还包括生成清音/有声决定点，用于使用所计算的差分作为判定参数来确定当前帧是否包括无声语音或浊音。

25.

发明授权
Spectral envelope coding of energy attack signal 有权
Title translation: 能量攻击信号的光谱信封编码

公开(公告)号：US09020815B2

公开(公告)日：2015-04-28

申请号：US13888550

申请日：2013-05-07

Applicant: Huawei Technologies Co., Ltd.

Inventor： Yang Gao

IPC: G10L19/00 , G10L19/12 , G10L19/022 , G10L19/03 , G10L19/02 , G10L19/025

CPC classification number: G10L19/12 , G10L19/0212 , G10L19/022 , G10L19/025 , G10L19/03

Abstract: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved by performing spectrum smoothing, replacing the segment of having spectral pre-echoes or filtering the segment with a combined filter obtained by doing LPC analysis.

Abstract translation: 当编码能量攻击信号时，基于MDCT或基于FFT的音频编码算法通常具有这里称为频谱预回波的问题。本发明提出了几种可能性，以避免在能量攻击点之前存在于解码信号段中的频谱预回波。攻击点之前的频谱包络可以通过进行频谱平滑处理来改善，用具有频谱预回波的段来替换，或者用通过LPC分析获得的组合滤波器对该段进行滤波。

26.

发明申请
SPECTRAL ENVELOPE CODING OF ENERGY ATTACK SIGNAL 审中-公开
Title translation: 能量攻击信号的光谱包络编码

公开(公告)号：US20130308792A1

公开(公告)日：2013-11-21

申请号：US13868806

申请日：2013-04-23

Applicant: Huawei Technologies Co., Ltd.

Inventor： Yang Gao

IPC: G10L19/03

CPC classification number: G10L19/12 , G10L19/0212 , G10L19/022 , G10L19/025 , G10L19/03

Abstract: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved by performing spectrum smoothing, replacing the segment of having spectral pre-echoes or filtering the segment with a combined filter obtained by doing LPC analysis.

Abstract translation: 当编码能量攻击信号时，基于MDCT或基于FFT的音频编码算法通常具有这里称为频谱预回波的问题。本发明提出了几种可能性，以避免在能量攻击点之前存在于解码信号段中的频谱预回波。攻击点之前的频谱包络可以通过进行频谱平滑处理来改善，用具有频谱预回波的段来替换，或者用通过LPC分析获得的组合滤波器对该段进行滤波。

27.

发明申请
Method for Encoding Signal, and Method for Decoding Signal 有权
Title translation: 信号编码方法以及解码信号的方法

公开(公告)号：US20130304460A1

公开(公告)日：2013-11-14

申请号：US13943812

申请日：2013-07-17

Applicant: Huawei Technologies Co., Ltd.

Inventor： Dejun Zhang , Lei Miao , Jianfeng Xu , Fengyan Qi , Qing Zhang , Lixiong Li , Fuwei Ma , Yang Gao

IPC: G10L19/09

CPC classification number: G10L19/09 , G10L19/0017 , H04N19/50

Abstract: The present disclosure relates to a method, apparatus, and system for encoding and decoding signals. The encoding method includes: converting a first-domain signal into a second-domain signal; performing Linear Prediction (LP) processing and Long-Term Prediction (LTP) processing for the second-domain signal; obtaining a long-term flag value according to a decision criterion; obtaining a second-domain predictive signal according to the LP processing result and the LTP processing result when the long-term flag value is a first value; obtaining a second-domain predictive signal according to the LP processing result when the long-term flag value is a second value; converting the second-domain predictive signal into a first-domain predictive signal, and calculating a first-domain predictive residual signal; and outputting a bit stream that includes the first-domain predictive residual signal.

Abstract translation: 本公开涉及用于对信号进行编码和解码的方法，装置和系统。编码方法包括：将第一域信号转换为第二域信号; 对第二域信号执行线性预测（LP）处理和长期预测（LTP）处理; 根据决策标准获得长期标志值; 当长期标志值为第一值时，根据LP处理结果和LTP处理结果获得第二域预测信号; 当长期标志值是第二值时，根据LP处理结果获得第二域预测信号; 将所述第二域预测信号转换为第一域预测信号，以及计算第一域预测残差信号; 并输出包括第一域预测残差信号的比特流。

28.

发明申请
System and Method for Post Excitation Enhancement for Low Bit Rate Speech Coding 有权
Title translation: 用于低比特率语音编码的后激励增强的系统和方法

公开(公告)号：US20130246055A1

公开(公告)日：2013-09-19

申请号：US13779589

申请日：2013-02-27

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yang Gao

IPC: G10L19/04

CPC classification number: G10L19/04 , G10L19/12 , G10L19/26

Abstract: In accordance with an embodiment, a method of decoding an audio/speech signal includes decoding an excitation signal based on an incoming audio/speech information, determining a stability of a high frequency portion of the excitation signal, smoothing an energy of the high frequency portion of the excitation signal based on the stability of the high frequency portion of the excitation signal, and producing an audio signal based on smoothing the high frequency portion of the excitation signal.

Abstract translation: 根据实施例，对音频/语音信号进行解码的方法包括基于输入音频/语音信息来解码激励信号，确定激励信号的高频部分的稳定性，平滑高频部分的能量基于激励信号的高频部分的稳定性的激励信号，并且基于使激励信号的高频部分平滑来产生音频信号。

29.

发明申请
System and Method for Mixed Codebook Excitation for Speech Coding 有权
Title translation: 用于语音编码的混合码本激励的系统和方法

公开(公告)号：US20130218578A1

公开(公告)日：2013-08-22

申请号：US13768814

申请日：2013-02-15

Applicant: Huawei Technologies Co., Ltd.

Inventor： Yang Gao

IPC: G10L19/00

CPC classification number: G10L19/00 , G10L19/12

Abstract: In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.

Abstract translation: 根据实施例，对音频/语音信号进行编码的方法包括基于输入音频/语音信号来确定混合码本矢量，其中混合码本向量包括来自第一码本的第一码本条目和第二码本矢量的和第二码本的码本条目。该方法还包括基于所确定的混合码本矢量生成编码音频信号，以及发送所确定的混合码本矢量的编码的激励索引。

30.

发明公开
File Generation Method and Electronic Device 审中-公开

公开(公告)号：US20230376459A1

公开(公告)日：2023-11-23

申请号：US18027035

申请日：2021-09-06

Applicant: Huawei Technologies Co., Ltd.

Inventor： Shen Wang , Yang Gao

IPC: G06F16/16 , G06F16/957 , G06F16/13

CPC classification number: G06F16/16 , G06F16/957 , G06F16/13

Abstract: A file generation method includes obtaining first information based on a resource index file, where the resource index file comprises indexes, file names, and first path information of a plurality of resource files, where the resource files include a first resource file and a second resource file, where the first information includes second path information and a file name of a first file; loading the first file based on the first information; obtaining the first resource file and the second resource file from the first file based on second information; and displaying the first page based on the first resource file and the second resource file.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification