-
公开(公告)号:US11715478B2
公开(公告)日:2023-08-01
申请号:US17372951
申请日:2021-07-12
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yang Gao
IPC: G10L19/06 , G10L19/032
CPC classification number: G10L19/06 , G10L19/032
Abstract: Methods, systems, apparatuses, and computer programs encoded on computer storage media for performing linear predictive coding (LPC) are described. At least one of a differential spectrum tilt and an energy difference between a current frame and a previous frame of the audio signal is determined. A spectral stability of the audio signal is detected based on at least one of the differential spectrum tilt and an energy difference between the current frame and the previous frame of the audio signal. In response to detecting the spectral stability of the audio signal, quantized LPC parameters for the previous frame are copied into the current frame of the audio signal.
-
公开(公告)号:US20220230647A1
公开(公告)日:2022-07-21
申请号:US17667891
申请日:2022-02-09
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yang Gao , Fengyan Qi
IPC: G10L21/003 , G10L25/21 , G10L25/06 , G10L25/90 , G10L19/00
Abstract: A method includes detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in a time domain and detecting a lack of low frequency energy in the speech or audio signal in a frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
-
公开(公告)号:US10249313B2
公开(公告)日:2019-04-02
申请号:US15491181
申请日:2017-04-19
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yang Gao
Abstract: A method of decoding an encoded audio bitstream and generating frequency bandwidth extension is disclosed. The method includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is identified within the low frequency band using a parameter which indicates energy information of a low band spectral envelope. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the identified sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.
-
公开(公告)号:US10128910B2
公开(公告)日:2018-11-13
申请号:US15567844
申请日:2015-05-15
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhonghui Peng , Heng Wang , Yonggang Wang , Dongjin Chen , Yang Gao
IPC: H04B5/00 , H04B1/3816
Abstract: A mobile device with the NFC function includes an NFC chip, multiple SIM card slots, a power supply unit, and an eSE integrated into the NFC chip. One SIM card slot is connected to a first power port on the NFC chip. The power supply unit is connected to a second power port on the NFC chip. When the mobile device performs near field communication, the second power port on the NFC chip is triggered to output a first level signal. Each of the rest SIM card slots is connected to the power supply unit. The eSE is connected to the power supply unit. The power supply unit is configured to supply power to the eSE and the SIM card slot that is connected to the power supply unit, when the first level signal is received.
-
公开(公告)号:US20180012606A1
公开(公告)日:2018-01-11
申请号:US15677027
申请日:2017-08-15
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yang Gao
IPC: G10L19/005 , G10L19/083 , G10L19/09 , G10L19/22
CPC classification number: G10L19/005 , G10L19/083 , G10L19/09 , G10L19/22
Abstract: A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame, the excitation of a next frame is obtained according to the reduced or limited pitch gain value of the first subframe, and the next frame is encoded according to the obtained excitation. The method is used for a voiced speech class.
-
公开(公告)号:US09685166B2
公开(公告)日:2017-06-20
申请号:US14511943
申请日:2014-10-10
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yang Gao
IPC: G10L21/00 , G10L19/125 , G10L19/22
CPC classification number: G10L19/125 , G10L19/002 , G10L19/22 , G10L2019/0002 , G10L2019/0011 , G10L2019/0016
Abstract: A method for processing speech signals prior to encoding a digital signal comprising audio data includes selecting frequency domain coding or time domain coding based on a coding bit rate to be used for coding the digital signal and a short pitch lag detection of the digital signal.
-
公开(公告)号:US09672835B2
公开(公告)日:2017-06-06
申请号:US14687689
申请日:2015-04-15
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yang Gao
IPC: G10L19/00 , G10L19/025 , G10L19/022 , G10L19/22
CPC classification number: G10L19/025 , G10L19/022 , G10L19/22
Abstract: Low bit rate audio coding such as BWE algorithm often encounters conflict goal of achieving high time resolution and high frequency resolution at the same time. In order to achieve best possible quality, input signal can be first classified into fast signal and slow signal. This invention focuses on classifying signal into fast signal and slow signal, based on at least one of the following parameters or a combination of the following parameters: spectral sharpness, temporal sharpness, pitch correlation (pitch gain), and/or spectral envelope variation. This classification information can help to choose different BWE algorithms, different coding algorithms, and different post-processing algorithms respectively for fast signal and slow signal.
-
公开(公告)号:US09666202B2
公开(公告)日:2017-05-30
申请号:US14478839
申请日:2014-09-05
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yang Gao
IPC: G10L19/12 , G10L21/038 , G10L19/08 , G10L19/02
CPC classification number: G10L19/22 , G10L19/0204 , G10L19/08 , G10L19/12 , G10L19/167 , G10L19/265 , G10L21/038
Abstract: In one embodiment of the present invention, a method of decoding an encoded audio bitstream and generating frequency bandwidth extension includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is selected from within the low frequency band using a parameter which indicates energy information of a spectral envelope of the decoded low band audio signal. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the selected sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.
-
公开(公告)号:US20160240197A1
公开(公告)日:2016-08-18
申请号:US15136968
申请日:2016-04-24
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Yang Gao
IPC: G10L19/005 , G10L19/22 , G10L19/083
CPC classification number: G10L19/005 , G10L19/083 , G10L19/09 , G10L19/22
Abstract: A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame. The method is used for a voiced speech class. A pitch cycle length is compared to a subframe size to decide to reduce the pitch gain for the first subframe or the first two subframes within the frame. A strongly voiced class is decided by checking if the pitch lags are stable and the pitch gains are high enough with the frame; for the strongly voiced frame, the pitch lags and the pitch gains can be encoded more efficiently than other speech classes.
Abstract translation: 通过仅对语音帧内的第一子帧或前两个子帧限制或减小音调增益来实现减少由于语音分组丢失引起的误差传播的语音编码方法。 该方法用于有声语音类。 将音调周期长度与子帧尺寸进行比较,以决定减小帧内的第一子帧或前两个子帧的音调增益。 通过检查音调滞后是否稳定并且音高增益足够高的帧来决定强音阶。 对于强有声的帧,音调滞后,音调增益可以比其他语音类别更有效地编码。
-
公开(公告)号:US20150287420A1
公开(公告)日:2015-10-08
申请号:US14744452
申请日:2015-06-19
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Yang Gao , Fengyan Qi
IPC: G10L21/003 , G10L19/00
Abstract: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
Abstract translation: 提供了用于语音或音频信号的非常短的音调检测和编码的系统和方法实施例。 该系统和方法包括使用时域和频域音调检测技术的组合来检测语音或音频信号中是否存在比常规最小音调限制短的音调滞后。 音调检测技术包括在时域中使用音调相关性,并检测频域中的语音或音频信号中缺少低频能量。 检测到的非常短的音调滞后使用小于传统的最小间距限制的预定的最小非常短的间距限制的音调范围进行编码。
-
-
-
-
-
-
-
-
-