-
公开(公告)号:US20210201920A1
公开(公告)日:2021-07-01
申请号:US17204073
申请日:2021-03-17
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zexin Liu , Fengyan Qi , Lei Miao
IPC: G10L19/002 , G10L19/028 , G10L19/02 , G10L19/005
Abstract: An audio signal decoding device includes a non-transitory memory storage stores audio data in a form of a bitstream; and an audio decoder, by which a first spectral coefficient of a first sub-band of a current frame of an audio signal by decoding the bitstream is obtained; a first average quantity of allocated bits per spectral coefficient of the first sub-band is obtained; a first noise filling gain for the first sub-band is obtained when the first average quantity is less than a threshold; a second spectral coefficient is reconstructed according to the first noise filling gain; a frequency domain audio signal is obtained according to the first spectral coefficient and the second spectral coefficient; and a time domain audio signal is generated according to the frequency domain signal.
-
公开(公告)号:US10607621B2
公开(公告)日:2020-03-31
申请号:US16502332
申请日:2019-07-03
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zexin Liu , Lei Miao , Fengyan Qi
IPC: G10L19/12 , G10L19/02 , G10L21/038 , G10L19/08
Abstract: A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
-
公开(公告)号:US10600430B2
公开(公告)日:2020-03-24
申请号:US15864147
申请日:2018-01-08
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zexin Liu , Lei Miao , Fengyan Qi
Abstract: In an audio signal decoding method, a decoded frequency domain signal of a current frame of the audio signal is obtained by decoding a received bitstream; a predicted frequency domain signal of the current frame is obtained according to the decoded frequency domain signal the current frame when the decoded frequency domain signal meets anyone of two given conditions; and a time domain signal of the current frame is obtained according to the decoded frequency domain signal and the predicted frequency domain signal.
-
公开(公告)号:US10546592B2
公开(公告)日:2020-01-28
申请号:US15981645
申请日:2018-05-16
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Fengyan Qi , Zexin Liu , Lei Miao
IPC: G10L19/002 , G10L19/032 , G10L19/028 , G10L19/02
Abstract: A audio signal encoding method includes: dividing a frequency band of an audio signal into a plurality of sub-bands, and quantifying a sub-band normalization factor of each sub-band; determining signal bandwidth of bit allocation according to the quantified sub-band normalization factor, or according to the quantified sub-band normalization factor and bit rate information; allocating bits for a sub-band within the determined signal bandwidth; and coding a spectrum coefficient of the audio signal according to the bits allocated for each sub-band. According to embodiments of the present disclosure, during coding and decoding, signal bandwidth of bit allocation is determined according to the quantified sub-band normalization factor and bit rate information. In this manner, the determined signal bandwidth is effectively coded and decoded by centralizing the bits, and audio quality is improved.
-
公开(公告)号:US10249315B2
公开(公告)日:2019-04-02
申请号:US15467356
申请日:2017-03-23
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Fengyan Qi , Lei Miao
IPC: G10L19/00 , G10L25/00 , G10L25/90 , G10L21/013 , G10L21/028
Abstract: A method and an apparatus for detecting correctness of a pitch period. The method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal; determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal; and determining correctness of the initial pitch period according to the pitch period correctness decision parameter. The method and apparatus for detecting correctness of a pitch period according to the embodiments of the present invention can improve, based on a relatively less complex algorithm, accuracy of detecting correctness of a pitch period.
-
公开(公告)号:US20170323652A1
公开(公告)日:2017-11-09
申请号:US15662302
申请日:2017-07-28
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Yang Gao , Fengyan Qi
Abstract: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
-
公开(公告)号:US09786293B2
公开(公告)日:2017-10-10
申请号:US15358649
申请日:2016-11-22
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zexin Liu , Lei Miao , Fengyan Qi
CPC classification number: G10L19/167 , G10L19/002 , G10L19/0204 , G10L19/06 , G10L19/26 , H04L5/06 , H04L27/2602
Abstract: In a signal coding method, bits for coding allocated to different bands of a frequency domain signal obtained from an input signal are adjusted to improve the coding quality. The total available bits for coding are first allocated to the bands of the frequency domain signal according to a predetermined allocation rule. The numbers of bits allocated to the respective bands of the frequency domain signal are then adjusted when a highest frequency of the frequency domain signal to which bits are allocated is greater than a predetermined value. The frequency domain signal is coded according to the adjusted bit allocation for the bands of the frequency domain signal.
-
公开(公告)号:US09741357B2
公开(公告)日:2017-08-22
申请号:US14744452
申请日:2015-06-19
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yang Gao , Fengyan Qi
Abstract: System and method embodiments are provided for very short pitch detection and coding for speech or audio signals. The system and method include detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in time domain and detecting a lack of low frequency energy in the speech or audio signal in frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
-
公开(公告)号:US09672830B2
公开(公告)日:2017-06-06
申请号:US13632905
申请日:2012-10-01
Applicant: Huawei Technologies Co., Ltd.
Inventor: Fengyan Qi , Lei Miao
CPC classification number: G10L19/0017 , G10L19/24 , H04B7/0647
Abstract: A voice signal encoding and decoding method, device, and codec system are provided. The coding method includes: encoding an input voice signal to obtain a broadband code stream, where the broadband code stream includes a core layer bit stream and an extension enhancement layer bit stream (101); compressing the core layer bit stream to obtain a compressed code stream (102); and packing the compressed code stream and the extension enhancement layer bit stream to obtain a packed code stream (103). The core layer bit stream is compressed, and the compressed code stream and the extension enhancement layer bit stream are packed, thereby reducing transmission bandwidth occupied by the input voice signal. Since the broadband voice encoding is performed on the input voice signal, a broadband voice code stream is transmitted by using narrowband transmission bandwidth, thereby improving the cost performance of voice signal transmission.
-
10.
公开(公告)号:US09530420B2
公开(公告)日:2016-12-27
申请号:US14675031
申请日:2015-03-31
Applicant: Huawei Technologies Co., Ltd.
Inventor: Fengyan Qi , Zexin Liu , Lei Miao
IPC: G10L19/002 , G10L19/035 , G10L19/02
CPC classification number: G10L19/002 , G10L19/0204 , G10L19/032 , G10L19/035
Abstract: A method and an apparatus for allocating bits of an audio signal. The method includes dividing a frequency band of an audio signal into multiple sub-bands, and quantizing a sub-band normalization factor of each sub-band; classifying the multiple sub-bands into multiple groups, and acquiring a sum of intra-group sub-band normalization factors of each group; performing initial inter-group bit allocation to determine the initial number of bits of each group; performing secondary inter-group bit allocation to allocate coding bits of the audio signal to at least one group; and allocating the bits of the audio signal to sub-bands in the group. The present invention can, by means of grouping, ensure relatively stable allocation in a previous frame and a next frame and reduce an impact of global allocation on local discontinuity in a case of low and medium bit rates.
Abstract translation: 一种用于分配音频信号的位的方法和装置。 该方法包括将音频信号的频带划分成多个子带,并量化每个子带的子带归一化因子; 将多个子带分为多个组,并获取每组的组内子带归一化因子之和; 执行初始组间比特分配以确定每组的初始比特数; 执行次要组间比特分配以将音频信号的编码比特分配给至少一个组; 并将音频信号的比特分配给组中的子带。 本发明可以通过分组确保在先前帧和下一帧中的相对稳定的分配,并且在低和中比特率的情况下减少全局分配对局部不连续性的影响。
-
-
-
-
-
-
-
-
-