-
公开(公告)号:US20210176583A1
公开(公告)日:2021-06-10
申请号:US17179619
申请日:2021-02-19
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Gavin KEARNEY , Cal ARMSTRONG , Bin WANG , Zexin LIU
Abstract: An audio processing method includes: M audio signals are obtained by processing an audio signal by M virtual speakers; M first HRTFs and M second HRTFs are obtained, where the M first HRTFs corresponding to a left ear position, and the M second HRTFs corresponding to a right ear position; high-band impulse responses of some of the M first HRTFs are modified to obtain modified first target HRTFs, and high-band impulse responses of some of the M second HRTFs are modified to obtain modified second target HRTFs; a first target audio signal corresponding to the left ear position is obtained based on the modified first target HRTFs and un-modified first HRTFs, and the M audio signals; and a second target audio signal corresponding to the right ear position is obtained based on the modified second HRTFs, un-modified second target HRTFs, and the M audio signals.
-
公开(公告)号:US20190066698A1
公开(公告)日:2019-02-28
申请号:US16149758
申请日:2018-10-02
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
IPC: G10L19/002 , G10L19/032 , G10L19/02
Abstract: The present disclosure provide a signal processing method and apparatus. The method includes: determining a total quantity of to-be-allocated bits corresponding to a current frame; implementing primary bit allocation on to-be-processed sub-bands; performing a primary information unit quantity determining operation for each sub-band that has undergone the primary bit allocation; selecting sub-bands for secondary bit allocation from the to-be-processed sub-bands according to at least one of a sub-band characteristic of each sub-band of the to-be-processed sub-bands or the total quantity of surplus bits; implementing secondary bit allocation on the sub-bands for secondary bit allocation; and performing a secondary information unit quantity determining operation for each sub-band of the sub-bands for secondary bit allocation.
-
公开(公告)号:US20160118055A1
公开(公告)日:2016-04-28
申请号:US14985831
申请日:2015-12-31
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
IPC: G10L19/005 , G10L21/0388 , G10L21/0232 , G10L19/02
CPC classification number: G10L19/005 , G10L19/0208 , G10L21/0232 , G10L21/0388
Abstract: Embodiments of the present disclosure provide a decoding method and a decoding apparatus. The decoding method includes: in a case in which it is determined that a current frame is a lost frame, synthesizing a high frequency band signal; determining subframe gains of multiple subframes of the current frame; determining a global gain of the current frame; and adjusting, according to the global gain and the subframe gains of the multiple subframes, the synthesized high frequency band signal to obtain a high frequency band signal of the current frame. A subframe gain of the current frame is obtained according to a gradient between subframe gains of subframes previous to the current frame, so that transition before and after frame loss is more continuous, thereby reducing noise during signal reconstruction, and improving speech quality.
Abstract translation: 本公开的实施例提供了一种解码方法和解码装置。 解码方法包括:在确定当前帧是丢失帧的情况下,合成高频带信号; 确定当前帧的多个子帧的子帧增益; 确定当前帧的全局增益; 并且根据多个子帧的全局增益和子帧增益来调整合成的高频带信号,以获得当前帧的高频带信号。 根据当前帧之前的子帧的子帧增益之间的梯度获得当前帧的子帧增益,使得在帧丢失之前和之后的转换更连续,从而减少信号重建期间的噪声,并提高语音质量。
-
公开(公告)号:US20150010021A1
公开(公告)日:2015-01-08
申请号:US14496986
申请日:2014-09-25
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
Inventor: Zexin LIU , Lei MIAO , Fengyan QI
CPC classification number: G10L19/167 , G10L19/002 , G10L19/0204 , G10L19/06 , G10L19/26 , H04L5/06 , H04L27/2602
Abstract: In a signal coding method, bits for coding allocated to different bands of a frequency domain signal obtained from an input signal are adjusted to improve the coding quality. The total available bits for coding are first allocated to the bands of the frequency domain signal according to a predetermined allocation rule. The numbers of bits allocated to the respective bands of the frequency domain signal are then adjusted when a highest frequency of the frequency domain signal to which bits are allocated is greater than a predetermined value. The frequency domain signal is coded according to the adjusted bit allocation for the bands of the frequency domain signal.
Abstract translation: 在信号编码方法中,调整分配给从输入信号获得的频域信号的不同频带的编码比特,以提高编码质量。 根据预定的分配规则,首先将用于编码的总可用比特分配给频域信号的频带。 然后,当分配比特的频域信号的最高频率大于预定值时,调整分配给频域信号的各个频带的比特数。 根据针对频域信号的频带的调整后的比特分配对频域信号进行编码。
-
公开(公告)号:US20150006163A1
公开(公告)日:2015-01-01
申请号:US14470559
申请日:2014-08-27
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
IPC: G10L19/00
CPC classification number: G10L19/00 , G10L19/0204 , G10L19/083
Abstract: The present invention discloses a speech/audio signal processing method and apparatus. In an embodiment, the speech/audio signal processing method includes: when a speech/audio signal switches bandwidth, obtaining an initial high frequency signal corresponding to a current frame of speech/audio signal; obtaining a time-domain global gain parameter of the initial high frequency signal; performing weighting processing on an energy ratio and the time-domain global gain parameter, and using an obtained weighted value as a predicted global gain parameter, where the energy ratio is a ratio between energy of a historical frame of high frequency time-domain signal and energy of a current frame of initial high frequency signal; correcting the initial high frequency signal by using the predicted global gain parameter, to obtain a corrected high frequency time-domain signal; and synthesizing a current frame of narrow frequency time-domain signal and the corrected high frequency time-domain signal and outputting the synthesized signal.
Abstract translation: 本发明公开了一种语音/音频信号处理方法和装置。 在一个实施例中,语音/音频信号处理方法包括:当语音/音频信号切换带宽时,获得对应于当前语音/音频信号帧的初始高频信号; 获得初始高频信号的时域全局增益参数; 对能量比进行加权处理和时域全局增益参数,并使用获得的加权值作为预测的全局增益参数,其中能量比是高频时域信号的历史帧的能量与 初始高频信号当前帧的能量; 通过使用预测的全局增益参数来校正初始高频信号,以获得校正的高频时域信号; 并合成窄频时域信号的当前帧和经校正的高频时域信号并输出合成信号。
-
公开(公告)号:US20230131892A1
公开(公告)日:2023-04-27
申请号:US18069573
申请日:2022-12-21
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Xingtao ZHANG , Haiting LI , Zexin LIU , Lei MIAO
IPC: G10L19/008 , G10L19/032 , H04S3/00
Abstract: The present disclosure discloses an inter-channel phase difference parameter encoding method, where a current frame is obtained; a signal type and a previous IPD parameter encoding scheme of a previous frame are obtained; a current IPD parameter encoding scheme is obtained at least based on the signal type of the previous frame and the previous IPD parameter encoding scheme; and an IPD parameter of the current frame is processed based on the current IPD parameter encoding scheme.
-
公开(公告)号:US20210250723A1
公开(公告)日:2021-08-12
申请号:US17240655
申请日:2021-04-26
Applicant: Huawei Technologies Co., Ltd.
Inventor: Bin WANG , Zexin LIU , Risheng XIA
Abstract: This application provides an audio rendering method, including: obtaining a to-be-rendered BRIR signal, where an elevation angle corresponding to the to-be-rendered BRIR signal is 0 degrees; obtaining a direct sound signal based on the to-be-rendered BRIR signal; correcting, based on a target elevation angle, a frequency-domain signal corresponding to the direct sound signal, to obtain a frequency-domain signal corresponding to the target elevation angle; obtaining a time-domain signal based on the frequency-domain signal of the target elevation angle; and superposing the time-domain signal on a signal that is in the to-be-rendered BRIR signal and that is in a second time period after a first time period, to obtain a BRIR signal of the target elevation angle.
-
公开(公告)号:US20190318747A1
公开(公告)日:2019-10-17
申请号:US16457165
申请日:2019-06-28
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
IPC: G10L19/00 , G10L19/083
Abstract: The present invention discloses a speech/audio signal processing method and apparatus. In an embodiment, the speech/audio signal processing method includes: when a speech/audio signal switches bandwidth, obtaining an initial high frequency signal corresponding to a current frame of speech/audio signal; obtaining a time-domain global gain parameter of the initial high frequency signal; performing weighting processing on an energy ratio and the time-domain global gain parameter, and using an obtained weighted value as a predicted global gain parameter, where the energy ratio is a ratio between energy of a historical frame of high frequency time-domain signal and energy of a current frame of initial high frequency signal; correcting the initial high frequency signal by using the predicted global gain parameter, to obtain a corrected high frequency time-domain signal; and synthesizing a current frame of narrow frequency time-domain signal and the corrected high frequency time-domain signal and outputting the synthesized signal.
-
9.
公开(公告)号:US20160111105A1
公开(公告)日:2016-04-21
申请号:US14981923
申请日:2015-12-29
Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
IPC: G10L19/038 , G10L19/06
CPC classification number: G10L19/038 , G10L19/06 , G10L2019/0005
Abstract: Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the ith portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
Abstract translation: 本申请的实施例提出了一种频率包络矢量量化方法和装置,其中该方法包括:将一帧中的N个频率包络分成N1个矢量; 通过使用第一码本量化N1矢量中的第一矢量,以获得与量化的第一矢量相对应的码字,其中第一码本分为2B1个部分; 根据与量化的第一矢量对应的代码字来确定; 根据第i部分的码本确定第二码本; 并且基于第二码本对N1个矢量中的第二矢量进行量化。 在本申请的实施例中,可以通过使用具有较小位数的码本来对频率包络向量执行矢量量化。 因此,可以减少矢量量化的复杂度,并且也可以确保矢量量化的效果。
-
公开(公告)号:US20240249731A1
公开(公告)日:2024-07-25
申请号:US18603770
申请日:2024-03-13
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Haiting LI , Bin WANG , Zexin LIU
IPC: G10L19/008 , G10L19/00 , G10L19/02 , G10L19/08
CPC classification number: G10L19/008 , G10L19/0208 , G10L19/08 , G10L2019/0001
Abstract: An audio signal encoding method is provided. According to the method, if a current frame is a switching frame, a to-be-encoded downmixed signal and a to-be-encoded residual signal of the subband corresponding to the preset frequency band in the current frame are obtained based on a switch fade-in/fade-out factor of a previous frame, an initial downmixed signal and an initial residual signal of the preset frequency band of the current frame.
-
-
-
-
-
-
-
-
-