-
公开(公告)号:US11887610B2
公开(公告)日:2024-01-30
申请号:US17862712
申请日:2022-07-12
Applicant: Huawei Technologies Co., Ltd.
Inventor: Bingyin Xia , Jiawei Li , Zhe Wang
Abstract: An audio decoding method includes obtaining an encoded bitstream; performing bitstream demultiplexing on the encoded bitstream, to obtain a high frequency band parameter of a current frame of an audio signal, wherein the high frequency band parameter indicates a location, a quantity, and an amplitude or energy of a tone component comprised in a high frequency band signal of the current frame; obtaining a reconstructed high frequency band signal of the current frame based on the high frequency band parameter; and obtaining an audio output signal of the current frame based on the reconstructed high frequency band signal of the current frame.
-
72.
公开(公告)号:US20240029757A1
公开(公告)日:2024-01-25
申请号:US18360675
申请日:2023-07-27
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: An audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.
-
公开(公告)号:US20230186924A1
公开(公告)日:2023-06-15
申请号:US18154486
申请日:2023-01-13
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhi Wang , Jiance Ding , Bin Wang , Zhe Wang
IPC: G10L19/008 , G10L25/21 , G10L19/002
CPC classification number: G10L19/008 , G10L25/21 , G10L19/002
Abstract: A multi-channel audio signal coding method includes obtaining a to-be-encoded first audio frame, pairing at least five channel signals according to a first pairing manner to obtain a first channel pair set, obtaining a first sum of correlation values of the first channel pair set, where one channel pair has one correlation value, pairing the at least five channel signals according to a second pairing manner to obtain a second channel pair set, obtaining a second sum of correlation values of the second channel pair set, determining a target pairing manner of the at least five channel signals based on the first sum of correlation values and the second sum of correlation values, and encoding the at least five channel signals based on a channel pair set corresponding to the target pairing manner, where the target pairing manner is the first pairing manner or the second pairing manner.
-
公开(公告)号:US20230137053A1
公开(公告)日:2023-05-04
申请号:US18072038
申请日:2022-11-30
Applicant: Huawei Technologies Co., Ltd.
Inventor: Bingyin Xia , Jiawei Li , Zhe Wang
IPC: G10L19/02
Abstract: An audio coding method includes obtaining a current frame that includes a high-frequency band signal and a low-frequency band signal; performing first coding on the high-frequency band signal and the low-frequency band signal to obtain a first coding parameter; determining a spectrum reservation flag of each frequency bin of the high-frequency band signal, where the spectrum reservation flag indicates whether a first spectrum corresponding to the frequency bin is reserved in a second spectrum corresponding to the frequency bin; and performing second coding on the high-frequency band signal based on the spectrum reservation flag of each frequency bin of the high-frequency band signal to obtain a second coding parameter, where the second coding parameter indicates information about a target tonal component of the high-frequency band signal.
-
公开(公告)号:US20230040515A1
公开(公告)日:2023-02-09
申请号:US17969454
申请日:2022-10-19
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Bingyin Xia , Jiawei Li , Zhe Wang
Abstract: An audio signal coding method is provided that includes: obtaining a current frame of an audio signal; obtaining a coding parameter based on a power spectrum ratio of a current frequency in a current frequency area of at least a part of signals of the current frame, where the coding parameter indicates tonal component information of the at least a part of signals, the tonal component information includes at least one of location information of a tonal component, quantity information of tonal components, amplitude information of the tonal component, or energy information of the tonal component, and the power spectrum ratio of the current frequency is a ratio of a value of a power spectrum of the current frequency to a mean value of power spectrums of the current frequency area; and performing bitstream multiplexing on the coding parameter to obtain a coded bitstream.
-
公开(公告)号:US11430461B2
公开(公告)日:2022-08-30
申请号:US17027025
申请日:2020-09-21
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: A method for detecting a voice activity in an input audio signal composed of frames includes that a noise characteristic of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise characteristic of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.
-
77.
公开(公告)号:US11289113B2
公开(公告)日:2022-03-29
申请号:US16723584
申请日:2019-12-20
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: A linear prediction residual energy tilt-based audio signal classification method and apparatus, where the method includes: determining, according to voice activity of a current audio frame, whether to obtain a linear prediction residual energy tilt of a current audio frame of the current audio frame and store a frequency spectrum fluctuation of the current frame in a frequency spectrum fluctuation memory, where the linear prediction residual energy tilt denotes an extent to which an audio signal's linear prediction residual energy changes as a linear prediction order inscreases; updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory; and classifying the current audio frame as a speech frame or a music frame according to statistics of some or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.
-
公开(公告)号:US11074922B2
公开(公告)日:2021-07-27
申请号:US16439954
申请日:2019-06-13
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: An audio encoding method includes dividing an energy spectrum of a current audio frame into P FFT energy spectrum coefficients; determining a minimum bandwidth of distribution, on spectrum, of first-preset-proportion energy of the current audio frame according to the energy of the P FFT energy spectrum coefficients of the current audio frame, wherein the minimum bandwidth of distribution, on spectrum, of first preset proportion energy of the current audio frame indicates sparseness of distribution, on the spectrum, of energy of the current audio frame; and determining to use a linear-prediction-based encoding method to encode the current audio frame in response to the minimum bandwidth of distribution is greater than a first preset value.
-
公开(公告)号:US20200327045A1
公开(公告)日:2020-10-15
申请号:US16911722
申请日:2020-06-25
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang , Ning Li , Dangfei Zhao
IPC: G06F11/36
Abstract: This application includes a case generation unit and a case execution unit. The case generation unit obtains an access request received by to-be-tested software, determines a value range of a request parameter in the access request based on the access request, and generates test cases for the access request based on the value range. Subsequently, the case execution unit executes the test cases for the to-be-tested software.
-
公开(公告)号:US10796712B2
公开(公告)日:2020-10-06
申请号:US16191914
申请日:2018-11-15
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhe Wang
Abstract: The disclosure provides a method and an apparatus for detecting a voice activity in an input audio signal composed of frames. A noise characteristic of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise characteristic of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.
-
-
-
-
-
-
-
-
-