-
51.
公开(公告)号:US08818811B2
公开(公告)日:2014-08-26
申请号:US13924637
申请日:2013-06-24
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
CPC classification number: G10L25/93 , G10L25/78 , G10L2025/786
Abstract: This application relates to a voice activity detection (VAD) apparatus configured to provide a voice activity detection decision for an input audio signal. The VAD apparatus includes a state detector and a voice activity calculator. The state detector is configured to determine, based on the input audio signal, a current working state of the VAD apparatus among at least two different working states. Each of the at least two different working states is associated with a corresponding working state parameter decision set which includes at least one voice activity decision parameter. The voice activity calculator is configured to calculate a voice activity detection parameter value for the at least one voice activity decision parameter of the working state parameter decision set associated with the current working state, and to provide the voice activity detection decision by comparing the calculated voice activity detection parameter value with a threshold.
Abstract translation: 本申请涉及被配置为提供输入音频信号的语音活动检测决定的语音活动检测(VAD)装置。 VAD装置包括状态检测器和语音活动计算器。 状态检测器被配置为基于输入音频信号确定VAD装置在至少两个不同工作状态中的当前工作状态。 所述至少两个不同工作状态中的每一个与包括至少一个语音活动决策参数的对应工作状态参数决策集相关联。 语音活动计算器被配置为计算与当前工作状态相关联的工作状态参数决定集合的至少一个语音活动判定参数的语音活动检测参数值,并且通过比较计算出的语音来提供语音活动检测决定 具有阈值的活动检测参数值。
-
公开(公告)号:US12198706B2
公开(公告)日:2025-01-14
申请号:US17969454
申请日:2022-10-19
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Bingyin Xia , Jiawei Li , Zhe Wang
Abstract: An audio signal coding method is provided that includes: obtaining a current frame of an audio signal; obtaining a coding parameter based on a power spectrum ratio of a current frequency in a current frequency area of at least a part of signals of the current frame, where the coding parameter indicates tonal component information of the at least a part of signals, the tonal component information includes at least one of location information of a tonal component, quantity information of tonal components, amplitude information of the tonal component, or energy information of the tonal component, and the power spectrum ratio of the current frequency is a ratio of a value of a power spectrum of the current frequency to a mean value of power spectrums of the current frequency area; and performing bitstream multiplexing on the coding parameter to obtain a coded bitstream.
-
公开(公告)号:US11922954B2
公开(公告)日:2024-03-05
申请号:US17232679
申请日:2021-04-16
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
IPC: G10L19/008 , G10L19/00 , G10L19/012 , H04S3/00 , G10L19/24 , G10L25/78
CPC classification number: G10L19/008 , G10L19/00 , G10L19/012 , H04S3/008 , G10L19/24 , G10L25/78 , H04S2400/03
Abstract: An encoder includes a signal detection circuit and a signal encoding circuit. The signal encoding circuit is configured to encode the Nth-frame downmixed signal when the signal detection circuit detects that an Nth-frame downmixed signal includes a speech signal, or when the signal detection circuit detects that the Nth-frame downmixed signal does not include a speech signal, encode the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal satisfies a preset audio frame encoding condition, or skip encoding the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal does not satisfy a preset audio frame encoding condition.
-
公开(公告)号:US20220343926A1
公开(公告)日:2022-10-27
申请号:US17862712
申请日:2022-07-12
Applicant: Huawei Technologies Co., Ltd.
Inventor: Bingyin Xia , Jiawei Li , Zhe Wang
Abstract: An audio decoding method includes obtaining an encoded bitstream; performing bitstream demultiplexing on the encoded bitstream, to obtain a high frequency band parameter of a current frame of an audio signal, wherein the high frequency band parameter indicates a location, a quantity, and an amplitude or energy of a tone component comprised in a high frequency band signal of the current frame; obtaining a reconstructed high frequency band signal of the current frame based on the high frequency band parameter; and obtaining an audio output signal of the current frame based on the reconstructed high frequency band signal of the current frame.
-
公开(公告)号:US20220199111A1
公开(公告)日:2022-06-23
申请号:US17692640
申请日:2022-03-11
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: An audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.
-
公开(公告)号:US10984807B2
公开(公告)日:2021-04-20
申请号:US16781421
申请日:2020-02-04
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
IPC: G10L19/00 , G10L19/02 , H04S3/00 , G10L19/008 , G10L19/012 , G10L25/78 , G10L19/24
Abstract: An encoder includes a signal detection circuit and a signal encoding circuit. The signal encoding circuit is configured to encode the Nth-frame downmixed signal when the signal detection circuit detects that an Nth-frame downmixed signal includes a speech signal, or when the signal detection circuit detects that the Nth-frame downmixed signal does not include a speech signal, encode the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal satisfies a preset audio frame encoding condition, or skip encoding the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal does not satisfy a preset audio frame encoding condition.
-
公开(公告)号:US20210074312A1
公开(公告)日:2021-03-11
申请号:US17027025
申请日:2020-09-21
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: A method for detecting a voice activity in an input audio signal composed of frames includes that a noise characteristic of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise characteristic of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.
-
公开(公告)号:US10827327B2
公开(公告)日:2020-11-03
申请号:US16414743
申请日:2019-05-16
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zhe Wang , Jun Zhang , Guangri Chen
IPC: H04W4/46 , H04W8/24 , H04H20/55 , H04W16/26 , H04L12/18 , H04W40/20 , H04W84/18 , H04W4/12 , H04W40/12 , H04W84/00 , H04W4/06 , H04W88/04 , H04L29/08 , H04L12/26 , H04W4/02
Abstract: Embodiments of this application disclose a relay transmission method and system, and a related device. The method includes: determining, by a relay terminal, a vehicle existing between a vehicle terminal i and a vehicle terminal j based on vehicle location information in received broadcast messages sent by N vehicle terminals; and when it is obtained through calculation that link quality of a communication link Rij used when the vehicle terminal i and the vehicle terminal j communicate with each other is lower than a preset threshold, forwarding, to the vehicle terminal j, a broadcast message sent by the vehicle terminal i, and forwarding, to the vehicle terminal i, a broadcast message sent by the vehicle terminal j. A relay forwarding function of the relay terminal avoids communication interruption or communication distance limitation caused by blocking due to dynamic and unpredictable factors such as a large vehicle within the Internet of vehicles, thereby improving reliability of message transmission within the Internet of vehicles.
-
公开(公告)号:US10734003B2
公开(公告)日:2020-08-04
申请号:US16168252
申请日:2018-10-23
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
IPC: G10L19/00 , G10L19/012 , G10L19/08 , G10L19/06 , G10L19/26 , G10L19/02 , G10L19/032
Abstract: A linear prediction-based noise signal processing method, includes obtaining a linear prediction coefficient of the noise signal, filtering a signal derived from the noise signal based on the linear prediction coefficient in order to obtain a linear prediction residual signal, obtaining excitation energy of the linear prediction residual signal and a spectral envelope of the linear prediction residual signal, and the spectral envelope, the excitation energy and the linear prediction coefficient are encoded.
-
公开(公告)号:US20190279657A1
公开(公告)日:2019-09-12
申请号:US16391893
申请日:2019-04-23
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: A method for detecting an audio signal and an apparatus, where the method includes determining an input audio signal as a to-be-determined audio signal, determining an enhanced segmental signal-to-noise ratio (SSNR) of the audio signal, where the enhanced SSNR is greater than a reference SSNR, and comparing the enhanced SSNR with a voice activity detection (VAD) decision threshold to determine whether the audio signal is an active signal. Therefore, the method and the apparatus can accurately distinguish an active voice and an inactive voice.
-
-
-
-
-
-
-
-
-