Method and apparatus for performing voice activity detection
    51.
    发明授权
    Method and apparatus for performing voice activity detection 有权
    执行语音活动检测的方法和装置

    公开(公告)号:US08818811B2

    公开(公告)日:2014-08-26

    申请号:US13924637

    申请日:2013-06-24

    Inventor: Zhe Wang

    CPC classification number: G10L25/93 G10L25/78 G10L2025/786

    Abstract: This application relates to a voice activity detection (VAD) apparatus configured to provide a voice activity detection decision for an input audio signal. The VAD apparatus includes a state detector and a voice activity calculator. The state detector is configured to determine, based on the input audio signal, a current working state of the VAD apparatus among at least two different working states. Each of the at least two different working states is associated with a corresponding working state parameter decision set which includes at least one voice activity decision parameter. The voice activity calculator is configured to calculate a voice activity detection parameter value for the at least one voice activity decision parameter of the working state parameter decision set associated with the current working state, and to provide the voice activity detection decision by comparing the calculated voice activity detection parameter value with a threshold.

    Abstract translation: 本申请涉及被配置为提供输入音频信号的语音活动检测决定的语音活动检测(VAD)装置。 VAD装置包括状态检测器和语音活动计算器。 状态检测器被配置为基于输入音频信号确定VAD装置在至少两个不同工作状态中的当前工作状态。 所述至少两个不同工作状态中的每一个与包括至少一个语音活动决策参数的对应工作状态参数决策集相关联。 语音活动计算器被配置为计算与当前工作状态相关联的工作状态参数决定集合的至少一个语音活动判定参数的语音活动检测参数值,并且通过比较计算出的语音来提供语音活动检测决定 具有阈值的活动检测参数值。

    Audio signal coding method and apparatus

    公开(公告)号:US12198706B2

    公开(公告)日:2025-01-14

    申请号:US17969454

    申请日:2022-10-19

    Abstract: An audio signal coding method is provided that includes: obtaining a current frame of an audio signal; obtaining a coding parameter based on a power spectrum ratio of a current frequency in a current frequency area of at least a part of signals of the current frame, where the coding parameter indicates tonal component information of the at least a part of signals, the tonal component information includes at least one of location information of a tonal component, quantity information of tonal components, amplitude information of the tonal component, or energy information of the tonal component, and the power spectrum ratio of the current frequency is a ratio of a value of a power spectrum of the current frequency to a mean value of power spectrums of the current frequency area; and performing bitstream multiplexing on the coding parameter to obtain a coded bitstream.

    Audio Encoding and Decoding Method and Audio Encoding and Decoding Device

    公开(公告)号:US20220343926A1

    公开(公告)日:2022-10-27

    申请号:US17862712

    申请日:2022-07-12

    Abstract: An audio decoding method includes obtaining an encoded bitstream; performing bitstream demultiplexing on the encoded bitstream, to obtain a high frequency band parameter of a current frame of an audio signal, wherein the high frequency band parameter indicates a location, a quantity, and an amplitude or energy of a tone component comprised in a high frequency band signal of the current frame; obtaining a reconstructed high frequency band signal of the current frame based on the high frequency band parameter; and obtaining an audio output signal of the current frame based on the reconstructed high frequency band signal of the current frame.

    Audio Signal Classification Method and Apparatus

    公开(公告)号:US20220199111A1

    公开(公告)日:2022-06-23

    申请号:US17692640

    申请日:2022-03-11

    Inventor: Zhe Wang

    Abstract: An audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.

    Multichannel audio signal processing method, apparatus, and system

    公开(公告)号:US10984807B2

    公开(公告)日:2021-04-20

    申请号:US16781421

    申请日:2020-02-04

    Inventor: Zhe Wang

    Abstract: An encoder includes a signal detection circuit and a signal encoding circuit. The signal encoding circuit is configured to encode the Nth-frame downmixed signal when the signal detection circuit detects that an Nth-frame downmixed signal includes a speech signal, or when the signal detection circuit detects that the Nth-frame downmixed signal does not include a speech signal, encode the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal satisfies a preset audio frame encoding condition, or skip encoding the Nth-frame downmixed signal when the signal detection circuit determines that the Nth-frame downmixed signal does not satisfy a preset audio frame encoding condition.

    Relay transmission method and system, and related device

    公开(公告)号:US10827327B2

    公开(公告)日:2020-11-03

    申请号:US16414743

    申请日:2019-05-16

    Abstract: Embodiments of this application disclose a relay transmission method and system, and a related device. The method includes: determining, by a relay terminal, a vehicle existing between a vehicle terminal i and a vehicle terminal j based on vehicle location information in received broadcast messages sent by N vehicle terminals; and when it is obtained through calculation that link quality of a communication link Rij used when the vehicle terminal i and the vehicle terminal j communicate with each other is lower than a preset threshold, forwarding, to the vehicle terminal j, a broadcast message sent by the vehicle terminal i, and forwarding, to the vehicle terminal i, a broadcast message sent by the vehicle terminal j. A relay forwarding function of the relay terminal avoids communication interruption or communication distance limitation caused by blocking due to dynamic and unpredictable factors such as a large vehicle within the Internet of vehicles, thereby improving reliability of message transmission within the Internet of vehicles.

    Method for Detecting Audio Signal and Apparatus

    公开(公告)号:US20190279657A1

    公开(公告)日:2019-09-12

    申请号:US16391893

    申请日:2019-04-23

    Inventor: Zhe Wang

    Abstract: A method for detecting an audio signal and an apparatus, where the method includes determining an input audio signal as a to-be-determined audio signal, determining an enhanced segmental signal-to-noise ratio (SSNR) of the audio signal, where the enhanced SSNR is greater than a reference SSNR, and comparing the enhanced SSNR with a voice activity detection (VAD) decision threshold to determine whether the audio signal is an active signal. Therefore, the method and the apparatus can accurately distinguish an active voice and an inactive voice.

Patent Agency Ranking