LOW-POWER VOICE COMMAND DETECTOR
    1.
    发明申请
    LOW-POWER VOICE COMMAND DETECTOR 审中-公开
    低功耗语音指令检测器

    公开(公告)号:WO2016142791A1

    公开(公告)日:2016-09-15

    申请号:PCT/IB2016/050068

    申请日:2016-01-07

    Abstract: A low-power voice command detection method uses an audio monitoring device to capture sound. The captured sound is analyzed in steps to determine if it fulfills a number of criteria regarding sound level, voice content and identifiable voice commands. For each step the processing is more complex and power demanding. A threshold between the first and subsequent steps is used to gate further processing. This threshold is dynamically adjusted, based on the outcome of the analysis, to avoid unnecessary processing and increase system performance.

    Abstract translation: 低功率语音指令检测方法使用音频监视装置来捕获声音。 捕获的声音被分阶段地分析以确定它是否满足关于声级,语音内容和可识别的语音命令的一些标准。 对于每个步骤,处理更加复杂和功率要求很高。 第一步和后续步骤之间的阈值用于进一步处理。 根据分析结果动态调整此阈值,避免不必要的处理并提高系统性能。

    METHOD AND APPARATUS FOR ADAPTIVELY DETECTING VOICE ACTIVITY IN INPUT AUDIO SIGNAL
    3.
    发明申请
    METHOD AND APPARATUS FOR ADAPTIVELY DETECTING VOICE ACTIVITY IN INPUT AUDIO SIGNAL 审中-公开
    用于自适应检测输入音频信号中的语音活动的方法和装置

    公开(公告)号:WO2012083555A1

    公开(公告)日:2012-06-28

    申请号:PCT/CN2010/080227

    申请日:2010-12-24

    Inventor: WANG, Zhe

    Abstract: A method and apparatus for adaptively detecting a voice activity in an input audio signal is provided. The method comprises the steps of: determining a noise characteristic (nc) of the input audio signal based at least on a received input frame of the input audio signal; deriving a voice activity detection (VAD) parameter (vp) adapted to the noise characteristic of the input audio signal; comparing the derived VAD parameter with a threshold to provide a voice activity detection decision.

    Abstract translation: 提供一种用于自适应地检测输入音频信号中的语音活动的方法和装置。 该方法包括以下步骤:至少基于输入音频信号的接收输入帧来确定输入音频信号的噪声特性(nc); 导出适合于输入音频信号的噪声特性的语音活动检测(VAD)参数(vp); 将导出的VAD参数与阈值进行比较以提供语音活动检测决策。

    METHOD AND VOICE ACTIVITY DETECTOR FOR A SPEECH ENCODER
    4.
    发明申请
    METHOD AND VOICE ACTIVITY DETECTOR FOR A SPEECH ENCODER 审中-公开
    语音编码器的方法和语音活动检测器

    公开(公告)号:WO2011049515A1

    公开(公告)日:2011-04-28

    申请号:PCT/SE2010/051117

    申请日:2010-10-18

    Abstract: The embodiments of the present invention relates to a primary voice activity detector and a method thereof. By using the method of the embodiments it is possible to determine whether frames of an input signal comprise voice. That is achieved by receiving a frame of the input signal, determining a first SNR of the received frame, comparing the determined first SNR with an adaptive threshold, and detecting whether the received frame comprises voice based on said comparison. The adaptive threshold is at least based on total noise energy of a noise level, an estimate of a second SNR and on energy variation between different frames.

    Abstract translation: 本发明的实施例涉及主要语音活动检测器及其方法。 通过使用实施例的方法,可以确定输入信号的帧是否包括语音。 这是通过接收输入信号的帧,确定接收到的帧的第一SNR,将所确定的第一SNR与自适应阈值进行比较,以及基于所述比较来检测所接收的帧是否包括语音。 自适应阈值至少基于噪声电平的总噪声能量,第二SNR的估计和不同帧之间的能量变化。

    ADAPTIVE VOICE MODE EXTENSION FOR A VOICE ACTIVITY DETECTOR
    5.
    发明申请
    ADAPTIVE VOICE MODE EXTENSION FOR A VOICE ACTIVITY DETECTOR 审中-公开
    语音活动检测器的自适应语音模式扩展

    公开(公告)号:WO2006104576A3

    公开(公告)日:2007-07-19

    申请号:PCT/US2006004687

    申请日:2006-01-26

    CPC classification number: G10L25/78 G10L2025/786

    Abstract: There is provided a voice activity detection method for indicating an active voice mode and an inactive voice mode. The method comprises receiving a first portion of an input signal; determining that the first portion of the input signal includes an active voice signal; indicating the active voice mode in response to the determining that the first portion of the input signal includes the active voice signal; receiving a second portion of the input signal immediately following the first portion of the input signal; detepnining that the second portion of the input signal includes an inactive voice signal; extending the indicating the active voice mode for a period of time after determining that the second portion of the input signal includes the inactive voice signal, wherein the period of time varies based on one or more conditions; and indicating the inactive voice mode after expiration of the period of time.

    Abstract translation: 提供了一种用于指示主动语音模式和无效语音模式的语音活动检测方法。 该方法包括接收输入信号的第一部分; 确定输入信号的第一部分包括有效语音信号; 响应于确定输入信号的第一部分包括有效语音信号,指示主动语音模式; 接收紧接在输入信号的第一部分之后的输入信号的第二部分; 确定输入信号的第二部分包括不活动的语音信号; 在确定所述输入信号的第二部分包括所述不活动语音信号之后,将所述主动语音模式指示一段时间,其中所述时间段基于一个或多个条件而变化; 并且在该时间段期满之后指示不活动的语音模式。

    VOICE ACTIVITY DETECTION WITH ADAPTIVE NOISE FLOOR TRACKING
    6.
    发明申请
    VOICE ACTIVITY DETECTION WITH ADAPTIVE NOISE FLOOR TRACKING 审中-公开
    自适应噪声跟踪的语音活动检测

    公开(公告)号:WO2005038773A1

    公开(公告)日:2005-04-28

    申请号:PCT/IB2004/052025

    申请日:2004-10-08

    Inventor: BROX, Wolfgang

    CPC classification number: G10L25/78 G10L2025/786

    Abstract: The present invention relates to a method and apparatus for detecting voice activity in a communication signal, wherein filter means are provided for estimating or suppressing an offset component of the level of the communication signal. A filter parameter is controlled based on the output of the filter means. Furthermore, the estimation or suppression of the offset component is limited in response to the output of the filter means. The filter means may be based on a non-linear adaptive notch level filter or a noise floor tracking filter. Thereby, the tracking behavior of noise floor estimation to sudden rises in noise floor can be improved and the voice activity detection can work efficiently over a wide dynamic range.

    Abstract translation: 本发明涉及一种用于检测通信信号中的语音活动的方法和装置,其中提供了滤波器装置,用于估计或抑制通信信号电平的偏移分量。 滤波器参数根据滤波器装置的输出进行控制。 此外,偏移分量的估计或抑制响应于滤波器装置的输出而受到限制。 滤波器装置可以基于非线性自适应陷波级滤波器或噪声本底跟踪滤波器。 因此,可以提高噪声基底估计对噪声本底的突然上升的跟踪行为,并且语音活动检测可以在宽动态范围内有效地工作。

    SYSTEM AND METHOD FOR NOISE THRESHOLD ADAPTATION FOR VOICE ACTIVITY DETECTION IN NONSTATIONARY NOISE ENVIRONMENTS
    7.
    发明申请
    SYSTEM AND METHOD FOR NOISE THRESHOLD ADAPTATION FOR VOICE ACTIVITY DETECTION IN NONSTATIONARY NOISE ENVIRONMENTS 审中-公开
    非噪声环境中语音活动检测的噪声阈值适应系统与方法

    公开(公告)号:WO99044191A1

    公开(公告)日:1999-09-02

    申请号:PCT/US1999/004176

    申请日:1999-02-26

    CPC classification number: G10L25/78 G10L2025/786

    Abstract: The system and method of the invention relates to voice detection technology for determining instants of time at which a snapshot of noise characteristics results in improved adaptation of noise floors used in voice detection. The approach is based on the "lower envelope" of the smoothed input signal power. Incorporation of this approach in a simple time domain VAD (Voice Activity Detector) results in an effective low-complexity system which, on the basis of simulations, gives good performance down to SNR values of about 0dB. In the invention the lower envelope also provides the updated value of the noise threshold during the presence of speech. The invention can also be embedded in other, more complex (e.g., frequency domain) VADs at low computational cost.

    Abstract translation: 本发明的系统和方法涉及用于确定时间的瞬间的语音检测技术,其中噪声特征的快照导致语音检测中使用的噪声底层的改进的适应。 该方法基于平滑输入信号功率的“较低包络”。 将这种方法并入简单的时域VAD(语音活动检测器)导致了一种有效的低复杂度系统,其在模拟的基础上提供了大约0dB的SNR值的良好性能。 在本发明中,下部信封还在语音存在期间提供噪声阈值的更新值。 本发明也可以以低的计算成本嵌入在其他更复杂(例如,频域)VAD中。

    WAVEFORM-BASED PERIODICITY DETECTOR
    8.
    发明申请
    WAVEFORM-BASED PERIODICITY DETECTOR 审中-公开
    基于波形的周期性检测器

    公开(公告)号:WO99010879A1

    公开(公告)日:1999-03-04

    申请号:PCT/SE1998/001444

    申请日:1998-08-07

    CPC classification number: G10L25/90 G10L2025/786

    Abstract: A waveform-based technique for generating periodicity information from an input signal includes generating a pre-processed signal by applying low pass and non-linear filtering to the input signal, wherein the pre-processed signal has highlighted speech pitch tracks. An adaptive threshold algorithm is applied to the pre-processed signal to generate a detection signal having waveform segments whose peaks are separated by a pitch period of the input signal. A period between peaks in the detection signal is determined that indicates the periodicity information. Information about the period between the peaks in the detection signal is then used to adapt a scaling value to be used by the adaptive threshold algorithm in a subsequent step. The periodicity information may be utilized in a voice activity detector in a telephonic communications system.

    Abstract translation: 用于从输入信号产生周期性信息的基于波形的技术包括通过对输入信号应用低通和非线性滤波来产生预处理信号,其中预处理信号具有突出的语音音调轨道。 对预处理的信号应用自适应阈值算法以产生具有其峰值被输入信号的音调周期分隔的波形段的检测信号。 确定检测信号中的峰值之间的周期,其指示周期性信息。 然后使用关于检测信号中的峰值之间的周期的信息来适应随后步骤中由自适应阈值算法使用的缩放值。 周期性信息可以在电话通信系统中的语音活动检测器中使用。

    IMPROVED NOISE SUPPRESSION SYSTEM
    9.
    发明申请
    IMPROVED NOISE SUPPRESSION SYSTEM 审中-公开
    改进的噪声抑制系统

    公开(公告)号:WO1989003141A1

    公开(公告)日:1989-04-06

    申请号:PCT/US1988003269

    申请日:1988-09-22

    Applicant: MOTOROLA, INC.

    Abstract: An improved noise suppression system (800) is disclosed which performs speech quantity enhancement upon the speech-plus-noise signal available at the input (205) to generate a clean speech signal at the output (265) by spectral gain modification. The improvements of the present invention include the addition of a signal-to-noise ratio (SNR) threshold mechanism (830) to reduce background noise flutter by offsetting the gain rise of the gain tables until a certain SNR threshold is reached, the use of a voice metric calculator (810) to produce more accurate background noise estimates via performing the update decision based on the overall voice-like characteristics in the channels and the time interval since the last update, and the use of a channel SNR modifier (820) to provide immunity to narrowband noise bursts through modification of the SNR estimates based on the voice metric calculation and the channel energies.

    ADAPTIVE SPEECH ENDPOINT DETECTOR
    10.
    发明申请
    ADAPTIVE SPEECH ENDPOINT DETECTOR 审中-公开
    自适应语音端点检测器

    公开(公告)号:WO2018063652A1

    公开(公告)日:2018-04-05

    申请号:PCT/US2017/048980

    申请日:2017-08-29

    CPC classification number: G10L15/04 G10L15/07 G10L15/22 G10L25/87 G10L2025/786

    Abstract: An embodiment of a speech endpoint detector apparatus may include a speech detector to detect a presence of speech in an electronic speech signal, a pause duration measurer communicatively coupled to the speech detector to measure a duration of a pause following a period of detected speech, an end of utterance detector communicatively coupled to the pause duration measurer to detect if the pause measured following the period of detected speech is greater than a pause threshold corresponding to an end of an utterance, and a pause threshold adjuster to adaptively adjust the pause threshold corresponding to an end of an utterance based on stored pause information. Other embodiments are disclosed and claimed.

    Abstract translation: 语音端点检测器装置的一个实施例可以包括:语音检测器,用于检测电子语音信号中语音的存在;暂停持续时间测量器,通信地耦合到语音检测器,以测量暂停的持续时间 在检测到的语音的一段时间之后,话语检测器的结束通信地耦合到暂停持续时间测量器,以检测在检测到的话音的时段之后测量的暂停是否大于对应于话语的结束的暂停阈值,以及暂停阈值调整器 基于存储的暂停信息自适应地调整对应于话语的结尾的暂停阈值。 其他实施例被公开和要求保护。

Patent Agency Ranking