Acoustic event detector with reduced resource consumption

    公开(公告)号:US10789941B2

    公开(公告)日:2020-09-29

    申请号:US16146416

    申请日:2018-09-28

    Abstract: Techniques are provided for efficient acoustic event detection with reduced resource consumption. A methodology implementing the techniques according to an embodiment includes calculating frames of power spectra based on segments of received acoustic signals. The method further includes two processes, one for detecting impulsive acoustic events and another for detecting continuous acoustic events. The first process includes generating impulsive acoustic event features associated with first and second power spectrum frames, applying a neural network classifier to the impulsive acoustic event features to generate event scores, and detecting an impulsive acoustic event based on those event scores. The second process includes generating reduced-dimension continuous acoustic event features associated with the first and second power spectrum frames, applying a neural network classifier to the reduced-dimension continuous acoustic event features to generate a second set of event scores, and detecting a continuous acoustic event based on the second set of event scores.

    WAKE ON VOICE KEY PHRASE SEGMENTATION
    34.
    发明申请

    公开(公告)号:US20190043479A1

    公开(公告)日:2019-02-07

    申请号:US15972369

    申请日:2018-05-07

    Abstract: Techniques are provided for segmentation of a key phrase. A methodology implementing the techniques according to an embodiment includes accumulating feature vectors extracted from time segments of an audio signal, and generating a set of acoustic scores based on those feature vectors. Each of the acoustic scores in the set represents a probability for a phonetic class associated with the time segments. The method further includes generating a progression of scored model state sequences, each of the scored model state sequences based on detection of phonetic units associated with a corresponding one of the sets of acoustic scores generated from the time segments of the audio signal. The method further includes analyzing the progression of scored state sequences to detect a pattern associated with the progression, and determining a starting and ending point for segmentation of the key phrase based on alignment of the detected pattern with an expected pattern.

    AUTOMATIC SPEECH RECOGNITION WITH FILLER MODEL PROCESSING

    公开(公告)号:US20210304759A1

    公开(公告)日:2021-09-30

    申请号:US17344165

    申请日:2021-06-10

    Abstract: Methods, apparatus, systems and articles of manufacture for recognizing speech are disclosed. An example system includes one or more processors to execute instructions to: identify a plurality of phonemes in a speech signal; perform a comparison of a subset of the phonemes to a phonetic string, the phonetic string representative of at least a portion of a wake up phrase; determine if one or more of the phonemes of the subset correspond to the wake up phrase based on the comparison; and generate a hypothesis of a command included in the speech signal by excluding the wake up phrase when one or more of the phonemes of the subset correspond to the wake up phrase or a portion of the wake up phrase.

Patent Agency Ranking