Abstract:
Methods, apparatus, systems and articles of manufacture for recognizing speech are disclosed. An example system includes one or more processors to execute instructions to: identify a plurality of phonemes in a speech signal; perform a comparison of a subset of the phonemes to a phonetic string, the phonetic string representative of at least a portion of a wake up phrase; determine if one or more of the phonemes of the subset correspond to the wake up phrase based on the comparison; and generate a hypothesis of a command included in the speech signal by excluding the wake up phrase when one or more of the phonemes of the subset correspond to the wake up phrase or a portion of the wake up phrase.
Abstract:
An automatic speech recognition (ASR) system includes a memory configured to store a filler model. The filler model includes one or more phonetic strings corresponding to one or more portions of a wake up phrase. The ASR system also includes one or more processors operatively coupled to the memory and configured to analyze a speech signal with the filler model to determine whether the speech signal includes the wake up phrase or any portion of the wake up phrase. The one or more processors are also configured to generate, based on the analysis, a hypothesis of underlying speech included in the speech signal. The hypothesis excludes the wake up phrase or any portion of the wake up phrase included in the speech signal.
Abstract:
An automatic speech recognition (ASR) system includes a memory configured to store a filler model. The filler model includes one or more phonetic strings corresponding to one or more portions of a wake up phrase. The ASR system also includes one or more processors operatively coupled to the memory and configured to analyze a speech signal with the filler model to determine whether the speech signal includes the wake up phrase or any portion of the wake up phrase. The one or more processors are also configured to generate, based on the analysis, a hypothesis of underlying speech included in the speech signal. The hypothesis excludes the wake up phrase or any portion of the wake up phrase included in the speech signal.
Abstract:
Methods, apparatus, systems and articles of manufacture for recognizing speech are disclosed. An example system includes one or more processors to execute instructions to: identify a plurality of phonemes in a speech signal; perform a comparison of a subset of the phonemes to a phonetic string, the phonetic string representative of at least a portion of a wake up phrase; determine if one or more of the phonemes of the subset correspond to the wake up phrase based on the comparison; and generate a hypothesis of a command included in the speech signal by excluding the wake up phrase when one or more of the phonemes of the subset correspond to the wake up phrase or a portion of the wake up phrase.
Abstract:
Techniques related to implementing neural networks for speech recognition systems are discussed. Such techniques may include implementing frame skipping with approximated skip frames and/or distances on demand such that only those outputs needed by a speech decoder are provided via the neural network or approximation techniques.