Automatic speech recognition with filler model processing

    公开(公告)号:US11062703B2

    公开(公告)日:2021-07-13

    申请号:US16106852

    申请日:2018-08-21

    Abstract: An automatic speech recognition (ASR) system includes a memory configured to store a filler model. The filler model includes one or more phonetic strings corresponding to one or more portions of a wake up phrase. The ASR system also includes one or more processors operatively coupled to the memory and configured to analyze a speech signal with the filler model to determine whether the speech signal includes the wake up phrase or any portion of the wake up phrase. The one or more processors are also configured to generate, based on the analysis, a hypothesis of underlying speech included in the speech signal. The hypothesis excludes the wake up phrase or any portion of the wake up phrase included in the speech signal.

    Adaptive speech endpoint detector
    14.
    发明授权

    公开(公告)号:US10339918B2

    公开(公告)日:2019-07-02

    申请号:US15277164

    申请日:2016-09-27

    Abstract: An embodiment of a speech endpoint detector apparatus may include a speech detector to detect a presence of speech in an electronic speech signal, a pause duration measurer communicatively coupled to the speech detector to measure a duration of a pause following a period of detected speech, an end of utterance detector communicatively coupled to the pause duration measurer to detect if the pause measured following the period of detected speech is greater than a pause threshold corresponding to an end of an utterance, and a pause threshold adjuster to adaptively adjust the pause threshold corresponding to an end of an utterance based on stored pause information. Other embodiments are disclosed and claimed.

    SCORE TREND ANALYSIS FOR REDUCED LATENCY AUTOMATIC SPEECH RECOGNITION

    公开(公告)号:US20190043476A1

    公开(公告)日:2019-02-07

    申请号:US15892510

    申请日:2018-02-09

    Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis. The method further includes calculating a trend of the relative likelihood score as a function of time and identifying an endpoint of the speech based on a determination that the trend does not decrease over a selected time period.

    AUTOMATIC SPEECH RECOGNITION WITH FILLER MODEL PROCESSING

    公开(公告)号:US20210304759A1

    公开(公告)日:2021-09-30

    申请号:US17344165

    申请日:2021-06-10

    Abstract: Methods, apparatus, systems and articles of manufacture for recognizing speech are disclosed. An example system includes one or more processors to execute instructions to: identify a plurality of phonemes in a speech signal; perform a comparison of a subset of the phonemes to a phonetic string, the phonetic string representative of at least a portion of a wake up phrase; determine if one or more of the phonemes of the subset correspond to the wake up phrase based on the comparison; and generate a hypothesis of a command included in the speech signal by excluding the wake up phrase when one or more of the phonemes of the subset correspond to the wake up phrase or a portion of the wake up phrase.

    AUTOMATIC SPEECH RECOGNITION WITH FILLER MODEL PROCESSING

    公开(公告)号:US20190043503A1

    公开(公告)日:2019-02-07

    申请号:US16106852

    申请日:2018-08-21

    Abstract: An automatic speech recognition (ASR) system includes a memory configured to store a filler model. The filler model includes one or more phonetic strings corresponding to one or more portions of a wake up phrase. The ASR system also includes one or more processors operatively coupled to the memory and configured to analyze a speech signal with the filler model to determine whether the speech signal includes the wake up phrase or any portion of the wake up phrase. The one or more processors are also configured to generate, based on the analysis, a hypothesis of underlying speech included in the speech signal. The hypothesis excludes the wake up phrase or any portion of the wake up phrase included in the speech signal.

    Speech Decoder and Language Interpreter With Asynchronous Pre-Processing

    公开(公告)号:US20180240466A1

    公开(公告)日:2018-08-23

    申请号:US15436171

    申请日:2017-02-17

    CPC classification number: G10L15/28 G10L15/08 G10L15/1822

    Abstract: An embodiment of a language interpreter apparatus may include a language analyzer to analyze an intermediate recognition result of an electronic speech signal, and a memory to store a language interpretation result of the analysis of the intermediate recognition result, wherein the language analyzer is further to receive a final recognition result of the electronic speech signal, compare the final recognition result to the intermediate recognition result, and retrieve the language interpretation result of the analysis corresponding to the intermediate recognition result if the final recognition result matches the intermediate recognition result. An embodiment of a speech decoder apparatus may include a speech analyzer to analyze an electronic speech signal to determine an intermediate recognition result of the electronic speech signal, and a language interpreter interface communicatively coupled to the speech analyzer to provide the intermediate recognition result to a language interpreter for language interpretation.

Patent Agency Ranking