-
公开(公告)号:US10255911B2
公开(公告)日:2019-04-09
申请号:US15525893
申请日:2014-12-17
Applicant: Intel Corporation
Inventor: Lukasz M. Malinowski , Piotr Jerzy Majcher , Georg Stemmer , Piotr Rozen , Joachim Hofer , Josef G. Bauer
IPC: G10L15/00 , G10L15/193 , G10L15/08 , G10L15/34 , G10L15/22
Abstract: A computer-implemented method of speech recognition comprises forming a weighted finite state transducer (WFST) having nodes associated with states and interconnected by arcs, and to identify at least one word or word sequence hypothesis, identifying multiple sub-graphs on the WFST, each sub-graph having the same arrangement of multiple states and at least one arc, and propagating tokens in parallel through the sub-graphs, where each sub-graph is stored as a supertoken each having an array of tokens.
-
公开(公告)号:US20190043476A1
公开(公告)日:2019-02-07
申请号:US15892510
申请日:2018-02-09
Applicant: INTEL CORPORATION
Inventor: Joachim Hofer , Georg Stemmer , Josef G. Bauer , Munir Nikolai Alexander Georges
Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis. The method further includes calculating a trend of the relative likelihood score as a function of time and identifying an endpoint of the speech based on a determination that the trend does not decrease over a selected time period.
-
公开(公告)号:US20180090131A1
公开(公告)日:2018-03-29
申请号:US15274498
申请日:2016-09-23
Applicant: Intel Corporation
Inventor: Praful Mangalath , Josef G. Bauer , Georg Stemmer
CPC classification number: G10L15/142 , G06F17/2705 , G10L15/144 , G10L15/1815 , G10L15/22 , G10L2015/088 , G10L2015/228
Abstract: Technologies for improved keyword spotting are disclosed. A compute device may capture speech data from a user of the compute device, and perform automatic speech recognition on the captured speech data. The automatic speech recognition algorithm is configured to both spot keywords as well as provide a full transcription of the captured speech data. The automatic speech recognition algorithm may preferentially match the keywords compared to similar words. The recognized keywords may be used to improve parsing of the transcribed speech data or to improve an assistive agent in holding a dialog with a user of the compute device.
-
公开(公告)号:US11308978B2
公开(公告)日:2022-04-19
申请号:US16531500
申请日:2019-08-05
Applicant: Intel Corporation
Inventor: Binuraj K. Ravindran , Francis M. Tharappel , Prabhakar R. Datta , Tobias Bocklet , Maciej Muchlinski , Tomasz Dorau , Josef G. Bauer , Saurin Shah , Georg Stemmer
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
-
5.
公开(公告)号:US20190355379A1
公开(公告)日:2019-11-21
申请号:US16531500
申请日:2019-08-05
Applicant: Intel Corporation
Inventor: Binuraj K. Ravindran , Francis M. Tharappel , Prabhakar R. Datta , Tobias Bocklet , Maciej Muchlinski , Tomasz Dorau , Josef G. Bauer , Saurin Shah , Georg Stemmer
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
-
公开(公告)号:US10217458B2
公开(公告)日:2019-02-26
申请号:US15274498
申请日:2016-09-23
Applicant: Intel Corporation
Inventor: Praful Mangalath , Josef G. Bauer , Georg Stemmer
Abstract: Technologies for improved keyword spotting are disclosed. A compute device may capture speech data from a user of the compute device, and perform automatic speech recognition on the captured speech data. The automatic speech recognition algorithm is configured to both spot keywords as well as provide a full transcription of the captured speech data. The automatic speech recognition algorithm may preferentially match the keywords compared to similar words. The recognized keywords may be used to improve parsing of the transcribed speech data or to improve an assistive agent in holding a dialog with a user of the compute device.
-
-
-
-
-