-
公开(公告)号:US10650807B2
公开(公告)日:2020-05-12
申请号:US16134826
申请日:2018-09-18
Applicant: Intel Corporation
Inventor: Tobias Bocklet , Jacek Ossowski , Tomasz Dorau , Maciej Muchlinski , David Pearce , Piotr Rozen
IPC: G10L15/00 , G10L15/16 , G10L15/22 , G06N7/00 , G06N3/063 , G10L15/02 , G06N3/04 , G10L15/08 , G10L25/84 , G10L25/30
Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.
-
公开(公告)号:US10468032B2
公开(公告)日:2019-11-05
申请号:US15483246
申请日:2017-04-10
Applicant: Intel Corporation
Inventor: Jonathan J. Huang , Gokcen Cilingir , Tobias Bocklet
Abstract: Techniques related to speaker recognition are discussed. Such techniques include determining context aware confidence values formed of false accept and false reject rates determined by using adaptively updated acoustic environment score distributions matched to current score distributions.
-
公开(公告)号:US20190221205A1
公开(公告)日:2019-07-18
申请号:US16369504
申请日:2019-03-29
Applicant: Intel Corporation
Inventor: Sebastian Czyryba , Tobias Bocklet , Kuba Lopatka
CPC classification number: G10L15/16 , G06N3/08 , G10L15/02 , G10L15/063 , G10L2015/022 , G10L2015/088
Abstract: Techniques related to keyphrase detection for applications such as wake on voice are disclosed herein. Such techniques may have high accuracy by using scores of phone positions in triphones to select which triphones to use with a rejection model, using context-related phones for the rejection model, adding silence before keyphrase sounds for a keyphrase model, or any combination of these.
-
14.
公开(公告)号:US20190049989A1
公开(公告)日:2019-02-14
申请号:US15816835
申请日:2017-11-17
Applicant: Intel Corporation
Inventor: Sarang Akotkar , Mithil Ramteke , Tobias Bocklet , Sivasubramanian Sundaram
CPC classification number: G05D1/0255 , G05D1/0088 , G05D1/0246 , G06N3/0445 , G06N3/0454 , G06N3/084 , G10L21/038 , G10L25/24 , G10L25/30 , G10L25/51 , H04R1/406 , H04R3/005 , H04R2420/01 , H04R2499/13
Abstract: Embodiments include apparatuses, systems, and methods for a computer-aided or autonomous driving (CA/AD) system to identify and respond to an audio signal, e.g., an emergency alarm signal. In embodiments, the CA/AD driving system may include a plurality of microphones disposed to capture the audio signal included in surrounding sounds to a semi-autonomous or autonomous (SA/AD) vehicle. In embodiments, an audio analysis unit may receive the audio signal to extract audio features from the audio signal. In embodiments, a neural network such as a Deep Neural Network (DNN) may receive the extracted audio features from the audio analysis unit and to generate a probability score to allow identification of the audio signal. In embodiments, the CA/AD driving system may control driving elements of the SA/AD vehicle to autonomously or semi-autonomously drive the SA/AD vehicle in response to the identification. Other embodiments may also be described and claimed.
-
公开(公告)号:US20190043489A1
公开(公告)日:2019-02-07
申请号:US16146416
申请日:2018-09-28
Applicant: INTEL CORPORATION
Inventor: Kuba Lopatka , Mateusz Kotarski , Tobias Bocklet , Marek Zabkiewicz
CPC classification number: G10L15/16 , G06F9/542 , G06F17/147 , G06K9/00536 , G06K9/4628 , G06K9/627 , G06N3/04 , G06N3/0454 , G06N3/08 , G10L15/02 , G10L15/22 , G10L25/51
Abstract: Techniques are provided for efficient acoustic event detection with reduced resource consumption. A methodology implementing the techniques according to an embodiment includes calculating frames of power spectra based on segments of received acoustic signals. The method further includes two processes, one for detecting impulsive acoustic events and another for detecting continuous acoustic events. The first process includes generating impulsive acoustic event features associated with first and second power spectrum frames, applying a neural network classifier to the impulsive acoustic event features to generate event scores, and detecting an impulsive acoustic event based on those event scores. The second process includes generating reduced-dimension continuous acoustic event features associated with the first and second power spectrum frames, applying a neural network classifier to the reduced-dimension continuous acoustic event features to generate a second set of event scores, and detecting a continuous acoustic event based on the second set of event scores.
-
公开(公告)号:US20190043488A1
公开(公告)日:2019-02-07
申请号:US16134826
申请日:2018-09-18
Applicant: Intel Corporation
Inventor: Tobias Bocklet , Jacek Ossowski , Tomasz Dorau , Maciej Muchlinski , David Pearce , Piotr Rozen
Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.
-
公开(公告)号:US11817094B2
公开(公告)日:2023-11-14
申请号:US17344165
申请日:2021-06-10
Applicant: Intel Corporation
Inventor: Josef Bauer , Tobias Bocklet , Joachim Hofer , Munir Georges
CPC classification number: G10L15/22 , G10L15/02 , G10L25/54 , G10L17/04 , G10L2015/025 , G10L2015/223 , G10L2015/225
Abstract: Methods, apparatus, systems and articles of manufacture for recognizing speech are disclosed. An example system includes one or more processors to execute instructions to: identify a plurality of phonemes in a speech signal; perform a comparison of a subset of the phonemes to a phonetic string, the phonetic string representative of at least a portion of a wake up phrase; determine if one or more of the phonemes of the subset correspond to the wake up phrase based on the comparison; and generate a hypothesis of a command included in the speech signal by excluding the wake up phrase when one or more of the phonemes of the subset correspond to the wake up phrase or a portion of the wake up phrase.
-
公开(公告)号:US20210264898A1
公开(公告)日:2021-08-26
申请号:US17319607
申请日:2021-05-13
Applicant: Intel Corporation
Inventor: Tomasz Dorau , Tobias Bocklet , Przemyslaw Tomaszewski , Sebastian Czyryba , Juliusz Norman Chojecki
Abstract: Techniques are provided for segmentation of a key phrase. A methodology implementing the techniques according to an embodiment includes accumulating feature vectors extracted from time segments of an audio signal, and generating a set of acoustic scores based on those feature vectors. Each of the acoustic scores in the set represents a probability for a phonetic class associated with the time segments. The method further includes generating a progression of scored model state sequences, each of the scored model state sequences based on detection of phonetic units associated with a corresponding one of the sets of acoustic scores generated from the time segments of the audio signal. The method further includes analyzing the progression of scored state sequences to detect a pattern associated with the progression, and determining a starting and ending point for segmentation of the key phrase based on alignment of the detected pattern with an expected pattern.
-
19.
公开(公告)号:US10665222B2
公开(公告)日:2020-05-26
申请号:US16022376
申请日:2018-06-28
Applicant: Intel Corporation
Inventor: Suyoung Bang , Muhammad Khellah , Somnath Paul , Charles Augustine , Turbo Majumder , Wootaek Lim , Tobias Bocklet , David Pearce
Abstract: A system, article, and method provide temporal-domain feature extraction for automatic speech recognition.
-
公开(公告)号:US20200090657A1
公开(公告)日:2020-03-19
申请号:US16692150
申请日:2019-11-22
Applicant: INTEL CORPORATION
Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.
-
-
-
-
-
-
-
-
-