Invention Grant
- Patent Title: Multi-channel speech signal enhancement for robust voice trigger detection and automatic speech recognition
-
Application No.: US15613127Application Date: 2017-06-02
-
Publication No.: US10403299B2Publication Date: 2019-09-03
- Inventor: Jason Wung , Joshua D. Atkins , Ramin Pishehvar , Mehrez Souden
- Applicant: Apple Inc.
- Applicant Address: US CA Cupertino
- Assignee: Apple Inc.
- Current Assignee: Apple Inc.
- Current Assignee Address: US CA Cupertino
- Agency: Womble Bond Dickinson (US) LLP
- Main IPC: H04M9/08
- IPC: H04M9/08 ; G10L21/02 ; G10L21/0208 ; G10L21/0216 ; G10L21/0232 ; G10L21/0272 ; G10L21/038

Abstract:
A digital speech enhancement system that performs a specific chain of digital signal processing operations upon multi-channel sound pick up, to result in a single, enhanced speech signal. The operations are designed to be computationally less complex yet as a whole yield an enhanced speech signal that produces accurate voice trigger detection and low word error rates by an automatic speech recognizer. The constituent operations or components of the system have been chosen so that the overall system is robust to changing acoustic conditions, and can deliver the enhanced speech signal with low enough latency so that the system can be used online (enabling real-time, voice trigger detection and streaming ASR.) Other embodiments are also described and claimed.
Public/Granted literature
Information query