Invention Grant
- Patent Title: Training acoustic models using connectionist temporal classification
-
Application No.: US15397327Application Date: 2017-01-03
-
Publication No.: US10229672B1Publication Date: 2019-03-12
- Inventor: Kanury Kanishka Rao , Andrew W. Senior , Hasim Sak
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman Miller Schwartz and Cohn LLP
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/16 ; G10L15/30 ; G10L15/187 ; G10L15/02

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training acoustic models and using the trained acoustic models. A connectionist temporal classification (CTC) acoustic model is accessed, the CTC acoustic model having been trained using a context-dependent state inventory generated from approximate phonetic alignments determined by another CTC acoustic model trained without fixed alignment targets. Audio data for a portion of an utterance is received. Input data corresponding to the received audio data is provided to the accessed CTC acoustic model. Data indicating a transcription for the utterance is generated based on output that the accessed CTC acoustic model produced in response to the input data. The data indicating the transcription is provided as output of an automated speech recognition service.
Information query