Invention Grant
- Patent Title: Training an automatic speech recognition system using compressed word frequencies
-
Application No.: US13666223Application Date: 2012-11-01
-
Publication No.: US08543398B1Publication Date: 2013-09-24
- Inventor: Brian Strope , Mitchel Weintraub
- Applicant: Google Inc.
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: McDonnell Boehnen Hulbert & Berghoff LLP
- Main IPC: G10L15/06
- IPC: G10L15/06

Abstract:
Respective word frequencies may be determined from a corpus of utterance-to-text-string mappings that contain associations between audio utterances and a respective text string transcription of each audio utterance. Respective compressed word frequencies may be obtained based on the respective word frequencies such that the distribution of the respective compressed word frequencies has a lower variance than the distribution of the respective word frequencies. Sample utterance-to-text-string mappings may be selected from the corpus of utterance-to-text-string mappings based on the compressed word frequencies. An automatic speech recognition (ASR) system may be trained with the sample utterance-to-text-string mappings.
Information query