Invention Application
- Patent Title: CONTEXTUAL BIASING FOR SPEECH RECOGNITION
-
Application No.: US18782001Application Date: 2024-07-23
-
Publication No.: US20240379095A1Publication Date: 2024-11-14
- Inventor: Rohit Prakash Prabhavalkar , Golan Pundak , Tara N. Sainath
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G10L15/26

Abstract:
A method includes receiving audio data encoding an utterance and obtaining a set of bias phrases corresponding to a context of the utterance. Each bias phrase includes one or more words. The method also includes processing, using a speech recognition model, acoustic features derived from the audio to generate an output from the speech recognition model. The speech recognition model includes a first encoder configured to receive the acoustic features, a bias encoder configured to receive data indicating the obtained set of bias phrases, a bias encoder, and a decoder configured to determine likelihoods of sequences of speech elements based on output of the first attention module and output of the bias attention module. The method also includes determining a transcript for the utterance based on the likelihoods of sequences of speech elements.
Information query