Streaming automatic speech recognition with non-streaming model distillation

Invention Grant

US11804212B2 Streaming automatic speech recognition with non-streaming model distillation 有权

Please log in to see more content

Patent Title: Streaming automatic speech recognition with non-streaming model distillation
Application No.: US17348118

Application Date: 2021-06-15
Publication No.: US11804212B2

Publication Date: 2023-10-31
Inventor: Thibault Doutre , Wei Han , Min Ma , Zhiyun Lu , Chung-Cheng Chiu , Ruoming Pang , Arun Narayanan , Ananya Misra , Yu Zhang , Liangliang Cao
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Honigman LLP
Agent Brett A. Krueger; Grant J. Griffith
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/08 ; G10L15/18 ; G06N3/04 ; G06N3/045

Streaming automatic speech recognition with non-streaming model distillation

Abstract:

A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality of non-streaming ASR teacher models by training the streaming ASR student model using the plurality of unlabeled student training utterances paired with the corresponding transcriptions generated by the plurality of non-streaming ASR teacher models.

Public/Granted literature

US20220343894A1 Streaming Automatic Speech Recognition With Non-Streaming Model Distillation Public/Granted day:2022-10-27

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）