ACCURACY OF STREAMING RNN TRANSDUCER

发明申请

US20220093083A1 ACCURACY OF STREAMING RNN TRANSDUCER 有权

请登陆查看更多内容

专利标题： ACCURACY OF STREAMING RNN TRANSDUCER
申请号： US17031345

申请日： 2020-09-24
公开(公告)号： US20220093083A1

公开(公告)日： 2022-03-24
发明人: Gakuto Kurata , George Andrei Saon
申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION
申请人地址： US NY Armonk
专利权人： INTERNATIONAL BUSINESS MACHINES CORPORATION
当前专利权人： INTERNATIONAL BUSINESS MACHINES CORPORATION
当前专利权人地址： US NY Armonk
主分类号： G10L15/16
IPC分类号： G10L15/16 ; G10L25/30 ; G06N3/08 ; G06N3/04 ; G06K9/62 ; G06F17/18

摘要：

A computer-implemented method is provided for model training. The method includes training a second end-to-end neural speech recognition model that has a bidirectional encoder to output same symbols from an output probability lattice of the second end-to-end neural speech recognition model as from an output probability lattice of a trained first end-to-end neural speech recognition model having a unidirectional encoder. The method also includes building a third end-to-end neural speech recognition model that has a unidirectional encoder by training the third end-to-end neural speech recognition model as a student by using the trained second end-to-end neural speech recognition model as a teacher in a knowledge distillation method.

公开/授权文献

US11783811B2 Accuracy of streaming RNN transducer 公开/授权日：2023-10-10

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络