- 专利标题: ACCURACY OF STREAMING RNN TRANSDUCER
-
申请号: US17031345申请日: 2020-09-24
-
公开(公告)号: US20220093083A1公开(公告)日: 2022-03-24
- 发明人: Gakuto Kurata , George Andrei Saon
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 主分类号: G10L15/16
- IPC分类号: G10L15/16 ; G10L25/30 ; G06N3/08 ; G06N3/04 ; G06K9/62 ; G06F17/18
摘要:
A computer-implemented method is provided for model training. The method includes training a second end-to-end neural speech recognition model that has a bidirectional encoder to output same symbols from an output probability lattice of the second end-to-end neural speech recognition model as from an output probability lattice of a trained first end-to-end neural speech recognition model having a unidirectional encoder. The method also includes building a third end-to-end neural speech recognition model that has a unidirectional encoder by training the third end-to-end neural speech recognition model as a student by using the trained second end-to-end neural speech recognition model as a teacher in a knowledge distillation method.
公开/授权文献
- US11783811B2 Accuracy of streaming RNN transducer 公开/授权日:2023-10-10
信息查询