- 专利标题: Method and apparatus for open-vocabulary end-to-end speech recognition
-
申请号: US15843055申请日: 2017-12-15
-
公开(公告)号: US10672388B2公开(公告)日: 2020-06-02
- 发明人: Takaaki Hori , Shinji Watanabe , John Hershey
- 申请人: Mitsubishi Electric Research Laboratories, Inc.
- 申请人地址: US MA Cambridge
- 专利权人: Mitsubishi Electric Research Laboratories, Inc.
- 当前专利权人: Mitsubishi Electric Research Laboratories, Inc.
- 当前专利权人地址: US MA Cambridge
- 代理商 Gennadiy Vinokur; James McAleenan; Hironori Tsukamoto
- 主分类号: G10L15/16
- IPC分类号: G10L15/16 ; G10L15/19 ; G10L15/183 ; G10L15/02 ; G10L15/187 ; G10L15/22
摘要:
A speech recognition system includes an input device to receive voice sounds, one or more processors, and one or more storage devices storing parameters and program modules including instructions which cause the one or more processors to perform operations. The operations include extracting an acoustic feature sequence from audio waveform data converted from the voice sounds, encoding the acoustic feature sequence into a hidden vector sequence using an encoder network having encoder network parameters, predicting first output label sequence probabilities by feeding the hidden vector sequence to a decoder network having decoder network parameters, predicting second output level sequence probabilities by a hybrid network using character-base language models (LMs) and word-level LMs; and searching, using a label sequence search module, for an output label sequence having a highest sequence probability by combining the first and second output label sequence probabilities provided from the decoder network and the hybrid network.
公开/授权文献
信息查询