发明公开
- 专利标题: Reducing Insertion Errors in Neural Transducer-Based Automatic Speech Recognition
-
申请号: US18096308申请日: 2023-01-12
-
公开(公告)号: US20240242707A1公开(公告)日: 2024-07-18
- 发明人: Takashi Fukuda
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G10L25/78 ; G10L25/93
摘要:
Techniques for training a neural transducer-based automatic speech recognition model to be robust against background additive noise and thereby reducing insertion errors. In one aspect, a method of training an automatic speech recognition model includes: generating a modified training data set from an initial training dataset by concatenating one-word utterances with a preceding or a succeeding sentence in the initial training dataset based on a duration of silence between the one-word utterances and the preceding or the succeeding sentence; and training the automatic speech recognition model using the modified training data set.
信息查询