- 专利标题: System and method for unified normalization in text-to-speech and automatic speech recognition
-
申请号: US14461930申请日: 2014-08-18
-
公开(公告)号: US10199034B2公开(公告)日: 2019-02-05
- 发明人: Alistair D. Conkie , Ladan Golipour
- 申请人: AT&T Intellectual Property I, L.P.
- 申请人地址: US GA Atlanta
- 专利权人: AT&T INTELLECTUAL PROPERTY I, L.P.
- 当前专利权人: AT&T INTELLECTUAL PROPERTY I, L.P.
- 当前专利权人地址: US GA Atlanta
- 主分类号: G10L15/00
- IPC分类号: G10L15/00 ; G10L13/08 ; G10L15/06 ; G10L15/02 ; G10L13/06 ; G10L15/183
摘要:
A system, method and computer-readable storage devices are for using a single set of normalization protocols and a single language lexica (or dictionary) for both TTS and ASR. The system receives input (which is either text to be converted to speech or ASR training text), then normalizes the input. The system produces, using the normalized input and a dictionary configured for both automatic speech recognition and text-to-speech processing, output which is either phonemes corresponding to the input or text corresponding to the input for training the ASR system. When the output is phonemes corresponding to the input, the system generates speech by performing prosody generation and unit selection synthesis using the phonemes. When the output is text corresponding to the input, the system trains both an acoustic model and a language model for use in future speech recognition.
公开/授权文献
信息查询