- 专利标题: CONTEXTUAL TEXT-TO-SPEECH PROCESSING
-
申请号: US16665886申请日: 2019-10-28
-
公开(公告)号: US20200152169A1公开(公告)日: 2020-05-14
- 发明人: Roberto Barra Chicote , Javier Latorre , Adam Franciszek Nadolski , Viacheslav Klimkov , Thomas Edward Merritt
- 申请人: Amazon Technologies, Inc.
- 主分类号: G10L13/10
- IPC分类号: G10L13/10 ; G10L13/033 ; G10L13/047
摘要:
A text-to-speech (TTS) system that is capable of considering characteristics of various portions of text data in order to create continuity between segments of synthesized speech. The system can analyze text portions of a work and create feature vectors including data corresponding to characteristics of the individual portions and/or the overall work. A TTS processing component can then consider feature vector(s) from other portions when performing TTS processing on text of a first portion, thus giving the TTS component some intelligence regarding other portions of the work, which can then result in more continuity between synthesized speech segments.
公开/授权文献
- US11443733B2 Contextual text-to-speech processing 公开/授权日:2022-09-13
信息查询