Invention Grant
- Patent Title: Text-to-speech (TTS) processing
-
Application No.: US16908882Application Date: 2020-06-23
-
Publication No.: US11763797B2Publication Date: 2023-09-19
- Inventor: Roberto Barra Chicote , Adam Franciszek Nadolski , Thomas Edward Merritt , Bartosz Putrycz , Andrew Paul Breen
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: PIERCE ATWOOD LLP
- Main IPC: G10L13/10
- IPC: G10L13/10 ; G10L13/033 ; G10L13/00

Abstract:
A speech model includes a sub-model corresponding to a vocal attribute. The speech model generates an output waveform using a sample model, which receives text data, and a conditioning model, which receives text metadata and produces a prosody output for use by the sample model. If, during training or runtime, a different vocal attribute is desired or needed, the sub-model is re-trained or switched to a different sub-model corresponding to the different vocal attribute.
Information query