Invention Application
- Patent Title: TEXT-TO-SPEECH PROCESSING
-
Application No.: US16586007Application Date: 2019-09-27
-
Publication No.: US20210097976A1Publication Date: 2021-04-01
- Inventor: Roberto Barra Chicote , Vatsal Aggarwal , Andrew Paul Breen , Javier Gonzalez Hernandez , Nishant Prateek
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Main IPC: G10L13/10
- IPC: G10L13/10 ; G10L13/047 ; G06F17/27 ; G10L13/033

Abstract:
During text-to-speech processing, a speech model creates synthesized speech that corresponds to input data. The speech model may include an encoder for encoding the input data into a context vector and a decoder for decoding the context vector into spectrogram data. The speech model may further include a voice decoder that receives vocal characteristic data representing a desired vocal characteristic of synthesized speech. The voice decoder may process the vocal characteristic data to determine configuration data, such as weights, for use by the speech decoder.
Public/Granted literature
- US11373633B2 Text-to-speech processing using input voice characteristic data Public/Granted day:2022-06-28
Information query