Invention Grant
- Patent Title: End-to-end text-to-speech conversion
-
Application No.: US17391799Application Date: 2021-08-02
-
Publication No.: US11862142B2Publication Date: 2024-01-02
- Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Priority: GR 170100126 2017.03.29
- Main IPC: G10L13/06
- IPC: G10L13/06 ; G10L13/08 ; G06N3/08 ; G10L25/18 ; G10L25/30 ; G10L13/04 ; G06N3/084 ; G10L15/16 ; G06N3/045

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
Public/Granted literature
- US20210366463A1 END-TO-END TEXT-TO-SPEECH CONVERSION Public/Granted day:2021-11-25
Information query