TEXT-TO-SPEECH PROCESSING

Invention Application

US20210097976A1 TEXT-TO-SPEECH PROCESSING 有权

Please log in to see more content

Patent Title: TEXT-TO-SPEECH PROCESSING
Application No.: US16586007

Application Date: 2019-09-27
Publication No.: US20210097976A1

Publication Date: 2021-04-01
Inventor: Roberto Barra Chicote , Vatsal Aggarwal , Andrew Paul Breen , Javier Gonzalez Hernandez , Nishant Prateek
Applicant: Amazon Technologies, Inc.
Applicant Address: US WA Seattle
Assignee: Amazon Technologies, Inc.
Current Assignee: Amazon Technologies, Inc.
Current Assignee Address: US WA Seattle
Main IPC: G10L13/10
IPC: G10L13/10 ; G10L13/047 ; G06F17/27 ; G10L13/033

Abstract:

During text-to-speech processing, a speech model creates synthesized speech that corresponds to input data. The speech model may include an encoder for encoding the input data into a context vector and a decoder for decoding the context vector into spectrogram data. The speech model may further include a voice decoder that receives vocal characteristic data representing a desired vocal characteristic of synthesized speech. The voice decoder may process the vocal characteristic data to determine configuration data, such as weights, for use by the speech decoder.

Public/Granted literature

US11373633B2 Text-to-speech processing using input voice characteristic data Public/Granted day:2022-06-28

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定
G10L13/10	..来自文本的韵律规则；重音或声调