CONTEXTUAL TEXT-TO-SPEECH PROCESSING

Invention Application

US20200152169A1 CONTEXTUAL TEXT-TO-SPEECH PROCESSING 审中-公开

Please log in to see more content

Patent Title: CONTEXTUAL TEXT-TO-SPEECH PROCESSING
Application No.: US16665886

Application Date: 2019-10-28
Publication No.: US20200152169A1

Publication Date: 2020-05-14
Inventor: Roberto Barra Chicote , Javier Latorre , Adam Franciszek Nadolski , Viacheslav Klimkov , Thomas Edward Merritt
Applicant: Amazon Technologies, Inc.
Main IPC: G10L13/10
IPC: G10L13/10 ; G10L13/033 ; G10L13/047

Abstract:

A text-to-speech (TTS) system that is capable of considering characteristics of various portions of text data in order to create continuity between segments of synthesized speech. The system can analyze text portions of a work and create feature vectors including data corresponding to characteristics of the individual portions and/or the overall work. A TTS processing component can then consider feature vector(s) from other portions when performing TTS processing on text of a first portion, thus giving the TTS component some intelligence regarding other portions of the work, which can then result in more continuity between synthesized speech segments.

Public/Granted literature

US11443733B2 Contextual text-to-speech processing Public/Granted day:2022-09-13

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定
G10L13/10	..来自文本的韵律规则；重音或声调