Unit-selection text-to-speech synthesis based on predicted concatenation parameters

Invention Grant

US09934775B2 Unit-selection text-to-speech synthesis based on predicted concatenation parameters 有权

Please log in to see more content

Patent Title: Unit-selection text-to-speech synthesis based on predicted concatenation parameters
Application No.: US15266930

Application Date: 2016-09-15
Publication No.: US09934775B2

Publication Date: 2018-04-03
Inventor: Tuomo J. Raitio , Kishore Sunkeswari Prahallad , Alistair D. Conkie , Ladan Golipour , David A. Winarsky
Applicant: Apple Inc.
Applicant Address: US CA Cupertino
Assignee: Apple Inc.
Current Assignee: Apple Inc.
Current Assignee Address: US CA Cupertino
Agency: Dentons US LLP
Main IPC: G10L13/10
IPC: G10L13/10 ; G10L13/033 ; G10L13/06

Unit-selection text-to-speech synthesis based on predicted concatenation parameters

Abstract:

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.

Public/Granted literature

US20170345411A1 UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS BASED ON PREDICTED CONCATENATION PARAMETERS Public/Granted day:2017-11-30

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定
G10L13/10	..来自文本的韵律规则；重音或声调