Unnatural prosody detection in speech synthesis

Invention Grant

US08583438B2 Unnatural prosody detection in speech synthesis 有权

Please log in to see more content

Patent Title: Unnatural prosody detection in speech synthesis
Application No.: US11903020

Application Date: 2007-09-20
Publication No.: US08583438B2

Publication Date: 2013-11-12
Inventor: Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang
Applicant: Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang
Applicant Address: US WA Redmond
Assignee: Microsoft Corporation
Current Assignee: Microsoft Corporation
Current Assignee Address: US WA Redmond
Agency: Collins & Collins Intellectual, LLC
Agent L. Alan Collins
Main IPC: G10L13/00
IPC: G10L13/00

Unnatural prosody detection in speech synthesis

Abstract:

Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.

Public/Granted literature

US20090083036A1 Unnatural prosody detection in speech synthesis Public/Granted day:2009-03-26

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统