- 专利标题: Unnatural prosody detection in speech synthesis
-
申请号: US11903020申请日: 2007-09-20
-
公开(公告)号: US08583438B2公开(公告)日: 2013-11-12
- 发明人: Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang
- 申请人: Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Collins & Collins Intellectual, LLC
- 代理商 L. Alan Collins
- 主分类号: G10L13/00
- IPC分类号: G10L13/00
摘要:
Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.
公开/授权文献
- US20090083036A1 Unnatural prosody detection in speech synthesis 公开/授权日:2009-03-26
信息查询