Invention Grant
US08019605B2 Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
有权
使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间
- Patent Title: Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
- Patent Title (中): 使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间
-
Application No.: US11748256Application Date: 2007-05-14
-
Publication No.: US08019605B2Publication Date: 2011-09-13
- Inventor: Ciprian Agapi , Oscar J. Blass , Paritosh D. Patel , Roberto Vila
- Applicant: Ciprian Agapi , Oscar J. Blass , Paritosh D. Patel , Roberto Vila
- Applicant Address: US MA Burlington
- Assignee: Nuance Communications, Inc.
- Current Assignee: Nuance Communications, Inc.
- Current Assignee Address: US MA Burlington
- Agency: Wolf, Greenfield & Sacks, P.C.
- Main IPC: G10L13/08
- IPC: G10L13/08 ; G10L13/06

Abstract:
The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
Public/Granted literature
Information query