Invention Grant
US08019605B2 Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets 有权
使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间

Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
Abstract:
The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
Information query
Patent Agency Ranking
0/0