-
公开(公告)号:US09679554B1
公开(公告)日:2017-06-13
申请号:US14311711
申请日:2014-06-23
Applicant: Amazon Technologies, Inc.
Inventor: Michal Czuczman , Michal Grzegorz Kurpanik , Remus Razvan Mois
Abstract: A system may determine text for inclusion in a voice corpus for use in text-to-speech (TTS) processing using an interface that allows multiple entities to connect and review potential text segments concurrently. The interface may allow networked communication with the system. Individual text segments may be approved or rejected by reviewing entities, such as proofreaders. The system may prioritize text segments to send to reviewers and may re-prioritize text segments based on a linguistic coverage of previously accepted text segments.
-
公开(公告)号:US12154544B1
公开(公告)日:2024-11-26
申请号:US17205493
申请日:2021-03-18
Applicant: Amazon Technologies, Inc.
Inventor: Michal Czuczman , You Wang , Masaki Noguchi , Viacheslav Klimkov
IPC: G10L13/08 , G10L15/05 , G10L15/16 , G10L15/187
Abstract: A speech-processing system receives input data representing text. An encoder processes segments of the text to determine embedding data, and a decoder processes the embedding data to determine one or more categories associated with each segment. Output data is determined by selecting words based on the segments and categories.
-
公开(公告)号:US09508338B1
公开(公告)日:2016-11-29
申请号:US14081233
申请日:2013-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Michal Tadeusz Kaszczuk , Maciej Tegi , Michal Czuczman , Remus Razvan Mois
CPC classification number: G10L13/02 , G10L13/06 , G10L2013/083
Abstract: A text-to-speech (TTS) system may be configured to incorporate breath sounds in the output speech. By incorporating breath sounds into speech output from text a TTS system may be able to mimic more naturally sounding human speech, particularly for long-form narration of text longer than short phrases. The breath sounds may be stored as units for unit selection or may be generated during parametric synthesis. The acoustic features of the breath sounds and duration between breaths may depend upon the punctuation of text, the linguistic distance between breaths, the breaks between intonational phrases, the linguistic context of the breaths, and other factors.
Abstract translation: 文本到语音(TTS)系统可以被配置为在输出语音中包含呼吸音。 通过将呼吸音融合到文本的语音输出中,TTS系统可能能够模仿更自然地发出的人类语音,特别是对于长于短语的文本的长时间叙述。 呼吸音可以存储为用于单位选择的单位,或者可以在参数合成期间产生。 呼吸声的声学特征和呼吸之间的持续时间可能取决于文本的标点符号,呼吸之间的语言距离,语调之间的间隔,呼吸的语言背景以及其他因素。
-
-