Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Michal Czuczman"

1.

发明授权
Text-to-speech corpus development system 有权

公开(公告)号：US09679554B1

公开(公告)日：2017-06-13

申请号：US14311711

申请日：2014-06-23

Applicant: Amazon Technologies, Inc.

Inventor： Michal Czuczman , Michal Grzegorz Kurpanik , Remus Razvan Mois

IPC: G06F17/27 , G10L13/08 , G06F17/28

CPC classification number: G10L13/08 , G06F17/27 , G06F17/28 , G06Q10/06 , G10L13/00

Abstract: A system may determine text for inclusion in a voice corpus for use in text-to-speech (TTS) processing using an interface that allows multiple entities to connect and review potential text segments concurrently. The interface may allow networked communication with the system. Individual text segments may be approved or rejected by reviewing entities, such as proofreaders. The system may prioritize text segments to send to reviewers and may re-prioritize text segments based on a linguistic coverage of previously accepted text segments.

2.

发明授权
Synthetic speech processing 有权

公开(公告)号：US12154544B1

公开(公告)日：2024-11-26

申请号：US17205493

申请日：2021-03-18

Applicant: Amazon Technologies, Inc.

Inventor： Michal Czuczman , You Wang , Masaki Noguchi , Viacheslav Klimkov

IPC: G10L13/08 , G10L15/05 , G10L15/16 , G10L15/187

Abstract: A speech-processing system receives input data representing text. An encoder processes segments of the text to determine embedding data, and a decoder processes the embedding data to determine one or more categories associated with each segment. Output data is determined by selecting words based on the segments and categories.

3.

发明授权
Inserting breath sounds into text-to-speech output 有权
Title translation: 将呼吸音插入文本到语音输出

公开(公告)号：US09508338B1

公开(公告)日：2016-11-29

申请号：US14081233

申请日：2013-11-15

Applicant: Amazon Technologies, Inc.

Inventor： Michal Tadeusz Kaszczuk , Maciej Tegi , Michal Czuczman , Remus Razvan Mois

IPC: G10L13/08 , G10L13/02

CPC classification number: G10L13/02 , G10L13/06 , G10L2013/083

Abstract: A text-to-speech (TTS) system may be configured to incorporate breath sounds in the output speech. By incorporating breath sounds into speech output from text a TTS system may be able to mimic more naturally sounding human speech, particularly for long-form narration of text longer than short phrases. The breath sounds may be stored as units for unit selection or may be generated during parametric synthesis. The acoustic features of the breath sounds and duration between breaths may depend upon the punctuation of text, the linguistic distance between breaths, the breaks between intonational phrases, the linguistic context of the breaths, and other factors.

Abstract translation: 文本到语音（TTS）系统可以被配置为在输出语音中包含呼吸音。通过将呼吸音融合到文本的语音输出中，TTS系统可能能够模仿更自然地发出的人类语音，特别是对于长于短语的文本的长时间叙述。呼吸音可以存储为用于单位选择的单位，或者可以在参数合成期间产生。呼吸声的声学特征和呼吸之间的持续时间可能取决于文本的标点符号，呼吸之间的语言距离，语调之间的间隔，呼吸的语言背景以及其他因素。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification