Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Rafal Kuklinski"

1.

发明授权
Controlling offensive content in output 有权
Title translation: 控制产出中的令人反感的内容

公开(公告)号：US09405741B1

公开(公告)日：2016-08-02

申请号：US14223648

申请日：2014-03-24

Applicant: Amazon Technologies, Inc.

Inventor： Thomas Schaaf , Sumedha Arvind Kshirsagar , Roger Alix-Gaudreau , Remus Razvan Mois , Rafal Kuklinski , Derek Christopher Murman

IPC: G06F17/27 , G10L15/08

CPC classification number: G06F17/27 , G06F17/274 , G10L13/00 , G10L15/08 , G10L15/22

Abstract: Features are disclosed for recognizing inappropriate content in an output. The offensive content may be generated as a result of a speech processing error. A system may identify the inappropriate elements of a generated output and select among different appropriate alternatives. The system may be adjusted based on certain user characteristics. The system may be localized based on language and cultural features. The system may modify the generated output based on characteristics such as the tolerance threshold of known persons in the proximity of the system. The tolerance threshold may further be used to personalize and modify available content. Models used by the system may be further trained using input from a user.

Abstract translation: 公开了用于识别输出中的不适当内容的特征。可能由于语音处理错误而产生令人反感的内容。系统可以识别生成的输出的不适当元素，并在不同的适当替代方案中进行选择。可以基于某些用户特征来调整系统。该系统可以基于语言和文化特征进行本地化。系统可以基于诸如在系统附近的已知人员的容许阈值的特性来修改生成的输出。公差阈值还可用于个性化和修改可用内容。可以使用来自用户的输入来进一步训练系统使用的模型。

2.

发明授权
Content output management based on speech quality 有权

公开(公告)号：US10600408B1

公开(公告)日：2020-03-24

申请号：US15933676

申请日：2018-03-23

Applicant: Amazon Technologies, Inc.

Inventor： Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski

IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18

Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.

3.

发明公开
CONTENT OUTPUT MANAGEMENT BASED ON SPEECH QUALITY 审中-公开

公开(公告)号：US20230290346A1

公开(公告)日：2023-09-14

申请号：US18098235

申请日：2023-01-18

Applicant: Amazon Technologies, Inc.

Inventor： Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski

IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18

CPC classification number: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/1807

Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.

4.

发明授权
Content output management based on speech quality 有权

公开(公告)号：US11562739B2

公开(公告)日：2023-01-24

申请号：US16786629

申请日：2020-02-10

Applicant: Amazon Technologies, Inc.

Inventor： Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski

IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18

Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.

5.

发明授权
Text-to-speech (TTS) processing 有权

公开(公告)号：US10699695B1

公开(公告)日：2020-06-30

申请号：US16023370

申请日：2018-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Adam Franciszek Nadolski , Daniel Korzekwa , Thomas Edward Merritt , Marco Nicolis , Bartosz Putrycz , Roberto Barra Chicote , Rafal Kuklinski , Wiktor Dolecki

IPC: G10L13/10 , G10L13/06 , G10L13/047

Abstract: During text-to-speech processing, audio data corresponding to a word part, word, or group of words is generated using a trained model and used by a unit selection engine to create output audio. The audio data is generated at least when an input word is unrecognized or when a cost of a unit selection is too high.

6.

发明申请
CONTENT OUTPUT MANAGEMENT BASED ON SPEECH QUALITY 审中-公开

公开(公告)号：US20200251104A1

公开(公告)日：2020-08-06

申请号：US16786629

申请日：2020-02-10

Applicant: Amazon Technologies, Inc.

Inventor： Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski

IPC: G10L15/20 , G10L15/18 , G10L13/10 , G10L13/033

Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.

7.

发明授权
Text-to-speech processing using previously speech processed data 有权

公开(公告)号：US10140973B1

公开(公告)日：2018-11-27

申请号：US15266116

申请日：2016-09-15

Applicant: Amazon Technologies, Inc.

Inventor： Manish Kumar Dalmia , Rafal Kuklinski

IPC: G10L13/08 , G10L13/033 , G10L15/26 , G10L13/10 , G10L15/02 , G10L13/07 , G06F17/27

Abstract: Systems, methods, and devices for generating text-to-speech output using previously captured speech are described. Spoken audio is obtained and undergoes speech processing to create text. The resulting text is stored with the spoken audio, with both the text and the spoken audio being associated with the individual that spoke the audio. Various spoken audio and corresponding text are stored over time to create a library of speech units. When the individual sends a text message to a recipient, the text message is processed to determine portions of text, and the portions of text are compared to the library of text associated with the individual. When text in the library is identified, the system selects the spoken audio units associated with the identified stored text. The selected spoken audio units are then used to generate output audio data corresponding to the original text message, with the output audio data being sent to a device of the message recipient.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification