-
公开(公告)号:US09405741B1
公开(公告)日:2016-08-02
申请号:US14223648
申请日:2014-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Thomas Schaaf , Sumedha Arvind Kshirsagar , Roger Alix-Gaudreau , Remus Razvan Mois , Rafal Kuklinski , Derek Christopher Murman
CPC classification number: G06F17/27 , G06F17/274 , G10L13/00 , G10L15/08 , G10L15/22
Abstract: Features are disclosed for recognizing inappropriate content in an output. The offensive content may be generated as a result of a speech processing error. A system may identify the inappropriate elements of a generated output and select among different appropriate alternatives. The system may be adjusted based on certain user characteristics. The system may be localized based on language and cultural features. The system may modify the generated output based on characteristics such as the tolerance threshold of known persons in the proximity of the system. The tolerance threshold may further be used to personalize and modify available content. Models used by the system may be further trained using input from a user.
Abstract translation: 公开了用于识别输出中的不适当内容的特征。 可能由于语音处理错误而产生令人反感的内容。 系统可以识别生成的输出的不适当元素,并在不同的适当替代方案中进行选择。 可以基于某些用户特征来调整系统。 该系统可以基于语言和文化特征进行本地化。 系统可以基于诸如在系统附近的已知人员的容许阈值的特性来修改生成的输出。 公差阈值还可用于个性化和修改可用内容。 可以使用来自用户的输入来进一步训练系统使用的模型。
-
公开(公告)号:US10600408B1
公开(公告)日:2020-03-24
申请号:US15933676
申请日:2018-03-23
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US20230290346A1
公开(公告)日:2023-09-14
申请号:US18098235
申请日:2023-01-18
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
CPC classification number: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/1807
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US11562739B2
公开(公告)日:2023-01-24
申请号:US16786629
申请日:2020-02-10
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US10699695B1
公开(公告)日:2020-06-30
申请号:US16023370
申请日:2018-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Adam Franciszek Nadolski , Daniel Korzekwa , Thomas Edward Merritt , Marco Nicolis , Bartosz Putrycz , Roberto Barra Chicote , Rafal Kuklinski , Wiktor Dolecki
IPC: G10L13/10 , G10L13/06 , G10L13/047
Abstract: During text-to-speech processing, audio data corresponding to a word part, word, or group of words is generated using a trained model and used by a unit selection engine to create output audio. The audio data is generated at least when an input word is unrecognized or when a cost of a unit selection is too high.
-
公开(公告)号:US20200251104A1
公开(公告)日:2020-08-06
申请号:US16786629
申请日:2020-02-10
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L15/18 , G10L13/10 , G10L13/033
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US10140973B1
公开(公告)日:2018-11-27
申请号:US15266116
申请日:2016-09-15
Applicant: Amazon Technologies, Inc.
Inventor: Manish Kumar Dalmia , Rafal Kuklinski
Abstract: Systems, methods, and devices for generating text-to-speech output using previously captured speech are described. Spoken audio is obtained and undergoes speech processing to create text. The resulting text is stored with the spoken audio, with both the text and the spoken audio being associated with the individual that spoke the audio. Various spoken audio and corresponding text are stored over time to create a library of speech units. When the individual sends a text message to a recipient, the text message is processed to determine portions of text, and the portions of text are compared to the library of text associated with the individual. When text in the library is identified, the system selects the spoken audio units associated with the identified stored text. The selected spoken audio units are then used to generate output audio data corresponding to the original text message, with the output audio data being sent to a device of the message recipient.
-
-
-
-
-
-