-
公开(公告)号:US10600408B1
公开(公告)日:2020-03-24
申请号:US15933676
申请日:2018-03-23
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US12211517B1
公开(公告)日:2025-01-28
申请号:US17475699
申请日:2021-09-15
Applicant: Amazon Technologies, Inc.
Inventor: Roland Maximilian Rolf Maas , Bjorn Hoffmeister , Ariya Rastrow , James Garnet Droppo , Veerdhawal Pande , Maarten Van Segbroeck , Gautam Tiwari , Andrew Smith , Eli Joshua Fidler
Abstract: A speech-processing system may determine potential endpoints in a user's speech. Such endpoint prediction may include determining a potential endpoint in a stream of audio data, and may additionally including determining an endpoint score representing a likelihood that the potential endpoint represents an end of speech representing a complete user input. When the potential endpoint has been determined, the system may publish a transcript of speech that preceded the potential endpoint, and send it to downstream components. The system may continue to transcribe audio data and determine additional potential endpoints while the downstream components process the transcript. The downstream components may determine whether the transcript is complete; e.g., represents the entirety of the user input. Final endpoint determinations may be made based on the results of the downstream processing including automatic speech recognition, natural language understanding, etc.
-
公开(公告)号:US20240105171A1
公开(公告)日:2024-03-28
申请号:US17952630
申请日:2022-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Ramya Chaganti , Mark Lawrence , Ryan McCrate , Melanie C. B. Gens , Andrew Smith , Raja Bose , Zexiong Yan , Jyoti Chhabra
CPC classification number: G10L15/22 , G10L15/063 , G10L15/08 , G10L2015/088 , G10L2015/223
Abstract: Techniques for enabling access in a multi-assistant speech processing system are described, where a first assistant system may use components of a second assistant system as data processing components. Runtime operational data and user input data related to the first assistant may be kept separate from the processing data and input data related to the second assistant by propagating a first account ID, for user inputs directed to the first assistant, through the processing pipeline, and using a second account for user inputs directed to the second assistant. A mapping between the first account ID and the second account ID may be accessible to a select number of system components. Handoffs between the two assistants are handled in a manner where data related to one assistant is not accessible by the other assistant.
-
公开(公告)号:US20200251104A1
公开(公告)日:2020-08-06
申请号:US16786629
申请日:2020-02-10
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L15/18 , G10L13/10 , G10L13/033
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US12254879B2
公开(公告)日:2025-03-18
申请号:US17952630
申请日:2022-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Ramya Chaganti , Mark Lawrence , Ryan McCrate , Melanie C B Gens , Andrew Smith , Raja Bose , Zexiong Yan , Jyoti Chhabra
Abstract: Techniques for enabling access in a multi-assistant speech processing system are described, where a first assistant system may use components of a second assistant system as data processing components. Runtime operational data and user input data related to the first assistant may be kept separate from the processing data and input data related to the second assistant by propagating a first account ID, for user inputs directed to the first assistant, through the processing pipeline, and using a second account for user inputs directed to the second assistant. A mapping between the first account ID and the second account ID may be accessible to a select number of system components. Handoffs between the two assistants are handled in a manner where data related to one assistant is not accessible by the other assistant.
-
公开(公告)号:US20230290346A1
公开(公告)日:2023-09-14
申请号:US18098235
申请日:2023-01-18
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
CPC classification number: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/1807
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
公开(公告)号:US11562739B2
公开(公告)日:2023-01-24
申请号:US16786629
申请日:2020-02-10
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Smith , Christopher Schindler , Karthik Ramakrishnan , Rohit Prasad , Michael George , Rafal Kuklinski
IPC: G10L15/20 , G10L13/033 , G10L13/10 , G10L15/18
Abstract: Techniques for ensuring content output to a user conforms to a quality of the user's speech, even when a speechlet or skill ignores the speech's quality, are described. When a system receives speech, the system determines an indicator of the speech's quality (e.g., whispered, shouted, fast, slow, etc.) and persists the indicator in memory. When the system receives output content from a speechlet or skill, the system checks whether the output content is in conformity with the speech quality indicator. If the content conforms to the speech quality indicator, the system may cause the content to be output to the user without further manipulation. But, if the content does not conform to the speech quality indicator, the system may manipulate the content to render it in conformity with the speech quality indicator and output the manipulated content to the user.
-
-
-
-
-
-