-
公开(公告)号:US11705108B1
公开(公告)日:2023-07-18
申请号:US17547586
申请日:2021-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Vasiliy Radostev , Ruhi Sarikaya , Rekha Seshadrinathan , Abhinav Sethy , Chetan Nagaraj Naik , Anjishnu Kumar
IPC: G10L13/08 , G10L15/183 , G10L15/06 , G10L15/08
CPC classification number: G10L13/08 , G10L15/063 , G10L15/083 , G10L15/183
Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.
-
公开(公告)号:US20240321261A1
公开(公告)日:2024-09-26
申请号:US18670819
申请日:2024-05-22
Applicant: Amazon Technologies, Inc.
Inventor: Vasiliy Radostev , Ruhi Sarikaya , Rekha Seshadrinathan , Abhinav Sethy , Chetan Nagaraj Naik , Anjishnu Kumar
IPC: G10L13/08 , G10L15/06 , G10L15/08 , G10L15/183
CPC classification number: G10L13/08 , G10L15/063 , G10L15/083 , G10L15/183
Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.
-
公开(公告)号:US20210142794A1
公开(公告)日:2021-05-13
申请号:US17099875
申请日:2020-11-17
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Leo Mathias , Bala Murali Krishna Ummaneni , Ryan Scott Aldrich , Diamond Bishop , Ruhi Sarikaya , Chetan Nagaraj Naik
IPC: G10L15/18 , G10L15/22 , G06F40/30 , G06F40/295 , G06F16/9032
Abstract: A system for processing user utterances and/or text based queries that tracks entities and other context data of a current dialog between the system and the user and can fill slots for new intents of the dialog by performing statistical processing on previously mentioned entities with respect to current slots to be filled. The system may compare a previously mentioned entity to a current slot to be filled using vector representations, such as word embeddings, of the current utterance, dialog history, current intent, name of an entity under consideration, category of the current slot to be filled, distance between the current dialog turn and the dialog turn that mentioned the entity, and other considerations. The individual vectors may be weighted according to an attention operation and processed by a trained decoder to output a score indicating whether the entity in consideration is relevant to the particular slot. In this manner, slots may be filled using entities from previous dialog turns, thus performing statistical anaphora resolution and leading to improved system performance.
-
公开(公告)号:US20240029708A1
公开(公告)日:2024-01-25
申请号:US18324234
申请日:2023-05-26
Applicant: Amazon Technologies, Inc.
Inventor: Vasiliy Radostev , Ruhi Sarikaya , Rekha Seshadrinathan , Abhinav Sethy , Chetan Nagaraj Naik , Anjishnu Kumar
IPC: G10L13/08 , G10L15/183 , G10L15/06 , G10L15/08
CPC classification number: G10L13/08 , G10L15/183 , G10L15/063 , G10L15/083
Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.
-
公开(公告)号:US11996081B2
公开(公告)日:2024-05-28
申请号:US18324234
申请日:2023-05-26
Applicant: Amazon Technologies, Inc.
Inventor: Vasiliy Radostev , Ruhi Sarikaya , Rekha Seshadrinathan , Abhinav Sethy , Chetan Nagaraj Naik , Anjishnu Kumar
IPC: G10L13/08 , G10L15/06 , G10L15/08 , G10L15/183
CPC classification number: G10L13/08 , G10L15/063 , G10L15/083 , G10L15/183
Abstract: Techniques for generating a visual response to a user input are described. A system may receive a natural language input and use a machine learning model to determine a first component is to determine a response to the natural language input while a second component is to determine supplemental content related to the natural language input. The system may receive, from the first component, first image data corresponding to the response. The system may also receive, from the second component, second image data corresponding to the supplemental content. The system may send, to a display, a command to present the first image data and the second image data.
-
-
-
-