-
公开(公告)号:US12205577B1
公开(公告)日:2025-01-21
申请号:US17217031
申请日:2021-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Taehwan Kim , Sanqiang Zhao , Robinson Piramuthu , Seokhwan Kim , Yang Liu , Gokhan Tur , Eshan Bhatnagar
Abstract: Techniques for rendering visual content, in response to one or more utterances, are described. A device receives one or more utterances that define a parameter(s) for desired output content. A system (or the device) identifies natural language data corresponding to the desired content, and uses natural language generation processes to update the natural language data based on the parameter(s). The system (or the device) then generates an image based on the updated natural language data. The system (or the device) also generates video data of an avatar. The device displays the image and the avatar, and synchronizes movements of the avatar with output of synthesized speech of the updated natural language data. The device may also display subtitles of the updated natural language data, and cause a word of the subtitles to be emphasized when synthesized speech of the word is being output.
-
公开(公告)号:US20250157463A1
公开(公告)日:2025-05-15
申请号:US19017979
申请日:2025-01-13
Applicant: Amazon Technologies, Inc.
Inventor: Taehwan Kim , Sanqiang Zhao , Robinson Piramuthu , Seokhwan Kim , Yang Liu , Gokhan Tur , Eshan Bhatnagar
Abstract: Techniques for rendering visual content, in response to one or more utterances, are described. A device receives one or more utterances that define a parameter(s) for desired output content. A system (or the device) identifies natural language data corresponding to the desired content, and uses natural language generation processes to update the natural language data based on the parameter(s). The system (or the device) then generates an image based on the updated natural language data. The system (or the device) also generates video data of an avatar. The device displays the image and the avatar, and synchronizes movements of the avatar with output of synthesized speech of the updated natural language data. The device may also display subtitles of the updated natural language data, and cause a word of the subtitles to be emphasized when synthesized speech of the word is being output.
-
公开(公告)号:US12293758B1
公开(公告)日:2025-05-06
申请号:US18081929
申请日:2022-12-15
Applicant: Amazon Technologies, Inc.
Inventor: Alexandros Papangelis , Behnam Hedayatnia , Chao Zhao , Devamanyu Hazarika , Di Jin , Dilek Hakkani-Tur , Mahdi Namazifar , Seokhwan Kim , Spandana Gella , Yang Liu
Abstract: Techniques for generating opinion-based content responsive to a user input are described. The system may receive a user input, and determine dialog context data corresponding to a dialog between a user and the system, and including the user input. The system may determine generation of content responsive to the user input requires opinion-based knowledge, and may extract entities from the dialog context data, and determine natural language data of a knowledge base that includes entities similar to the extracted entities. The system may processes the natural language data and the dialog context data to determine a subset of the natural language data that is responsive to the user input. The system may generate output data responsive to the user input using the responsive natural language data and the dialog context.
-
-