-
公开(公告)号:US10818312B2
公开(公告)日:2020-10-27
申请号:US16226166
申请日:2018-12-19
Applicant: Disney Enterprises, Inc.
Inventor: Ashutosh Modi , Mubbasir Kapadia , Douglas A. Fidaleo , James R. Kennedy , Wojciech Witon , Pierre Colombo
Abstract: According to one implementation, an affect-driven dialog generation system includes a computing platform having a hardware processor and a system memory storing a software code including a sequence-to-sequence (seq2seq) architecture trained using a loss function having an affective regularizer term based on a difference in emotional content between a target dialog response and a dialog sequence determined by the seq2seq architecture during training. The hardware processor executes the software code to receive an input dialog sequence, and to use the seq2seq architecture to generate emotionally diverse dialog responses based on the input dialog sequence and a predetermined target emotion. The hardware processor further executes the software code to determine, using the seq2seq architecture, a final dialog sequence responsive to the input dialog sequence based on an emotional relevance of each of the emotionally diverse dialog responses, and to provide the final dialog sequence as an output.
-
公开(公告)号:US11887600B2
公开(公告)日:2024-01-30
申请号:US16593938
申请日:2019-10-04
Applicant: DISNEY ENTERPRISES, INC.
Inventor: Erika Doggett , Nathan Nocon , Ashutosh Modi , Joseph Charles Sengir , Maxwell McCoy
CPC classification number: G10L15/25 , G10L15/063 , G10L15/22 , G10L15/26 , G10L2015/228
Abstract: In various embodiments, a communication fusion application enables other software application(s) to interpret spoken user input. In operation, a communication fusion application determines that a prediction is relevant to a text input derived from a spoken input received from a user. Subsequently, the communication fusion application generates a predicted context based on the prediction. The communication fusion application then transmits the predicted context and the text input to the other software application(s). The other software application(s) perform additional action(s) based on the text input and the predicted context. Advantageously, by providing additional, relevant information to the software application(s), the communication fusion application increases the level of understanding during interactions with the user and the overall user experience is improved.
-
公开(公告)号:US11749265B2
公开(公告)日:2023-09-05
申请号:US16593939
申请日:2019-10-04
Applicant: DISNEY ENTERPRISES, INC.
Inventor: Erika Varis Doggett , Ashutosh Modi , Nathan Nocon
IPC: G10L15/22 , G10L15/18 , G10L15/197 , G10L15/24 , G10L15/04
CPC classification number: G10L15/22 , G10L15/04 , G10L15/1815 , G10L15/197 , G10L15/24 , G10L2015/223
Abstract: Various embodiments disclosed herein provide techniques for performing incremental natural language understanding on a natural language understanding (NLU) system. The NLU system acquires a first audio speech segment associated with a user utterance. The NLU system converts the first audio speech segment into a first text segment. The NLU system determines a first intent based on a text string associated with the first text segment, wherein the text string represents a portion of the user utterance. The NLU system generates a first response based on the first intent prior to when the user utterance completes.
-
-