Patent search ap:("Disney Enterprises Page Inc.") AND inv:"Ashutosh Modi"

1.

发明授权
Affect-driven dialog generation 有权

公开(公告)号：US10818312B2

公开(公告)日：2020-10-27

申请号：US16226166

申请日：2018-12-19

Applicant: Disney Enterprises, Inc.

Inventor： Ashutosh Modi , Mubbasir Kapadia , Douglas A. Fidaleo , James R. Kennedy , Wojciech Witon , Pierre Colombo

IPC: G10L25/63 , G10L15/22 , G10L15/28 , G06N3/04 , G10L15/06

Abstract: According to one implementation, an affect-driven dialog generation system includes a computing platform having a hardware processor and a system memory storing a software code including a sequence-to-sequence (seq2seq) architecture trained using a loss function having an affective regularizer term based on a difference in emotional content between a target dialog response and a dialog sequence determined by the seq2seq architecture during training. The hardware processor executes the software code to receive an input dialog sequence, and to use the seq2seq architecture to generate emotionally diverse dialog responses based on the input dialog sequence and a predetermined target emotion. The hardware processor further executes the software code to determine, using the seq2seq architecture, a final dialog sequence responsive to the input dialog sequence based on an emotional relevance of each of the emotionally diverse dialog responses, and to provide the final dialog sequence as an output.

2.

发明授权
Techniques for interpreting spoken input using non-verbal cues 有权

公开(公告)号：US11887600B2

公开(公告)日：2024-01-30

申请号：US16593938

申请日：2019-10-04

Applicant: DISNEY ENTERPRISES, INC.

Inventor： Erika Doggett , Nathan Nocon , Ashutosh Modi , Joseph Charles Sengir , Maxwell McCoy

IPC: G10L15/25 , G10L15/22 , G10L15/06 , G10L15/26

CPC classification number: G10L15/25 , G10L15/063 , G10L15/22 , G10L15/26 , G10L2015/228

Abstract: In various embodiments, a communication fusion application enables other software application(s) to interpret spoken user input. In operation, a communication fusion application determines that a prediction is relevant to a text input derived from a spoken input received from a user. Subsequently, the communication fusion application generates a predicted context based on the prediction. The communication fusion application then transmits the predicted context and the text input to the other software application(s). The other software application(s) perform additional action(s) based on the text input and the predicted context. Advantageously, by providing additional, relevant information to the software application(s), the communication fusion application increases the level of understanding during interactions with the user and the overall user experience is improved.

3.

发明授权
Techniques for incremental computer-based natural language understanding 有权

公开(公告)号：US11749265B2

公开(公告)日：2023-09-05

申请号：US16593939

申请日：2019-10-04

Applicant: DISNEY ENTERPRISES, INC.

Inventor： Erika Varis Doggett , Ashutosh Modi , Nathan Nocon

IPC: G10L15/22 , G10L15/18 , G10L15/197 , G10L15/24 , G10L15/04

CPC classification number: G10L15/22 , G10L15/04 , G10L15/1815 , G10L15/197 , G10L15/24 , G10L2015/223

Abstract: Various embodiments disclosed herein provide techniques for performing incremental natural language understanding on a natural language understanding (NLU) system. The NLU system acquires a first audio speech segment associated with a user utterance. The NLU system converts the first audio speech segment into a first text segment. The NLU system determines a first intent based on a text string associated with the first text segment, wherein the text string represents a portion of the user utterance. The NLU system generates a first response based on the first intent prior to when the user utterance completes.

Patent Agency Ranking