Patent search ap:("NVIDIA Corporation") AND inv:"Niral Lalit Pathak" Page 1

1.

发明公开
DIALOGUE SYSTEMS USING KNOWLEDGE BASES AND LANGUAGE MODELS FOR AUTOMOTIVE SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20240095460A1

公开(公告)日：2024-03-21

申请号：US17947491

申请日：2022-09-19

Applicant: NVIDIA Corporation

Inventor： Peng Xu , Mostofa Patwary , Rajath Shetty , Niral Lalit Pathak , Ratin Kumar , Bryan Catanzaro , Mohammad Shoeybi

IPC: G06F40/35

CPC classification number: G06F40/35

Abstract: In various examples, systems and methods that use dialogue systems associated with various machine systems and applications are described. For instance, the systems and methods may receive text data representing speech, such as a question associated with a vehicle or other machine type. The systems and methods then use a retrieval system(s) to retrieve a question/answer pair(s) associated with the text data and/or contextual information associated with the text data. In some examples, the contextual information is associated with a knowledge base associated with or corresponding to the vehicle. The systems and methods then generate a prompt using the text data, the question/answer pair(s), and/or the contextual information. Additionally, the systems and methods determine, using a language model(s) and based at least on the prompt, an output associated with the text data. For instance, the output may include information that answers the question associated with the vehicle.

2.

发明公开
USING SCENE-AWARE CONTEXT FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 审中-公开

公开(公告)号：US20240087561A1

公开(公告)日：2024-03-14

申请号：US17942950

申请日：2022-09-12

Applicant: NVIDIA Corporation

Inventor： Niral Lalit Pathak , Rajath Shetty , Ratin Kumar

IPC: G10L15/18 , G06F3/01 , G06T7/73 , G10L15/16 , G10L15/183

CPC classification number: G10L15/1822 , G06F3/013 , G06F3/017 , G06T7/73 , G10L15/16 , G10L15/183 , G06T2207/20081

Abstract: In various examples, techniques for using scene-aware context for dialogue systems and applications are described herein. For instance, systems and methods are disclosed that process audio data representing speech in order to determine an intent associated with the speech. Systems and methods are also disclosed that process sensor data representing at least a user in order to determine a point of interest associated with the user. In some examples, the point of interest may include a landmark, a person, and/or any other object within an environment. The systems and methods may then generate a context associated with the point of interest. Additionally, the systems and methods may process the intent and the context using one or more language models. Based on the processing, the language model(s) may output data associated with the speech.

Patent Agency Ranking