-
1.
公开(公告)号:US20240095460A1
公开(公告)日:2024-03-21
申请号:US17947491
申请日:2022-09-19
Applicant: NVIDIA Corporation
Inventor: Peng Xu , Mostofa Patwary , Rajath Shetty , Niral Lalit Pathak , Ratin Kumar , Bryan Catanzaro , Mohammad Shoeybi
IPC: G06F40/35
CPC classification number: G06F40/35
Abstract: In various examples, systems and methods that use dialogue systems associated with various machine systems and applications are described. For instance, the systems and methods may receive text data representing speech, such as a question associated with a vehicle or other machine type. The systems and methods then use a retrieval system(s) to retrieve a question/answer pair(s) associated with the text data and/or contextual information associated with the text data. In some examples, the contextual information is associated with a knowledge base associated with or corresponding to the vehicle. The systems and methods then generate a prompt using the text data, the question/answer pair(s), and/or the contextual information. Additionally, the systems and methods determine, using a language model(s) and based at least on the prompt, an output associated with the text data. For instance, the output may include information that answers the question associated with the vehicle.
-
公开(公告)号:US20240087561A1
公开(公告)日:2024-03-14
申请号:US17942950
申请日:2022-09-12
Applicant: NVIDIA Corporation
Inventor: Niral Lalit Pathak , Rajath Shetty , Ratin Kumar
IPC: G10L15/18 , G06F3/01 , G06T7/73 , G10L15/16 , G10L15/183
CPC classification number: G10L15/1822 , G06F3/013 , G06F3/017 , G06T7/73 , G10L15/16 , G10L15/183 , G06T2207/20081
Abstract: In various examples, techniques for using scene-aware context for dialogue systems and applications are described herein. For instance, systems and methods are disclosed that process audio data representing speech in order to determine an intent associated with the speech. Systems and methods are also disclosed that process sensor data representing at least a user in order to determine a point of interest associated with the user. In some examples, the point of interest may include a landmark, a person, and/or any other object within an environment. The systems and methods may then generate a context associated with the point of interest. Additionally, the systems and methods may process the intent and the context using one or more language models. Based on the processing, the language model(s) may output data associated with the speech.
-