DOMAIN-CUSTOMIZABLE MODELS FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

Invention Publication

US20240193445A1 DOMAIN-CUSTOMIZABLE MODELS FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS 审中-公开

Please log in to see more content

Patent Title: DOMAIN-CUSTOMIZABLE MODELS FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS
Application No.: US18064125

Application Date: 2022-12-09
Publication No.: US20240193445A1

Publication Date: 2024-06-13
Inventor: Yi Dong , Xianchao Wu
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Main IPC: G06N5/043
IPC: G06N5/043 ; G06F40/40

DOMAIN-CUSTOMIZABLE MODELS FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

Abstract:

In various examples, systems and methods are disclosed that train a machine learning model(s)—such as a large language model (LLM)—for one or more specific domains. In some embodiments, the machine learning model(s) may include at least a base model(s) as well as additional parts, such as additional layers, associated with the domains for which the machine learning model(s) is being trained. As such, the parts of the machine learning model(s) may be trained separately, such that training data associated with a domain is used to train a part of the machine learning model(s) that is associated with the domain without training the other part(s) of the machine learning model(s). The systems and methods may then use these parts when deploying the machine learning model(s), such as by activating and/or deactivating parts based on the input data being processed.

Information query

Global Dossier Espacenet