Invention Application
- Patent Title: Partitioned Inference And Training Of Large Models
-
Application No.: US18727800Application Date: 2022-02-03
-
Publication No.: US20250094798A1Publication Date: 2025-03-20
- Inventor: Li Zhang , Matthew Sharifi , David Petrou , Blaise Aguera y Arcas
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- International Application: PCT/US22/15090 WO 20220203
- Main IPC: G06N3/08
- IPC: G06N3/08

Abstract:
Systems and methods for partitioning a large model that has been configured to use a model-synthesis approach in which multiple basis models are combined to generate a final output. The present technology provides systems and methods for identifying a device-specific or subject-specific subset of those basis models to be used on a given device, such that it need not store the weight matrices for the entire set of basis models, and may perform inference using only the weight matrices of the identified subset of basis models. In some examples, the subset of basis models used by a given device may be updated based on actual usage and feedback. Likewise, in some examples, the model may be trained in a federated setting in which multiple devices each utilize different subsets of the basis models, and share training signals with a full copy of the model.
Information query