Partitioned Inference And Training Of Large Models

Invention Application

US20250094798A1 Partitioned Inference And Training Of Large Models 有权

Please log in to see more content

Patent Title: Partitioned Inference And Training Of Large Models
Application No.: US18727800

Application Date: 2022-02-03
Publication No.: US20250094798A1

Publication Date: 2025-03-20
Inventor: Li Zhang , Matthew Sharifi , David Petrou , Blaise Aguera y Arcas
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
International Application: PCT/US22/15090 WO 20220203
Main IPC: G06N3/08
IPC: G06N3/08

Partitioned Inference And Training Of Large Models

Abstract:

Systems and methods for partitioning a large model that has been configured to use a model-synthesis approach in which multiple basis models are combined to generate a final output. The present technology provides systems and methods for identifying a device-specific or subject-specific subset of those basis models to be used on a given device, such that it need not store the weight matrices for the entire set of basis models, and may perform inference using only the weight matrices of the identified subset of basis models. In some examples, the subset of basis models used by a given device may be updated based on actual usage and feedback. Likewise, in some examples, the model may be trained in a federated setting in which multiple devices each utilize different subsets of the basis models, and share training signals with a full copy of the model.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法