-
公开(公告)号:US20250077895A1
公开(公告)日:2025-03-06
申请号:US18826005
申请日:2024-09-05
Applicant: Google LLC
Inventor: Xinyi Wang , John Frederick Wieting , Jonathan Hudson Clark
IPC: G06N3/0985 , G06N3/045
Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for configuring a set of language model neural networks, e.g., a first large language model and a second smaller-sized language model, and performing a machine learning task on new inputs using the set of language model neural networks. Configuring the language model neural networks and performing a machine learning task can include leveraging the ability of a first large language model to follow prompt-engineered instructions and perform chain-of-thought reasoning, while also fine-tuning a second, smaller language model neural network to optimize the machine learning task performance.