FUSING IN-CONTEXT LEARNING AND FINE-TUNING FOR LANGUAGE MODEL NEURAL NETWORKS

    公开(公告)号:US20250077895A1

    公开(公告)日:2025-03-06

    申请号:US18826005

    申请日:2024-09-05

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for configuring a set of language model neural networks, e.g., a first large language model and a second smaller-sized language model, and performing a machine learning task on new inputs using the set of language model neural networks. Configuring the language model neural networks and performing a machine learning task can include leveraging the ability of a first large language model to follow prompt-engineered instructions and perform chain-of-thought reasoning, while also fine-tuning a second, smaller language model neural network to optimize the machine learning task performance.

Patent Agency Ranking