Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps

Invention Publication

US20240256965A1 Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps 审中-公开

Please log in to see more content

Patent Title: Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps
Application No.: US18424624

Application Date: 2024-01-26
Publication No.: US20240256965A1

Publication Date: 2024-08-01
Inventor: Hyung Won Chung , Barret Zoph , Dengyong Zhou , Liam Fedus , Shayne Longpre , Le Hou , Yi Tay , Jason Weng Wei , Siddhartha Brahma , Quoc V. Le
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Priority: SG 202300219X 2023.01.27
Main IPC: G06N20/00
IPC: G06N20/00

Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps

Abstract:

An example method for training a machine-learned sequence processing model includes obtaining a plurality of training examples for training the machine-learned sequence processing model. For each respective training example of the plurality of training examples, the example method includes: obtaining a respective query associated with the respective training example; inputting the respective query to the machine-learned sequence processing model; obtaining, from the machine-learned sequence processing model a response to the respective query and a trace of intermediate states from the respective query to the response; evaluating the response using a ground truth response associated with the respective training example; evaluating the trace using a ground truth trace associated with the respective training example; and updating one or more parameters of the machine-learned sequence processing model based on the evaluation of the response and based on the evaluation of the trace.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习