Systems And Methods For Parameter Sharing To Reduce Computational Costs Of Training Machine-Learned Models

    公开(公告)号:US20220108221A1

    公开(公告)日:2022-04-07

    申请号:US17493442

    申请日:2021-10-04

    Applicant: Google LLC

    Abstract: Systems and methods of the present disclosure are directed to a computer-implemented method. The method can include obtaining a machine-learned model comprising a plurality of model units, wherein each model unit comprises a plurality of parameters that are tied to a shared plurality of parameters. The method can include performing a first plurality of training iterations with the machine-learned model to adjust parameters of the shared plurality of parameters. The method can include detecting, based on the first plurality of training iterations, an occurrence of an untying condition. The method can include untying the parameters of one or more model units from the shared plurality of parameters. The method can include performing a second plurality of training iterations with the machine-learned model to adjust parameters of the one or more model units independent of the shared plurality of parameters.

    Instruction Fine-Tuning Machine-Learned Models Using Intermediate Reasoning Steps

    公开(公告)号:US20240256965A1

    公开(公告)日:2024-08-01

    申请号:US18424624

    申请日:2024-01-26

    Applicant: Google LLC

    CPC classification number: G06N20/00

    Abstract: An example method for training a machine-learned sequence processing model includes obtaining a plurality of training examples for training the machine-learned sequence processing model. For each respective training example of the plurality of training examples, the example method includes: obtaining a respective query associated with the respective training example; inputting the respective query to the machine-learned sequence processing model; obtaining, from the machine-learned sequence processing model a response to the respective query and a trace of intermediate states from the respective query to the response; evaluating the response using a ground truth response associated with the respective training example; evaluating the trace using a ground truth trace associated with the respective training example; and updating one or more parameters of the machine-learned sequence processing model based on the evaluation of the response and based on the evaluation of the trace.

Patent Agency Ranking