Framework for Learning to Transfer Learn
    1.
    发明公开

    公开(公告)号:US20240054345A1

    公开(公告)日:2024-02-15

    申请号:US18455182

    申请日:2023-08-24

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06N3/04

    Abstract: A method includes receiving a source data set and a target data set and identifying a loss function for a deep learning model based on the source data set and the target data set. The loss function includes encoder weights, source classifier layer weights, target classifier layer weights, coefficients, and a policy weight. During a first phase of each of a plurality of learning iterations for a learning to transfer learn (L2TL) architecture, the method also includes: applying gradient decent-based optimization to learn the encoder weights, the source classifier layer weights, and the target classifier weights that minimize the loss function; and determining the coefficients by sampling actions of a policy model. During a second phase of each of the plurality of learning iterations, determining the policy weight that maximizes an evaluation metric.

    Framework for Learning to Transfer Learn

    公开(公告)号:US20210034976A1

    公开(公告)日:2021-02-04

    申请号:US16945880

    申请日:2020-08-02

    Applicant: Google LLC

    Abstract: A method includes receiving a source data set and a target data set and identifying a loss function for a deep learning model based on the source data set and the target data set. The loss function includes encoder weights, source classifier layer weights, target classifier layer weights, coefficients, and a policy weight. During a first phase of each of a plurality of learning iterations for a learning to transfer learn (L2TL) architecture, the method also includes: applying gradient decent-based optimization to learn the encoder weights, the source classifier layer weights, and the target classifier weights that minimize the loss function; and determining the coefficients by sampling actions of a policy model. During a second phase of each of the plurality of learning iterations, determining the policy weight that maximizes an evaluation metric.

Patent Agency Ranking