MODEL TRAINING OF ACTOR MODEL AND CRITIC MODEL

    公开(公告)号:US20250061339A1

    公开(公告)日:2025-02-20

    申请号:US18936628

    申请日:2024-11-04

    Applicant: Lemon Inc.

    Abstract: Embodiments of the present disclosure provide a solution for model training. A method comprises: performing training of a critic model and training of an actor model according to an alternating scheme. The actor model is configured to generate a response for an input question based on a feedback generated by the critic model, and the critic model is configured to generate a feedback to a response generated by the actor mode.

    MACHINE LEARNING MODEL EVALUATION

    公开(公告)号:US20250005459A1

    公开(公告)日:2025-01-02

    申请号:US18885135

    申请日:2024-09-13

    Applicant: Lemon Inc.

    Abstract: Embodiments of the disclosure provide a solution for machine learning model evaluation. The solution includes: obtaining a target answer to a test question generated by a target machine learning (ML) model; obtaining a plurality of reference answers to the test question generated respectively by a plurality of reference ML models; determining respective professional levels of the plurality of reference ML models in answering the test question; and generating an evaluation result on correctness of the target ML model in question answering based on the target answer, the plurality of reference answers and the respective professional levels of the plurality of reference ML models.

Patent Agency Ranking