MODEL TRAINING METHOD AND APPARATUS
    1.
    发明公开

    公开(公告)号:US20240020541A1

    公开(公告)日:2024-01-18

    申请号:US18476830

    申请日:2023-09-28

    CPC classification number: G06N3/08

    Abstract: This application describes a model training method, applied to the field of artificial intelligence. The method includes a computing core of a first processor obtains an embedding used for model training, and writes an updated embedding to a first memory of the first processor instead of transferring the updated embedding to a second processor after model training is completed. In this application, after updating an embedding, the first processor saves the updated embedding to the first memory of the first processor. Without needing to wait for the second processor to complete a process of transferring a second target embedding to a GPU, the first processor may directly obtain the updated embedding and perform model training of a next round based on the updated embedding, provided that the first processor may obtain a latest updated embedding.

Patent Agency Ranking