AI model optimization method and apparatus

    公开(公告)号:US12032571B2

    公开(公告)日:2024-07-09

    申请号:US17694970

    申请日:2022-03-15

    CPC分类号: G06F16/2453 G06N20/00

    摘要: In a method for AI model optimization, an optimization device receives an original AI model and search configuration information that comprises a plurality of search items each indicating its search categories for performing optimization information search on the original AI model. The device obtains a plurality of search operators corresponding to the plurality of search items, and arranges the search operators in an operation sequence based on the search configuration information. The device then executes the search operators in the arranged operation sequence on the original AI model to obtain an optimized AI model. In the execution of the operation sequence, each search operator, except for the first search operator in the operation sequence, is executed utilizing operation results of a preceding search operator in the operation sequence, the operation results including generated network structures and search space information.

    AI MODEL OPTIMIZATION METHOD AND APPARATUS

    公开(公告)号:US20220197901A1

    公开(公告)日:2022-06-23

    申请号:US17694970

    申请日:2022-03-15

    IPC分类号: G06F16/2453 G06N20/00

    摘要: In a method for AI model optimization, an optimization device receives an original AI model and search configuration information that comprises a plurality of search items each indicating its search categories for performing optimization information search on the original AI model. The device obtains a plurality of search operators corresponding to the plurality of search items, and arranges the search operators in an operation sequence based on the search configuration information. The device then executes the search operators in the arranged operation sequence on the original AI model to obtain an optimized AI model. In the execution of the operation sequence, each search operator, except for the first search operator in the operation sequence, is executed utilizing operation results of a preceding search operator in the operation sequence, the operation results including generated network structures and search space information.