CODE-LEVEL NEURAL ARCHITECTURE SEARCH USING LANGUAGE MODELS

    公开(公告)号:US20240273371A1

    公开(公告)日:2024-08-15

    申请号:US18431804

    申请日:2024-02-02

    Applicant: Google LLC

    CPC classification number: G06N3/086

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining an architecture for a neural network configured to perform a machine learning task. In one aspect, a method comprises: receiving training data; searching for a final architecture of the neural network, wherein the searching comprises: maintaining current population data; and repeatedly performing evolutionary architecture search steps comprising: selecting one or more candidate architectures from the current population of candidate architectures defined by the source code included in the current population data; generating an input prompt; processing the input prompt using the language model neural network to generate output source code that defines a plurality of new candidate architectures; and using the plurality of new candidate architectures defined by the output source code to update the current population data.

Patent Agency Ranking