Search method, device and storage medium for neural network model structure

    公开(公告)号:US11580408B2

    公开(公告)日:2023-02-14

    申请号:US16830566

    申请日:2020-03-26

    摘要: A search method for a neural network model structure, includes: generating an initial generation population of network model structure based on multi-objective optimization hyper parameters, as a current generation population of network model structure; performing selection and crossover on the current generation population of network model structure; generating a part of network model structure based on reinforcement learning mutation, and generating a remaining part of network model structure based on random mutation on the selected and crossed network model structure; generating a new population of network model structure based on the part of network model structure generated by reinforcement learning mutation and the remaining part of network model structure generated by random mutation; and searching a next generation population of network model structure based on the current generation population of network model structure and the new population of network model structure.

    HYPERNETWORK TRAINING METHOD AND DEVICE, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210142166A1

    公开(公告)日:2021-05-13

    申请号:US16828427

    申请日:2020-03-24

    IPC分类号: G06N3/08 G06N3/04

    摘要: A hypernetwork training method includes: acquiring a multipath neural subnetwork based on a preconstructed initial hypernetwork; training the multipath neural subnetwork to update a weight parameter of each substructure in the multipath neural subnetwork; synchronizing the weight parameter of each substructure in the multipath neural subnetwork to the preconstructed initial hypernetwork; and determining whether the preconstructed initial hypernetwork converges, and if it is determined that the preconstructed initial hypernetwork does not converge, re-executing the acquiring, the training, the synchronizing, and the determining, to obtain a target hypernetwork.

    Hypernetwork training method and device, electronic device and storage medium

    公开(公告)号:US11443189B2

    公开(公告)日:2022-09-13

    申请号:US16828427

    申请日:2020-03-24

    IPC分类号: G06N3/08 G06N3/04

    摘要: A hypernetwork training method includes: acquiring a multipath neural subnetwork based on a preconstructed initial hypernetwork; training the multipath neural subnetwork to update a weight parameter of each substructure in the multipath neural subnetwork; synchronizing the weight parameter of each substructure in the multipath neural subnetwork to the preconstructed initial hypernetwork; and determining whether the preconstructed initial hypernetwork converges, and if it is determined that the preconstructed initial hypernetwork does not converge, re-executing the acquiring, the training, the synchronizing, and the determining, to obtain a target hypernetwork.

    METHOD AND APPARATUS FOR TRAINING NEURAL NETWORK, AND STORAGE MEDIUM

    公开(公告)号:US20210133563A1

    公开(公告)日:2021-05-06

    申请号:US16744674

    申请日:2020-01-16

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method for training a neural network, includes: training a super network to obtain a network parameter of the super network, wherein each network layer of the super network includes multiple candidate network sub-structures in parallel; for each network layer of the super network, selecting, from the multiple candidate network sub-structures, a candidate network sub-structure to be a target network sub-structure; constructing a sub-network based on target network sub-structures each selected in a respective network layer of the super network; and training the sub-network, by taking the network parameter inherited from the super network as an initial parameter of the sub-network, to obtain a network parameter of the sub-network.