Neural Network Model Training Method, Electronic Device, Cloud, Cluster, and Medium

    公开(公告)号:US20250165782A1

    公开(公告)日:2025-05-22

    申请号:US19027111

    申请日:2025-01-17

    Abstract: A neural network model training method includes: constructing a first neural network architecture, where the first neural network architecture includes M basic unit layers, each of the M basic unit layers includes a plurality of basic units, and the plurality of basic units includes at least a first-type basic unit and a second-type basic unit; and obtaining a target model through training based on datasets respectively corresponding to a plurality of tasks and the first neural network architecture, where the target model includes a plurality of task paths, at least some of the plurality of task paths include N basic units selected from some of the M basic unit layers, and N

Patent Agency Ranking