EFFICIENT SCALING OF PARTITIONED NEURAL NETWORK INFERENCE

    公开(公告)号:US20250094823A1

    公开(公告)日:2025-03-20

    申请号:US18368801

    申请日:2023-09-15

    Abstract: In one implementation, a controller determines performance of a partitioned neural network. The controller identifies, based on the performance, a particular partition of the partitioned neural network as a bottleneck. The controller configures a first device to execute a replica of the particular partition. The controller configures a multiplexer that provides an output of the particular partition or the replica of the particular partition as input to a downstream partition of the partitioned neural network.

    RECONCILING COMPUTING INFRASTRUCTURE AND DATA IN FEDERATED LEARNING

    公开(公告)号:US20230385708A1

    公开(公告)日:2023-11-30

    申请号:US17828582

    申请日:2022-05-31

    CPC classification number: G06N20/20 G06F16/9027

    Abstract: In one embodiment, a controller for a federated learning system represents computing infrastructure for the federated learning system as a tree structure. The controller forms associations between datasets available to the federated learning system and nodes in the tree structure. The controller receives one or more instructions to perform model training in the federated learning system with datasets specified using their associations. The controller configures, in response to the one or more instructions, the federated learning system to perform the model training using the datasets specified by the one or more instructions using the tree structure.

Patent Agency Ranking