FEDERATED DISTRIBUTION OF COMPUTATION AND OPERATIONS USING NETWORKED PROCESSING UNITS

    公开(公告)号:US20230136048A1

    公开(公告)日:2023-05-04

    申请号:US18090686

    申请日:2022-12-29

    IPC分类号: G06F9/48 G06F9/30

    摘要: Various approaches for deploying and controlling distributed compute operations with the use of infrastructure processing units (IPUs) and similar network-addressable processing units are disclosed. A device for orchestrating functions in a network compute mesh is configured to receive, at a network-addressable processing unit of a network-addressable processing unit mesh from a requestor device, a computation request to execute a workflow with a set of objectives; query at least one other network-addressable processing units of the network-addressable processing unit mesh using the set of objectives, to determine aspects of available resources and data in the network-addressable processing unit mesh to apply to the workflow; transmit a list of recommended resources available to execute the workflow to the requestor device, the list of recommended resources being ranked based on at least one dimension of the resources; obtain a compute chain from the requestor device, the compute chain describing resource control transitions and data flow provided from the recommended resources and data in the network-addressable processing unit mesh; and schedule the execution of the workflow at one or more network-addressable processing units in the network-addressable processing unit mesh in accordance with the compute chain.

    INFRASTRUCTURE-DELEGATED ORCHESTRATION BACKUP USING NETWORKED PROCESSING UNITS

    公开(公告)号:US20230132992A1

    公开(公告)日:2023-05-04

    申请号:US18090786

    申请日:2022-12-29

    IPC分类号: H04L67/10 G06F11/07

    摘要: Various approaches for monitoring and responding to orchestration or service failures with the use of infrastructure processing units (IPUs) and similar networked processing units are disclosed. A method performed by a computing device for deploying remedial actions in failure scenarios of an orchestrated edge computing environment may include: identifying an orchestration configuration of a controller entity (responsible for orchestration) and a worker entity (subject to the orchestration to provide at least one service); determining a failure scenario of the orchestration of the worker entity, such as at a networked processing unit implemented at a network interface located between the controller entity and the worker entity; and causing a remedial action to resolve the failure scenario and modify the orchestration configuration, such as replacing functionality of the controller entity or the worker entity with functionality at a replacement entity.