Warp execution method and associated GPU

    公开(公告)号:US12100064B2

    公开(公告)日:2024-09-24

    申请号:US18046097

    申请日:2022-10-12

    IPC分类号: G06T1/60 G06T1/20

    CPC分类号: G06T1/60 G06T1/20

    摘要: The present application discloses a warp execution method used for SPs of an SM of a GPU and an associated GPU. The SPs share a scratchpad memory, and the warp execution method includes: when the predetermined time point for warp-loading is reached, checking a first indicator to obtain a size of a space with the status of blank in the scratchpad memory, to determining whether to load the warp, wherein the first indicator is used to indicate a starting position of a space with the status of data-in-use and an ending position of the space with the status of blank; and when the predetermined time point for computing is reached, checking a second indicator and a third indicator to obtain a size of a space with the status of data-not-in-use in the scratchpad memory, to determining whether to compute the warp.

    METHOD AND DEVICE FOR GENERATING DATA FLOW POLICY

    公开(公告)号:US20240311196A1

    公开(公告)日:2024-09-19

    申请号:US18606882

    申请日:2024-03-15

    IPC分类号: G06F9/50

    摘要: Some embodiments of this disclosure provide a method and an apparatus for generating a data flow policy. For example, a method for generating a data flow policy includes: obtaining a computational graph corresponding to a data processing task; generating an inter-stage data flow policy based on the computational graph and an execution cost; generating a plurality of intra-stage data flow policies corresponding to the plurality of pipeline stages based on the inter-stage data flow policy; and updating the execution cost based on the plurality of intra-stage data flow policies.