-
公开(公告)号:US20240311196A1
公开(公告)日:2024-09-19
申请号:US18606882
申请日:2024-03-15
发明人: Haoran Li , Fei Xue , Ruiguang Zhong , Yuan Gao , Yuanwei Fang , Peiye Liu
IPC分类号: G06F9/50
CPC分类号: G06F9/5038 , G06F9/5044 , G06F2209/501
摘要: Some embodiments of this disclosure provide a method and an apparatus for generating a data flow policy. For example, a method for generating a data flow policy includes: obtaining a computational graph corresponding to a data processing task; generating an inter-stage data flow policy based on the computational graph and an execution cost; generating a plurality of intra-stage data flow policies corresponding to the plurality of pipeline stages based on the inter-stage data flow policy; and updating the execution cost based on the plurality of intra-stage data flow policies.
-
公开(公告)号:US12100064B2
公开(公告)日:2024-09-24
申请号:US18046097
申请日:2022-10-12
发明人: Yuan Gao , Fei Sun , Haoran Li , Guyue Huang , Chen Zhang , Ruiguang Zhong
摘要: The present application discloses a warp execution method used for SPs of an SM of a GPU and an associated GPU. The SPs share a scratchpad memory, and the warp execution method includes: when the predetermined time point for warp-loading is reached, checking a first indicator to obtain a size of a space with the status of blank in the scratchpad memory, to determining whether to load the warp, wherein the first indicator is used to indicate a starting position of a space with the status of data-in-use and an ending position of the space with the status of blank; and when the predetermined time point for computing is reached, checking a second indicator and a third indicator to obtain a size of a space with the status of data-not-in-use in the scratchpad memory, to determining whether to compute the warp.
-