-
公开(公告)号:US20230394617A1
公开(公告)日:2023-12-07
申请号:US18046097
申请日:2022-10-12
发明人: YUAN GAO , FEI SUN , HAORAN LI , GUYUE HUANG , CHEN ZHANG , RUIGUANG ZHONG
摘要: The present application discloses a warp execution method used for SPs of an SM of a GPU and an associated GPU. The SPs share a scratchpad memory, and the warp execution method includes: when the predetermined time point for warp-loading is reached, checking a first indicator to obtain a size of a space with the status of blank in the scratchpad memory, to determining whether to load the warp, wherein the first indicator is used to indicate a starting position of a space with the status of data-in-use and an ending position of the space with the status of blank; and when the predetermined time point for computing is reached, checking a second indicator and a third indicator to obtain a size of a space with the status of data-not-in-use in the scratchpad memory, to determining whether to compute the warp
-
公开(公告)号:US20230367741A1
公开(公告)日:2023-11-16
申请号:US17937973
申请日:2022-10-04
发明人: HAORAN LI , FEI SUN , YUAN GAO , GUYUE HUANG , RUIGUANG ZHONG , CHEN ZHANG
CPC分类号: G06F15/82 , G06F9/3013
摘要: The present application discloses a GPU and a method of the same. The GPU includes: a plurality of streaming multiprocessor (SMs), each including: a plurality of streaming processors (SPs), each including a register, wherein each SP has a predetermined upper bound of warp number, and the register has a predetermined upper bound of register capacity; and a global dispatcher, including: a register occupancy status table, for recording the warp number and an occupancy status of the register of each SP of each SM; a TB (TB) dispatch module, for dispatching the TB to a first SM of the SMs according to a warp type classification table and the register occupancy status table; and a warp dispatch module, for dispatching a plurality of warps to the plurality of SPs of the first SM according to the warp type classification table and the register occupancy status table.
-