发明申请
- 专利标题: UNIFORM LOAD PROCESSING FOR PARALLEL THREAD SUB-SETS
- 专利标题(中): 用于并联螺纹子组的均匀加载处理
-
申请号: US13412438申请日: 2012-03-05
-
公开(公告)号: US20130232322A1公开(公告)日: 2013-09-05
- 发明人: Michael FETTERMAN , Stewart Glenn Carlton , Douglas J. Hahn , Rajeshwaran Selvanesan , Shirish Gadre , Steven James Heinrich
- 申请人: Michael FETTERMAN , Stewart Glenn Carlton , Douglas J. Hahn , Rajeshwaran Selvanesan , Shirish Gadre , Steven James Heinrich
- 主分类号: G06F9/312
- IPC分类号: G06F9/312 ; G06F9/38
摘要:
One embodiment of the present invention sets forth a technique for processing load instructions for parallel threads of a thread group when a sub-set of the parallel threads request the same memory address. The load/store unit determines if the memory addresses for each sub-set of parallel threads match based on one or more uniform patterns. When a match is achieved for at least one of the uniform patterns, the load/store unit transmits a read request to retrieve data for the sub-set of parallel threads. The number of read requests transmitted is reduced compared with performing a separate read request for each thread in the sub-set. A variety of uniform patterns may be defined based on common access patterns present in program instructions. A variety of uniform patterns may also be defined based on interconnect constraints between the load/store unit and the memory when a full crossbar interconnect is not available.
公开/授权文献
- US10007527B2 Uniform load processing for parallel thread sub-sets 公开/授权日:2018-06-26
信息查询