发明申请
US20130232322A1 UNIFORM LOAD PROCESSING FOR PARALLEL THREAD SUB-SETS 有权
用于并联螺纹子组的均匀加载处理

UNIFORM LOAD PROCESSING FOR PARALLEL THREAD SUB-SETS
摘要:
One embodiment of the present invention sets forth a technique for processing load instructions for parallel threads of a thread group when a sub-set of the parallel threads request the same memory address. The load/store unit determines if the memory addresses for each sub-set of parallel threads match based on one or more uniform patterns. When a match is achieved for at least one of the uniform patterns, the load/store unit transmits a read request to retrieve data for the sub-set of parallel threads. The number of read requests transmitted is reduced compared with performing a separate read request for each thread in the sub-set. A variety of uniform patterns may be defined based on common access patterns present in program instructions. A variety of uniform patterns may also be defined based on interconnect constraints between the load/store unit and the memory when a full crossbar interconnect is not available.
公开/授权文献
信息查询
0/0