-
公开(公告)号:US20240220314A1
公开(公告)日:2024-07-04
申请号:US18091441
申请日:2022-12-30
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Harris Gasparakis
CPC classification number: G06F9/4881 , G06F9/522
Abstract: A processing system flexibly schedules workgroups across kernels based on data dependencies between workgroups to enhance processing efficiency. The workgroups are partitioned into subsets based on the data dependencies and workgroups of a first subset that produces data are scheduled to execute immediately before workgroups of a second subset that consumes the data generated by the first subset. Thus, the processing system does not execute one kernel at a time, but instead schedules workgroups across kernels based on data dependencies across kernels. By limiting the sizes of the subsets to the amount of data that can be stored at local caches, the processing system increases the probability that data to be consumed by workgroups of a subset will be resident in a local cache and will not require a memory access.