-
公开(公告)号:US20220413945A1
公开(公告)日:2022-12-29
申请号:US17366770
申请日:2021-07-02
Applicant: NVIDIA Corporation
Inventor: Piotr Ciolkosz , Kyrylo Perelygin , Harold Carter Edwards , Wesley Maxey
Abstract: Apparatuses, systems, and techniques to implement a barrier operation. In at least one embodiment, a memory barrier operation causes accesses to memory by a plurality of groups of threads to occur in an order indicated by the memory barrier operation.
-
公开(公告)号:US20230111125A1
公开(公告)日:2023-04-13
申请号:US17497731
申请日:2021-10-08
Applicant: NVIDIA Corporation
Inventor: Piotr Ciolkosz , Kyrylo Perelygin , Harold Carter Edwards , Wesley Maxey
Abstract: Apparatuses, systems, and techniques to perform parallel processing. In at least one embodiment, a parallel processing algorithm for performing an additive prefix scan is selected from a plurality of alternatives based on an arrangement of a group of threads provided to perform the scan.
-
公开(公告)号:US20230305853A1
公开(公告)日:2023-09-28
申请号:US17705154
申请日:2022-03-25
Applicant: NVIDIA Corporation
Inventor: Piotr Ciolkosz , Kyrylo Perelygin , Harold Carter Edwards , Wesley Maxey
CPC classification number: G06F9/3851 , G06T15/005
Abstract: Apparatuses, systems, and techniques to perform collective operations using parallel processing. In at least one embodiment, a non-blocking application programming interface allow programs to improve performance of one or more collective operations on a GPU.
-
公开(公告)号:US20230086989A1
公开(公告)日:2023-03-23
申请号:US17478079
申请日:2021-09-17
Applicant: NVIDIA Corporation
Inventor: Piotr Ciolkosz , Kyrylo Perelygin , Harold Carter Edwards , Wesley Maxey
Abstract: Apparatuses, systems, and techniques to facilitate parallel processing. In at least one embodiment, an application programming interface allows a user to define a plurality of cooperative thread groups, and launch multiple cooperative thread groups in parallel provided sufficient processing resources are available.
-
-
-