-
公开(公告)号:US20230100552A1
公开(公告)日:2023-03-30
申请号:US17929672
申请日:2022-09-02
Applicant: NVIDIA Corporation
Inventor: Albert Davies , Akash Bellubbi , Ashutosh Tadkase , Bruce Holmer , Suraj Das , Vishanth Iyer , Sever Topan , Ian Tramble , Linda Xiong , Sharat Janapareddy , Ranvijay Singh , John Lore
Abstract: One or more embodiments of the present disclosure relate to identifying, based on application data associated with a computing application that includes a set of runnables, a plurality of scheduling branches associated with scheduling execution of at least a subset of runnables of the set of runnables. Further, one or more embodiments relate to selecting a scheduling branch from the plurality of scheduling branches based at least on a coupling constraint that is applied to related runnables of at least the subset of runnables. The related runnables may include a first runnable that is designated for execution on a first compute engine and that triggers execution of a second runnable on a second compute engine. In addition, one or more embodiments may relate to determining an execution schedule of the set of runnables based at least on the scheduling branch.
-
公开(公告)号:US20230096502A1
公开(公告)日:2023-03-30
申请号:US17929674
申请日:2022-09-02
Applicant: NVIDIA Corporation
Inventor: Ashutosh Tadkase , Akash Bellubbi , Ian Tramble , Peter Boonstoppel , Suraj Das , Ranvijay Singh , Sever Topan , Albert Davies , Linda Xiong , Sharat Janapareddy , Ashkan Vafaee , Sai Gurrappadi , Bruce Holmer , Vishanth Iyer , John Lore , Ian Howson , Pulkit Desai , Michael Cox
Abstract: One or more embodiments of the present disclosure relate to executing, by a plurality of compute engines, a plurality of runnables of a computing application based at least on an execution schedule and a set of commands associated with the execution schedule. The execution schedule may be generated using a compiling system to include the set of commands. The set of commands may include one or more individual commands corresponding to one or more timing fences dictating a timing and order of execution of one or more individual runnables of the plurality of runnables.
-