-
公开(公告)号:US12056534B2
公开(公告)日:2024-08-06
申请号:US18091951
申请日:2022-12-30
Applicant: Google LLC
Inventor: Sheng Li , Brian Zhang , Liqun Cheng , Norman Paul Jouppi , Yun Ni
CPC classification number: G06F9/5044 , G06F9/4881 , G06F9/5011 , G06F9/5066 , G06F9/545 , G06F11/3409 , G06F11/3612 , G06F18/214 , G06N3/08 , G06F2209/501
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scheduling operations represented as a computational graph on a distributed computing network. A method includes: receiving data representing operations to be executed in order to perform a job on a plurality of hardware accelerators of a plurality of different accelerator types; generating, for the job and from at least the data representing the operations, features that represent a predicted performance for the job on hardware accelerators of the plurality of different accelerator types; generating, from the features, a respective predicted performance metric for the job for each of the plurality of different accelerator types according to a performance objective function; and providing, to a scheduling system, one or more recommendations for scheduling the job on one or more recommended types of hardware accelerators.
-
公开(公告)号:US20210073028A1
公开(公告)日:2021-03-11
申请号:US16600437
申请日:2019-10-11
Applicant: Google LLC
Inventor: Sheng Li , Brian Zhang , Liqun Cheng , Norman Paul Jouppi , Yun Ni
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scheduling operations represented as a computational graph on a distributed computing network. A method includes: receiving data representing operations to be executed in order to perform a job on a plurality of hardware accelerators of a plurality of different accelerator types; generating, for the job and from at least the data representing the operations, features that represent a predicted performance for the job on hardware accelerators of the plurality of different accelerator types; generating, from the features, a respective predicted performance metric for the job for each of the plurality of different accelerator types according to a performance objective function; and providing, to a scheduling system, one or more recommendations for scheduling the job on one or more recommended types of hardware accelerators.
-
公开(公告)号:US11544105B2
公开(公告)日:2023-01-03
申请号:US16600437
申请日:2019-10-11
Applicant: Google LLC
Inventor: Sheng Li , Brian Zhang , Liqun Cheng , Norman Paul Jouppi , Yun Ni
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scheduling operations represented as a computational graph on a distributed computing network. A method includes: receiving data representing operations to be executed in order to perform a job on a plurality of hardware accelerators of a plurality of different accelerator types; generating, for the job and from at least the data representing the operations, features that represent a predicted performance for the job on hardware accelerators of the plurality of different accelerator types; generating, from the features, a respective predicted performance metric for the job for each of the plurality of different accelerator types according to a performance objective function; and providing, to a scheduling system, one or more recommendations for scheduling the job on one or more recommended types of hardware accelerators.
-
公开(公告)号:US20230222000A1
公开(公告)日:2023-07-13
申请号:US18091951
申请日:2022-12-30
Applicant: Google LLC
Inventor: Sheng Li , Brian Zhang , Liqun Cheng , Norman Paul Jouppi , Yun Ni
CPC classification number: G06F9/5044 , G06F9/545 , G06F11/3612 , G06F9/5066 , G06F18/214 , G06N3/08 , G06F9/5011 , G06F11/3409 , G06F9/4881 , G06F2209/501
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scheduling operations represented as a computational graph on a distributed computing network. A method includes: receiving data representing operations to be executed in order to perform a job on a plurality of hardware accelerators of a plurality of different accelerator types; generating, for the job and from at least the data representing the operations, features that represent a predicted performance for the job on hardware accelerators of the plurality of different accelerator types; generating, from the features, a respective predicted performance metric for the job for each of the plurality of different accelerator types according to a performance objective function; and providing, to a scheduling system, one or more recommendations for scheduling the job on one or more recommended types of hardware accelerators.
-
-
-