-
公开(公告)号:US20230409889A1
公开(公告)日:2023-12-21
申请号:US17842910
申请日:2022-06-17
Applicant: Google LLC
Inventor: Salem Elie Haykal , Arvind Krishnamurthy , Chang Lan , Soroush Radpour
CPC classification number: G06N3/063 , G06N3/08 , G06N3/0472
Abstract: Aspects of the disclosure are directed to performing disaggregation-aware model graph partitioning, which can include provisioning and load balancing disaggregated resource pools, such as general purpose processors, accelerators, general purpose memory, and high bandwidth memory. Across these disaggregated resource pools, machine learning model operations can be packed and/or batched. The partitioning can further include automatically tuning runtime parameters.