-
公开(公告)号:US11966766B2
公开(公告)日:2024-04-23
申请号:US17076393
申请日:2020-10-21
Applicant: Google LLC
Inventor: Chang Lan , Soroush Radpour
CPC classification number: G06F9/45558 , G06N3/08 , G06N3/084 , G06F2009/45562 , G06F2009/4557 , G06N3/098
Abstract: A data processing system, that includes: one or more host processing devices, the one or more host processing devices may be configured to support instantiation of a plurality of virtual machines such that a first set of virtual machines run one or more worker processes, each worker process operating on a respective data set to produce a respective gradient. The host processing devices may be configured to support instantiation of a second set of virtual machines running one or more reducer processes that operate on each respective gradient produced by each worker process to produce an aggregated gradient. The one or more reducer processes may cause the aggregated gradient to be broadcasted to each worker process.
-
公开(公告)号:US20230409889A1
公开(公告)日:2023-12-21
申请号:US17842910
申请日:2022-06-17
Applicant: Google LLC
Inventor: Salem Elie Haykal , Arvind Krishnamurthy , Chang Lan , Soroush Radpour
CPC classification number: G06N3/063 , G06N3/08 , G06N3/0472
Abstract: Aspects of the disclosure are directed to performing disaggregation-aware model graph partitioning, which can include provisioning and load balancing disaggregated resource pools, such as general purpose processors, accelerators, general purpose memory, and high bandwidth memory. Across these disaggregated resource pools, machine learning model operations can be packed and/or batched. The partitioning can further include automatically tuning runtime parameters.
-
公开(公告)号:US20220121465A1
公开(公告)日:2022-04-21
申请号:US17076393
申请日:2020-10-21
Applicant: Google LLC
Inventor: Chang Lan , Soroush Radpour
Abstract: A data processing system, that includes: one or more host processing devices, the one or more host processing devices may be configured to support instantiation of a plurality of virtual machines such that a first set of virtual machines run one or more worker processes, each worker process operating on a respective data set to produce a respective gradient. The host processing devices may be configured to support instantiation of a second set of virtual machines running one or more reducer processes that operate on each respective gradient produced by each worker process to produce an aggregated gradient. The one or more reducer processes may cause the aggregated gradient to be broadcasted to each worker process.
-
-