-
公开(公告)号:US20230024350A1
公开(公告)日:2023-01-26
申请号:US17953991
申请日:2022-09-27
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Chong LI , Thibaut TACHON , Hongxing WANG , Kelun CHAI , Chang LIU
IPC: G06N3/06
Abstract: A device receives a computation graph and transforms the computation graph into a dataflow graph comprising recursive subgraphs. Each recursive subgraph comprises a tuple of another recursive subgraph and an operator node, or an empty graph. The device determines a number of partitioning recursions based on a number of parallel computing devices. For each partitioning recursion, the device determines costs corresponding to operator nodes, determines a processing order of the recursive subgraphs, and processes the recursive subgraphs. To process a recursive subgraph, the device selects a partitioning axis for tensors associated with an operator node of the recursive subgraph. The device outputs a partitioning scheme comprising partitioning axes for each tensor associated with the operator nodes.