-
公开(公告)号:US20230315414A1
公开(公告)日:2023-10-05
申请号:US18128104
申请日:2023-03-29
Applicant: SambaNova Systems, Inc.
Inventor: Bowen YANG , Fei WANG , Shengyue HUO
CPC classification number: G06F8/45 , G06F16/2237
Abstract: A method comprises a compiler determining operators and matrices of an application model. The compiler generates a dimension-based search space (DBSS) comprising Named Nodes corresponding to the operators. The Named Nodes comprise a Named DIM corresponding to a matrix associated with an operator. The Named DIM comprises a DIM Name corresponding to a dimension of a row or column of the matrix. The DBSS comprises an application programming interface (API) to determine operators, matrices, and/or attributes of operators/matrices of the application model using the DIM Names. The method includes the compiler determining the operator, the matrix, and the Named DIM and generating an entry in the DBSS that includes a Named Node corresponding to the operator, a Named DIM corresponding to the matrix and including the DIM Name. A computing system and/or a computer program product can implement the method.
-
公开(公告)号:US20230315407A1
公开(公告)日:2023-10-05
申请号:US18129722
申请日:2023-03-31
Applicant: SambaNova Systems, Inc.
Inventor: Bowen YANG , Zhuo CHEN , Fei WANG , Venkat Krishna SRINIVASAN , Chen LIU , Junjue WANG , Arvind Krishna SUJEETH , Sumti JAIRATH
IPC: G06F8/41
CPC classification number: G06F8/433
Abstract: According to a computing method a compiler determines a recompute node included in a dataflow application and a checkpoint tensor produced by the recompute node. The compiler determines a recompute cost to recompute the checkpoint tensor, and a memory cost to checkpoint the checkpoint tensor in a memory. Based on the recompute cost and/or the memory cost, the compiler determines a solution cost and compares the solution cost to a solution threshold. Based on comparing the solution cost to the solution threshold, the compiler determines a checkpoint solution to execute the dataflow application. The checkpoint solution can comprise recomputing or checkpointing the checkpoint tensor. In some implementations, the compiler can determine a recompute ratio of the recompute cost to the memory cost and can compare the recompute ratio to the solution threshold. A computer program product and a computing system can implement aspects of the method.
-
公开(公告)号:US20240069880A1
公开(公告)日:2024-02-29
申请号:US18387906
申请日:2023-11-08
Applicant: SambaNova Systems, Inc.
Inventor: Blaine RISTER , Qingjian LI , Bowen YANG , Junjue WANG , Chen LIU , Zhuo CHEN , Arvind SUJEETH , Sumti JAIRATH
IPC: G06F8/41
CPC classification number: G06F8/433
Abstract: In a method a computer-implemented efficiency analyzer selects operators from an intermediate representation of a dataflow program. The operators are included in a mapping of the operators to hardware of a computing system to execute the dataflow program. Based on the mapping and a description of the hardware, the efficiency analyzer computes an execution metric associated with executing the operators on the hardware. Based on the execution metric and hardware description, the efficiency analyzer determines an inefficiency metric, and based on the inefficiency metric, the efficiency analyzer determines an inefficiency associated with the dataflow program. The computing system to execute the dataflow program can comprise a coarse grain computing system and the hardware can include a reconfigurable processor of the computing system. A computer program product and a computing system to a the dataflow program can implement the method.
-
公开(公告)号:US20240078098A1
公开(公告)日:2024-03-07
申请号:US18387912
申请日:2023-11-08
Applicant: SambaNova Systems, Inc.
Inventor: Blaine RISTER , Qingjian LI , Bowen YANG , Junjue WANG , Chen LIU , Zhuo CHEN , Arvind SUJEETH , Sumti JAIRATH
IPC: G06F8/41
CPC classification number: G06F8/433
Abstract: In a method, in response to an interface a computer-implemented analysis assistant initiates a presentation of inefficiency results, determined an efficiency analyzer based on a mapping of a dataflow program to execute on hardware of a computing system. The assistant receives an inefficiency included among the inefficiency results and composes formatted inefficiency results comprising a presentation format of the inefficiency to assist a developer of the dataflow program to interpret the inefficiency. The analysis assistant outputs the formatted inefficiency results to an interface, which can comprise an interface to output the formatted inefficiency results for use by the developer to improve the dataflow program in association with the inefficiency. In implementations the presentation can comprise an interactive presentation with a developer of the dataflow program. A computer program product and a computing system can implement the method. The computing system can execute the assistant, and can include the interfaces.
-
公开(公告)号:US20230315410A1
公开(公告)日:2023-10-05
申请号:US18129714
申请日:2023-03-31
Applicant: SambaNova Systems, Inc.
Inventor: Bowen YANG , Zhuo CHEN , Chen LIU , Fei WANG , Ruobing WANG , Qinghua Li , Weiwei CHEN , Junjue WANG , Sumti JAIRATH
IPC: G06F8/41
CPC classification number: G06F8/443
Abstract: A method comprises a compiler analyzing a graph to determine a pipeline of operators based on a shared dimension of input and output tensors among the operators. The operators are included in the graph and the graph corresponds to a dataflow application. The compiler determines a tiling decision associated with the pipeline and a tiling cost associated with the tiling decision. The tiling decision can comprise a tile shape to slice tensors of operators of the pipeline. Based on the tiling cost, the compiler determines that the tiling decision improves an optimization objective and includes the pipeline and tiling decision in mapping decisions associated with executing the application on a computing system. The compiler can apply a tiling cost model to determine the tiling costs. A computer program product and a computing system can implement the method.
-
公开(公告)号:US20230259823A1
公开(公告)日:2023-08-17
申请号:US18109080
申请日:2023-02-13
Applicant: SambaNova Systems, Inc.
Inventor: Greg DYKEMA , Fansheng CHENG , Kuan ZHOU , Arnav GOEL , Subhra MAZUMDAR , Milad SHARIF , Po-Yu WU , Bowen YANG , Qi ZHENG
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: In a method an orchestrator of a computing system determines that results of Machine Learning model computations are available and dispatches a worker to perform model computations that include computing gradients of the results. The orchestrator determines that a set of gradients of the results is available and dispatches a gradient worker to compute a sum of the gradients. The orchestrator determines that a second set of gradients of the results is available and dispatches a second gradient worker to compute a sum of the second set of gradients. The orchestrator determines that the sums of the first and second gradients are available and dispatches a third gradient worker to compute synchronized gradients. The gradient workers compute the sums and synchronized gradients concurrent with training workers computing additional model computations results and/or gradients. A computer program product can include the method and a computing system can include the orchestrator.
-
-
-
-
-