TENSOR CHECKPOINT OPTIMIZATION IN DATAFLOW COMPUTING APPLICATIONS

    公开(公告)号:US20230315407A1

    公开(公告)日:2023-10-05

    申请号:US18129722

    申请日:2023-03-31

    CPC classification number: G06F8/433

    Abstract: According to a computing method a compiler determines a recompute node included in a dataflow application and a checkpoint tensor produced by the recompute node. The compiler determines a recompute cost to recompute the checkpoint tensor, and a memory cost to checkpoint the checkpoint tensor in a memory. Based on the recompute cost and/or the memory cost, the compiler determines a solution cost and compares the solution cost to a solution threshold. Based on comparing the solution cost to the solution threshold, the compiler determines a checkpoint solution to execute the dataflow application. The checkpoint solution can comprise recomputing or checkpointing the checkpoint tensor. In some implementations, the compiler can determine a recompute ratio of the recompute cost to the memory cost and can compare the recompute ratio to the solution threshold. A computer program product and a computing system can implement aspects of the method.

    METHOD AND SYSTEM TO DETERMINE EXECUTION INEFFICIENCIES IN DATAFLOW PROGRAMS

    公开(公告)号:US20240069880A1

    公开(公告)日:2024-02-29

    申请号:US18387906

    申请日:2023-11-08

    CPC classification number: G06F8/433

    Abstract: In a method a computer-implemented efficiency analyzer selects operators from an intermediate representation of a dataflow program. The operators are included in a mapping of the operators to hardware of a computing system to execute the dataflow program. Based on the mapping and a description of the hardware, the efficiency analyzer computes an execution metric associated with executing the operators on the hardware. Based on the execution metric and hardware description, the efficiency analyzer determines an inefficiency metric, and based on the inefficiency metric, the efficiency analyzer determines an inefficiency associated with the dataflow program. The computing system to execute the dataflow program can comprise a coarse grain computing system and the hardware can include a reconfigurable processor of the computing system. A computer program product and a computing system to a the dataflow program can implement the method.

    COMPILER OPTIMIZATION OF DATAFLOW APPLICATIONS USING MIXED INTEGER EQUATIONS

    公开(公告)号:US20230315802A1

    公开(公告)日:2023-10-05

    申请号:US18128076

    申请日:2023-03-29

    CPC classification number: G06F17/11

    Abstract: A method comprises a compiler generating a MI (mixed integer) model to determine mapping decisions to map a dataflow application to hardware of a computing system to execute the application. The MI model comprises MI equations to solve by an MI solver. The MI equations include equations of an objective function corresponding to an optimization objective. The MI equations can comprise decision variables and equations and constraint variables and equations. The compiler outputs the MI model to the MI solver and invokes the MI solver to compute an MI solution comprising solutions to equations among the equations included in the MI model. The compiler receives the MI solution and generates a globally optimized mapping decision based on the MI solution. The MI solver can comprise a commercial program to solve MI linear equations. A computer program product and a computing system can implement the method.

    ANALYSIS ASSISTANT FOR DETERMINING EXECUTION INEFFICIENCIES IN DATAFLOW PROGRAMS

    公开(公告)号:US20240078098A1

    公开(公告)日:2024-03-07

    申请号:US18387912

    申请日:2023-11-08

    CPC classification number: G06F8/433

    Abstract: In a method, in response to an interface a computer-implemented analysis assistant initiates a presentation of inefficiency results, determined an efficiency analyzer based on a mapping of a dataflow program to execute on hardware of a computing system. The assistant receives an inefficiency included among the inefficiency results and composes formatted inefficiency results comprising a presentation format of the inefficiency to assist a developer of the dataflow program to interpret the inefficiency. The analysis assistant outputs the formatted inefficiency results to an interface, which can comprise an interface to output the formatted inefficiency results for use by the developer to improve the dataflow program in association with the inefficiency. In implementations the presentation can comprise an interactive presentation with a developer of the dataflow program. A computer program product and a computing system can implement the method. The computing system can execute the assistant, and can include the interfaces.

    OPTIMIZING TENSOR TILING IN NEURAL NETWORKS BASED ON A TILING COST MODEL

    公开(公告)号:US20230315410A1

    公开(公告)日:2023-10-05

    申请号:US18129714

    申请日:2023-03-31

    CPC classification number: G06F8/443

    Abstract: A method comprises a compiler analyzing a graph to determine a pipeline of operators based on a shared dimension of input and output tensors among the operators. The operators are included in the graph and the graph corresponds to a dataflow application. The compiler determines a tiling decision associated with the pipeline and a tiling cost associated with the tiling decision. The tiling decision can comprise a tile shape to slice tensors of operators of the pipeline. Based on the tiling cost, the compiler determines that the tiling decision improves an optimization objective and includes the pipeline and tiling decision in mapping decisions associated with executing the application on a computing system. The compiler can apply a tiling cost model to determine the tiling costs. A computer program product and a computing system can implement the method.

    ITERATIVE COMPILATION TO OPTIMIZE TRANSLATION IN RECONFIGURABLE DATAFLOW ARCHITECTURES

    公开(公告)号:US20230315406A1

    公开(公告)日:2023-10-05

    申请号:US18129718

    申请日:2023-03-31

    CPC classification number: G06F8/433

    Abstract: In a method a compiler performs a trial compilation to a low level (LL) intermediate representation (IR) of a high level (HL) decision to execute a dataflow application on a computing system. The LLIR comprises hardware resources to execute the application based on the HL decision and the compiler determines a trial result based on LL execution metrics associated with the trail compilation. The compiler performs a trial compilation of a second HL decision to a second LLIR and determines a trial result based on LL execution metrics associated with the second trail compilation. The compiler evaluates the trial results and, based on the evaluations, selects one or both of the HL decisions for executing the dataflow application. A computer program product and a computing system can implement the method.

Patent Agency Ranking