Operation Fusion in Nested Meta-pipeline Loops

    公开(公告)号:US20230315411A1

    公开(公告)日:2023-10-05

    申请号:US18130642

    申请日:2023-04-04

    IPC分类号: G06F8/41

    摘要: A method for improving throughput in a reconfigurable computing system includes detecting, in an algebraic representation of a computing task for a reconfigurable dataflow processor, an outer meta-pipeline loop, detecting an inner meta-pipeline loop nested within the outer meta-pipeline loop, and determining that the inner meta-pipeline loop and the outer meta-pipeline loop each conduct a common operation. The method also includes fusing the common operation for the inner meta-pipeline loop and the outer meta-pipeline loop into a single operation within the inner meta-pipeline loop. The instances of the common operation may be fused if the output of a first instance of the common operation is the source for a second instance of the common operation. Examples of the common operation include an accumulator operation, a re-read operation, and a temporal (chip buffer synchronized) operation such as a temporal concatenation operation and a temporal slicing operation.

    SLOPPY FEEDBACK LOOP COMPILATION
    6.
    发明申请

    公开(公告)号:US20170185387A1

    公开(公告)日:2017-06-29

    申请号:US15325332

    申请日:2014-08-28

    IPC分类号: G06F9/45 G06F9/38

    摘要: A system, method, and a computer-readable storage device for sloppy feedback loop compilation are described herein. For example, a dataflow application definition can be obtained. The dataflow application definition may include a feedback loop bounding a compute block. It may then be determined that the feedback loop is to be compiled as a sloppy feedback loop. The dataflow application can then be compiled as an executable object that pipelines the compute block of the dataflow application.

    Code profiling of executable library for pipeline parallelization

    公开(公告)号:US09619360B2

    公开(公告)日:2017-04-11

    申请号:US14479455

    申请日:2014-09-08

    IPC分类号: G06F9/44 G06F11/34 G06F9/45

    摘要: A method and system for creating a library method stub in source code form corresponding to an original library call in machine-executable form. The library method stub is created in a predefined programming language by use of a library method signature associated with the original library call, at least one idiom sentence, and a call invoking the original library call. Creating the library method stub includes composing source code of the library method stub by matching the at least one idiom sentence with idiom-stub mappings predefined for each basic idiom of at least one basic idiom. The original library call appears in sequential code. The library method signature specifies formal arguments of the original library call. The at least one idiom sentence summarizes memory operations performed by the original library call on the formal arguments. The created library method stub is stored in a database.

    Confluence analysis and loop fast-forwarding for improving SIMD execution efficiency

    公开(公告)号:US09612811B2

    公开(公告)日:2017-04-04

    申请号:US14160426

    申请日:2014-01-21

    IPC分类号: G06F9/45

    摘要: One embodiment of the present invention sets forth a method for causing thread convergence. The method includes determining that a control flow graph representing a first section of a program includes at least two non-overlapping paths that extend from a first divergent node to a candidate node. The method also includes determining that the first divergent node is not a dominator of the candidate node or that the candidate node is not a post-dominator of the first divergent node. The method further includes identifying an external node and inserting a first instruction configured to cause a predicate variable to be set to true for a first set of threads that is to execute the external node. The method additionally includes inserting into the program a second divergent node configured to cause various threads to execute or not execute a first control flow path associated with the external node.

    BUILD AND COMPILE STATISTICS FOR DISTRIBUTED ENGINEERS
    10.
    发明申请
    BUILD AND COMPILE STATISTICS FOR DISTRIBUTED ENGINEERS 有权
    分布式工程师的建立和编制统计

    公开(公告)号:US20160124721A1

    公开(公告)日:2016-05-05

    申请号:US14530440

    申请日:2014-10-31

    申请人: AppDynamics, Inc.

    IPC分类号: G06F9/44 H04L29/08

    摘要: The present technology adds code to a top level build configuration file of a configuration program that will gather metrics for each invocation of a build. These metrics are sent to a commonly accessible metric server for future analysis. The metrics are collected for a distributed engineering team over several machines. Compilation time metrics may then be collected for each compilation event and those metrics are analyzed by a common aggregator.

    摘要翻译: 本技术将代码添加到配置程序的顶级构建配置文件中,该配置程序将收集每次调用构建的度量标准。 这些指标被发送到一个常用的度量标准服务器,以便将来进行分析。 通过几台机器为分布式工程团队收集这些指标。 然后可以针对每个编译事件收集编译时间度量,并且这些度量由公共聚合器分析。