Mechanism to restrict parallelization of loops
    1.
    发明授权
    Mechanism to restrict parallelization of loops 失效
    限制环路并行化的机制

    公开(公告)号:US08104030B2

    公开(公告)日:2012-01-24

    申请号:US11314456

    申请日:2005-12-21

    IPC分类号: G06F9/44 G06F9/45

    CPC分类号: G06F8/4452

    摘要: A computer implemented method, computer usable program code, and a system for parallelizing a loop. A parameter that will be used to limit parallelization of the loop is identified to limit parallelization of the loop. The parameter specifies a minimum number of loop iterations that a thread should execute. The parameter can be adjusted based on a parallel performance factor. A parallel performance factor is a factor that influences the performance of parallel code. A number of threads from a plurality of threads is selected for processing iterations of the loop based on the parameter. The number of threads is selected prior to execution of the first iteration of the loop.

    摘要翻译: 计算机实现的方法,计算机可用程序代码和用于并行化循环的系统。 确定用于限制环路并联的参数,以限制环路的并行化。 该参数指定线程应执行的最小循环迭代次数。 该参数可以根据并行性能因素进行调整。 并行性能因素是影响并行代码性能的因素。 选择来自多个线程的多个线程用于基于该参数来处理该循环的迭代。 在执行循环的第一次迭代之前选择线程数。

    Pipelined parallelization of multi-dimensional loops with multiple data dependencies
    2.
    发明授权
    Pipelined parallelization of multi-dimensional loops with multiple data dependencies 失效
    多维循环与多个数据依赖关系的流水线并行化

    公开(公告)号:US08146071B2

    公开(公告)日:2012-03-27

    申请号:US11857211

    申请日:2007-09-18

    IPC分类号: G06F9/45

    CPC分类号: G06F8/4452 G06F8/443

    摘要: A mechanism for folding all the data dependencies in a loop into a single, conservative dependence. This mechanism leads to one pair of synchronization primitives per loop. This mechanism does not require complicated, multi-stage compile time analysis. This mechanism considers only the data dependence information in the loop. The low synchronization cost balances the loss in parallelism due to the reduced overlap between iterations. Additionally, a novel scheme is presented to implement required synchronization to enforce data dependences in a DOACROSS loop. The synchronization is based on an iteration vector, which identifies a spatial position in the iteration space of the loop. Multiple iterations executing in parallel have their own iteration vector for synchronization where they update their position in the iteration space. As no sequential updates to the synchronization variable exist, this method exploits a greater degree of parallelism.

    摘要翻译: 将循环中的所有数据依赖关系折叠成单个,保守依赖的机制。 这种机制导致每个循环一对同步原语。 该机制不需要复杂的多阶段编译时分析。 该机制只考虑循环中的数据依赖信息。 由于迭代之间的重叠减少,低同步成本平衡了并行性的损失。 另外,提出了一种新颖的方案来实现所需的同步以在DOACROSS循环中实现数据依赖。 同步基于迭代向量,该向量标识循环的迭代空间中的空间位置。 并行执行的多个迭代具有自己的迭代向量,用于同步,它们更新其在迭代空间中的位置。 由于不存在对同步变量的顺序更新,所以该方法利用更大程度的并行性。

    PIPELINED PARALLELIZATION OF MULTI-DIMENSIONAL LOOPS WITH MULTIPLE DATA DEPENDENCIES
    3.
    发明申请
    PIPELINED PARALLELIZATION OF MULTI-DIMENSIONAL LOOPS WITH MULTIPLE DATA DEPENDENCIES 失效
    具有多个数据依赖关系的多维数据管道并行化

    公开(公告)号:US20090077545A1

    公开(公告)日:2009-03-19

    申请号:US11857211

    申请日:2007-09-18

    IPC分类号: G06F9/45

    CPC分类号: G06F8/4452 G06F8/443

    摘要: A mechanism for folding all the data dependencies in a loop into a single, conservative dependence. This mechanism leads to one pair of synchronization primitives per loop. This mechanism does not require complicated, multi-stage compile time analysis. This mechanism considers only the data dependence information in the loop. The low synchronization cost balances the loss in parallelism due to the reduced overlap between iterations. Additionally, a novel scheme is presented to implement required synchronization to enforce data dependences in a DOACROSS loop. The synchronization is based on an iteration vector, which identifies a spatial position in the iteration space of the loop. Multiple iterations executing in parallel have their own iteration vector for synchronization where they update their position in the iteration space. As no sequential updates to the synchronization variable exist, this method exploits a greater degree of parallelism.

    摘要翻译: 将循环中的所有数据依赖关系折叠成单个,保守依赖的机制。 这种机制导致每个循环一对同步原语。 该机制不需要复杂的多阶段编译时分析。 该机制只考虑循环中的数据依赖信息。 由于迭代之间的重叠减少,低同步成本平衡了并行性的损失。 另外,提出了一种新颖的方案来实现所需的同步以在DOACROSS循环中实现数据依赖。 同步基于迭代向量,该向量标识循环的迭代空间中的空间位置。 并行执行的多个迭代具有自己的迭代向量,用于同步,它们更新其在迭代空间中的位置。 由于不存在对同步变量的顺序更新,所以该方法利用更大程度的并行性。

    Mechanism to restrict parallelization of loops
    4.
    发明申请
    Mechanism to restrict parallelization of loops 失效
    限制环路并行化的机制

    公开(公告)号:US20070169057A1

    公开(公告)日:2007-07-19

    申请号:US11314456

    申请日:2005-12-21

    IPC分类号: G06F9/45

    CPC分类号: G06F8/4452

    摘要: A computer implemented method, computer usable program code, and a system for parallelizing a loop. A parameter that will be used to limit parallelization of the loop is identified to limit parallelization of the loop. The parameter specifies a minimum number of loop iterations that a thread should execute. The parameter can be adjusted based on a parallel performance factor. A parallel performance factor is a factor that influences the performance of parallel code. A number of threads from a plurality of threads is selected for processing iterations of the loop based on the parameter. The number of threads is selected prior to execution of the first iteration of the loop.

    摘要翻译: 计算机实现的方法,计算机可用程序代码和用于并行化循环的系统。 确定用于限制环路并联的参数,以限制环路的并行化。 该参数指定线程应执行的最小循环迭代次数。 该参数可以根据并行性能因素进行调整。 并行性能因素是影响并行代码性能的因素。 选择来自多个线程的多个线程用于基于该参数来处理该循环的迭代。 在执行循环的第一次迭代之前选择线程数。