PIPELINING PROTOCOLS IN MISALIGNED BUFFER CASES
    1.
    发明申请
    PIPELINING PROTOCOLS IN MISALIGNED BUFFER CASES 有权
    管道缓冲器案例中的管道协议

    公开(公告)号:US20110271006A1

    公开(公告)日:2011-11-03

    申请号:US12769972

    申请日:2010-04-29

    IPC分类号: G06F15/16

    CPC分类号: G06F15/17318

    摘要: Systems, methods and articles of manufacture are disclosed for effecting a desired collective operation on a parallel computing system that includes multiple compute nodes. The compute nodes may pipeline multiple collective operations to effect the desired collective operation. To select protocols suitable for the multiple collective operations, the compute nodes may also perform additional collective operations. The compute nodes may pipeline the multiple collective operations and/or the additional collective operations to effect the desired collective operation more efficiently.

    摘要翻译: 公开了系统,方法和制品,用于在包括多个计算节点的并行计算系统上实现期望的集体操作。 计算节点可以管理多个集合操作来实现所需的集体操作。 为了选择适合于多个集合操作的协议,计算节点还可以执行附加的集合操作。 计算节点可以管理多个集合操作和/或附加集合操作以更有效地实现期望的集体操作。

    Pipelining protocols in misaligned buffer cases
    2.
    发明授权
    Pipelining protocols in misaligned buffer cases 有权
    不对齐缓冲区案例中的流水线协议

    公开(公告)号:US08572276B2

    公开(公告)日:2013-10-29

    申请号:US12769972

    申请日:2010-04-29

    IPC分类号: G06F15/16

    CPC分类号: G06F15/17318

    摘要: Systems, methods and articles of manufacture are disclosed for effecting a desired collective operation on a parallel computing system that includes multiple compute nodes. The compute nodes may pipeline multiple collective operations to effect the desired collective operation. To select protocols suitable for the multiple collective operations, the compute nodes may also perform additional collective operations. The compute nodes may pipeline the multiple collective operations and/or the additional collective operations to effect the desired collective operation more efficiently.

    摘要翻译: 公开了系统,方法和制品,用于在包括多个计算节点的并行计算系统上实现期望的集体操作。 计算节点可以管理多个集合操作来实现所需的集体操作。 为了选择适合于多个集合操作的协议,计算节点还可以执行附加的集合操作。 计算节点可以管理多个集合操作和/或附加集合操作以更有效地实现期望的集体操作。

    MECHANISM OF SUPPORTING SUB-COMMUNICATOR COLLECTIVES WITH O(64) COUNTERS AS OPPOSED TO ONE COUNTER FOR EACH SUB-COMMUNICATOR
    3.
    发明申请
    MECHANISM OF SUPPORTING SUB-COMMUNICATOR COLLECTIVES WITH O(64) COUNTERS AS OPPOSED TO ONE COUNTER FOR EACH SUB-COMMUNICATOR 有权
    O(64)对每个分散通讯员的一个计数器的支持次级通讯员收集机制

    公开(公告)号:US20110119468A1

    公开(公告)日:2011-05-19

    申请号:US12697164

    申请日:2010-01-29

    IPC分类号: G06F9/30

    CPC分类号: G06F9/50 G06F9/522

    摘要: A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal to the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.

    摘要翻译: 一种用于增强计算机系统上的屏障共同同步的系统和方法,包括包括数据存储装置的计算机系统。 计算机系统包括存储在数据存储装置中的程序和由处理器执行的程序的步骤。 该系统包括提供多个用于存储用于沐浴算法的状态信息的通信器。 每个通信器在计算机系统的多处理器环境中指定主核心。 系统为多个线程中的每一个分配或指定一个计数器。 系统配置具有等于最大线程数的条目数的表。 当进程线程启动集合时,系统设置具有与通信器相关联的ID的表条目。 系统通过搜索表中的条目来确定分配的或指定的计数器。

    Mechanism of supporting sub-communicator collectives with O(64) counters as opposed to one counter for each sub-communicator
    4.
    发明授权
    Mechanism of supporting sub-communicator collectives with O(64) counters as opposed to one counter for each sub-communicator 有权
    用O(64)计数器支持子通信集体的机制,而不是每个子通信器的一个计数器

    公开(公告)号:US08527740B2

    公开(公告)日:2013-09-03

    申请号:US12697164

    申请日:2010-01-29

    IPC分类号: G06F9/30

    CPC分类号: G06F9/50 G06F9/522

    摘要: A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal to the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.

    摘要翻译: 一种用于增强计算机系统上的屏障共同同步的系统和方法,包括包括数据存储装置的计算机系统。 计算机系统包括存储在数据存储装置中的程序和由处理器执行的程序的步骤。 该系统包括提供多个用于存储用于沐浴算法的状态信息的通信器。 每个通信器在计算机系统的多处理器环境中指定主核心。 系统为多个线程中的每一个分配或指定一个计数器。 系统配置具有等于最大线程数的条目数的表。 当进程线程启动集合时,系统设置具有与通信器相关联的ID的表条目。 系统通过搜索表中的条目来确定分配的或指定的计数器。

    Message passing with a limited number of DMA byte counters
    5.
    发明授权
    Message passing with a limited number of DMA byte counters 失效
    消息传递有限数量的DMA字节计数器

    公开(公告)号:US08032892B2

    公开(公告)日:2011-10-04

    申请号:US11768813

    申请日:2007-06-26

    CPC分类号: G06F15/17356 G06F9/546

    摘要: A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.

    摘要翻译: 一种在并行计算机系统中传送消息的方法,该并行计算机系统被构造为作为网络互连的多个计算节点,其中每个计算节点包括DMA引擎,但是仅包括有限数量的字节计数器,用于跟踪由 DMA引擎,其中可以在共享计数器或专用计数器操作模式中使用字节计数器。 该方法包括使用会合协议,源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求(RTS)消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据, 到目的地计算节点,使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。 在源计算节点使用一个DMA FIFO,将为发往目的地计算节点的会合消息保留RTS描述符,以确保正确的消息数据顺序。 在DMA引擎上使用接收计数器,目的地计算节点跟踪RTS和相关联的消息数据的接收,并以远程获取的会合协议形式向源节点发送明确发送(CTS)消息以接受RTS消息 和消息数据,并由源计算节点DMA引擎处理远程获取(CTS)以提供要发送的消息数据。

    Data driven dynamic workflow
    6.
    发明申请
    Data driven dynamic workflow 有权
    数据驱动动态工作流程

    公开(公告)号:US20060229924A1

    公开(公告)日:2006-10-12

    申请号:US11100963

    申请日:2005-04-07

    IPC分类号: G06F9/46

    摘要: A method, system and article of manufacture for workflow processing and, more particularly, for managing creation and execution of data driven dynamic workflows. One embodiment provides a computer-implemented method for managing execution of workflow instances. The method comprises providing a parent process template and providing a child process template. The child process template is configured to implement an arbitrary number of workflow operations for a given workflow instance, and the parent process template is configured to instantiate child processes on the basis of the child process template to implement a desired workflow. The method further comprises receiving a workflow configuration and instantiating an instance of the workflow on the basis of the workflow configuration. The instantiating comprises instantiating a parent process on the basis of the parent process template and instantiating, by the parent process template, one or more child processes on the basis of the child process template.

    摘要翻译: 一种用于工作流程处理的方法,系统和制品,更具体地说,用于管理数据驱动的动态工作流程的创建和执行。 一个实施例提供了一种用于管理工作流实例的执行的计算机实现的方法。 该方法包括提供父进程模板并提供子进程模板。 子进程模板被配置为实现给定工作流实例的任意数量的工作流操作,并且父进程模板被配置为基于子进程模板来实例化子进程以实现期望的工作流。 该方法还包括基于工作流配置来接收工作流配置并实例化工作流的实例。 实例包括基于父进程模板实例化父进程,并且由父进程模板根据子进程模板实例化一个或多个子进程。

    MESSAGE PASSING WITH A LIMITED NUMBER OF DMA BYTE COUNTERS
    7.
    发明申请
    MESSAGE PASSING WITH A LIMITED NUMBER OF DMA BYTE COUNTERS 失效
    消息传递与有限数量的DMA字节计数器

    公开(公告)号:US20090007141A1

    公开(公告)日:2009-01-01

    申请号:US11768813

    申请日:2007-06-26

    IPC分类号: G06F9/44

    CPC分类号: G06F15/17356 G06F9/546

    摘要: A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.

    摘要翻译: 一种在并行计算机系统中传送消息的方法,该并行计算机系统被构造为作为网络互连的多个计算节点,其中每个计算节点包括DMA引擎,但是仅包括有限数量的字节计数器,用于跟踪由 DMA引擎,其中可以在共享计数器或专用计数器操作模式中使用字节计数器。 该方法包括使用会合协议,源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求(RTS)消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据, 到目的地计算节点,使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。 在源计算节点使用一个DMA FIFO,将为发往目的地计算节点的会合消息保留RTS描述符,以确保正确的消息数据顺序。 在DMA引擎上使用接收计数器,目的地计算节点跟踪RTS和相关联的消息数据的接收,并以远程获取的会合协议形式向源节点发送明确发送(CTS)消息以接受RTS消息 和消息数据,并由源计算节点DMA引擎处理远程获取(CTS)以提供要发送的消息数据。

    Shared address collectives using counter mechanisms
    8.
    发明授权
    Shared address collectives using counter mechanisms 失效
    共享地址集合使用计数器机制

    公开(公告)号:US08655962B2

    公开(公告)日:2014-02-18

    申请号:US12568115

    申请日:2009-09-28

    IPC分类号: G06F15/16 G06F15/167

    CPC分类号: G06F9/544

    摘要: A shared address space on a compute node stores data received from a network and data to transmit to the network. The shared address space includes an application buffer that can be directly operated upon by a plurality of processes, for instance, running on different cores on the compute node. A shared counter is used for one or more of signaling arrival of the data across the plurality of processes running on the compute node, signaling completion of an operation performed by one or more of the plurality of processes, obtaining reservation slots by one or more of the plurality of processes, or combinations thereof.

    摘要翻译: 计算节点上的共享地址空间存储从网络接收的数据和要发送到网络的数据。 共享地址空间包括可以通过多个进程直接操作的应用缓冲器,例如在计算节点上的不同核上运行。 共享计数器用于通过在计算节点上运行的多个进程的信令到达的一个或多个,信令完成由多个进程中的一个或多个执行的操作,通过一个或多个 多个处理或其组合。

    SHARED ADDRESS COLLECTIVES USING COUNTER MECHANISMS
    9.
    发明申请
    SHARED ADDRESS COLLECTIVES USING COUNTER MECHANISMS 失效
    使用计数器机制的共享地址集合

    公开(公告)号:US20110078249A1

    公开(公告)日:2011-03-31

    申请号:US12568115

    申请日:2009-09-28

    IPC分类号: G06F15/16

    CPC分类号: G06F9/544

    摘要: A shared address space on a compute node stores data received from a network and data to transmit to the network. The shared address space includes an application buffer that can be directly operated upon by a plurality of processes, for instance, running on different cores on the compute node. A shared counter is used for one or more of signaling arrival of the data across the plurality of processes running on the compute node, signaling completion of an operation performed by one or more of the plurality of processes, obtaining reservation slots by one or more of the plurality of processes, or combinations thereof.

    摘要翻译: 计算节点上的共享地址空间存储从网络接收的数据和要发送到网络的数据。 共享地址空间包括可以通过多个进程直接操作的应用缓冲器,例如在计算节点上的不同核上运行。 共享计数器用于通过在计算节点上运行的多个进程的信令到达的一个或多个,信令完成由多个进程中的一个或多个执行的操作,通过一个或多个 多个处理或其组合。