专利检索 ap:"Michael Blocksome" 第 1 页

1.

发明授权
Shared address collectives using counter mechanisms 失效
标题翻译：共享地址集合使用计数器机制

公开(公告)号：US08655962B2

公开(公告)日：2014-02-18

申请号：US12568115

申请日：2009-09-28

申请人： Michael Blocksome , Gabor Dozsa , Thomas M. Gooding , Philip Heidelberger , Sameer Kumar , Amith R. Mamidala , Douglas Miller

发明人： Michael Blocksome , Gabor Dozsa , Thomas M. Gooding , Philip Heidelberger , Sameer Kumar , Amith R. Mamidala , Douglas Miller

IPC分类号： G06F15/16 , G06F15/167

CPC分类号： G06F9/544

摘要： A shared address space on a compute node stores data received from a network and data to transmit to the network. The shared address space includes an application buffer that can be directly operated upon by a plurality of processes, for instance, running on different cores on the compute node. A shared counter is used for one or more of signaling arrival of the data across the plurality of processes running on the compute node, signaling completion of an operation performed by one or more of the plurality of processes, obtaining reservation slots by one or more of the plurality of processes, or combinations thereof.

摘要翻译： 计算节点上的共享地址空间存储从网络接收的数据和要发送到网络的数据。共享地址空间包括可以通过多个进程直接操作的应用缓冲器，例如在计算节点上的不同核上运行。共享计数器用于通过在计算节点上运行的多个进程的信令到达的一个或多个，信令完成由多个进程中的一个或多个执行的操作，通过一个或多个多个处理或其组合。

2.

发明授权
Mechanism of supporting sub-communicator collectives with O(64) counters as opposed to one counter for each sub-communicator 有权
标题翻译：用O（64）计数器支持子通信集体的机制，而不是每个子通信器的一个计数器

公开(公告)号：US08527740B2

公开(公告)日：2013-09-03

申请号：US12697164

申请日：2010-01-29

申请人： Sameer Kumar , Amith R. Mamidala , Joseph D. Ratterman , Michael Blocksome , Douglas Miller

发明人： Sameer Kumar , Amith R. Mamidala , Joseph D. Ratterman , Michael Blocksome , Douglas Miller

IPC分类号： G06F9/30

CPC分类号： G06F9/50 , G06F9/522

摘要： A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal to the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.

摘要翻译： 一种用于增强计算机系统上的屏障共同同步的系统和方法，包括包括数据存储装置的计算机系统。计算机系统包括存储在数据存储装置中的程序和由处理器执行的程序的步骤。该系统包括提供多个用于存储用于沐浴算法的状态信息的通信器。每个通信器在计算机系统的多处理器环境中指定主核心。系统为多个线程中的每一个分配或指定一个计数器。系统配置具有等于最大线程数的条目数的表。当进程线程启动集合时，系统设置具有与通信器相关联的ID的表条目。系统通过搜索表中的条目来确定分配的或指定的计数器。

3.

发明申请
SHARED ADDRESS COLLECTIVES USING COUNTER MECHANISMS 失效
标题翻译：使用计数器机制的共享地址集合

公开(公告)号：US20110078249A1

公开(公告)日：2011-03-31

申请号：US12568115

申请日：2009-09-28

申请人： Michael Blocksome , Gabor Dozsa , Thomas M. Gooding , Philip Heidelberger , Sameer Kumar , Amith R. Mamidala , Douglas Miller

发明人： Michael Blocksome , Gabor Dozsa , Thomas M. Gooding , Philip Heidelberger , Sameer Kumar , Amith R. Mamidala , Douglas Miller

IPC分类号： G06F15/16

CPC分类号： G06F9/544

摘要： A shared address space on a compute node stores data received from a network and data to transmit to the network. The shared address space includes an application buffer that can be directly operated upon by a plurality of processes, for instance, running on different cores on the compute node. A shared counter is used for one or more of signaling arrival of the data across the plurality of processes running on the compute node, signaling completion of an operation performed by one or more of the plurality of processes, obtaining reservation slots by one or more of the plurality of processes, or combinations thereof.

摘要翻译： 计算节点上的共享地址空间存储从网络接收的数据和要发送到网络的数据。共享地址空间包括可以通过多个进程直接操作的应用缓冲器，例如在计算节点上的不同核上运行。共享计数器用于通过在计算节点上运行的多个进程的信令到达的一个或多个，信令完成由多个进程中的一个或多个执行的操作，通过一个或多个多个处理或其组合。

4.

发明授权
Message passing with a limited number of DMA byte counters 失效
标题翻译：消息传递有限数量的DMA字节计数器

公开(公告)号：US08032892B2

公开(公告)日：2011-10-04

申请号：US11768813

申请日：2007-06-26

申请人： Michael Blocksome , Dong Chen , Mark E. Giampapa , Philip Heidelberger , Sameer Kumar , Jeffrey J. Parker

发明人： Michael Blocksome , Dong Chen , Mark E. Giampapa , Philip Heidelberger , Sameer Kumar , Jeffrey J. Parker

IPC分类号： G06F9/44 , G06F9/46 , G06F13/00 , G06F15/167

CPC分类号： G06F15/17356 , G06F9/546

摘要： A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.

摘要翻译： 一种在并行计算机系统中传送消息的方法，该并行计算机系统被构造为作为网络互连的多个计算节点，其中每个计算节点包括DMA引擎，但是仅包括有限数量的字节计数器，用于跟踪由 DMA引擎，其中可以在共享计数器或专用计数器操作模式中使用字节计数器。该方法包括使用会合协议，源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求（RTS）消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据，到目的地计算节点，使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。在源计算节点使用一个DMA FIFO，将为发往目的地计算节点的会合消息保留RTS描述符，以确保正确的消息数据顺序。在DMA引擎上使用接收计数器，目的地计算节点跟踪RTS和相关联的消息数据的接收，并以远程获取的会合协议形式向源节点发送明确发送（CTS）消息以接受RTS消息和消息数据，并由源计算节点DMA引擎处理远程获取（CTS）以提供要发送的消息数据。

5.

发明申请
MECHANISM OF SUPPORTING SUB-COMMUNICATOR COLLECTIVES WITH O(64) COUNTERS AS OPPOSED TO ONE COUNTER FOR EACH SUB-COMMUNICATOR 有权
标题翻译： O（64）对每个分散通讯员的一个计数器的支持次级通讯员收集机制

公开(公告)号：US20110119468A1

公开(公告)日：2011-05-19

申请号：US12697164

申请日：2010-01-29

申请人： Sameer Kumar , Amith R. Mamidala , Joseph D. Ratterman , Michael Blocksome , Douglas Miller

发明人： Sameer Kumar , Amith R. Mamidala , Joseph D. Ratterman , Michael Blocksome , Douglas Miller

IPC分类号： G06F9/30

CPC分类号： G06F9/50 , G06F9/522

摘要： A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal to the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.

摘要翻译： 一种用于增强计算机系统上的屏障共同同步的系统和方法，包括包括数据存储装置的计算机系统。计算机系统包括存储在数据存储装置中的程序和由处理器执行的程序的步骤。该系统包括提供多个用于存储用于沐浴算法的状态信息的通信器。每个通信器在计算机系统的多处理器环境中指定主核心。系统为多个线程中的每一个分配或指定一个计数器。系统配置具有等于最大线程数的条目数的表。当进程线程启动集合时，系统设置具有与通信器相关联的ID的表条目。系统通过搜索表中的条目来确定分配的或指定的计数器。

6.

发明申请
PIPELINING PROTOCOLS IN MISALIGNED BUFFER CASES 有权
标题翻译：管道缓冲器案例中的管道协议

公开(公告)号：US20110271006A1

公开(公告)日：2011-11-03

申请号：US12769972

申请日：2010-04-29

申请人： Charles J. Archer , Michael Blocksome , Bob R. Cernohous , Joseph D. Ratterman , Brian E. Smith

发明人： Charles J. Archer , Michael Blocksome , Bob R. Cernohous , Joseph D. Ratterman , Brian E. Smith

IPC分类号： G06F15/16

CPC分类号： G06F15/17318

摘要： Systems, methods and articles of manufacture are disclosed for effecting a desired collective operation on a parallel computing system that includes multiple compute nodes. The compute nodes may pipeline multiple collective operations to effect the desired collective operation. To select protocols suitable for the multiple collective operations, the compute nodes may also perform additional collective operations. The compute nodes may pipeline the multiple collective operations and/or the additional collective operations to effect the desired collective operation more efficiently.

摘要翻译： 公开了系统，方法和制品，用于在包括多个计算节点的并行计算系统上实现期望的集体操作。计算节点可以管理多个集合操作来实现所需的集体操作。为了选择适合于多个集合操作的协议，计算节点还可以执行附加的集合操作。计算节点可以管理多个集合操作和/或附加集合操作以更有效地实现期望的集体操作。

7.

发明授权
Pipelining protocols in misaligned buffer cases 有权
标题翻译：不对齐缓冲区案例中的流水线协议

公开(公告)号：US08572276B2

公开(公告)日：2013-10-29

申请号：US12769972

申请日：2010-04-29

申请人： Charles J. Archer , Michael Blocksome , Bob R. Cernohous , Joseph D. Ratterman , Brian E. Smith

发明人： Charles J. Archer , Michael Blocksome , Bob R. Cernohous , Joseph D. Ratterman , Brian E. Smith

IPC分类号： G06F15/16

CPC分类号： G06F15/17318

摘要： Systems, methods and articles of manufacture are disclosed for effecting a desired collective operation on a parallel computing system that includes multiple compute nodes. The compute nodes may pipeline multiple collective operations to effect the desired collective operation. To select protocols suitable for the multiple collective operations, the compute nodes may also perform additional collective operations. The compute nodes may pipeline the multiple collective operations and/or the additional collective operations to effect the desired collective operation more efficiently.

摘要翻译： 公开了系统，方法和制品，用于在包括多个计算节点的并行计算系统上实现期望的集体操作。计算节点可以管理多个集合操作来实现所需的集体操作。为了选择适合于多个集合操作的协议，计算节点还可以执行附加的集合操作。计算节点可以管理多个集合操作和/或附加集合操作以更有效地实现期望的集体操作。

8.

发明申请
Data driven dynamic workflow 有权
标题翻译：数据驱动动态工作流程

公开(公告)号：US20060229924A1

公开(公告)日：2006-10-12

申请号：US11100963

申请日：2005-04-07

申请人： Melissa Aron , Michael Blocksome , David Herbeck , Todd Johnson

发明人： Melissa Aron , Michael Blocksome , David Herbeck , Todd Johnson

IPC分类号： G06F9/46

CPC分类号： G06Q10/10 , G06Q10/06316 , G06Q10/0633

摘要： A method, system and article of manufacture for workflow processing and, more particularly, for managing creation and execution of data driven dynamic workflows. One embodiment provides a computer-implemented method for managing execution of workflow instances. The method comprises providing a parent process template and providing a child process template. The child process template is configured to implement an arbitrary number of workflow operations for a given workflow instance, and the parent process template is configured to instantiate child processes on the basis of the child process template to implement a desired workflow. The method further comprises receiving a workflow configuration and instantiating an instance of the workflow on the basis of the workflow configuration. The instantiating comprises instantiating a parent process on the basis of the parent process template and instantiating, by the parent process template, one or more child processes on the basis of the child process template.

摘要翻译： 一种用于工作流程处理的方法，系统和制品，更具体地说，用于管理数据驱动的动态工作流程的创建和执行。一个实施例提供了一种用于管理工作流实例的执行的计算机实现的方法。该方法包括提供父进程模板并提供子进程模板。子进程模板被配置为实现给定工作流实例的任意数量的工作流操作，并且父进程模板被配置为基于子进程模板来实例化子进程以实现期望的工作流。该方法还包括基于工作流配置来接收工作流配置并实例化工作流的实例。实例包括基于父进程模板实例化父进程，并且由父进程模板根据子进程模板实例化一个或多个子进程。

9.

发明申请
PASSING NON-ARCHITECTED REGISTERS VIA A CALLBACK/ADVANCE MECHANISM IN A SIMULATOR ENVIRONMENT 有权
标题翻译：通过模拟环境中的回叫/推进机制通过非构建寄存器

公开(公告)号：US20110288848A1

公开(公告)日：2011-11-24

申请号：US12785141

申请日：2010-05-21

申请人： Charles J. ARCHER , Michael BLOCKSOME , Joseph D. RATTERMAN , Brian E. SMITH

发明人： Charles J. ARCHER , Michael BLOCKSOME , Joseph D. RATTERMAN , Brian E. SMITH

IPC分类号： G06F9/44

CPC分类号： G06F11/3409 , G06F11/3457 , G06F2201/88

摘要： Embodiments of the invention provide a method of calculating performance counter data for a computer simulator, while minimizing the performance costs associated with cycle-accurate simulation. A callback may be associated with the instructions of a user program and, when the instructions are executed, the associated callbacks may be executed as well. Upon execution, the callbacks may calculate performance counter data related to the associated instruction.

摘要翻译： 本发明的实施例提供了一种计算计算机模拟器的性能计数器数据的方法，同时使与循环精确模拟相关的性能成本最小化。回调可以与用户程序的指令相关联，并且当执行指令时，也可以执行相关联的回调。在执行时，回调可以计算与相关指令相关的性能计数器数据。

10.

发明申请
COMBINING MULTIPLE HARDWARE NETWORKS TO ACHIEVE LOW-LATENCY HIGH-BANDWIDTH POINT-TO-POINT COMMUNICATION 失效
标题翻译：组合多个硬件网络实现低端高端点到点通信

公开(公告)号：US20110270942A1

公开(公告)日：2011-11-03

申请号：US12768804

申请日：2010-04-28

申请人： CHARLES J. ARCHER , MICHAEL BLOCKSOME , JOSEPH D. RATTERMAN , BRIAN E. SMITH

发明人： CHARLES J. ARCHER , MICHAEL BLOCKSOME , JOSEPH D. RATTERMAN , BRIAN E. SMITH

IPC分类号： G06F15/167

CPC分类号： G06F15/17318

摘要： Systems, methods and articles of manufacture are disclosed for performing a collective operation on a parallel computing system that includes multiple compute nodes and multiple networks connecting the compute nodes. Each of the networks may have different characteristics. A source node may broadcast a DMA descriptor over a first network to a target node, to initialize the collective operation. The target node may perform the collective operation over a second network and using the broadcast DMA descriptor.

摘要翻译： 公开了用于在包括多个计算节点和连接计算节点的多个网络的并行计算系统上执行集体操作的系统，方法和制品。每个网络可能具有不同的特征。源节点可以通过第一网络向目标节点广播DMA描述符，以初始化集合操作。目标节点可以通过第二网络并使用广播DMA描述符来执行集体操作。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类