专利检索 ap:("Michael A. Blocksome" OR "Daniel A. Faraj") AND inv:"Michael A. Blocksome" 第 1 页

1.

发明申请
Performing A Local Reduction Operation On A Parallel Computer 失效
标题翻译：在并行计算机上执行局部缩减操作

公开(公告)号：US20120317399A1

公开(公告)日：2012-12-13

申请号：US13585993

申请日：2012-08-15

申请人： Michael A. Blocksome , Daniel A. Faraj

发明人： Michael A. Blocksome , Daniel A. Faraj

IPC分类号： G06F15/76 , G06F15/16 , G06F9/02 , G06F12/00

CPC分类号： G06F15/17387 , G06F15/17318

摘要： A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

摘要翻译： 并行计算机包括计算节点，每个包括两个减少处理核心，一个网络写入处理核心和一个网络读取处理核心，每个处理核心分配一个输入缓冲器。通过缩小处理核心在交织块中将缩小处理核心的输入缓冲器的内容复制到共享存储器中的交错缓冲器; 通过一个还原处理核心将网络写处理核心的输入缓冲器的内容复制到共享存储器; 通过另一个还原处理核心将网络读处理核心的输入缓冲器的内容复制到共享存储器; 并通过还原处理核心并行减少：还原处理核心的输入缓冲器的内容; 交错缓冲器的每隔一个交错块; 复制内容的网络写入处理核心的输入缓冲区; 以及网络读取处理核心的输入缓冲区的复制内容。

2.

发明申请
Performing A Local Reduction Operation On A Parallel Computer 失效
标题翻译：在并行计算机上执行局部缩减操作

公开(公告)号：US20110258245A1

公开(公告)日：2011-10-20

申请号：US12760020

申请日：2010-04-14

申请人： Michael A. Blocksome , Daniel A. Faraj

发明人： Michael A. Blocksome , Daniel A. Faraj

IPC分类号： G06F15/76 , G06F15/16 , G06F9/02 , G06F12/06

CPC分类号： G06F15/17387 , G06F15/17318

摘要： A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

摘要翻译： 并行计算机包括计算节点，每个包括两个减少处理核心，一个网络写入处理核心和一个网络读取处理核心，每个处理核心分配一个输入缓冲器。通过缩小处理核心在交织块中将缩小处理核心的输入缓冲器的内容复制到共享存储器中的交错缓冲器; 通过一个还原处理核心将网络写处理核心的输入缓冲器的内容复制到共享存储器; 通过另一个还原处理核心将网络读处理核心的输入缓冲器的内容复制到共享存储器; 并通过还原处理核心并行减少：还原处理核心的输入缓冲器的内容; 交错缓冲器的每隔一个交错块; 复制内容的网络写入处理核心的输入缓冲区; 以及网络读取处理核心的输入缓冲区的复制内容。

3.

发明授权
Performing a local reduction operation on a parallel computer 失效
标题翻译：在并行计算机上执行局部缩减操作

公开(公告)号：US08332460B2

公开(公告)日：2012-12-11

申请号：US12760020

申请日：2010-04-14

申请人： Michael A. Blocksome , Daniel A. Faraj

发明人： Michael A. Blocksome , Daniel A. Faraj

IPC分类号： G06F15/76 , G06F15/16 , G06F9/02 , G06F12/06

CPC分类号： G06F15/17387 , G06F15/17318

摘要： A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

摘要翻译： 并行计算机包括计算节点，每个包括两个减少处理核心，一个网络写入处理核心和一个网络读取处理核心，每个处理核心分配一个输入缓冲器。通过缩小处理核心在交织块中将缩小处理核心的输入缓冲器的内容复制到共享存储器中的交错缓冲器; 通过一个还原处理核心将网络写处理核心的输入缓冲器的内容复制到共享存储器; 通过另一个还原处理核心将网络读处理核心的输入缓冲器的内容复制到共享存储器; 并通过还原处理核心并行减少：还原处理核心的输入缓冲器的内容; 交错缓冲器的每隔一个交错块; 复制内容的网络写入处理核心的输入缓冲区; 以及网络读取处理核心的输入缓冲区的复制内容。

4.

发明授权
Performing a local reduction operation on a parallel computer 失效
标题翻译：在并行计算机上执行局部缩减操作

公开(公告)号：US08458244B2

公开(公告)日：2013-06-04

申请号：US13585993

申请日：2012-08-15

申请人： Michael A. Blocksome , Daniel A. Faraj

发明人： Michael A. Blocksome , Daniel A. Faraj

IPC分类号： G06F15/76 , G06F15/16 , G06F9/02 , G06F12/00

CPC分类号： G06F15/17387 , G06F15/17318

摘要： A parallel computer including compute nodes, each including two reduction processing cores, a network write processing core, and a network read processing core, each processing core assigned an input buffer. Copying, in interleaved chunks by the reduction processing cores, contents of the reduction processing cores' input buffers to an interleaved buffer in shared memory; copying, by one of the reduction processing cores, contents of the network write processing core's input buffer to shared memory; copying, by another of the reduction processing cores, contents of the network read processing core's input buffer to shared memory; and locally reducing in parallel by the reduction processing cores: the contents of the reduction processing core's input buffer; every other interleaved chunk of the interleaved buffer; the copied contents of the network write processing core's input buffer; and the copied contents of the network read processing core's input buffer.

摘要翻译： 并行计算机包括计算节点，每个包括两个减少处理核心，一个网络写入处理核心和一个网络读取处理核心，每个处理核心分配一个输入缓冲器。通过缩小处理核心在交织块中将缩小处理核心的输入缓冲器的内容复制到共享存储器中的交错缓冲器; 通过一个还原处理核心将网络写处理核心的输入缓冲器的内容复制到共享存储器; 通过另一个还原处理核心将网络读处理核心的输入缓冲器的内容复制到共享存储器; 并通过还原处理核心并行减少：还原处理核心的输入缓冲器的内容; 交错缓冲器的每隔一个交错块; 复制内容的网络写入处理核心的输入缓冲区; 以及网络读取处理核心的输入缓冲区的复制内容。

5.

发明授权
Parallel, low-latency method for high-performance speculative globally-large element extraction from distributed, sorted arrays 有权

公开(公告)号：US09811316B2

公开(公告)日：2017-11-07

申请号：US11758703

申请日：2007-06-06

申请人： Charles J. Archer , Michael A. Blocksome , Joseph D. Ratterman , Brian Smith

发明人： Charles J. Archer , Michael A. Blocksome , Joseph D. Ratterman , Brian Smith

IPC分类号： G06F7/36 , G06F9/38 , G06F15/80 , G06F19/16 , G06F9/30 , G06F19/28

CPC分类号： G06F7/36 , G06F9/30021 , G06F9/30032 , G06F9/3885 , G06F15/80 , G06F19/16 , G06F19/28

摘要： The present invention provides a system and method for extracting elements from distributed arrays on a parallel processing system. The system includes a module that populates a result array with globally largest elements from input arrays, a module that generates a partition element, a module that counts the number of local elements greater than the partition element, and a module that determines the globally largest elements. The method for extracting elements from distributed arrays on a parallel processing system includes populating a result array with globally largest elements from input arrays, generating a partition element, counting the number of local elements greater than the partition and determining the globally largest elements.

6.

发明授权
Routing data communications packets in a parallel computer 有权

公开(公告)号：US09569398B2

公开(公告)日：2017-02-14

申请号：US12892192

申请日：2010-09-28

申请人： Charles J. Archer , Michael A. Blocksome , Todd A. Inglett , Joseph D. Ratterman , Brian E. Smith

发明人： Charles J. Archer , Michael A. Blocksome , Todd A. Inglett , Joseph D. Ratterman , Brian E. Smith

IPC分类号： H04L12/701 , H04L29/08 , G06F9/455 , G06F15/173 , H04L12/781

CPC分类号： G06F15/17312 , G06F9/45558 , H04L45/52 , H04L67/327

摘要： Routing data communications packets in a parallel computer that includes compute nodes organized for collective operations. Each compute node including an operating system kernel and a system-level messaging module that is a module of automated computing machinery that exposes a messaging interface to applications. Each compute node including a routing table that specifies, for each of a multiplicity of route identifiers, a data communications path through the compute node. Including to carry out the steps of: receiving in a compute node a data communications packet that includes a route identifier value; retrieving from the routing table a specification of a data communications path through the compute node; and routing, by the compute node, the data communications packet according to the data communications path identified by the compute node's routing table entry for the data communications packet's route identifier value.

7.

发明授权
Fencing network direct memory access data transfers in a parallel active messaging interface of a parallel computer 有权

公开(公告)号：US09075759B2

公开(公告)日：2015-07-07

申请号：US12940282

申请日：2010-11-05

申请人： Michael A. Blocksome , Amith R. Mamidala

发明人： Michael A. Blocksome , Amith R. Mamidala

IPC分类号： G06F15/16 , G06F15/173 , G06F7/38 , H04L29/06 , G06F9/54

CPC分类号： G06F15/16 , G06F9/544 , G06F9/546 , G06F15/17331 , H04L67/40

摘要： Fencing direct memory access (‘DMA’) data transfers in a parallel active messaging interface (‘PAMI’) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to a deterministic data communications network through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and the deterministic data communications network; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.

8.

发明授权
Managing internode data communications for an uninitialized process in a parallel computer 有权
标题翻译：管理并行计算机中未初始化进程的节点间数据通信

公开(公告)号：US08732725B2

公开(公告)日：2014-05-20

申请号：US13292293

申请日：2011-11-09

申请人： Charles J. Archer , Michael A. Blocksome , Douglas R. Miller , Jeffrey J. Parker , Joseph D. Ratterman , Brian E. Smith

发明人： Charles J. Archer , Michael A. Blocksome , Douglas R. Miller , Jeffrey J. Parker , Joseph D. Ratterman , Brian E. Smith

IPC分类号： G06F3/00 , G06F9/44 , G06F9/46 , G06F13/00

CPC分类号： G06F15/167 , G06F9/544 , G06F15/17318

摘要： A parallel computer includes nodes, each having main memory and a messaging unit (MU). Each MU includes computer memory, which in turn includes, MU message buffers. Each MU message buffer is associated with an uninitialized process on the compute node. In the parallel computer, managing internode data communications for an uninitialized process includes: receiving, by an MU of a compute node, one or more data communications messages in an MU message buffer associated with an uninitialized process on the compute node; determining, by an application agent, that the MU message buffer associated with the uninitialized process is full prior to initialization of the uninitialized process; establishing, by the application agent, a temporary message buffer for the uninitialized process in main computer memory; and moving, by the application agent, data communications messages from the MU message buffer associated with the uninitialized process to the temporary message buffer in main computer memory.

摘要翻译： 并行计算机包括各自具有主存储器和消息传送单元（MU）的节点。每个MU包括计算机存储器，其又包括MU消息缓冲器。每个MU消息缓冲区与计算节点上的未初始化进程相关联。在并行计算机中，管理未初始化过程的节点间数据通信包括：由计算节点的MU接收与计算节点上的未初始化过程相关联的MU消息缓冲器中的一个或多个数据通信消息; 由应用代理确定与未初始化过程相关联的MU消息缓冲器在未初始化过程的初始化之前已满; 由应用代理建立用于主计算机存储器中未初始化过程的临时消息缓冲器; 并且由应用代理将与未初始化过程相关联的MU消息缓冲器的数据通信消息移动到主计算机存储器中的临时消息缓冲器。

9.

发明申请
Utilizing A Kernel Administration Hardware Thread Of A Multi-Threaded, Multi-Core Compute Node Of A Parallel Computer 有权
标题翻译：利用并行计算机的多线程，多核心计算节点的内核管理硬件线程

公开(公告)号：US20140047450A1

公开(公告)日：2014-02-13

申请号：US13569275

申请日：2012-08-08

申请人： Michael A. Blocksome , Todd A. Inglett , Patrick J. McCarthy , Joseph D. Ratterman , Brian E. Smith

发明人： Michael A. Blocksome , Todd A. Inglett , Patrick J. McCarthy , Joseph D. Ratterman , Brian E. Smith

IPC分类号： G06F9/46

CPC分类号： G06F9/544

摘要： Methods, apparatuses, and computer program products for utilizing a kernel administration hardware thread of a multi-threaded, multi-core compute node of a parallel computer are provided. Embodiments include a kernel assigning a memory space of a hardware thread of an application processing core to a kernel administration hardware thread of a kernel processing core. A kernel administration hardware thread is configured to advance the hardware thread to a next memory space associated with the hardware thread in response to the assignment of the kernel administration hardware thread to the memory space of the hardware thread. Embodiments also include the kernel administration hardware thread executing an instruction within the assigned memory space.

摘要翻译： 提供了用于并行计算机的多线程，多核计算节点的内核管理硬件线程的方法，装置和计算机程序产品。实施例包括将应用处理核心的硬件线程的存储器空间分配给内核处理核心的内核管理硬件线程的内核。内核管理硬件线程被配置为响应于将内核管理硬件线程分配给硬件线程的存储器空间而将硬件线程推进到与硬件线程相关联的下一个存储器空间。实施例还包括内核管理硬件线程执行分配的存储器空间内的指令。

10.

发明授权
Compiling software for a hierarchical distributed processing system 失效
标题翻译：为分层分布式处理系统编写软件

公开(公告)号：US08621446B2

公开(公告)日：2013-12-31

申请号：US12770353

申请日：2010-04-29

申请人： Charles J. Archer , Michael A. Blocksome , Joseph D. Ratterman , Brian E. Smith

发明人： Charles J. Archer , Michael A. Blocksome , Joseph D. Ratterman , Brian E. Smith

IPC分类号： G06F9/45

CPC分类号： G06F8/45

摘要： Compiling software for a hierarchical distributed processing system including providing to one or more compiling nodes software to be compiled, wherein at least a portion of the software to be compiled is to be executed by one or more other nodes; compiling, by the compiling node, the software; maintaining, by the compiling node, any compiled software to be executed on the compiling node; selecting, by the compiling node, one or more nodes in a next tier of the hierarchy of the distributed processing system in dependence upon whether any compiled software is for the selected node or the selected node's descendants; sending to the selected node only the compiled software to be executed by the selected node or selected node's descendant.

摘要翻译： 编译用于分级分布式处理系统的软件，包括向一个或多个编译节点提供待编译的软件，其中待编译的软件的至少一部分将被一个或多个其他节点执行; 由编译节点编译软件; 由编译节点维护要在编译节点上执行的任何编译软件; 根据编辑的软件是否针对所选择的节点或所选择的节点的后代，由编译节点选择分布式处理系统的层级的下一层中的一个或多个节点; 将所选择的节点或所选节点的后代发送到所选择的节点仅由编译的软件执行。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类