Optimizing collective operations
    3.
    发明授权
    Optimizing collective operations 有权
    优化集体经营

    公开(公告)号:US09424087B2

    公开(公告)日:2016-08-23

    申请号:US12770286

    申请日:2010-04-29

    IPC分类号: G06F15/173 G06F9/50 H04L29/08

    CPC分类号: G06F9/5011 H04L67/10

    摘要: Optimizing collective operations including receiving an instruction to perform a collective operation type; selecting an optimized collective operation for the collective operation type; performing the selected optimized collective operation; determining whether a resource needed by the one or more nodes to perform the collective operation is not available; if a resource needed by the one or more nodes to perform the collective operation is not available: notifying the other nodes that the resource is not available; selecting a next optimized collective operation; and performing the next optimized collective operation.

    摘要翻译: 优化集体经营,包括收取执行集体经营类型的指示; 为集体经营类型选择优化集体经营; 执行选定的优化集体操作; 确定一个或多个节点需要的资源来执行集体操作是否不可用; 如果一个或多个节点执行集体操作所需的资源不可用:通知其他节点资源不可用; 选择下一个优化的集体操作; 并执行下一个优化的集体操作。

    Administering registered virtual addresses in a hybrid computing environment including maintaining a cache of ranges of currently registered virtual addresses
    4.
    发明授权
    Administering registered virtual addresses in a hybrid computing environment including maintaining a cache of ranges of currently registered virtual addresses 有权
    在混合计算环境中管理注册的虚拟地址,包括维护当前注册的虚拟地址的范围的高速缓存

    公开(公告)号:US09286232B2

    公开(公告)日:2016-03-15

    申请号:US12359383

    申请日:2009-01-26

    IPC分类号: G06F12/00 G06F12/10

    摘要: Administering registered virtual addresses in a hybrid computing environment that includes a host computer, an accelerator, the accelerator architecture optimized, with respect to the host computer architecture, for speed of execution of a particular class of computing functions, the host computer and the accelerator adapted to one another for data communications by a system level message passing module, where administering registered virtual addresses includes maintaining a cache of ranges of currently registered virtual addresses, the cache including entries associating a range of currently registered virtual addresses, a handle representing physical addresses mapped to the range of currently registered virtual addresses, and a counter; determining whether to register ranges of virtual addresses in dependence upon the cache of ranges of currently registered virtual addresses; and determining whether to deregister ranges of virtual addresses in dependence upon the cache of ranges of currently registered virtual addresses.

    摘要翻译: 在混合计算环境中管理注册的虚拟地址,该混合计算环境包括主机计算机,加速器,针对主计算机架构优化的加速器架构,用于执行特定类别的计算功能的速度,主计算机和加速器 用于由系统级消息传递模块进行数据通信,其中管理注册的虚拟地址包括维护当前注册的虚拟地址的范围的高速缓存,所述高速缓存包括关联当前注册的虚拟地址的范围的条目,表示物理地址映射的句柄 到当前注册的虚拟地址的范围,以及计数器; 根据当前注册的虚拟地址的范围的高速缓存来确定是否注册虚拟地址的范围; 以及根据当前注册的虚拟地址的范围的高速缓存来确定是否取消注册虚拟地址的范围。

    Providing point to point communications among compute nodes in a global combining network of a parallel computer
    5.
    发明授权
    Providing point to point communications among compute nodes in a global combining network of a parallel computer 有权
    在并行计算机的全局组合网络中的计算节点之间提供点对点通信

    公开(公告)号:US09246792B2

    公开(公告)日:2016-01-26

    申请号:US13440252

    申请日:2012-04-05

    摘要: Methods, apparatus, and products are disclosed for providing point to point data communications among compute nodes in a global combining network of a parallel computer that include: determining a class route identifier available for all of the nodes along a communications path from an origin node to a target node; configuring network hardware of each node along the communications path with routing instructions in dependence upon the available class route identifier and the network's topology; transmitting, by the origin node along the communications path, a network packet to the target node, including encoding the available class route identifier in the network packet; and routing, by the network hardware of each node along the communications path, the network packet to the target node in dependence upon the routing instructions for each node and the available class route identifier.

    摘要翻译: 公开了用于在并行计算机的全局组合网络中的计算节点之间提供点对点数据通信的方法,装置和产品,其包括:确定沿着从原始节点到通信路径的通信路径的所有节点可用的类路由标识符 目标节点; 根据可用的类路由标识符和网络的拓扑,沿着通信路径配置具有路由指令的每个节点的网络硬件; 由原始节点沿着通信路径将网络分组发送到目标节点,包括对网络分组中的可用分类路由标识符进行编码; 以及通过沿着通信路径的每个节点的网络硬件,将网络分组路由到目标节点,其依赖于每个节点的路由指令和可用的类路由标识符。

    Determining a path for network traffic between nodes in a parallel computer
    6.
    发明授权
    Determining a path for network traffic between nodes in a parallel computer 有权
    确定并行计算机中节点之间网络流量的路径

    公开(公告)号:US09225545B2

    公开(公告)日:2015-12-29

    申请号:US12060508

    申请日:2008-04-01

    IPC分类号: G06F15/173 H04L12/413

    CPC分类号: H04L12/413

    摘要: Determining a path for network traffic between a source compute node and a destination compute node in a parallel computer including identifying a group of compute nodes, the group of compute nodes having topological network locations included in a predefined topological shape; selecting, from the predefined topological shape, in dependence upon a global contention counter stored on the source compute node, a path on which to send a data communications message from the source compute node to the destination compute node; and sending, by the messaging module of the source compute node, the data communications message along the selected path for network traffic between the source and destination compute nodes.

    摘要翻译: 确定并行计算机中的源计算节点和目的地计算节点之间的网络流量的路径,包括识别一组计算节点,具有包含在预定拓扑形状中的拓扑网络位置的计算节点组; 根据存储在源计算节点上的全局争用计数器从预定义的拓扑形状中选择从源计算节点向目的地计算节点发送数据通信消息的路径; 以及由所述源计算节点的所述消息传递模块沿所述选择的路径发送所述源计算节点与所述目的地计算节点之间的网络流量的所述数据通信消息。

    Performing collective operations in a distributed processing system
    7.
    发明授权
    Performing collective operations in a distributed processing system 有权
    在分布式处理系统中执行集体操作

    公开(公告)号:US08949328B2

    公开(公告)日:2015-02-03

    申请号:US13181601

    申请日:2011-07-13

    摘要: Methods, apparatuses, and computer program products for performing collective operations on a hybrid distributed processing system that includes a plurality of compute nodes and a plurality of tasks, each task is assigned a unique rank, and each compute node is coupled for data communications by at least two different networking topologies. At least one of the two networking topologies is a tiered tree topology having a root task and at least two child tasks and the at least two child tasks are peers of one another in the same tier. Embodiments include for each task, sending at least a portion of data corresponding to the task to all child tasks of the task through the tree topology; and sending at least a portion of the data corresponding to the task to all peers of the task at the same tier in the tree topology through the second topology.

    摘要翻译: 用于在包括多个计算节点和多个任务的混合分布式处理系统上进行集体操作的方法,装置和计算机程序产品,每个任务被分配唯一的等级,并且每个计算节点被耦合用于数据通信 最少两种不同的网络拓扑。 两个网络拓扑中的至少一个是具有根任务和至少两个子任务的分层树状拓扑,并且所述至少两个子任务是同一层中彼此的对等体。 实施例包括对于每个任务,通过树状拓扑将与任务对应的至少一部分数据发送到任务的所有子任务; 以及通过所述第二拓扑将与所述任务相对应的所述数据的至少一部分发送到所述树形拓扑中的同一层的任务的所有对等体。

    Processing unexpected messages at a compute node of a parallel computer
    8.
    发明授权
    Processing unexpected messages at a compute node of a parallel computer 有权
    在并行计算机的计算节点处理意外的消息

    公开(公告)号:US08930962B2

    公开(公告)日:2015-01-06

    申请号:US13401975

    申请日:2012-02-22

    IPC分类号: G06F9/54 G06F9/44

    CPC分类号: G06F15/17306 G06F9/546

    摘要: Methods, apparatuses, and computer program products for processing unexpected messages at a compute node of a parallel computer are provided. Embodiments include receiving, by the compute node, a portion of a message from another compute node of the parallel computer, the message comprising a plurality of separate portions; in response to receiving the portion of the message, determining, by the compute node, whether one of the applications executing on the compute node, has indicated that the message is expected; if one of the applications executing on the compute node has not indicated that the message is expected, storing, by the compute node, the portion of the message in an unexpected message buffer within the compute node; and if one of the applications executing on the compute node has indicated that the message is expected, storing the portion of the message at a storage destination indicated by the message.

    摘要翻译: 提供了用于在并行计算机的计算节点处理意外消息的方法,装置和计算机程序产品。 实施例包括由计算节点从并行计算机的另一计算节点接收消息的一部分,该消息包括多个分离的部分; 响应于接收到所述消息的部分,由所述计算节点确定在所述计算节点上执行的所述应用中的一个是否已经指示所述消息是预期的; 如果在计算节点上执行的应用程序中的一个尚未指示消息是预期的,则由计算节点将该消息的部分存储在计算节点内的意外消息缓冲器中; 并且如果在计算节点上执行的应用程序中的一个已经指示该消息是预期的,则将消息的该部分存储在该消息指示的存储目的地。

    Managing internode data communications for an uninitialized process in a parallel computer
    9.
    发明授权
    Managing internode data communications for an uninitialized process in a parallel computer 有权
    管理并行计算机中未初始化进程的节点间数据通信

    公开(公告)号:US08732725B2

    公开(公告)日:2014-05-20

    申请号:US13292293

    申请日:2011-11-09

    摘要: A parallel computer includes nodes, each having main memory and a messaging unit (MU). Each MU includes computer memory, which in turn includes, MU message buffers. Each MU message buffer is associated with an uninitialized process on the compute node. In the parallel computer, managing internode data communications for an uninitialized process includes: receiving, by an MU of a compute node, one or more data communications messages in an MU message buffer associated with an uninitialized process on the compute node; determining, by an application agent, that the MU message buffer associated with the uninitialized process is full prior to initialization of the uninitialized process; establishing, by the application agent, a temporary message buffer for the uninitialized process in main computer memory; and moving, by the application agent, data communications messages from the MU message buffer associated with the uninitialized process to the temporary message buffer in main computer memory.

    摘要翻译: 并行计算机包括各自具有主存储器和消息传送单元(MU)的节点。 每个MU包括计算机存储器,其又包括MU消息缓冲器。 每个MU消息缓冲区与计算节点上的未初始化进程相关联。 在并行计算机中,管理未初始化过程的节点间数据通信包括:由计算节点的MU接收与计算节点上的未初始化过程相关联的MU消息缓冲器中的一个或多个数据通信消息; 由应用代理确定与未初始化过程相关联的MU消息缓冲器在未初始化过程的初始化之前已满; 由应用代理建立用于主计算机存储器中未初始化过程的临时消息缓冲器; 并且由应用代理将与未初始化过程相关联的MU消息缓冲器的数据通信消息移动到主计算机存储器中的临时消息缓冲器。

    Compiling software for a hierarchical distributed processing system
    10.
    发明授权
    Compiling software for a hierarchical distributed processing system 失效
    为分层分布式处理系统编写软件

    公开(公告)号:US08621446B2

    公开(公告)日:2013-12-31

    申请号:US12770353

    申请日:2010-04-29

    IPC分类号: G06F9/45

    CPC分类号: G06F8/45

    摘要: Compiling software for a hierarchical distributed processing system including providing to one or more compiling nodes software to be compiled, wherein at least a portion of the software to be compiled is to be executed by one or more other nodes; compiling, by the compiling node, the software; maintaining, by the compiling node, any compiled software to be executed on the compiling node; selecting, by the compiling node, one or more nodes in a next tier of the hierarchy of the distributed processing system in dependence upon whether any compiled software is for the selected node or the selected node's descendants; sending to the selected node only the compiled software to be executed by the selected node or selected node's descendant.

    摘要翻译: 编译用于分级分布式处理系统的软件,包括向一个或多个编译节点提供待编译的软件,其中待编译的软件的至少一部分将被一个或多个其他节点执行; 由编译节点编译软件; 由编译节点维护要在编译节点上执行的任何编译软件; 根据编辑的软件是否针对所选择的节点或所选择的节点的后代,由编译节点选择分布式处理系统的层级的下一层中的一个或多个节点; 将所选择的节点或所选节点的后代发送到所选择的节点仅由编译的软件执行。