Performing a global barrier operation in a parallel computer
    21.
    发明授权
    Performing a global barrier operation in a parallel computer 有权
    在并行计算机中执行全局屏障操作

    公开(公告)号:US08910178B2

    公开(公告)日:2014-12-09

    申请号:US13206581

    申请日:2011-08-10

    IPC分类号: G06F9/46 G06F9/52

    摘要: Executing computing tasks on a parallel computer that includes compute nodes coupled for data communications, where each compute node executes tasks, with one task on each compute node designated as a master task, including: for each task on each compute node until all master tasks have joined a global barrier: determining whether the task is a master task; if the task is not a master task, joining a single local barrier; if the task is a master task, joining the global barrier and the single local barrier only after all other tasks on the compute node have joined the single local barrier.

    摘要翻译: 在并行计算机上执行计算任务,其中包括耦合用于数据通信的计算节点,其中每个计算节点执行任务,每个计算节点上指定为主任务的一个任务包括:对于每个计算节点上的每个任务,直到所有主任务具有 加入了全球壁垒:确定任务是否是主要任务; 如果任务不是主要任务,加入一个单一的局部障碍; 如果任务是主任务,则仅在计算节点上的所有其他任务已加入单个本地障碍之后才加入全局障碍和单个本地屏障。

    Administering truncated receive functions in a parallel messaging interface
    22.
    发明授权
    Administering truncated receive functions in a parallel messaging interface 有权
    在并行消息接口中管理截断的接收功能

    公开(公告)号:US08909716B2

    公开(公告)日:2014-12-09

    申请号:US12892153

    申请日:2010-09-28

    IPC分类号: G06F15/16

    CPC分类号: G06F15/16

    摘要: Administering truncated receive functions in a parallel messaging interface (‘PMI’) of a parallel computer comprising a plurality of compute nodes coupled for data communications through the PMI and through a data communications network, including: sending, through the PMI on a source compute node, a quantity of data from the source compute node to a destination compute node; specifying, by an application on the destination compute node, a portion of the quantity of data to be received by the application on the destination compute node and a portion of the quantity of data to be discarded; receiving, by the PMI on the destination compute node, all of the quantity of data; providing, by the PMI on the destination compute node to the application on the destination compute node, only the portion of the quantity of data to be received by the application; and discarding, by the PMI on the destination compute node, the portion of the quantity of data to be discarded.

    摘要翻译: 在并行计算机的并行消息接口(“PMI”)中管理截断的接收功能,所述并行计算机包括多个计算节点,所述计算节点被耦合用于通过所述PMI并通过数据通信网络进行数据通信,包括:通过源计算节点 ,从源计算节点到目的地计算节点的数据量; 由目的地计算节点上的应用指定要由目标计算节点上的应用接收的数据量的一部分和待丢弃的数据量的一部分; 由目标计算节点上的PMI接收所有数据量; 由目的地计算节点上的PMI向目标计算节点上的应用提供应用程序要接收的数据量的部分; 并且由目的地计算节点上的PMI舍弃待丢弃的数据量的部分。

    Providing policy-based application services to an application running on a computing system
    23.
    发明授权
    Providing policy-based application services to an application running on a computing system 失效
    为运行在计算系统上的应用程序提供基于策略的应用程序服务

    公开(公告)号:US08656448B2

    公开(公告)日:2014-02-18

    申请号:US11553101

    申请日:2006-10-26

    IPC分类号: H04L29/06

    CPC分类号: G06F11/3447 G06F11/3409

    摘要: Methods, apparatus, products are disclosed for providing policy-based application services to an application running on a computing system. The computing system includes at least one compute node. The compute node includes an application and a plurality of application services of a service type. Providing policy-based application services to an application running on a computing system includes establishing, on the compute node, a service policy specifying one of the application services of the service type for use by the application, establishing a performance ruleset, the performance ruleset specifying rules for analyzing the performance of the computing system, measuring performance of the computing system, and adjusting the service policy in dependence upon the performance ruleset and the measured performance. Providing policy-based application services to an application running on a computing system may also include accessing, by the application, the specified application service.

    摘要翻译: 公开了用于向在计算系统上运行的应用程序提供基于策略的应用服务的方法,装置,产品。 计算系统包括至少一个计算节点。 计算节点包括服务类型的应用和多个应用服务。 向在计算系统上运行的应用提供基于策略的应用服务包括在计算节点上建立指定服务类型的应用服务之一以供应用使用的服务策略,建立性能规则集,指定性能规则集 用于分析计算系统的性能,衡量计算系统的性能以及根据性能规则集和测量性能调整服务策略的规则。 向运行在计算系统上的应用提供基于策略的应用服务也可以包括由应用访问指定的应用服务。

    Scheduling Synchronization In Association With Collective Operations In A Parallel Computer
    24.
    发明申请
    Scheduling Synchronization In Association With Collective Operations In A Parallel Computer 有权
    在并行计算机中与集体操作相关联的调度同步

    公开(公告)号:US20130304995A1

    公开(公告)日:2013-11-14

    申请号:US13470932

    申请日:2012-05-14

    IPC分类号: G06F12/00

    CPC分类号: G06F15/17325

    摘要: Methods, apparatuses, and computer program products for scheduling synchronization in association with collective operations in a parallel computer that includes a shared memory and a plurality of compute nodes that execute a parallel application utilizing the shared memory are provided. Embodiments include acquiring an available channel of the shared memory; posting to the acquired channel of the shared memory one or more collective operations and a synchronization point; determining that processing within the acquired channel has reached the synchronization point; and posting to the acquired channel, in response to determining that processing within the acquired channel has reached the synchronization point, a background synchronization operation corresponding to the one or more collective operations.

    摘要翻译: 提供了用于在并行计算机中与集合操作相关联的用于调度同步的方法,装置和计算机程序产品,所述并行计算机包括共享存储器和使用所述共享存储器执行并行应用的多个计算节点。 实施例包括获取共享存储器的可用信道; 向共享存储器的获取通道发布一个或多个集合操作和同步点; 确定所获取的信道内的处理已经达到同步点; 并且响应于确定所获取的信道中的处理已经到达同步点,发布到获取的信道,对应于一个或多个集合操作的后台同步操作。

    Data communications in a parallel active messaging interface of a parallel computer
    25.
    发明授权
    Data communications in a parallel active messaging interface of a parallel computer 失效
    并行计算机的并行活动消息接口中的数据通信

    公开(公告)号:US08572629B2

    公开(公告)日:2013-10-29

    申请号:US12963694

    申请日:2010-12-09

    IPC分类号: G06F9/46

    CPC分类号: G06F9/546

    摘要: Data communications in a parallel active messaging interface (‘PAMI’) of a parallel computer, the parallel computer including a plurality of compute nodes that execute a parallel application, the PAMI composed of data communications endpoints, each endpoint including a specification of data communications parameters for a thread of execution on a compute node, including specifications of a client, a context, and a task, the compute nodes and the endpoints coupled for data communications through the PAMI and through data communications resources, including receiving in an origin endpoint of the PAMI a data communications instruction, the instruction characterized by an instruction type, the instruction specifying a transmission of transfer data from the origin endpoint to a target endpoint and transmitting, in accordance with the instruction type, the transfer data from the origin endpoint to the target endpoint.

    摘要翻译: 并行计算机的并行活动消息接口(“PAMI”)中的数据通信,并行计算机包括执行并行应用的多个计算节点,由数据通信端点组成的PAMI,每个端点包括数据通信参数的规范 对于在计算节点上执行的线程,包括客户端,上下文和任务的规范,所述计算节点和端点被耦合用于通过PAMI进行数据通信,并且通过数据通信资源,包括在源终端中接收 PAMI数据通信指令,以指令类型为特征的指令,指示传输数据从原始端点传输到目标端点的指令,并根据指令类型将传输数据从原点终端发送到目标 端点。

    Processing communications events in parallel active messaging interface by awakening thread from wait state
    26.
    发明授权
    Processing communications events in parallel active messaging interface by awakening thread from wait state 失效
    通过唤醒线程等待状态来处理通信事件并行活动消息接口

    公开(公告)号:US08566841B2

    公开(公告)日:2013-10-22

    申请号:US12943105

    申请日:2010-11-10

    IPC分类号: G06F15/163

    摘要: Processing data communications events in a parallel active messaging interface (‘PAMI’) of a parallel computer that includes compute nodes that execute a parallel application, with the PAMI including data communications endpoints, and the endpoints are coupled for data communications through the PAMI and through other data communications resources, including determining by an advance function that there are no actionable data communications events pending for its context, placing by the advance function its thread of execution into a wait state, waiting for a subsequent data communications event for the context; responsive to occurrence of a subsequent data communications event for the context, awakening by the thread from the wait state; and processing by the advance function the subsequent data communications event now pending for the context.

    摘要翻译: 在并行计算机的并行活动消息接口(“PAMI”)中处理数据通信事件,其包括执行并行应用的计算节点,PAMI包括数据通信端点和端点,用于通过PAMI进行数据通信,并通过 其他数据通信资源,包括通过提前功能确定不存在针对其上下文等待的可操作的数据通信事件,通过提前功能将其执行线程置于等待状态,等待上下文的后续数据通信事件; 响应于上下文的后续数据通信事件的发生,线程从等待状态唤醒; 以及通过提前功能处理现在正在等待上下文的后续数据通信事件。

    Performing a scatterv operation on a hierarchical tree network optimized for collective operations
    27.
    发明授权
    Performing a scatterv operation on a hierarchical tree network optimized for collective operations 失效
    在为集体操作优化的分层树网络上执行分散操作

    公开(公告)号:US08565089B2

    公开(公告)日:2013-10-22

    申请号:US12748594

    申请日:2010-03-29

    CPC分类号: G06F15/17318

    摘要: Performing a scatterv operation on a hierarchical tree network optimized for collective operations including receiving, by the scatterv module installed on the node, from a nearest neighbor parent above the node a chunk of data having at least a portion of data for the node; maintaining, by the scatterv module installed on the node, the portion of the data for the node; determining, by the scatterv module installed on the node, whether any portions of the data are for a particular nearest neighbor child below the node or one or more other nodes below the particular nearest neighbor child; and sending, by the scatterv module installed on the node, those portions of data to the nearest neighbor child if any portions of the data are for a particular nearest neighbor child below the node or one or more other nodes below the particular nearest neighbor child.

    摘要翻译: 在对集体操作进行了优化的分层树网络上执行分散操作,包括由节点上安装的scatterv模块从节点上方的最邻近父节点接收具有该节点的至少一部分数据的数据块; 通过安装在节点上的scatterv模块维护节点的数据部分; 通过安装在节点上的scatterv模块来确定数据的任何部分是否用于节点下方的特定最近邻孩子或特定最近邻孩子下面的一个或多个其他节点; 并且如果数据的任何部分用于节点下方的特定最近邻居子节点或特定最邻近子节点下方的一个或多个其他节点,则将由节点上安装的scatterv模块发送到最近邻居子节点的那些部分数据。

    Profiling an application for power consumption during execution on a compute node
    28.
    发明授权
    Profiling an application for power consumption during execution on a compute node 有权
    在计算节点上分析执行期间的功耗应用程序

    公开(公告)号:US08539270B2

    公开(公告)日:2013-09-17

    申请号:US13447501

    申请日:2012-04-16

    IPC分类号: G06F1/26

    摘要: Methods, apparatus, and products are disclosed for profiling an application for power consumption during execution on a compute node that include: receiving an application for execution on a compute node; identifying a hardware power consumption profile for the compute node, the hardware power consumption profile specifying power consumption for compute node hardware during performance of various processing operations; determining a power consumption profile for the application in dependence upon the application and the hardware power consumption profile for the compute node; and reporting the power consumption profile for the application.

    摘要翻译: 公开了方法,装置和产品,用于在计算节点执行期间对用于功耗的应用进行分析,所述计算节点包括:在计算节点上接收用于执行的应用; 识别所述计算节点的硬件功耗简档,所述硬件功耗简档在执行各种处理操作期间指定计算节点硬件的功耗; 根据应用和计算节点的硬件功耗特性,确定应用的功耗曲线; 并报告应用程序的功耗曲线。

    Performing a deterministic reduction operation in a compute node organized into a branched tree topology
    30.
    发明授权
    Performing a deterministic reduction operation in a compute node organized into a branched tree topology 失效
    在组织成分支树拓扑的计算节点中执行确定性简化操作

    公开(公告)号:US08489859B2

    公开(公告)日:2013-07-16

    申请号:US12790037

    申请日:2010-05-28

    IPC分类号: G06F9/00

    CPC分类号: G06F15/76 G06F15/17318

    摘要: Performing a deterministic reduction operation in a parallel computer that includes compute nodes, each of which includes computer processors and a CAU (Collectives Acceleration Unit) that couples computer processors to one another for data communications, including organizing processors and a CAU into a branched tree topology in which the CAU is a root and the processors are children; receiving, from each of the processors in any order, dummy contribution data, where each processor is restricted from sending any other data to the root CAU prior to receiving an acknowledgement of receipt from the root CAU; sending, by the root CAU to the processors in the branched tree topology, in a predefined order, acknowledgements of receipt of the dummy contribution data; receiving, by the root CAU from the processors in the predefined order, the processors' contribution data to the reduction operation; and reducing, by the root CAU, the processors' contribution data.

    摘要翻译: 在包括计算节点的并行计算机中执行确定性简化操作,每个节点包括计算机处理器和将计算机处理器彼此耦合以用于数据通信的CAU(集体加速单元),包括将处理器和CAU组织成分支树形拓扑 其中CAU是根,处理器是孩子; 从每个处理器以任何顺序接收虚拟贡献数据,其中每个处理器在从根CAU接收到接收确认之前被限制不发送任何其他数据到根CAU; 由根CAU以分支树拓扑结构向处理器发送预定义的顺序,接收虚拟贡献数据的确认; 根据CAU从预定义的顺序从处理器接收处理器对减少操作的贡献数据; 并由根CAU减少处理器的贡献数据。