Remote asynchronous data mover
    1.
    发明授权
    Remote asynchronous data mover 失效
    远程异步数据移动器

    公开(公告)号:US07996564B2

    公开(公告)日:2011-08-09

    申请号:US12425093

    申请日:2009-04-16

    IPC分类号: G06F12/00

    摘要: A distributed data processing system executes multiple tasks within a parallel job, including a first local task on a local node and at least one task executing on a remote node, with a remote memory having real address (RA) locations mapped to one or more of the source effective addresses (EA) and destination EA of a data move operation initiated by a task executing on the local node. On initiation of the data move operation, remote asynchronous data move (RADM) logic identifies that the operation moves data to/from a first EA that is memory mapped to an RA of the remote memory. The local processor/RADM logic initiates a RADM operation that moves a copy of the data directly from/to the first remote memory by completing the RADM operation using the network interface cards (NICs) of the source and destination processing nodes, determined by accessing a data center for the node IDs of remote memory.

    摘要翻译: 分布式数据处理系统在并行作业中执行多个任务,包括本地节点上的第一本地任务和在远程节点上执行的至少一个任务,具有映射到以下的一个或多个的实地址(RA)位置的远程存储器 由本地节点上执行的任务启动的数据移动操作的源有效地址(EA)和目标EA。 在启动数据移动操作时,远程异步数据移动(RADM)逻辑识别该操作将数据移动到/从第一个EA,该第一个EA是映射到远程存储器的RA的存储器。 本地处理器/ RADM逻辑启动RADM操作,其通过使用源和目的地处理节点的网络接口卡(NIC)完成RADM操作,直接从/向第一远程存储器移动数据的副本,其通过访问 数据中心为远程存储器的节点ID。

    Remote Asynchronous Data Mover
    2.
    发明申请
    Remote Asynchronous Data Mover 失效
    远程异步数据移动器

    公开(公告)号:US20100268788A1

    公开(公告)日:2010-10-21

    申请号:US12425093

    申请日:2009-04-16

    摘要: A distributed data processing system executes multiple tasks within a parallel job, including a first local task on a local node and at least one task executing on a remote node, with a remote memory having real address (RA) locations mapped to one or more of the source effective addresses (EA) and destination EA of a data move operation initiated by a task executing on the local node. On initiation of the data move operation, remote asynchronous data move (RADM) logic identifies that the operation moves data to/from a first EA that is memory mapped to an RA of the remote memory. The local processor/RADM logic initiates a RADM operation that moves a copy of the data directly from/to the first remote memory by completing the RADM operation using the network interface cards (NICs) of the source and destination processing nodes, determined by accessing a data center for the node IDs of remote memory.

    摘要翻译: 分布式数据处理系统在并行作业中执行多个任务,包括本地节点上的第一本地任务和在远程节点上执行的至少一个任务,具有映射到以下的一个或多个的实地址(RA)位置的远程存储器 由本地节点上执行的任务启动的数据移动操作的源有效地址(EA)和目标EA。 在启动数据移动操作时,远程异步数据移动(RADM)逻辑识别该操作将数据移动到/从第一个EA,该第一个EA是映射到远程存储器的RA的存储器。 本地处理器/ RADM逻辑启动RADM操作,其通过使用源和目的地处理节点的网络接口卡(NIC)完成RADM操作,直接从/向第一远程存储器移动数据的副本,其通过访问 数据中心为远程存储器的节点ID。

    Performing Setup Operations for Receiving Different Amounts of Data While Processors are Performing Message Passing Interface Tasks
    3.
    发明申请
    Performing Setup Operations for Receiving Different Amounts of Data While Processors are Performing Message Passing Interface Tasks 审中-公开
    在处理器执行消息传递接口任务时,执行接收不同数据量的设置操作

    公开(公告)号:US20120266180A1

    公开(公告)日:2012-10-18

    申请号:US13524585

    申请日:2012-06-15

    IPC分类号: G06F9/52 G06F15/173

    CPC分类号: G06F9/522 G06F9/5083

    摘要: A system and method are provided for performing setup operations for receiving a different amount of data while processors are performing message passing interface (MPI) tasks. Mechanisms for adjusting the balance of processing workloads of the processors are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. An MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, setup operations may be performed while processors are performing MPI tasks to prepare for receiving different sized portions of data in a subsequent computation cycle based on the history.

    摘要翻译: 提供了一种系统和方法,用于在处理器执行消息传递接口(MPI)任务时执行用于接收不同数量的数据的建立操作。 提供了用于调整处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 MPI负载平衡控制器维护一个历史记录,提供关于其对同步操作的调用的任务简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 结果,可以在处理器正在执行MPI任务以准备在基于历史的后续计算周期中接收不同大小的数据部分时执行设置操作。

    Hardware based dynamic load balancing of message passing interface tasks
    4.
    发明授权
    Hardware based dynamic load balancing of message passing interface tasks 失效
    基于硬件的动态负载平衡消息传递接口任务

    公开(公告)号:US08127300B2

    公开(公告)日:2012-02-28

    申请号:US11846119

    申请日:2007-08-28

    IPC分类号: G06F9/46 G06F15/173

    CPC分类号: G06F9/522 G06F9/5083

    摘要: Mechanisms for providing hardware based dynamic load balancing of message passing interface (MPI) tasks are provided. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了提供消息传递接口(MPI)任务的基于硬件的动态负载平衡的机制。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢的处理器转移到一个或多个更快的处理器。

    System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks By Modifying Tasks
    5.
    发明申请
    System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks By Modifying Tasks 失效
    基于硬件的动态负载平衡的系统和方法消息传递接口任务通过修改任务

    公开(公告)号:US20090064168A1

    公开(公告)日:2009-03-05

    申请号:US11846168

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5083 G06F9/522

    摘要: A system and method are provided for providing hardware based dynamic load balancing of message passing interface (MPI) tasks by modifying tasks. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. Thus, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种系统和方法,用于通过修改任务来提供消息传递接口(MPI)任务的基于硬件的动态负载平衡。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢处理器转移到一个或多个较快处理器。

    System and Computer Program Product for Modifying an Operation of One or More Processors Executing Message Passing Interface Tasks
    6.
    发明申请
    System and Computer Program Product for Modifying an Operation of One or More Processors Executing Message Passing Interface Tasks 失效
    系统和计算机程序产品,用于修改执行消息传递接口任务的一个或多个处理器的操作

    公开(公告)号:US20090063885A1

    公开(公告)日:2009-03-05

    申请号:US11846101

    申请日:2007-08-28

    IPC分类号: G06F9/30 G06F15/163

    摘要: A system and computer program product for modifying an operation of one or more processors executing message passing interface (MPI) tasks are provided. Mechanisms for adjusting the balance of processing workloads of the processors are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种用于修改执行消息传递接口(MPI)任务的一个或多个处理器的操作的系统和计算机程序产品。 提供了用于调整处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢的处理器转移到一个或多个更快的处理器。

    Hardware based dynamic load balancing of message passing interface tasks by modifying tasks
    7.
    发明授权
    Hardware based dynamic load balancing of message passing interface tasks by modifying tasks 失效
    基于硬件的动态负载平衡消息传递接口任务通过修改任务

    公开(公告)号:US08312464B2

    公开(公告)日:2012-11-13

    申请号:US11846168

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5083 G06F9/522

    摘要: Mechanisms are provided for providing hardware based dynamic load balancing of message passing interface (MPI) tasks by modifying tasks. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. Thus, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了通过修改任务来提供消息传递接口(MPI)任务的基于硬件的动态负载平衡的机制。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢处理器转移到一个或多个较快处理器。

    Providing full hardware support of collective operations in a multi-tiered full-graph interconnect architecture
    8.
    发明授权
    Providing full hardware support of collective operations in a multi-tiered full-graph interconnect architecture 失效
    在多层全图互连架构中提供集体操作的完整硬件支持

    公开(公告)号:US07958182B2

    公开(公告)日:2011-06-07

    申请号:US11845223

    申请日:2007-08-27

    IPC分类号: G06F15/16

    CPC分类号: G06F15/17381

    摘要: A mechanism is provided for performing collective operations. In hardware of a parent processor in a first processor book, a number of other processors are determined in a same or different processor book of the data processing system that is needed to execute the collective operation, thereby establishing a plurality of processors comprising the parent processor and the other processors. In hardware of the parent processor, the plurality of processors are logically arranged as a plurality of nodes in a hierarchical structure. The collective operation is transmitted to the plurality of processors based on the hierarchical structure. In hardware of the parent processor, results are received from the execution of the collective operation from the other processors, a final result is generated of the collective operation based on the received results, and the final result is output.

    摘要翻译: 提供了一种用于执行集体操作的机制。 在第一处理器书中的母处理器的硬件中,在执行集体操作所需的数据处理系统的相同或不同的处理器簿中确定多个其他处理器,由此建立多个处理器,其包括母处理器 和其他处理器。 在母处理器的硬件中,多个处理器在逻辑上被布置为分层结构中的多个节点。 基于层次结构将集体操作发送到多个处理器。 在母处理器的硬件中,从其他处理器的集体操作的执行中接收到结果,基于接收到的结果生成集合操作的最终结果,并输出最终结果。

    Routing information through a data processing system implementing a multi-tiered full-graph interconnect architecture
    9.
    发明授权
    Routing information through a data processing system implementing a multi-tiered full-graph interconnect architecture 失效
    通过实现多层全图互连架构的数据处理系统路由信息

    公开(公告)号:US07904590B2

    公开(公告)日:2011-03-08

    申请号:US11845215

    申请日:2007-08-27

    IPC分类号: G06F15/173

    CPC分类号: G06F15/17381

    摘要: A mechanism is provided for routing information through the data processing system. Data is received at a source processor within a set of processors that is to be transmitted to a destination processor, where the data includes address information. A first determination is performed as to whether the destination processor is within a same processor book as the source processor based on the address information. A second determination is performed as to whether the destination processor is within a same supernode as the source processor based on the address information if the destination processor is not within the same processor book. A routing path is identified for the data based on results of the first determination, the second determination, and one or more routing table data structures. The data is then transmitted from the source processor along the identified routing path toward the destination processor.

    摘要翻译: 提供了一种通过数据处理系统路由信息的机制。 在要发送到目标处理器的一组处理器内的源处理器处接收数据,其中数据包括地址信息。 基于地址信息,执行目的地处理器是否在与处理器相同的处理器簿内的第一确定。 如果目的地处理器不在相同的处理器书中,则基于地址信息来执行关于目的地处理器是否在与源处理器相同的超级节点内的第二确定。 基于第一确定,第二确定和一个或多个路由表数据结构的结果,为数据识别路由路径。 然后将数据从源处理器沿着识别的路由路径发送到目的地处理器。

    Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks
    10.
    发明申请
    Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks 失效
    基于硬件的消息传递接口任务的动态负载平衡方法

    公开(公告)号:US20090064165A1

    公开(公告)日:2009-03-05

    申请号:US11846119

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/522 G06F9/5083

    摘要: A method for providing hardware based dynamic load balancing of message passing interface (MPI) tasks are provided. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种用于提供消息传递接口(MPI)任务的基于硬件的动态负载平衡的方法。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢的处理器转移到一个或多个更快的处理器。