System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks By Modifying Tasks
    31.
    发明申请
    System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks By Modifying Tasks 失效
    基于硬件的动态负载平衡的系统和方法消息传递接口任务通过修改任务

    公开(公告)号:US20090064168A1

    公开(公告)日:2009-03-05

    申请号:US11846168

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5083 G06F9/522

    摘要: A system and method are provided for providing hardware based dynamic load balancing of message passing interface (MPI) tasks by modifying tasks. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. Thus, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种系统和方法,用于通过修改任务来提供消息传递接口(MPI)任务的基于硬件的动态负载平衡。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢处理器转移到一个或多个较快处理器。

    System and Computer Program Product for Modifying an Operation of One or More Processors Executing Message Passing Interface Tasks
    32.
    发明申请
    System and Computer Program Product for Modifying an Operation of One or More Processors Executing Message Passing Interface Tasks 失效
    系统和计算机程序产品,用于修改执行消息传递接口任务的一个或多个处理器的操作

    公开(公告)号:US20090063885A1

    公开(公告)日:2009-03-05

    申请号:US11846101

    申请日:2007-08-28

    IPC分类号: G06F9/30 G06F15/163

    摘要: A system and computer program product for modifying an operation of one or more processors executing message passing interface (MPI) tasks are provided. Mechanisms for adjusting the balance of processing workloads of the processors are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种用于修改执行消息传递接口(MPI)任务的一个或多个处理器的操作的系统和计算机程序产品。 提供了用于调整处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢的处理器转移到一个或多个更快的处理器。

    Method for data processing using a multi-tiered full-graph interconnect architecture
    33.
    发明授权
    Method for data processing using a multi-tiered full-graph interconnect architecture 失效
    使用多层全图互连架构进行数据处理的方法

    公开(公告)号:US08185896B2

    公开(公告)日:2012-05-22

    申请号:US11845207

    申请日:2007-08-27

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5061 G06F2209/5012

    摘要: A method is provided for implementing a multi-tiered full-graph interconnect architecture. In order to implement a multi-tiered full-graph interconnect architecture, a plurality of processors are coupled to one another to create a plurality of processor books. The plurality of processor books are coupled together to create a plurality of supernodes. Then, the plurality of supernodes are coupled together to create the multi-tiered full-graph interconnect architecture. Data is then transmitted from one processor to another within the multi-tiered full-graph interconnect architecture based on an addressing scheme that specifies at least a supernode and a processor book associated with a target processor to which the data is to be transmitted.

    摘要翻译: 提供了一种实现多层全图互连架构的方法。 为了实现多层全图互连架构,多个处理器彼此耦合以创建多个处理器书籍。 多个处理器书联接在一起以创建多个超节点。 然后,将多个超节点耦合在一起以创建多层全图互连体系结构。 然后,数据在多层全图互连体系结构中从一个处理器传输到另一个处理器,这是基于一个寻址方案,该寻址方案至少指定了一个与要发送数据的目标处理器相关联的超级节点和一个处理器。

    System for data processing using a multi-tiered full-graph interconnect architecture
    34.
    发明授权
    System for data processing using a multi-tiered full-graph interconnect architecture 失效
    使用多层全图互连架构进行数据处理的系统

    公开(公告)号:US08140731B2

    公开(公告)日:2012-03-20

    申请号:US11845206

    申请日:2007-08-27

    IPC分类号: G06F13/14

    CPC分类号: G06F15/16

    摘要: A system is provided for implementing a multi-tiered full-graph interconnect architecture. In order to implement a multi-tiered full-graph interconnect architecture, a plurality of processors are coupled to one another to create a plurality of processor books. The plurality of processor books are coupled together to create a plurality of supernodes. Then, the plurality of supernodes are coupled together to create the multi-tiered full-graph interconnect architecture. Data is then transmitted from one processor to another within the multi-tiered full-graph interconnect architecture based on an addressing scheme that specifies at least a supernode and a processor book associated with a target processor to which the data is to be transmitted.

    摘要翻译: 提供了一种用于实现多层全图互连架构的系统。 为了实现多层全图互连架构,多个处理器彼此耦合以创建多个处理器书籍。 多个处理器书联接在一起以创建多个超节点。 然后,将多个超节点耦合在一起以创建多层全图互连体系结构。 然后,数据在多层全图互连体系结构中从一个处理器传输到另一个处理器,这是基于一个寻址方案,该寻址方案至少指定了一个与要发送数据的目标处理器相关联的超级节点和一个处理器。

    System and Method for Routing Information Through a Data Processing System Implementing a Multi-Tiered Full-Graph Interconnect Architecture
    35.
    发明申请
    System and Method for Routing Information Through a Data Processing System Implementing a Multi-Tiered Full-Graph Interconnect Architecture 失效
    通过数据处理系统路由信息的系统和方法实现多层全图互连架构

    公开(公告)号:US20090063814A1

    公开(公告)日:2009-03-05

    申请号:US11845215

    申请日:2007-08-27

    IPC分类号: G06F15/76 G06F9/00

    CPC分类号: G06F15/17381

    摘要: A method, computer program product, and system are provided for routing information through the data processing system. Data is received at a source processor within a set of processors that is to be transmitted to a destination processor, where the data includes address information. A first determination is performed as to whether the destination processor is within a same processor book as the source processor based on the address information. A second determination is performed as to whether the destination processor is within a same supernode as the source processor based on the address information if the destination processor is not within the same processor book. A routing path is identified for the data based on results of the first determination, the second determination, and one or more routing table data structures. The data is then transmitted from the source processor along the identified routing path toward the destination processor.

    摘要翻译: 提供了一种通过数据处理系统路由信息的方法,计算机程序产品和系统。 在要发送到目标处理器的一组处理器内的源处理器处接收数据,其中数据包括地址信息。 基于地址信息,执行目的地处理器是否在与处理器相同的处理器簿内的第一确定。 如果目的地处理器不在相同的处理器书中,则基于地址信息来执行关于目的地处理器是否在与源处理器相同的超级节点内的第二确定。 基于第一确定,第二确定和一个或多个路由表数据结构的结果,为数据识别路由路径。 然后将数据从源处理器沿着识别的路由路径发送到目的地处理器。

    Performing setup operations for receiving different amounts of data while processors are performing message passing interface tasks
    36.
    发明授权
    Performing setup operations for receiving different amounts of data while processors are performing message passing interface tasks 有权
    执行设置操作以在处理器执行消息传递接口任务时接收不同数量的数据

    公开(公告)号:US08234652B2

    公开(公告)日:2012-07-31

    申请号:US11846154

    申请日:2007-08-28

    IPC分类号: G06F9/46 G06F15/173

    CPC分类号: G06F9/522 G06F9/5083

    摘要: Mechanisms are provided for performing setup operations for receiving a different amount of data while processors are performing message passing interface (MPI) tasks. Mechanisms for adjusting the balance of processing workloads of the processors are provided so us to minimize wait periods for waiting for all of the processors to call a synchronization operation. An MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, setup operations may be performed while processors are performing MPI tasks to prepare for receiving different sized portions of data in a subsequent computation cycle based on the history.

    摘要翻译: 提供了用于在处理器执行消息传递接口(MPI)任务时执行用于接收不同数据量的设置操作的机制。 提供了用于调整处理器的处理工作负载的平衡的机制,使得我们最小化等待所有处理器调用同步操作的等待周期。 MPI负载平衡控制器维护一个历史记录,提供关于其对同步操作的调用的任务简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 结果,可以在处理器正在执行MPI任务以准备在基于历史的后续计算周期中接收不同大小的数据部分时执行设置操作。

    Remote asynchronous data mover
    37.
    发明授权
    Remote asynchronous data mover 失效
    远程异步数据移动器

    公开(公告)号:US07996564B2

    公开(公告)日:2011-08-09

    申请号:US12425093

    申请日:2009-04-16

    IPC分类号: G06F12/00

    摘要: A distributed data processing system executes multiple tasks within a parallel job, including a first local task on a local node and at least one task executing on a remote node, with a remote memory having real address (RA) locations mapped to one or more of the source effective addresses (EA) and destination EA of a data move operation initiated by a task executing on the local node. On initiation of the data move operation, remote asynchronous data move (RADM) logic identifies that the operation moves data to/from a first EA that is memory mapped to an RA of the remote memory. The local processor/RADM logic initiates a RADM operation that moves a copy of the data directly from/to the first remote memory by completing the RADM operation using the network interface cards (NICs) of the source and destination processing nodes, determined by accessing a data center for the node IDs of remote memory.

    摘要翻译: 分布式数据处理系统在并行作业中执行多个任务,包括本地节点上的第一本地任务和在远程节点上执行的至少一个任务,具有映射到以下的一个或多个的实地址(RA)位置的远程存储器 由本地节点上执行的任务启动的数据移动操作的源有效地址(EA)和目标EA。 在启动数据移动操作时,远程异步数据移动(RADM)逻辑识别该操作将数据移动到/从第一个EA,该第一个EA是映射到远程存储器的RA的存储器。 本地处理器/ RADM逻辑启动RADM操作,其通过使用源和目的地处理节点的网络接口卡(NIC)完成RADM操作,直接从/向第一远程存储器移动数据的副本,其通过访问 数据中心为远程存储器的节点ID。

    Performing collective operations using software setup and partial software execution at leaf nodes in a multi-tiered full-graph interconnect architecture
    38.
    发明授权
    Performing collective operations using software setup and partial software execution at leaf nodes in a multi-tiered full-graph interconnect architecture 失效
    使用多层全图互连体系结构中叶节点的软件设置和部分软件执行来执行集体操作

    公开(公告)号:US07958183B2

    公开(公告)日:2011-06-07

    申请号:US11845224

    申请日:2007-08-27

    IPC分类号: G06F15/16

    CPC分类号: G06F15/17381

    摘要: A mechanism for performing collective operations. In software executing on a parent processor in a first processor book, a number of other processors are determined in a same or different processor book of the data processing system that is needed to execute the collective operation, thereby establishing a plurality of processors comprising the parent processor and the other processors. In software executing on the parent processor, the plurality of processors are logically arranged as a plurality of nodes in a hierarchical structure. The collective operation is transmitted to the plurality of processors based on the hierarchical structure. In hardware of the parent processor, results are received from the execution of the collective operation from the other processors, a final result is generated of the collective operation based on the received results, and the final result is output.

    摘要翻译: 执行集体行动的机制。 在第一处理器书中在母处理器上执行的软件中,在执行集体操作所需的数据处理系统的相同或不同的处理器簿中确定多个其他处理器,由此建立多个处理器,其包含该父 处理器和其他处理器。 在在母处理器上执行的软件中,多个处理器在逻辑上被布置为分层结构中的多个节点。 基于层次结构将集体操作发送到多个处理器。 在母处理器的硬件中,从其他处理器的集体操作的执行中接收到结果,基于接收到的结果生成集合操作的最终结果,并输出最终结果。

    Remote Asynchronous Data Mover
    39.
    发明申请
    Remote Asynchronous Data Mover 失效
    远程异步数据移动器

    公开(公告)号:US20100268788A1

    公开(公告)日:2010-10-21

    申请号:US12425093

    申请日:2009-04-16

    摘要: A distributed data processing system executes multiple tasks within a parallel job, including a first local task on a local node and at least one task executing on a remote node, with a remote memory having real address (RA) locations mapped to one or more of the source effective addresses (EA) and destination EA of a data move operation initiated by a task executing on the local node. On initiation of the data move operation, remote asynchronous data move (RADM) logic identifies that the operation moves data to/from a first EA that is memory mapped to an RA of the remote memory. The local processor/RADM logic initiates a RADM operation that moves a copy of the data directly from/to the first remote memory by completing the RADM operation using the network interface cards (NICs) of the source and destination processing nodes, determined by accessing a data center for the node IDs of remote memory.

    摘要翻译: 分布式数据处理系统在并行作业中执行多个任务,包括本地节点上的第一本地任务和在远程节点上执行的至少一个任务,具有映射到以下的一个或多个的实地址(RA)位置的远程存储器 由本地节点上执行的任务启动的数据移动操作的源有效地址(EA)和目标EA。 在启动数据移动操作时,远程异步数据移动(RADM)逻辑识别该操作将数据移动到/从第一个EA,该第一个EA是映射到远程存储器的RA的存储器。 本地处理器/ RADM逻辑启动RADM操作,其通过使用源和目的地处理节点的网络接口卡(NIC)完成RADM操作,直接从/向第一远程存储器移动数据的副本,其通过访问 数据中心为远程存储器的节点ID。

    System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks
    40.
    发明申请
    System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks 审中-公开
    基于硬件的消息传递接口任务的动态负载平衡的系统和方法

    公开(公告)号:US20090064166A1

    公开(公告)日:2009-03-05

    申请号:US11846141

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5083 G06F9/522

    摘要: A system and method for providing hardware based dynamic load balancing of message passing interface (MPI) tasks are provided. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. As a result, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了一种用于提供消息传递接口(MPI)任务的基于硬件的动态负载平衡的系统和方法。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢的处理器转移到一个或多个更快的处理器。