Interconnect operation indicating acceptability of partial data delivery
    61.
    发明授权
    Interconnect operation indicating acceptability of partial data delivery 有权
    互连操作指示部分数据传送的可接受性

    公开(公告)号:US08117401B2

    公开(公告)日:2012-02-14

    申请号:US12024467

    申请日:2008-02-01

    IPC分类号: G06F12/08

    摘要: According to at least one embodiment, a method of data processing in a multiprocessor data processing system includes a requesting processing unit initiating an interconnect operation including a memory access request that indicates an acceptability of a variable amount of data to service the interconnect request for data. In response to snooping the memory access request on an interconnect, a snooper selects an amount of data to supply to the requesting processing unit and transmits the selected amount of data to the requesting processing unit. The requesting processing unit receives the selected amount of data and utilizes at least some of the selected amount of data to service a processor request.

    摘要翻译: 根据至少一个实施例,多处理器数据处理系统中的数据处理方法包括请求处理单元发起包括存储器访问请求的互连操作,所述存储器访问请求指示可变数量的数据的可接受性来维护数据的互连请求。 响应于在互连上窥探存储器访问请求,窥探者选择要提供给请求处理单元的数据量,并将所选择的数据量发送到请求处理单元。 请求处理单元接收所选择的数据量并利用所选择的数量的数据中的至少一些来处理处理器请求。

    Packet coalescing in virtual channels of a data processing system in a multi-tiered full-graph interconnect architecture
    62.
    发明授权
    Packet coalescing in virtual channels of a data processing system in a multi-tiered full-graph interconnect architecture 失效
    分组聚合在多层全图互连架构中的数据处理系统的虚拟通道中

    公开(公告)号:US08108545B2

    公开(公告)日:2012-01-31

    申请号:US11845227

    申请日:2007-08-27

    CPC分类号: H04L67/10 H04L69/40

    摘要: A mechanism is provided for packet coalescing in virtual channels of a data processing system. A first processor bundles original data into a data packet to be transmitted to a destination processor, the original data comprising payload data and overhead data. The first processor transmits the data packet to a second processor along a path to the destination processor. The second processor determines if the second processor has additional payload data destined for the same destination processor. Responsive to the second processor having the additional payload data, the second processor unbundles the data packet, adds the additional payload data to the payload data, and rebundles the payload data along with the additional payload data and the overhead data into a rebundled data packet. Then the second processor transmits the rebundled data packet to at least one other processor along the path to the destination processor.

    摘要翻译: 提供了一种用于数据处理系统的虚拟通道中的分组聚合的机制。 第一处理器将原始数据捆绑到要发送到目的处理器的数据分组中,原始数据包括有效载荷数据和开销数据。 第一处理器沿着到目的地处理器的路径向第二处理器发送数据分组。 第二处理器确定第二处理器是否具有去往相同目的地处理器的附加有效载荷数据。 响应于具有附加有效载荷数据的第二处理器,第二处理器解除数据分组的捆绑,将附加的有效载荷数据添加到有效载荷数据,并将有效载荷数据连同额外的有效载荷数据和开销数据重新分组成重新绑定的数据分组。 然后,第二处理器将重新发送的数据分组传送到至少一个其他处理器,沿着到达目的地处理器的路径。

    Performing dynamic request routing based on broadcast queue depths
    63.
    发明授权
    Performing dynamic request routing based on broadcast queue depths 失效
    基于广播队列深度执行动态请求路由

    公开(公告)号:US08077602B2

    公开(公告)日:2011-12-13

    申请号:US12024514

    申请日:2008-02-01

    CPC分类号: G06F15/17

    摘要: Mechanisms for performing dynamic request routing based on broadcast depth queue information are provided. Each processor chip in the system may use a synchronized heartbeat signal it generates to provide queue depth information to each of the other processor chips in the system. The queue depth information identifies a number of requests or amount of data in each of the queues of a processor chip that originated the heartbeat signal. The queue depth information from each of the processor chips in the system may be used by the processor chips in determining optimal routing paths for data from a source processor chip to a destination processor chip. As a result, the congestion of data for processing at each of the processor chips along each possible routing path may be taken into account when selecting to which processor chip to forward data.

    摘要翻译: 提供了基于广播深度队列信息执行动态请求路由的机制。 系统中的每个处理器芯片可以使用其产生的同步心跳信号来向系统中的每个其他处理器芯片提供队列深度信息。 队列深度信息识别发起心跳信号的处理器芯片的每个队列中的数量的请求或数据量。 系统中每个处理器芯片的队列深度信息可被处理器芯片用于确定用于从源处理器芯片到目的地处理器芯片的数据的最佳路由路径。 结果,当选择哪个处理器芯片来转发数据时,可以考虑在每个可能的路由路径处的每个处理器芯片处理数据的拥塞。

    Collective Acceleration Unit Tree Flow Control and Retransmit
    64.
    发明申请
    Collective Acceleration Unit Tree Flow Control and Retransmit 失效
    集体加速单位树流量控制和重新发布

    公开(公告)号:US20110173258A1

    公开(公告)日:2011-07-14

    申请号:US12640208

    申请日:2009-12-17

    IPC分类号: G06F15/16

    CPC分类号: G06F15/16

    摘要: A mechanism is provided for collective acceleration unit tree flow control forms a logical tree (sub-network) among those processors and transfers “collective” packets on this tree. The system supports many collective trees, and each collective acceleration unit (CAU) includes resources to support a subset of the trees. Each CAU has limited buffer space, and the connection between two CAUs is not completely reliable. Therefore, to address the challenge of collective packets traversing on the tree without colliding with each other for buffer space and guaranteeing the end-to-end packet delivery, each CAU in the system effectively flow controls the packets, detects packet loss, and retransmits lost packets.

    摘要翻译: 提供了一种用于集体加速单元树流控制的机制,形成这些处理器之间的逻辑树(子网),并在该树上传输“集合”分组。 系统支持许多集体树,每个集体加速单元(CAU)包括支持一部分树的资源。 每个CAU具有有限的缓冲区空间,两个CAU之间的连接不是完全可靠的。 因此,为了解决在树上遍历的集合分组的挑战,不会相互冲突,保证端到端的分组传递,系统中的每个CAU都有效地流量控制分组,检测分组丢失,重传丢失 数据包

    Dynamic selection of a memory access size
    65.
    发明授权
    Dynamic selection of a memory access size 有权
    动态选择内存访问大小

    公开(公告)号:US07958309B2

    公开(公告)日:2011-06-07

    申请号:US12024476

    申请日:2008-02-01

    IPC分类号: G06F12/04

    摘要: A method of data processing in a processing unit supported by a memory hierarchy includes the processing unit performing a plurality of memory accesses to the memory hierarchy. The plurality of memory accesses includes one or more memory accesses targeting a full cache line of data. The processing unit monitors utilization of data accessed by the plurality of memory accesses, and based upon the utilization of the data, dynamically alters a memory access mode of operation so that a subsequent storage-modifying memory access targets less than a full cache line of data.

    摘要翻译: 由存储器层级支持的处理单元中的数据处理方法包括执行对存储器层次的多个存储器访问的处理单元。 多个存储器访问包括针对全缓存数据行的一个或多个存储器访问。 处理单元监视由多个存储器访问访问的数据的利用,并且基于数据的利用,动态地改变存储器访问操作模式,使得后续存储修改存储器访问目标小于完整高速缓存行数据 。

    System and method for dynamically supporting indirect routing within a multi-tiered full-graph interconnect architecture
    66.
    发明授权
    System and method for dynamically supporting indirect routing within a multi-tiered full-graph interconnect architecture 有权
    在多层全图互连体系结构中动态支持间接路由的系统和方法

    公开(公告)号:US07840703B2

    公开(公告)日:2010-11-23

    申请号:US11845213

    申请日:2007-08-27

    IPC分类号: G06F15/16

    摘要: A method, computer program product, and system are provided for dynamically routing data through the data processing system. Data is received at a first processor that is to be transmitted to a destination processor. The data that is received includes address information. A lookup is performed in routing table data structures based on the address information to identify candidate paths through which the data is routed to the destination processor. A determination is made as to whether any of the candidate paths are not able to be used to route the data to the destination processor based on a setting of at least one identifier. A path is selected from the identified candidate paths for routing of the data based on a setting of the at least one identifier. Then, the data is transmitted from the first processor along the selected path toward the destination processor.

    摘要翻译: 提供了一种通过数据处理系统动态路由数据的方法,计算机程序产品和系统。 在要发送到目的处理器的第一处理器处接收数据。 接收的数据包括地址信息。 基于地址信息在路由表数据结构中执行查找,以识别数据被路由到目的地处理器的候选路径。 基于至少一个标识符的设置,确定是否有任何候选路径不能用于将数据路由到目的地处理器。 基于所述至少一个标识符的设置,从所识别的候选路径中选择路径以路由数据。 然后,数据从所选择的路径从第一处理器发送到目的处理器。

    USER LEVEL MESSAGE BROADCAST MECHANISM IN DISTRIBUTED COMPUTING ENVIRONMENT
    67.
    发明申请
    USER LEVEL MESSAGE BROADCAST MECHANISM IN DISTRIBUTED COMPUTING ENVIRONMENT 失效
    分布式计算环境中的用户级信息广播机制

    公开(公告)号:US20100269027A1

    公开(公告)日:2010-10-21

    申请号:US12424837

    申请日:2009-04-16

    摘要: A data processing system is programmed to provide a method for enabling user-level one-to-all message/messaging (OTAM) broadcast within a distributed parallel computing environment in which multiple threads of a single job execute on different processing nodes across a network. The method comprises: generating one or more messages for transmission to at least one other processing node accessible via a network, where the messages are generated by/for a first thread executing at the data processing system (first processing node) and the other processing node executes one or more second threads of a same parallel job as the first thread. An OTAM broadcast is transmitting via a host fabric interface (HFI) of the data processing system as a one-to-all broadcast on the network, whereby the messages are transmitted to a cluster of processing nodes across the network that execute threads of the same parallel job as the first thread.

    摘要翻译: 数据处理系统被编程为提供一种在分布式并行计算环境中实现用户级一对一消息/消息传递(OTAM)广播的方法,其中单个作业的多个线程在跨越网络的不同处理节点上执行。 该方法包括:生成一个或多个消息以便传输到经由网络可访问的至少一个其他处理节点,其中消息由数据处理系统(第一处理节点)执行的第一个线程生成,另一个处理节点 执行与第一线程相同的并行作业的一个或多个第二线程。 OTAM广播通过数据处理系统的主机结构接口(HFI)作为网络上的一对一广播进行发送,由此将消息传送到跨网络的处理节点群集,该群集执行相同的线程 并行作为第一个线程。

    System and method for providing a high-speed message passing interface for barrier operations in a multi-tiered full-graph interconnect architecture
    68.
    发明授权
    System and method for providing a high-speed message passing interface for barrier operations in a multi-tiered full-graph interconnect architecture 有权
    用于在多层全图互连架构中提供用于屏障操作的高速消息传递接口的系统和方法

    公开(公告)号:US07809970B2

    公开(公告)日:2010-10-05

    申请号:US11845225

    申请日:2007-08-27

    IPC分类号: G06F1/12 G06F1/32

    CPC分类号: G06F9/5088

    摘要: A method, computer program product, and system are provided performing a Message Passing Interface (MPI) job. A first processor chip receives a set of arrival signals from a set of processor chips executing tasks of the MPI job in the data processing system. The arrival signals identify when a processor chip executes a synchronization operation for synchronizing the tasks for the MPI job. Responsive to receiving the set of arrival signals from the set of processor chips, the first processor chip identifies a fastest processor chip of the set of processor chips whose arrival signal arrived first. An operation of the fastest processor chip is modified based on the identification of the fastest processor chip. The set of processor chips comprises processor chips that are in one of a same processor book or a different processor book of the data processing system.

    摘要翻译: 提供了一种执行消息传递接口(MPI)作业的方法,计算机程序产品和系统。 第一处理器芯片从数据处理系统中执行MPI作业的任务的一组处理器芯片接收一组到达信号。 到达信号确定处理器芯片何时执行用于同步MPI作业任务的同步操作。 响应于从该组处理器芯片接收到一组到达信号,第一处理器芯片标识出到达信号到达第一个处理器芯片组中最快的处理器芯片。 基于最快处理器芯片的识别,修改最快处理器芯片的操作。 该组处理器芯片包括位于数据处理系统的相同处理器簿或不同处理器簿之一中的处理器芯片。

    Dynamic routing based on information of not responded active source requests quantity received in broadcast heartbeat signal and stored in local data structure for other processor chips
    69.
    发明授权
    Dynamic routing based on information of not responded active source requests quantity received in broadcast heartbeat signal and stored in local data structure for other processor chips 有权
    基于在广播心跳信号中接收到的未响应的活动源请求数量的信息进行动态路由并存储在其他处理器芯片的本地数据结构中

    公开(公告)号:US07779148B2

    公开(公告)日:2010-08-17

    申请号:US12024553

    申请日:2008-02-01

    IPC分类号: G06F15/163

    CPC分类号: H04L45/122 H04L45/06

    摘要: A mechanism for performing dynamic request routing based on broadcast source request information is provided. Each processor chip in the system may use a synchronized heartbeat signal it generates to provide source request information to each of the other processor chips in the system. The source request information identifies the number of active source requests sent by the processor chip that originated the heartbeat signal. The source request information from each of the processor chips in the system may be used by the processor chips in determining optimal routing paths for data from a source processor chip to a destination processor chip. As a result, the congestion of data for processing at each of the processor chips along each possible routing path may be taken into account when selecting to which processor chip to forward data.

    摘要翻译: 提供了一种基于广播源请求信息进行动态请求路由的机制。 系统中的每个处理器芯片可以使用其产生的同步心跳信号来向系统中的每个其他处理器芯片提供源请求信息。 源请求信息标识由发起心跳信号的处理器芯片发送的活动源请求的数量。 来自系统中的每个处理器芯片的源请求信息可被处理器芯片用于确定用于从源处理器芯片到目的地处理器芯片的数据的最佳路由路径。 结果,当选择哪个处理器芯片来转发数据时,可以考虑在每个可能的路由路径处的每个处理器芯片处理数据的拥塞。