Efficient protocol for retransmit logic in reliable zero copy message transport
    1.
    发明授权
    Efficient protocol for retransmit logic in reliable zero copy message transport 失效
    可靠的零复制消息传输中的重传逻辑的高效协议

    公开(公告)号:US06735620B1

    公开(公告)日:2004-05-11

    申请号:US09619054

    申请日:2000-07-18

    IPC分类号: G06F15167

    CPC分类号: G06F15/17

    摘要: In a transmission protocol in which a user running an application in an address space in one data processing system wishes to transmit a data packet to another address space in another data processing system by means of direct memory access directly from a sending buffer to a receiving buffer with no copy, a mechanism is provided for minimizing the need for retransmission and for insuring proper entry into the target data processing system address space. In particular, when the first system does not receive an acknowledgment from the receiver, a special data packet with a retransmit flag bit set is sent to the second system. When this system receives the data packet with the retransmit flag bit set the second system responds either by sending a new acknowledgment or by sending a request for retransmission. No transmission back to the first system occurs, however before such a request is made and in fact the receiving system does not send this retransmission request without insuring that its receipt would be appropriate. In particular, the second system, before requesting retransmission, checks to assure that tag association is still valid so that an adapter at the second system is still capable of matching tags in data packet headers with appropriate real address memory locations within address spaces belonging to the second receiving data processing system. In this manner needless retransmission of packets does not occur and retransmission occurs only when receipt of the data packet is appropriate.

    摘要翻译: 在一种传输协议中,其中在一个数据处理系统中的地址空间中运行应用的用户希望通过直接从发送缓冲器到接收缓冲器的直接存储器访问将数据分组发送到另一个数据处理系统中的另一个地址空间 没有复制,提供了用于最小化对重传的需要并确保正确进入目标数据处理系统地址空间的机制。 特别地,当第一系统没有从接收机接收到确认时,将具有重传标志位的特殊数据分组发送到第二系统。 当该系统接收到重传标志位设置的数据分组时,第二系统通过发送新的确认或通过发送重传请求来进行响应。 然而,在进行这种请求之前,不会发送回到第一系统,并且实际上接收系统不发送该重传请求而不确保其接收是合适的。 特别地,第二系统在请求重传之前检查以确保标签关联仍然有效,使得第二系统处的适配器仍然能够将属于数据包头部的地址空间内的适当的真实地址存储器位置的数据包头中的标签进行匹配 第二接收数据处理系统。 以这种方式,不会发生分组的不必要的重传,只有当数据分组的接收是适当的时才重发。

    Mechanisms for efficient message passing with copy avoidance in a distributed system using advanced network devices
    2.
    发明授权
    Mechanisms for efficient message passing with copy avoidance in a distributed system using advanced network devices 有权
    使用高级网络设备的分布式系统中有效的消息传递与避免复制的机制

    公开(公告)号:US07089289B1

    公开(公告)日:2006-08-08

    申请号:US09619051

    申请日:2000-07-18

    CPC分类号: G06F13/28

    摘要: An efficient mechanism for sending messages without the use of intermediate copies (i.e. without the staging of data) is provided. In particular an interface specification which allows use users of a transport protocol is defined so as to lend itself to efficient implementations. The interface specification is a complete and robust set of user functions usable within systems desiring reliable and efficient zero copy transport protocols. Two methods are provided to accomplish the implementation of an efficient zero copy protocol. The first method is especially useful in systems where the network device has limited capabilities in terms of hardware, message fragmentation and message reassembly. An additional RDRAM memory allows data to reside in an adapter while handshake operations take place between an adapter and a node so as to specify the final destination of the data. The second method takes advantage of network devices with advanced features which are exploited for maximum efficiency.

    摘要翻译: 提供了一种用于在不使用中间副本(即,不进行数据分段)的情况下发送消息的有效机制。 特别地,允许使用传输协议的用户的接口规范被定义为使其能够有效地实现。 接口规范是一个完整和强大的用户功能集合,可在系统中使用,可靠和高效的零拷贝传输协议。 提供了两种方法来实现有效的零拷贝协议。 第一种方法在系统中特别有用,其中网络设备在硬件,消息分段和消息重组方面具有有限的能力。 另外的RDRAM内存允许数据驻留在适配器中,而握手操作会在适配器和节点之间进行,以便指定数据的最终目的地。 第二种方法利用具有高效能的网络设备,以最大限度地提高效率。

    Mechanisms for efficient message passing with copy avoidance in a distributed system
    3.
    发明授权
    Mechanisms for efficient message passing with copy avoidance in a distributed system 有权
    在分布式系统中有效的消息传递与避免复制的机制

    公开(公告)号:US06799200B1

    公开(公告)日:2004-09-28

    申请号:US09619053

    申请日:2000-07-18

    IPC分类号: G06F15167

    CPC分类号: G06F12/1081

    摘要: An efficient mechanism for sending messages without the use of intermediate copies (i.e. without the staging of data) is provided. In particular an interface specification which allows use users of a transport protocol is defined so as to lend itself to efficient implementations. The interface specification is a complete and robust set of user functions usable within systems desiring reliable and efficient zero copy transport protocols. Two methods are provided to accomplish the implementation of an efficient zero copy protocol. The first method is especially useful in systems where the network device has limited capabilities in terms of hardware, message fragmentation and message reassembly. An additional RDRAM memory allows data to reside in an adapter while handshake operations take place between an adapter and a node so as to specify the final destination of the data. The second method takes advantage of network devices with advanced features which are exploited for maximum efficiency.

    摘要翻译: 提供了一种用于在不使用中间副本(即,不进行数据分段)的情况下发送消息的有效机制。 特别地,允许使用传输协议的用户的接口规范被定义为使其能够有效地实现。 接口规范是一个完整和强大的用户功能集合,可在系统中使用,可靠和高效的零拷贝传输协议。 提供了两种方法来实现有效的零拷贝协议。 第一种方法在系统中特别有用,其中网络设备在硬件,消息分段和消息重组方面具有有限的能力。 另外的RDRAM内存允许数据驻留在适配器中,而握手操作会在适配器和节点之间进行,以便指定数据的最终目的地。 第二种方法利用具有高效能的网络设备,以最大限度地提高效率。

    Hardware interface between a switch adapter and a communications
subsystem in a data processing system
    4.
    发明授权
    Hardware interface between a switch adapter and a communications subsystem in a data processing system 失效
    交换适配器与数据处理系统中的通信子系统之间的硬件接口

    公开(公告)号:US06111894A

    公开(公告)日:2000-08-29

    申请号:US920084

    申请日:1997-08-26

    IPC分类号: H04L29/06 G06F3/00

    摘要: Method, apparatus and program product for communicating from a node to a communications device. A Hardware Abstraction Layer (HAL) provides functions which can be called from user space in a node to access the communications device. An instance of HAL is created in the node. Device specific characteristics from the communications device and a pointer pointing to HAL functions for accessing the communications device are obtained by HAL. HAL then opens multiple ports on the communications device using the functions pointed to by the pointer, and messages are sent between the node and the communications device. The messages thus sent are optimized with respect to the communications device as determined by the obtained device specific characteristics. Multiple processes and protocol stacks may be associated with each port in a single instance of HAL. A further embodiment provides that multiple virtual ports may be associated with a port, with a multiple protocol stacks associated with each virtual port. A further embodiment provides that multiple communications devices may be associated with a single instance of HAL.

    摘要翻译: 用于从节点到通信设备进行通信的方法,装置和程序产品。 硬件抽象层(HAL)提供可从节点中的用户空间调用以访问通信设备的功能。 在节点中创建HAL的一个实例。 来自通信设备的设备特定特征和指向HAL功能的指针用于访问通信设备,由HAL获得。 然后,HAL使用指针指向的功能在通信设备上打开多个端口,并且在节点和通信设备之间发送消息。 如此发送的消息相对于通过所获得的设备特定特性确定的通信设备进行了优化。 多个进程和协议栈可能与HAL的单个实例中的每个端口相关联。 另一实施例提供了多个虚拟端口可以与端口相关联,其中多个协议栈与每个虚拟端口相关联。 另一实施例提供多个通信设备可以与HAL的单个实例相关联。

    Signaling communication events in a computer network
    5.
    发明授权
    Signaling communication events in a computer network 失效
    在计算机网络中进行信令通信事件

    公开(公告)号:US6070189A

    公开(公告)日:2000-05-30

    申请号:US921757

    申请日:1997-08-26

    IPC分类号: G06F9/46 G06F15/173 G06F13/00

    CPC分类号: G06F9/542 G06F15/17375

    摘要: A method, apparatus and program product for detecting a communication event in a distributed parallel data processing system in which a message is sent from an origin to a target. A low-level application programming interface (LAPI) is provided which has an operation for associating a counter with a communication event to be detected. The LAPI increments the counter upon the occurrence of the communication event. The number in the counter is monitored, and when the number increases, the event is detected. A completion counter in the origin is associated with the completion of a message being sent from the origin to the target. When the message is completed, LAPI increments the completion counter such that monitoring the completion counter detects the completion of the message. The completion counter may be used to insure that a first message has been sent from the origin to the target and completed before a second message is sent.

    摘要翻译: 一种用于检测分布式并行数据处理系统中的通信事件的方法,装置和程序产品,其中消息从原点发送到目标。 提供了一种低级应用编程接口(LAPI),其具有将计数器与要检测的通信事件相关联的操作。 LAPI在通信事件发生时增加计数器。 监视计数器中的数字,当数量增加时,检测到事件。 原点的完成计数器与从原点发送到目标的消息的完成相关联。 当消息完成时,LAPI会增加完成计数器,以便监视完成计数器检测到消息的完成。 完成计数器可用于确保第一消息已经从原点发送到目标并且在发送第二消息之前完成。

    Method and apparatus for efficient communications using active messages
    6.
    发明授权
    Method and apparatus for efficient communications using active messages 失效
    用于使用活动消息进行高效通信的方法和装置

    公开(公告)号:US6038604A

    公开(公告)日:2000-03-14

    申请号:US918816

    申请日:1997-08-26

    摘要: A method, apparatus and program product for message communication in a distributed parallel data processing system. A user message is sent from a sender to a receiver. The user message contains user data and a pointer to a header handler routine. The header handler routine includes a first pointer to a target user buffer and a second pointer to a completion routine. When the user message is received, a low level application program interface (LAPI) is informed which invokes the header handler routines which returns the first and second pointers. LAPI then transfers the user data to the user buffer indicated by the header handler routine, and invokes the completion routine indicated by the header handler routine to complete the transfer of the user message to the receiver.

    摘要翻译: 一种用于分布式并行数据处理系统中消息通信的方法,装置和程序产品。 用户消息从发送方发送到接收方。 用户消息包含用户数据和指向头处理程序例程的指针。 报头处理程序例程包括指向目标用户缓冲区的第一指针和指向完成例程的第二指针。 当接收到用户消息时,通知低级应用程序接口(LAPI),调用返回第一和第二指针的报头处理程序例程。 然后,LAPI将用户数据传送到由报头处理程序指示的用户缓冲器,并调用由报头处理程序指示的完成例程,以完成将用户消息传送到接收器。

    Interrupt handling using simultaneous multi-threading
    7.
    发明授权
    Interrupt handling using simultaneous multi-threading 失效
    中断处理使用同时多线程

    公开(公告)号:US07996593B2

    公开(公告)日:2011-08-09

    申请号:US12371865

    申请日:2009-02-16

    IPC分类号: G06F9/48

    摘要: Disclosed are a method, information processing system, and computer readable medium for managing interrupts. The method includes placing at least one physical processor of an information processing system in a simultaneous multi-threading mode. At least a first logical processor and a second logical processor associated with the at least one physical processor are partitioned. The first logical processor is assigned to manage interrupts and the second logical processor is assigned to dispatch runnable user threads.

    摘要翻译: 公开了一种用于管理中断的方法,信息处理系统和计算机可读介质。 该方法包括将信息处理系统的至少一个物理处理器置于同时多线程模式中。 至少与至少一个物理处理器相关联的第一逻辑处理器和第二逻辑处理器被分区。 分配第一个逻辑处理器来管理中断,第二个逻辑处理器被分配给调度可运行的用户线程。

    EFFICIENT PIPELINING OF RDMA FOR COMMUNICATIONS
    8.
    发明申请
    EFFICIENT PIPELINING OF RDMA FOR COMMUNICATIONS 审中-公开
    RDMA通信的有效管道

    公开(公告)号:US20110078410A1

    公开(公告)日:2011-03-31

    申请号:US11457921

    申请日:2006-07-17

    IPC分类号: G06F12/00 G06F15/76 G06F9/02

    CPC分类号: G06F15/17375

    摘要: Disclosed are a method of and system for multiple party communications in a processing system including multiple processing subsystems. Each of the processing subsystems includes a central processing unit and one or more network adapters for connecting said each processing subsystem to the other processing subsystems. A multitude of nodes are established or created, and each of these nodes is associated with one of the processing subsystems. A first aspect of the invention involves pipelined communication using RDMA among three nodes, where the first node breaks up a large communication into multiple parts and sends these parts one after the other to the second node using RDMA, and the second node in turn absorbs and forwards each of these parts to a third node before all parts of the communication arrive from the first node.

    摘要翻译: 公开了一种包括多个处理子系统的处理系统中的多方通信的方法和系统。 每个处理子系统包括中央处理单元和用于将所述每个处理子系统连接到其他处理子系统的一个或多个网络适配器。 建立或创建多个节点,并且这些节点中的每一个都与处理子系统之一相关联。 本发明的第一方面涉及在三个节点之间使用RDMA的流水线通信,其中第一节点将大型通信分解成多个部分,并且使用RDMA将这些部分一个接一个地发送到第二节点,并且第二节点依次吸收和 在通信的所有部分从第一节点到达之前,将这些部分中的每一个转发到第三节点。

    INTERRUPT HANDLING USING SIMULTANEOUS MULTI-THREADING
    9.
    发明申请
    INTERRUPT HANDLING USING SIMULTANEOUS MULTI-THREADING 失效
    使用同时多线程的中断处理

    公开(公告)号:US20080104296A1

    公开(公告)日:2008-05-01

    申请号:US11553229

    申请日:2006-10-26

    IPC分类号: G06F9/46 G06F13/24

    摘要: Disclosed are a method, information processing system, and computer readable medium for managing interrupts. The method includes placing at least one physical processor of an information processing system in a simultaneous multi-threading mode. At least a first logical processor and a second logical processor associated with the at least one physical processor are partitioned. The first logical processor is assigned to manage interrupts and the second logical processor is assigned to dispatch runnable user threads.

    摘要翻译: 公开了一种用于管理中断的方法,信息处理系统和计算机可读介质。 该方法包括将信息处理系统的至少一个物理处理器置于同时多线程模式中。 至少与至少一个物理处理器相关联的第一逻辑处理器和第二逻辑处理器被分区。 分配第一个逻辑处理器来管理中断,第二个逻辑处理器被分配给调度可运行的用户线程。