Method to optimize network data flows within a constrained system
    1.
    发明授权
    Method to optimize network data flows within a constrained system 有权
    在受限系统内优化网络数据流的方法

    公开(公告)号:US09558148B2

    公开(公告)日:2017-01-31

    申请号:US14266241

    申请日:2014-04-30

    摘要: Methods, apparatus, and software for optimizing network data flows within constrained systems. The methods enable data to be transferred between PCIe cards in multi-socket server platforms, each platform including a local socket having an InfiniBand (IB) HCA and a remote socket. Data to be transmitted outbound from a platform is transferred from a PCIe card to the platform's IB HCA via a proxied datapath. Data received at a platform may employ a direct PCIe peer-to-peer (P2P) transfer if the destined PCIe card is installed in the local socket or via a proxied datapath if the destined PCIe card is installed in a remote socket. Outbound transfers from a PCIe card in a local socket to the platform's IB HCA may selectively be transferred using an either a proxied data path for larger data transfers or a direct P2P datapath for smaller data transfers. The software is configured to support each of local-local, remote-local, local-remote, and remote-remote data transfers in a manner that is transparent to the software applications generating and receiving the data.

    摘要翻译: 用于优化受限系统内网络数据流的方法,装置和软件。 这些方法使数据能够在多插槽服务器平台中的PCIe卡之间传输,每个平台包括具有InfiniBand(IB)HCA和远程插座的本地插座。 从平台传出的数据通过代理的数据路径从PCIe卡传输到平台的IB HCA。 如果将目的地的PCIe卡安装在本地套接字中,或者如果目的地的PCIe卡安装在远程插座中,则通过代理的数据路径,平台上接收的数据可以采用直接PCIe对等(P2P)传输。 从本地插座的PCIe卡到平台的IB HCA的出站传输可以选择性地使用用于更大数据传输的代理数据路径或用于较小数据传输的直接P2P数据路径传输。 该软件配置为以对生成和接收数据的软件应用程序透明的方式支持本地本地,远程本地,本地远程和远程数据传输。

    METHOD TO OPTIMIZE NETWORK DATA FLOWS WITHIN A CONSTRAINED SYSTEM
    2.
    发明申请
    METHOD TO OPTIMIZE NETWORK DATA FLOWS WITHIN A CONSTRAINED SYSTEM 有权
    优化网络数据流在受约束系统中的方法

    公开(公告)号:US20150317280A1

    公开(公告)日:2015-11-05

    申请号:US14266241

    申请日:2014-04-30

    摘要: Methods, apparatus, and software for optimizing network data flows within constrained systems. The methods enable data to be transferred between PCIe cards in multi-socket server platforms, each platform including a local socket having an InfiniBand (IB) HCA and a remote socket. Data to be transmitted outbound from a platform is transferred from a PCIe card to the platform's IB HCA via a proxied datapath. Data received at a platform may employ a direct PCIe peer-to-peer (P2P) transfer if the destined PCIe card is installed in the local socket or via a proxied datapath if the destined PCIe card is installed in a remote socket. Outbound transfers from a PCIe card in a local socket to the platform's IB HCA may selectively be transferred using an either a proxied data path for larger data transfers or a direct P2P datapath for smaller data transfers. The software is configured to support each of local-local, remote-local, local-remote, and remote-remote data transfers in a manner that is transparent to the software applications generating and receiving the data.

    摘要翻译: 用于优化受限系统内网络数据流的方法,装置和软件。 这些方法使数据能够在多插槽服务器平台中的PCIe卡之间传输,每个平台包括具有InfiniBand(IB)HCA和远程插座的本地插座。 从平台传出的数据通过代理的数据路径从PCIe卡传输到平台的IB HCA。 如果将目的地的PCIe卡安装在本地套接字中,或者如果目的地的PCIe卡安装在远程插座中,则通过代理的数据路径,平台上接收的数据可以采用直接PCIe对等(P2P)传输。 从本地插座的PCIe卡到平台的IB HCA的出站传输可以选择性地使用用于更大数据传输的代理数据路径或用于较小数据传输的直接P2P数据路径传输。 该软件配置为以对生成和接收数据的软件应用程序透明的方式支持本地本地,远程本地,本地远程和远程数据传输。

    Direct I/O access for system co-processors
    3.
    发明授权
    Direct I/O access for system co-processors 有权
    用于系统协处理器的直接I / O访问

    公开(公告)号:US08914556B2

    公开(公告)日:2014-12-16

    申请号:US13997601

    申请日:2011-09-30

    IPC分类号: G06F13/28 G06F13/14 G06F13/16

    CPC分类号: G06F13/16 G06F13/14 G06F13/28

    摘要: Embodiments of the invention describe systems, apparatuses and methods that enable sharing Remote Direct Memory Access (RDMA) device hardware between a host and a peripheral device including a CPU and memory complex (alternatively referred to herein as a processor add-in card). Embodiments of the invention utilize interconnect hardware such as Peripheral Component Interconnect express (PCIe) hardware for peer-to-peer data transfers between processor add-in cards and RDMA devices. A host system may include modules or logic to map memory and registers to and/or from the RDMA device, thereby enabling I/O to be performed directly to and from user-mode applications on the processor add-in card, concurrently with host system I/O operations.

    摘要翻译: 本发明的实施例描述了能够在主机和包括CPU和存储器复合体(在本文中称为处理器附加卡)的外围设备之间共享远程直接存储器访问(RDMA)设备硬件的系统,装置和方法。 本发明的实施例利用互连硬件,例如用于处理器附加卡和RDMA设备之间的对等数据传输的外围组件互连快速(PCIe)硬件。 主机系统可以包括将存储器映射到RDMA设备和/或来自RDMA设备的寄存器的模块或逻辑,从而使I / O能够直接从处理器附加卡上的用户模式应用执行,与主机系统 I / O操作。

    DIRECT I/O ACCESS FOR SYSTEM CO-PROCESSORS
    4.
    发明申请
    DIRECT I/O ACCESS FOR SYSTEM CO-PROCESSORS 有权
    用于系统协处理器的直接I / O访问

    公开(公告)号:US20130275631A1

    公开(公告)日:2013-10-17

    申请号:US13997601

    申请日:2011-09-30

    IPC分类号: G06F13/16

    CPC分类号: G06F13/16 G06F13/14 G06F13/28

    摘要: Embodiments of the invention describe systems, apparatuses and methods that enable sharing Remote Direct Memory Access (RDMA) device hardware between a host and a peripheral device including a CPU and memory complex (alternatively referred to herein as a processor add-in card). Embodiments of the invention utilize interconnect hardware such as Peripheral Component Interconnect express (PCIe) hardware for peer-to-peer data transfers between processor add-in cards and RDMA devices. A host system may include modules or logic to map memory and registers to and/or from the RDMA device, thereby enabling I/O to be performed directly to and from user-mode applications on the processor add-in card, concurrently with host system I/O operations.

    摘要翻译: 本发明的实施例描述了能够在主机和包括CPU和存储器复合体(在本文中称为处理器附加卡)的外围设备之间共享远程直接存储器访问(RDMA)设备硬件的系统,装置和方法。 本发明的实施例利用互连硬件,例如用于处理器附加卡和RDMA设备之间的对等数据传输的外围组件互连快速(PCIe)硬件。 主机系统可以包括将存储器映射到RDMA设备和/或来自RDMA设备的寄存器的模块或逻辑,从而使I / O能够直接从处理器附加卡上的用户模式应用执行,与主机系统 I / O操作。

    Each of a plurality of descriptors having a completion indicator and being stored in a cache memory of an input/output processor
    6.
    发明授权
    Each of a plurality of descriptors having a completion indicator and being stored in a cache memory of an input/output processor 失效
    多个描述符中的每一个具有完成指示符并被存储在输入/输出处理器的高速缓冲存储器中

    公开(公告)号:US06675238B1

    公开(公告)日:2004-01-06

    申请号:US09390041

    申请日:1999-09-03

    IPC分类号: G06F300

    CPC分类号: G06F9/52 G06F12/0866

    摘要: An apparatus and method for efficient input/output processing without the use of interrupts is described. The apparatus includes a plurality of descriptors where each descriptor includes a completion indicator and data associated with an input/output request. The plurality of descriptors includes a head descriptor and a tail descriptor. The apparatus further include a plurality of address holders associated with an input/output processor, and each the plurality of address holders is uniquely affiliated with one of the plurality of descriptors. The apparatus further include a polling mechanism for evaluating the completion indicator of the head descriptor and a completion processor for interfacing with the head descriptor. Finally, the apparatus includes connectors between the tail descriptor and address holder and between the input/output processor and the head descriptor.

    摘要翻译: 描述了一种用于高效率输入/输出处理而不使用中断的设备和方法。 该装置包括多个描述符,其中每个描述符包括完成指示符和与输入/输出请求相关联的数据。 多个描述符包括头描述符和尾描述符。 所述设备还包括与输入/输出处理器相关联的多个地址保持器,并且所述多个地址保持器中的每一个与所述多个描述符中的一个唯一地附属。 该装置还包括用于评估头部描述符的完成指示符的轮询机制和用于与头部描述符对接的完成处理器。 最后,该装置包括尾部描述符和地址保持器之间以及输入/输出处理器和头部描述符之间的连接器。

    PEER-TO-PEER INTERRUPT SIGNALING BETWEEN DEVICES COUPLED VIA INTERCONNECTS
    7.
    发明申请
    PEER-TO-PEER INTERRUPT SIGNALING BETWEEN DEVICES COUPLED VIA INTERCONNECTS 有权
    通过互连连接的设备之间的对等中断信号

    公开(公告)号:US20140250202A1

    公开(公告)日:2014-09-04

    申请号:US13997250

    申请日:2012-05-29

    IPC分类号: H04L29/08

    摘要: Methods and apparatus to provide peer-to-peer interrupt signaling between devices coupled via one or more interconnects are described. In one embodiment, a NIC (Network Interface Card such as a Remote Direct Memory Access (RDMA) capable NIC) transfers data directly into or out of the memory of a peer device that is coupled to the NIC via one or more interconnects, bypassing a host computing/processing unit and/or main system memory. Other embodiments are also disclosed.

    摘要翻译: 描述了通过一个或多个互连耦合的设备之间提供对等中断信令的方法和装置。 在一个实施例中,NIC(诸如具有远程直接存储器访问(RDMA)能力的NIC)的网络接口卡将数据直接传送到经由一个或多个互连耦合到NIC的对等设备的存储器中,绕过一个 主机计算/处理单元和/或主系统存储器。 还公开了其他实施例。

    Method and system for communication between memory regions
    8.
    发明授权
    Method and system for communication between memory regions 有权
    存储器区域之间通信的方法和系统

    公开(公告)号:US07624156B1

    公开(公告)日:2009-11-24

    申请号:US09576038

    申请日:2000-05-23

    IPC分类号: G06F15/167

    CPC分类号: H04L49/10 H04L49/358

    摘要: A method and system are provided for transferring data in a networked system between a local memory in a local system and a remote memory in a remote system. A RDMA request is received and a first buffer region is associated with a first transfer operation. The system determines whether a size of the first buffer region exceeds a maximum transfer size of the networked system. Portions of the second buffer region may be associated with the first transfer operation based on the determination of the size of the first buffer region. The system subsequently performs the first transfer operation.

    摘要翻译: 提供了一种用于在本地系统中的本地存储器和远程系统中的远程存储器之间的网络系统中传送数据的方法和系统。 接收到RDMA请求,并且第一缓冲区域与第一传送操作相关联。 系统确定第一缓冲区的大小是否超过联网系统的最大传输大小。 基于第一缓冲区域的大小的确定,第二缓冲区域的部分可以与第一传送操作相关联。 系统随后执行第一次传送操作。

    Method and systems for flow control of transmissions over channel-based switched fabric connections
    9.
    发明授权
    Method and systems for flow control of transmissions over channel-based switched fabric connections 有权
    用于通过基于通道的交换矩阵连接进行传输的流量控制的方法和系统

    公开(公告)号:US06735174B1

    公开(公告)日:2004-05-11

    申请号:US09537396

    申请日:2000-03-29

    IPC分类号: H04J116

    摘要: Methods and systems for flow control over channel-based switched fabric connections between a first side and a second side. At least one posted receive buffer is stored in a receive buffer queue at the first side. A number of credits is incremented based on the at least one posted receive buffer. The second side is notified of the number of credits. A number of send credits is incremented at the second side based on the number of credits. A message is sent from the second side to the first side if the number of send credits is larger than or equal to two or the number of send credits is equal to one and a second number of credits is larger than or equal to one. The second number of credits is based on at least one second posted receive buffer at the second side. Therefore, communication of messages between the first side and the second side is prevented from deadlocking.

    摘要翻译: 用于在第一侧和第二侧之间的基于通道的交换结构连接上的流控制的方法和系统。 在第一侧的接收缓冲区队列中至少存储一个发送的接收缓冲区。 基于至少一个发布的接收缓冲器来增加许多信用。 第二方通知学分数。 许多发送信用额在第二方根据信用数量递增。 如果发送信用次数大于或等于2,或者发送信用次数等于1,而第二个信用数量大于或等于1则从第二方向第一方发送消息。 第二数量的信用是基于在第二侧的至少一个第二张贴的接收缓冲器。 因此,防止在第一侧和第二侧之间的消息的通信死锁。

    Method and system for communicating between memory regions
    10.
    发明授权
    Method and system for communicating between memory regions 有权
    用于存储区域之间通信的方法和系统

    公开(公告)号:US08099471B2

    公开(公告)日:2012-01-17

    申请号:US12542546

    申请日:2009-08-17

    IPC分类号: G06F15/167

    CPC分类号: H04L49/10 H04L49/358

    摘要: A method and system are provided for transferring data in a networked system between a local memory in a local system and a remote memory in a remote system. A RDMA request is received and a first buffer region is associated with a first transfer operation. The system determines whether a size of the first buffer region exceeds a maximum transfer size of the networked system. Portions of the second buffer region may be associated with the first transfer operation based on the determination of the size of the first buffer region. The system subsequently performs the first transfer operation.

    摘要翻译: 提供了一种用于在本地系统中的本地存储器和远程系统中的远程存储器之间的网络系统中传送数据的方法和系统。 接收到RDMA请求,并且第一缓冲区域与第一传送操作相关联。 系统确定第一缓冲区的大小是否超过联网系统的最大传输大小。 基于第一缓冲区域的大小的确定,第二缓冲区域的部分可以与第一传送操作相关联。 系统随后执行第一次传送操作。