Cache collaboration in tiled processor systems
    3.
    发明授权
    Cache collaboration in tiled processor systems 有权
    在平铺处理器系统中缓存协作

    公开(公告)号:US08838915B2

    公开(公告)日:2014-09-16

    申请号:US13537417

    申请日:2012-06-29

    摘要: The present invention may provide a computer system including a plurality of tiles divided into multiple virtual domains. Each tile may include a router to communicate with others of said tiles, a private cache to store data, and a spill table to record pointers for data evicted from the private cache to a remote host, wherein the remote host and the respective tile are provided in the same virtual domain. The spill tables may allow for faster retrieval of previously evicted data because the home registry does not need to be referenced if requested data is listed in the spill table. Therefore, embodiments of the present invention may provide a distance-aware cache collaboration architecture without incurring extraneous overhead expenses.

    摘要翻译: 本发明可以提供一种包括分成多个虚拟域的多个瓦片的计算机系统。 每个瓦片可以包括与所述瓦片中的其他人通信的路由器,用于存储数据的专用高速缓存以及用于将从专用高速缓存驱逐的数据的指针记录到远程主机的溢出表,其中提供远程主机和相应的瓦片 在同一个虚拟域中。 泄漏表可以允许更快地检索先前驱逐的数据,因为如果请求的数据列在泄漏表中,则不需要引用本地注册表。 因此,本发明的实施例可以提供一种距离感知高速缓存协作架构,而不会产生额外的间接费用。

    ROUTER PARKING IN POWER-EFFICIENT INTERCONNECT ARCHITECTURES
    4.
    发明申请
    ROUTER PARKING IN POWER-EFFICIENT INTERCONNECT ARCHITECTURES 有权
    路由器驻留在功率有效的互联体系结构中

    公开(公告)号:US20140149766A1

    公开(公告)日:2014-05-29

    申请号:US13994780

    申请日:2012-01-10

    IPC分类号: G06F1/32

    摘要: A method and apparatus for selectively parking routers used for routing traffic in mesh interconnects. Various router parking (RP) algorithms are disclosed, including an aggressive RP algorithm where a minimum number of routers are kept active to ensure adequate network connectivity between active nodes and/or intercommunicating nodes, leading to a maximum reduction in static power consumption, and a conservative RP algorithm that favors network latency considerations over static power consumption while also reducing power. An adaptive RP algorithm is also disclosed that implements aspects of the aggressive and conservative RP algorithms to balance power consumption and latency considerations in response to ongoing node utilization and associated traffic. The techniques may be implemented in internal network structures, such as for single chip computers, as well as external network structures, such as computing clusters and massively parallel computer architectures. Performance modeling has demonstrated substantial power reduction may be obtained using the router parking techniques while maintaining Quality of Service performance objectives.

    摘要翻译: 一种用于选择性地停放用于在网状互连中路由业务的路由器的方法和装置。 公开了各种路由器驻留(RP)算法,包括侵略性RP算法,其中最少数量的路由器保持活动以确保活动节点和/或互通节点之间的足够的网络连接,导致静态功耗的最大降低,以及 保守的RP算法有利于静态功耗下的网络延迟考虑,同时也降低了功耗。 还公开了一种自适应RP算法,其实现了积极和保守的RP算法的各个方面,以响应于正在进行的节点利用率和相关联的流量来平衡功耗和延迟考虑。 这些技术可以在诸如单芯片计算机的内部网络结构以及诸如计算集群和大规模并行计算机体系结构的外部网络结构中实现。 性能建模已经证明,使用路由器停放技术可以获得大量的功率降低,同时保持服务质量的性能目标。

    MECHANISM FOR EMPLOYING AND FACILITATING DYNAMIC AND REMOTE MEMORY COLLABORATION AT COMPUTING DEVICES
    5.
    发明申请
    MECHANISM FOR EMPLOYING AND FACILITATING DYNAMIC AND REMOTE MEMORY COLLABORATION AT COMPUTING DEVICES 审中-公开
    在计算机上进行动态和远程记忆合作的机制

    公开(公告)号:US20130290546A1

    公开(公告)日:2013-10-31

    申请号:US13977692

    申请日:2011-10-07

    IPC分类号: H04L12/70

    摘要: A mechanism is described for facilitating dynamic and remote memory collaboration at computing devices according to one embodiment of the invention. A method of embodiments of the invention includes dynamically classifying a computing device of a plurality of computing devices as a memory server, where the plurality of computing devices are coupled to each other over a network. The method may further include offering, by the memory server, of memory to be used by one or more of the plurality of computing devices classified as one or more memory clients, and remotely granting, by the memory server, of the memory to the one or more memory clients.

    摘要翻译: 描述了根据本发明的一个实施例的用于促进计算设备处的动态和远程存储器协作的机制。 本发明的实施例的方法包括将多个计算设备的计算设备动态地分类为存储器服务器,其中多个计算设备通过网络彼此耦合。 该方法还可以包括由存储器服务器提供要被分类为一个或多个存储器客户机的多个计算设备中的一个或多个的使用的存储器,并且由存储器服务器将存储器远程授予存储器 或更多内存客户端。

    Router parking in power-efficient interconnect architectures

    公开(公告)号:US09619006B2

    公开(公告)日:2017-04-11

    申请号:US13994780

    申请日:2012-01-10

    摘要: A method and apparatus for selectively parking routers used for routing traffic in mesh interconnects. Various router parking (RP) algorithms are disclosed, including an aggressive RP algorithm where a minimum number of routers are kept active to ensure adequate network connectivity between active nodes and/or intercommunicating nodes, leading to a maximum reduction in static power consumption, and a conservative RP algorithm that favors network latency considerations over static power consumption while also reducing power. An adaptive RP algorithm is also disclosed that implements aspects of the aggressive and conservative RP algorithms to balance power consumption and latency considerations in response to ongoing node utilization and associated traffic. The techniques may be implemented in internal network structures, such as for single chip computers, as well as external network structures, such as computing clusters and massively parallel computer architectures. Performance modeling has demonstrated substantial power reduction may be obtained using the router parking techniques while maintaining Quality of Service performance objectives.

    Cache Collaboration in Tiled Processor Systems
    7.
    发明申请
    Cache Collaboration in Tiled Processor Systems 有权
    平铺处理器系统中的缓存协作

    公开(公告)号:US20140006713A1

    公开(公告)日:2014-01-02

    申请号:US13537417

    申请日:2012-06-29

    IPC分类号: G06F12/08

    摘要: The present invention may provide a computer system including a plurality of tiles divided into multiple virtual domains. Each tile may include a router to communicate with others of said tiles, a private cache to store data, and a spill table to record pointers for data evicted from the private cache to a remote host, wherein the remote host and the respective tile are provided in the same virtual domain. The spill tables may allow for faster retrieval of previously evicted data because the home registry does not need to be referenced if requested data is listed in the spill table. Therefore, embodiments of the present invention may provide a distance-aware cache collaboration architecture without incurring extraneous overhead expenses.

    摘要翻译: 本发明可以提供一种包括分成多个虚拟域的多个瓦片的计算机系统。 每个瓦片可以包括与所述瓦片中的其他人通信的路由器,用于存储数据的专用高速缓存以及用于将从专用高速缓存驱逐的数据的指针记录到远程主机的溢出表,其中提供远程主机和相应的瓦片 在同一个虚拟域中。 泄漏表可以允许更快地检索先前驱逐的数据,因为如果请求的数据列在泄漏表中,则不需要引用本地注册表。 因此,本发明的实施例可以提供一种距离感知高速缓存协作架构,而不会产生额外的间接费用。

    PACKET PROCESSING APPROACH TO IMPROVE PERFORMANCE AND ENERGY EFFICIENCY FOR SOFTWARE ROUTERS
    8.
    发明申请
    PACKET PROCESSING APPROACH TO IMPROVE PERFORMANCE AND ENERGY EFFICIENCY FOR SOFTWARE ROUTERS 有权
    分组处理方法提高软件路​​由器的性能和能源效率

    公开(公告)号:US20140029617A1

    公开(公告)日:2014-01-30

    申请号:US13559992

    申请日:2012-07-27

    IPC分类号: H04L12/56

    摘要: Methods, apparatus and systems for improved performance and energy efficiency of software-based routers. A software router running on a host computer system employing multiple Network Interface Controllers (NICs) maintains a routing table wherein packet flows are classified as managed flows (MFs) under which packets are received at and forwarded from the same NIC and unmanaged flows UFs under which packets are received at and forwarded from different NICs. Forwarding table data is employed by a NIC to facilitate packet identification and flow classification operations under which the NIC determines whether a received packet is an MF, UF, or an unclassified flow. Under various schemes, packet forwarding for MFs is handled by the software router architecture such that either only the packet header is copied into memory in the host or the entire packet forwarding is handled by the NIC.

    摘要翻译: 用于提高基于软件的路由器的性能和能效的方法,装置和系统。 在使用多个网络接口控制器(NIC)的主机计算机系统上运行的软件路由器维护路由表,其中分组流被分类为被管理流(MF),在该流中,分组在同一个NIC处接收并从同一个NIC和非托管流UF转发 分组在不同的NIC处被接收并从不同的NIC转发。 NIC使用转发表数据来促进分组标识和流分类操作,在该操作下,NIC确定接收的分组是MF,UF还是未分类的流。 在各种方案下,MF的分组转发由软件路由器架构处理,使得仅将分组报头复制到主机中的存储器中,或者整个分组转发由NIC处理。

    Packet processing approach to improve performance and energy efficiency for software routers
    9.
    发明授权
    Packet processing approach to improve performance and energy efficiency for software routers 有权
    数据包处理方法来提高软件路​​由器的性能和能效

    公开(公告)号:US09450780B2

    公开(公告)日:2016-09-20

    申请号:US13559992

    申请日:2012-07-27

    摘要: Methods, apparatus and systems for improved performance and energy efficiency of software-based routers. A software router running on a host computer system employing multiple Network Interface Controllers (NICs) maintains a routing table wherein packet flows are classified as managed flows (MFs) under which packets are received at and forwarded from the same NIC and unmanaged flows UFs under which packets are received at and forwarded from different NICs. Forwarding table data is employed by a NIC to facilitate packet identification and flow classification operations under which the NIC determines whether a received packet is an MF, UF, or an unclassified flow. Under various schemes, packet forwarding for MFs is handled by the software router architecture such that either only the packet header is copied into memory in the host or the entire packet forwarding is handled by the NIC.

    摘要翻译: 用于提高基于软件的路由器的性能和能效的方法,装置和系统。 在使用多个网络接口控制器(NIC)的主机计算机系统上运行的软件路由器维护路由表,其中分组流被分类为被管理流(MF),在该流中,分组在同一个NIC处接收并从同一个NIC和非托管流UF转发 分组在不同的NIC处被接收并从不同的NIC转发。 NIC使用转发表数据来促进分组标识和流分类操作,在该操作下,NIC确定接收的分组是MF,UF还是未分类的流。 在各种方案下,MF的分组转发由软件路由器架构处理,使得仅将分组报头复制到主机中的存储器中,或者整个分组转发由NIC处理。

    TECHNOLOGIES FOR NETWORK DEVICE FLOW LOOKUP MANAGEMENT
    10.
    发明申请
    TECHNOLOGIES FOR NETWORK DEVICE FLOW LOOKUP MANAGEMENT 审中-公开
    网络流量查询管理技术

    公开(公告)号:US20160182373A1

    公开(公告)日:2016-06-23

    申请号:US14580801

    申请日:2014-12-23

    摘要: Technologies for managing network flow lookups of a network device include a network controller and a target device, each communicatively coupled to the network device. The network device includes a cache for a processor of the network device and a main memory. The network device additionally includes a multi-level hash table having a first-level hash table stored in the cache of the network device and a second-level hash table stored in the main memory of the network device. The network device is configured to determine whether to store a network flow hash corresponding to a network flow indicating the target device in the first-level or second-level hash table based on a priority of the network flow provided to the network device by the network controller.

    摘要翻译: 用于管理网络设备的网络流查找的技术包括网络控制器和目标设备,每个通信地耦合到网络设备。 网络设备包括用于网络设备的处理器的缓存和主存储器。 网络设备还包括具有存储在网络设备的高速缓存中的第一级散列表的多级散列表和存储在网络设备的主存储器中的第二级散列表。 网络设备被配置为基于由网络提供给网络设备的网络流的优先级来确定是否将与指示目标设备的网络流相对应的网络流哈希存储在第一级或第二级哈希表中 控制器。