ISOLATING A PCI HOST BRIDGE IN RESPONSE TO AN ERROR EVENT
    61.
    发明申请
    ISOLATING A PCI HOST BRIDGE IN RESPONSE TO AN ERROR EVENT 有权
    隔离PCI主机桥响应错误事件

    公开(公告)号:US20150019903A1

    公开(公告)日:2015-01-15

    申请号:US13940757

    申请日:2013-07-12

    Abstract: Instead of disabling PCI communication between system resources in a host computing device and I/O devices when a PCI Host Bridge (PHB) is unable to function, the host computing device may include a redundant PCI communication path for maintaining communication between the system resources and the I/O devices after a first PHB experiences an unrecoverable error. In one embodiment, the redundant PCI communication path includes a second PHB that is maintained in a standby state so long as the first PHB is functioning normally. However, once the first PHB experiences an unrecoverable error, the second PHB is changed to the master state and assumes the responsibility for maintaining communication between the system resources and the I/O devices.

    Abstract translation: 当PCI主机桥(PHB)不能起作用时,代替在主机计算设备和I / O设备之间的系统资源之间禁用PCI通信,主计算设备可以包括冗余PCI通信路径,用于维护系统资源和 第一个PHB之后的I / O设备遇到不可恢复的错误。 在一个实施例中,冗余PCI通信路径包括保持在待机状态的第二PHB,只要第一PHB正常工作即可。 然而,一旦第一PHB遇到不可恢复的错误,则第二PHB被改变为主状态,并承担维护系统资源和I / O设备之间的通信的责任。

    Performing Failover in a Redundancy Group

    公开(公告)号:US20140369183A1

    公开(公告)日:2014-12-18

    申请号:US14472420

    申请日:2014-08-29

    CPC classification number: H04L41/0654 G06F11/00 G06F11/2002

    Abstract: A method, system, and computer program product for performing failover in a redundancy group, where the redundancy group comprises a plurality of routers including an active router and a standby router, the failover being characterized by zero black hole or significantly reduced black hole conditions versus a conventional failover system. The method comprises the steps of: receiving an incoming message at a switch; sending a request of identification to the plurality of routers to identify a current active router, where the current active router represents a virtual router of the redundancy group; and in response to receiving a reply containing an identification from the current active router within a predetermined time, forwarding the incoming message to the current active router.

    Disable Restart Setting for AMF Configuration Components
    63.
    发明申请
    Disable Restart Setting for AMF Configuration Components 有权
    禁用AMF配置组件的重新启动设置

    公开(公告)号:US20140331079A1

    公开(公告)日:2014-11-06

    申请号:US13875114

    申请日:2013-05-01

    Inventor: Ali Kanso

    CPC classification number: G06F11/2002 G06F11/008

    Abstract: A method and a system are provided for determining an AMF configuration of a highly available system with respect to whether to failover or restart a component when the component fails. The AMF configuration specifies at least two service-units containing components that represent resources, and a set of service-instances representing workload incurred by provision of services using the resources. The method identifies a failover duration and a restart duration for each component in a service-unit; and determines a failover outage and a restart outage for each service-instance impacted by a failure of a given component, based on the failover duration and the restart duration of each component in the service-unit. The method further determines whether to failover or to restart the given component if the given component fails, based on the failover outage and the restart outage of each service-instance impacted by the failure of the given component.

    Abstract translation: 提供了一种方法和系统,用于当组件发生故障时确定高可用性系统的AMF配置是否故障转移或重新启动组件。 AMF配置指定至少两个包含表示资源的组件的服务单元,以及一组服务实例,表示通过使用资源提供服务而产生的工作负载。 该方法识别服务单元中每个组件的故障转移持续时间和重新启动持续时间; 并基于服务单元中每个组件的故障转移持续时间和重启持续时间,确定受给定组件故障影响的每个服务实例的故障转移中断和重新启动中断。 该方法进一步确定是否故障切换,或者如果给定组件出现故障,则基于故障转移停机和由给定组件故障影响的每个服务实例的重新启动中断,重新启动给定组件。

    PHYSICAL DOMAIN ERROR ISOLATION AND RECOVERY IN A MULTI-DOMAIN SYSTEM
    64.
    发明申请
    PHYSICAL DOMAIN ERROR ISOLATION AND RECOVERY IN A MULTI-DOMAIN SYSTEM 有权
    多域系统中的物理域错误分离和恢复

    公开(公告)号:US20140310555A1

    公开(公告)日:2014-10-16

    申请号:US13861917

    申请日:2013-04-12

    Abstract: The disclosed embodiments disclose techniques for performing physical domain error isolation and recovery in a multi-domain system, where the multi-domain system includes two or more processor chips and one or more switch chips that provide connectivity and cache-coherency support for the processor chips, and the processor chips are divided into two or more distinct domains. During operation, one of the switch chips determines a fault in the multi-domain system. The switch chip determines an originating domain that is associated with the fault, and then signals the fault and an identifier for the originating domain to its internal units, some of which perform clearing operations that clear out all traffic for the originating domain without affecting the other domains of the multi-domain system.

    Abstract translation: 所公开的实施例公开了在多域系统中执行物理域错误隔离和恢复的技术,其中多域系统包括两个或多个处理器芯片和为处理器芯片提供连接性和高速缓存一致性支持的一个或多个交换芯片 ,并且处理器芯片被分成两个或更多个不同的域。 在运行期间,其中一个开关芯片决定了多域系统中的故障。 交换芯片确定与故障相关联的始发域,然后将故障信号和起始域的标识符发送到其内部单元,其中一些执行清除操作,清除始发域的所有业务,而不影响其他 多域系统的域。

    Localized Fast Bulk Storage in a Multi-Node Computer System
    65.
    发明申请
    Localized Fast Bulk Storage in a Multi-Node Computer System 有权
    多节点计算机系统中的本地化快速批量存储

    公开(公告)号:US20140298079A1

    公开(公告)日:2014-10-02

    申请号:US13931861

    申请日:2013-06-29

    Inventor: Steven Dean

    Abstract: A high performance computing (HPC) system includes computing blades having a first region that includes computing circuit boards having processors for performing a computation, and a second region that includes non-volatile memory for use in performing the computation. The regions are connected by a plurality of power connectors that convey power from the computing circuit boards to the memory, and a plurality of data connectors that convey data between the first and second regions. The power and data connectors are configured redundantly so that failure of a computing circuit board, a power connector, or a data connector does not interrupt the computation. A method of performing such a computation, and a computer program product implementing the method, are also disclosed.

    Abstract translation: 高性能计算(HPC)系统包括具有第一区域的计算刀片,所述第一区域包括具有用于执行计算的处理器的计算电路板,以及包括用于执行计算的非易失性存储器的第二区域。 这些区域通过将功率从计算电路板传送到存储器的多个电力连接器连接,以及在第一和第二区域之间传送数据的多个数据连接器。 电源和数据连接器冗余配置,从而计算电路板,电源连接器或数据连接器的故障不会中断计算。 还公开了执行这种计算的方法以及实现该方法的计算机程序产品。

    Load balancing and failover of gateway devices
    66.
    发明授权
    Load balancing and failover of gateway devices 有权
    网关设备的负载平衡和故障转移

    公开(公告)号:US08832819B2

    公开(公告)日:2014-09-09

    申请号:US13689877

    申请日:2012-11-30

    Abstract: Methods and systems for load balancing and failover among gateway devices are disclosed. One method provides for assigning communication transaction handling to a gateway. The method includes receiving a request for a license from a computing device at a control gateway within a group of gateway devices including a plurality of gateway devices configured to support communication of cryptographically split data. The method also includes assigning communications from the computing device to one of the plurality of gateway devices based on a load balancing algorithm, and routing the communication request to the assigned gateway device.

    Abstract translation: 公开了用于网关设备之间的负载平衡和故障转移的方法和系统。 一种方法提供将通信事务处理分配给网关。 该方法包括从网络设备组内的控制网关处的计算设备接收对许可证的请求,所述网关设备包括被配置为支持密码分割数据的通信的多个网关设备。 该方法还包括基于负载平衡算法将来自计算设备的通信分配给多个网关设备中的一个,以及将通信请求路由到所分配的网关设备。

    STORAGE DEVICE, ERROR PROCESSING METHOD, AND COMMUNICATION SYSTEM
    67.
    发明申请
    STORAGE DEVICE, ERROR PROCESSING METHOD, AND COMMUNICATION SYSTEM 有权
    存储设备,错误处理方法和通信系统

    公开(公告)号:US20140173364A1

    公开(公告)日:2014-06-19

    申请号:US14063524

    申请日:2013-10-25

    Inventor: Satoru NISHITA

    Abstract: A storage device, includes: a plurality of controller modules; a bus disposed among the plurality of controller modules, the bus including a plurality of transmission paths; a detector configured to detect an error in data communication through the bus; and a connection controller configured to carry out partial fallback processing of the bus if the number of the errors has exceeded a given number.

    Abstract translation: 一种存储装置,包括:多个控制器模块; 布置在所述多个控制器模块中的总线,所述总线包括多个传输路径; 检测器,被配置为通过总线检测数据通信中的错误; 以及连接控制器,被配置为如果所述错误的数量超过给定的数量,则执行所述总线的部分回退处理。

    LOAD BALANCING AND FAILOVER OF GATEWAY DEVICES
    68.
    发明申请
    LOAD BALANCING AND FAILOVER OF GATEWAY DEVICES 有权
    负载均衡和网关设备故障

    公开(公告)号:US20140157042A1

    公开(公告)日:2014-06-05

    申请号:US13689877

    申请日:2012-11-30

    Abstract: Methods and systems for load balancing and failover among gateway devices are disclosed. One method provides for assigning communication transaction handling to a gateway. The method includes receiving a request for a license from a computing device at a control gateway within a group of gateway devices including a plurality of gateway devices configured to support communication of cryptographically split data. The method also includes assigning communications from the computing device to one of the plurality of gateway devices based on a load balancing algorithm, and routing the communication request to the assigned gateway device.

    Abstract translation: 公开了用于网关设备之间的负载平衡和故障转移的方法和系统。 一种方法提供将通信事务处理分配给网关。 该方法包括从网络设备组内的控制网关处的计算设备接收对许可证的请求,所述网关设备包括被配置为支持密码分割数据的通信的多个网关设备。 该方法还包括基于负载平衡算法将来自计算设备的通信分配给多个网关设备中的一个,以及将通信请求路由到所分配的网关设备。

    MANAGING FATE-SHARING IN SHARED-MEDIA COMMUNICATION NETWORKS
    69.
    发明申请
    MANAGING FATE-SHARING IN SHARED-MEDIA COMMUNICATION NETWORKS 有权
    在共享媒体通信网络中管理FATE共享

    公开(公告)号:US20140136881A1

    公开(公告)日:2014-05-15

    申请号:US14154875

    申请日:2014-01-14

    Abstract: In one embodiment, a management device receives one or more fate-sharing reports locally generated by one or more corresponding reporting nodes in a shared-media communication network, the fate-sharing reports indicating a degree of localized fate-sharing between one or more pairs of nodes local to the corresponding reporting nodes. The management device may then determine, globally from aggregating the fate-sharing reports, one or more fate-sharing groups indicating sets of nodes having a global degree of fate-sharing within the communication network. As such, the management device may then advertise the fate-sharing groups within the communication network, wherein nodes of the communication network are configured to select a plurality of next-hops that minimizes fate-sharing between the plurality of next-hops.

    Abstract translation: 在一个实施例中,管理设备接收在共享 - 媒体通信网络中由一个或多个对应的报告节点本地生成的一个或多个命运共享报告,命运共享报告指示一个或多个对之间的局部化命运共享程度 的节点本地到相应的报告节点。 然后,管理设备可以全局地从聚合命运共享报告中确定一个或多个命运共享组,指示在通信网络内具有全局命中共享度的节点集合。 这样,管理装置然后可以通告通信网络内的命运共享组,其中通信网络的节点被配置为选择使多个下一跳之间的命运共享最小化的多个下一跳。

    METHODS AND DEVICES FOR DETECTING SERVICE FAILURES AND MAINTAINING COMPUTING SERVICES USING A RESILIENT INTELLIGENT CLIENT COMPUTER
    70.
    发明申请
    METHODS AND DEVICES FOR DETECTING SERVICE FAILURES AND MAINTAINING COMPUTING SERVICES USING A RESILIENT INTELLIGENT CLIENT COMPUTER 有权
    用于检测服务故障的方法和设备,并使用灵活的智能客户端计算机维护计算服务

    公开(公告)号:US20140129873A1

    公开(公告)日:2014-05-08

    申请号:US13854581

    申请日:2013-04-01

    Abstract: Intelligent client computing devices track and record the changes they make to data, applications, and services. Systems, devices, and computer readable media for detecting service tier failures and maintaining application services provide a resilient client architecture that allows a client application on an intelligent client to automatically detect the unavailability of server tiers or sites and re-route requests and updates to secondary sites to maintain application services at the client tier in a manner that is transparent to a user. The resilient client architecture understands the level of currentness of secondary sites in order to select the best secondary site and to automatically and transparently bring this secondary site up to date to ensure no data updates are missing from the secondary site.

    Abstract translation: 智能客户端计算设备跟踪和记录对数据,应用程序和服务所做的更改。 用于检测服务层故障和维护应用程序服务的系统,设备和计算机可读介质提供了一种弹性客户端架构,允许智能客户端上的客户端应用程序自动检测服务器层或站点的不可用性,并将请求和更新重新路由到次要 站点以对用户透明的方式在客户层维护应用程序服务。 弹性客户端架构了解辅助站点的最新级别,以便选择最佳辅助站点,并自动和透明地将此辅助站点更新为最新,以确保辅助站点中没有丢失数据更新。

Patent Agency Ranking