Techniques for maintaining high availability of networked systems
    1.
    发明申请
    Techniques for maintaining high availability of networked systems 有权
    维护网络系统高可用性的技术

    公开(公告)号:US20050071470A1

    公开(公告)日:2005-03-31

    申请号:US10399286

    申请日:2001-10-15

    摘要: This computer implemented software invention supervises networked system resources with the goal of maximizing service availability, providing on-demand and uninterrupted access to service, and minimizing the down time due to failures. It is a cluster-wide solution that co-ordinates the states and activities of resources, assigns availability roles, implements recovery from failures, and implements overall system policy. To do this, it maintains a system model of the system's physical and logical configuration and models the resources using managed objects that provides an extensive representation of the states, roles, and relationships of the systems resources.

    摘要翻译: 该计算机实施的软件发明监督网络系统资源,目的是最大限度地提高服务可用性,提供按需和不间断的服务访问,并最大限度地减少由于故障引起的停机时间。 它是一个集群范围的解决方案,用于协调资源的状态和活动,分配可用性角色,实现故障恢复,并实现整体系统策略。 为此,它维护系统的物理和逻辑配置的系统模型,并使用托管对象对资源进行建模,从而提供系统资源的状态,角色和关系的广泛表示。

    MINIMIZING DATA LOSS IN ASYNCHRONOUS REPLICATION SOLUTION USING DISTRIBUTED REDUNDANCY
    2.
    发明申请
    MINIMIZING DATA LOSS IN ASYNCHRONOUS REPLICATION SOLUTION USING DISTRIBUTED REDUNDANCY 有权
    使用分布式冗余方式在异步复制解决方案中最小化数据丢失

    公开(公告)号:US20090327805A1

    公开(公告)日:2009-12-31

    申请号:US12146602

    申请日:2008-06-26

    IPC分类号: G06F17/30 G06F11/07

    摘要: Architecture that reduces data loss resulting from failover in an asynchronous log shipping deployment, but leveraging mid-tier and frontend servers to fill in lost data. In an asynchronous log shipping operation, a replication component asynchronously replicates messaging data to a backend server in accordance with one or more replication operations, which can be updates to databases on the backend server. These databases can include messaging data, such as email address books, mailboxes, etc. A history component maintains a history of replication operations on a frontend server. In the event of a lossy failover, a replay component is used for replaying the replication operations from the history to the backend server.

    摘要翻译: 架构,减少异步日志传送部署中故障切换导致的数据丢失,但利用中间层和前端服务器填充丢失的数据。 在异步日志传送操作中,复制组件根据一个或多个复制操作将消息数据异步复制到后端服务器,这些操作可以是后端服务器上数据库的更新。 这些数据库可以包括邮件数据,例如电子邮件地址簿,邮箱等。历史组件在前端服务器上维护复制操作的历史记录。 在有故障转移的情况下,使用重放组件来重播从历史记录到后端服务器的复制操作。

    Techniques for leveraging replication to provide rolling point in time backup with simplified restoration through distributed transactional re-creation
    3.
    发明授权
    Techniques for leveraging replication to provide rolling point in time backup with simplified restoration through distributed transactional re-creation 有权
    利用复制技术提供滚动点的时间备份,并通过分布式事务重新创建进行简化恢复

    公开(公告)号:US08903774B2

    公开(公告)日:2014-12-02

    申请号:US13425876

    申请日:2012-03-21

    IPC分类号: G06F17/30 G06F11/14

    摘要: Techniques to leverage replication to provide rolling point in time backup are described. Some embodiments are directed to techniques to provide rolling point in time backup with simplified restoration through distributed transactional re-creation. In one embodiment, for example, a technique may comprise creating a plurality of availability copies of a primary set of data; designating at least one of the plurality of availability copies as a backup copy; creating a log file that indicates changes to the primary set of data; updating the plurality of availability copies from the log file in near real time, without updating the backup copy; and restoring at least one of: the primary set of data and an availability copy using the backup copy and content resubmitted from a content contributor when an error occurs in at least one of: the primary set of data and an availability copy. Other embodiments are described and claimed.

    摘要翻译: 描述了利用复制提供滚动点的时间备份的技术。 一些实施例涉及通过分布式事务再创建简化恢复来提供时间备份中的滚动点的技术。 在一个实施例中,例如,技术可以包括创建主要数据集合的多个可用性副本; 将所述多个可用性副本中的至少一个指定为备份副本; 创建指示对主要数据集进行更改的日志文件; 从日志文件中实时更新多个可用性副本,而无需更新备份副本; 以及至少一个:当所述主要数据集和可用性副本中的至少一个发生错误时,使用所述备份副本和从所述内容提供者重新提交的内容来恢复所述主要数据集和可用性副本。 描述和要求保护其他实施例。

    LOST WRITE PROTECTION VIA STREAM-BASED REPLICATION
    5.
    发明申请
    LOST WRITE PROTECTION VIA STREAM-BASED REPLICATION 审中-公开
    通过基于流行的复制丢失写保护

    公开(公告)号:US20090319525A1

    公开(公告)日:2009-12-24

    申请号:US12144613

    申请日:2008-06-23

    IPC分类号: G06F7/02

    CPC分类号: G06F11/0751 G06F11/0727

    摘要: Architecture for detecting lost writes using timestamps. During a replication process, lost writes in data replicated from a stream can be detected by noting discrepancies between the timestamps of data in the replica and timestamps associated with the corresponding data from the source in original data store. A lost write either in the original data store or in the replica data store can be inferred by comparing these timestamps with the timestamps in a number of other replica data stores. Additionally, check entries can be added to the replicas by the original data store to allow expanded comparison between recently modified data and the source data in the original data store. The check entries can be added to the replication journal after a time delay, thereby increasing effectiveness of the check by decreasing the likelihood that caching in the hardware will defeat the test.

    摘要翻译: 用于使用时间戳检测丢失的写入的架构。 在复制过程中,可以通过记录副本中的数据的时间戳和与来自原始数据存储中的源的相应数据相关联的时间戳之间的差异来检测从流复制的数据中的丢失写入。 可以通过将这些时间戳与许多其他副本数据存储中的时间戳进行比较来推断在原始数据存储器中或复制数据存储器中的丢失写入。 此外,检查条目可以由原始数据存储添加到副本,以允许最近修改的数据与原始数据存储中的源数据之间的扩展比较。 检查条目可以在延迟之后添加到复制日志中,从而通过减少硬件中缓存失败的可能性来提高检查的有效性。

    Active/passive database management across clusters
    7.
    发明授权
    Active/passive database management across clusters 有权
    跨集群的主动/被动数据库管理

    公开(公告)号:US08984332B2

    公开(公告)日:2015-03-17

    申请号:US13491633

    申请日:2012-06-08

    IPC分类号: G06F11/00

    摘要: A primary active manager can manage a first copy of a database in a first computer system cluster according to a set of management rules that provide for an active copy and one or more passive copies of the database at a given time. The primary active manager can also manage a second copy of the database in a second computer system cluster according to the rules. The rules can allow the first copy of the database or the second copy of the database to be the active copy if one or more criterion in the rules is met for that active copy. The first copy can be designated as the active copy and the second copy can be designated as a passive copy. A failure of the first copy can be detected, and in response, the second copy can be automatically designated as the active copy.

    摘要翻译: 主要活动管理器可以根据在给定时间提供活动副本和数据库的一个或多个被动副本的一组管理规则来管理第一计算机系统集群中的数据库的第一副本。 主要的主管理器还可以根据规则在第二计算机系统集群中管理数据库的第二副本。 如果满足该活动副本的规则中的一个或多个标准,则规则可以允许数据库的第一个副本或数据库的第二个副本为活动副本。 第一个副本可以指定为活动副本,第二个副本可以指定为被动副本。 可以检测到第一副本的故障,作为响应,可以将第二副本自动指定为活动副本。

    Computing asynchronous transaction log replication progress based on file change notifications
    8.
    发明申请
    Computing asynchronous transaction log replication progress based on file change notifications 审中-公开
    基于文件更改通知计算异步事务日志复制进度

    公开(公告)号:US20070162516A1

    公开(公告)日:2007-07-12

    申请号:US11324003

    申请日:2005-12-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/273

    摘要: Asynchronous transaction log replication from a source database to a destination database utilizing file change notifications for a source log directory generated by an operating system of a source computing machine and received by a destination computing machine. In response to the received file change notification, a source transaction log in the source log directory is copied to a destination transaction log in a destination log directory of the destinations machine. After the copy is completed, transactions contained in the destination transaction log are applied to the destination database.

    摘要翻译: 使用由源计算机的操作系统生成并由目的地计算机接收的源日志目录的文件更改通知,从源数据库到目标数据库的异步事务日志复制。 响应于收到的文件更改通知,源日志目录中的源事务日志将复制到目标机器的目标日志目录中的目标事务日志。 复制完成后,目标事务日志中包含的事务将应用于目标数据库。

    Split brain protection in computer clusters
    9.
    发明授权
    Split brain protection in computer clusters 有权
    在计算机集群中分裂大脑保护

    公开(公告)号:US09146705B2

    公开(公告)日:2015-09-29

    申请号:US13441922

    申请日:2012-04-09

    摘要: A computer cluster can be divided into a plurality of failure scopes and a voting constraint can be enforced. The voting constraint can allow a portion of the cluster to provide the service if a majority of health votes from cluster members is obtained by that portion. A loss of connectivity between a first failure scope, which has a majority of cluster members in the cluster, and one or more other failure scopes in the cluster can be detected. The loss of connectivity can be such that the first failure scope does not have connectivity to a member in any other failure scope in the cluster. In response to detecting the loss of connectivity, a split brain situation in the cluster can be automatically protected against by preventing the first failure scope from providing the service.

    摘要翻译: 计算机集群可以分为多个故障范围,并且可以执行投票约束。 如果通过该部分获得集群成员的大多数健康投票,投票约束可以允许集群的一部分提供服务。 可以检测到具有集群中大多数集群成员的第一个故障范围与集群中的一个或多个其他故障范围之间的连接丢失。 连接的丢失可能使得第一个故障范围与集群中任何其他故障范围内的成员没有连接。 响应于检测到连通性的损失,可以通过防止第一个故障范围提供服务来自动保护群集中的分裂大脑情况。

    Managing cluster split-brain in datacenter service site failover
    10.
    发明授权
    Managing cluster split-brain in datacenter service site failover 有权
    管理数据中心服务站点故障切换中的群集分裂

    公开(公告)号:US08001413B2

    公开(公告)日:2011-08-16

    申请号:US12115536

    申请日:2008-05-05

    IPC分类号: G06F11/00

    摘要: A central controlling service for datacenter activation/deactivation control in a cluster deployment to assist in preventing a split-brain scenario. The central controlling service provides a central point of control in the datacenter for application servers to periodically query as to whether to go offline, online, or normal. Redundancy of the central service facilitates detection of datacenter failure by the redundant services interacting to resolve the state of control information. This control information is then used to answer the server queries. On startup from a datacenter failure, a single instance of the central service queries other redundant instance(s) to determine if the single instance is starting up from a datacenter-wide failure or from operations other than total datacenter failure. If the failure is datacenter-wide, a central service protocol assists in resolving to the single service keeping the associated datacenter servers offline; otherwise, the server queries are answered to go online.

    摘要翻译: 集群部署中的数据中心激活/停用控制的中央控制服务,以帮助防止裂脑情景。 中央控制服务在数据中心中提供了一个中心控制点,用于应用程序服务器定期查询是否脱机,在线或正常。 中央服务器的冗余有助于通过交互来解决控制信息状态的冗余服务来检测数据中心故障。 然后,该控制信息用于回答服务器查询。 在数据中心故障启动时,单个中央服务器实例会查询其他冗余实例,以确定单个实例是否从数据中心范围的故障或除数据中心故障之外的操作启动。 如果故障是数据中心范围的,则中央服务协议有助于解决单个服务,使关联的数据中心服务器脱机; 否则,服务器查询被回答为上线。