Recovering a file system to any point-in-time in the past with guaranteed structure, content consistency and integrity

    公开(公告)号:US08352523B1

    公开(公告)日:2013-01-08

    申请号:US13242916

    申请日:2011-09-23

    CPC classification number: G06F17/30088 G06F11/1435 G06F11/1471 G06F17/3023

    Abstract: A data management system (DMS) includes a continuous real-time object store that captures all real-time activities, with associated object metadata information. The DMS is capable of reintroducing any point-in-time view of data ranging from a granular object to an entire file system. A set of algorithms (for creation of a file or directory, modification of a file or directory, deletion of a file or directory, and relocation/renaming of a file or directory) are used to generate and maintain a file system history in the DMS and to ensure that a latest version of a directory always refers to a latest version of its children until the directory changed. Any point-in-time recovery is implemented using the file system history in one of various ways to provide strong individual file integrity, exact point-in-time crash consistency, and/or recovery of last version of all files in the file system.

    Method and system for data reduction
    2.
    发明授权
    Method and system for data reduction 有权
    减少数据的方法和系统

    公开(公告)号:US08195628B2

    公开(公告)日:2012-06-05

    申请号:US12901824

    申请日:2010-10-11

    CPC classification number: G06F17/3015

    Abstract: A “forward” delta data management technique uses a “sparse” index associated with a delta file to achieve both delta management efficiency and to eliminate read latency while accessing history data. The invention may be implemented advantageously in a data management system that provides real-time data services to data sources associated with a set of application host servers. A host driver embedded in an application server connects an application and its data to a cluster. The host driver captures real-time data transactions, preferably in the form of an event journal that is provided to the data management system. In particular, the driver functions to translate traditional file/database/block I/O into a continuous, application-aware, output data stream. A given application-aware data stream is processed through a multi-stage data reduction process to produce a compact data representation from which an “any point-in-time” reconstruction of the original data can be made.

    Abstract translation: “前进”增量数据管理技术使用与增量文件相关联的“稀疏”索引来实现增量管理效率,并在访问历史数据时消除读延迟。 本发明可以有利地在向与一组应用主机服务器相关联的数据源提供实时数据服务的数据管理系统中实现。 嵌入在应用程序服务器中的主机驱动程序将应用程序及其数据连接到集群。 主机驱动程序最好以提供给数据管理系统的事件日志的形式捕获实时数据事务。 特别地,驱动程序用于将传统的文件/数据库/块I / O转换成连续的,应用感知的输出数据流。 给定的应用感知数据流通过多级数据缩减处理进行处理,以产生紧凑的数据表示,可以从该数据表示形成原始数据的“任何时间点”重建。

    Method for erasure coding data across a plurality of data stores in a network
    3.
    发明授权
    Method for erasure coding data across a plurality of data stores in a network 有权
    用于在网络中跨多个数据存储器擦除数据的方法

    公开(公告)号:US08086937B2

    公开(公告)日:2011-12-27

    申请号:US12695385

    申请日:2010-01-28

    CPC classification number: H04L1/0057

    Abstract: An efficient method to apply an erasure encoding and decoding scheme across dispersed data stores that receive constant updates. A data store is a persistent memory for storing a data block. Such data stores include, without limitation, a group of disks, a group of disk arrays, or the like. An encoding process applies a sequencing method to assign a sequence number to each data and checksum block as they are modified and updated onto their data stores. The method preferably uses the sequence number to identify data set consistency. The sequencing method allows for self-healing of each individual data store, and it maintains data consistency and correctness within a data block and among a group of data blocks. The inventive technique can be applied on many forms of distributed persistent data stores to provide failure resiliency and to maintain data consistency and correctness.

    Abstract translation: 一种在接收不断更新的分散数据存储中应用擦除编码和解码方案的有效方法。 数据存储器是用于存储数据块的持久存储器。 这样的数据存储包括但不限于一组磁盘,一组磁盘阵列等。 编码过程应用排序方法,将序列号分配给每个数据和校验和块,因为它们被修改和更新到其数据存储上。 该方法优选使用序列号来识别数据集一致性。 排序方法允许每个单独数据存储的自我修复,并且它保持数据块内和数据块组之间的数据一致性和正确性。 本发明的技术可以应用于许多形式的分布式持久数据存储,以提供故障弹性并维持数据一致性和正确性。

    MANAGEMENT INTERFACE FOR A SYSTEM THAT PROVIDES AUTOMATED, REAL-TIME, CONTINUOUS DATA PROTECTION
    4.
    发明申请
    MANAGEMENT INTERFACE FOR A SYSTEM THAT PROVIDES AUTOMATED, REAL-TIME, CONTINUOUS DATA PROTECTION 有权
    提供自动化,实时,连续数据保护的系统的管理界面

    公开(公告)号:US20110252432A1

    公开(公告)日:2011-10-13

    申请号:US12901888

    申请日:2010-10-11

    Abstract: A data management system that protects data into a continuous object store includes a management interface having a time control. The time control allows an administrator to specify a “past” time, such as a single point or range. When the time control is set to a single point, a hierarchical display of data appears on a display exactly as the data existed in the system at that moment in the past. The time control enables the management interface to operate within a history mode in which the display provides a visual representation of a “virtual” point in time in the past during which the data management system has been operative to provide the data protection service.

    Abstract translation: 将数据保护为连续对象存储的数据管理系统包括具有时间控制的管理接口。 时间控制允许管理员指定“过去”时间,例如单个点或范围。 当将时间控制设置为单个点时,数据的分层显示与过去那一刻的系统中存在的数据完全相同。 时间控制使得管理接口能够在历史模式下操作,其中显示器提供过去在数据管理系统已经可操作以提供数据保护服务的“虚拟”时间点的视觉表示。

    Method of creating hierarchical indices for a distributed object system
    5.
    发明授权
    Method of creating hierarchical indices for a distributed object system 有权
    为分布式对象系统创建分层索引的方法

    公开(公告)号:US08429198B1

    公开(公告)日:2013-04-23

    申请号:US13489751

    申请日:2012-06-06

    Abstract: A data management system or “DMS” provides data services to data sources associated with a set of application host servers. The DMS typically comprises one or more regions, with each region having one or more clusters. A given cluster has one or more nodes that share storage. When providing continuous data protection and data distribution, the DMS nodes create distributed active object storage to provide the necessary real-time data management services. The distributed object store can be built above raw storage devices, a traditional file system, a special purpose file system, a clustered file system, and a database. The DMS active object store provides an indexing service to the active objects. In an illustrative embodiment, any object property that has a given attribute is indexed and, as a result, the attribute becomes searchable. The DMS provides hierarchical distributed indexing using index trees to facilitate searching.

    Abstract translation: 数据管理系统或“DMS”向与一组应用主机服务器相关联的数据源提供数据服务。 DMS通常包括一个或多个区域,每个区域具有一个或多个簇。 给定的集群具有一个或多个共享存储的节点。 当提供连续的数据保护和数据分发时,DMS节点创建分布式活动对象存储,以提供必要的实时数据管理服务。 可以在原始存储设备,传统文件系统,专用文件系统,群集文件系统和数据库之上构建分布式对象存储。 DMS活动对象存储为活动对象提供索引服务。 在说明性实施例中,具有给定属性的任何对象属性被索引,并且因此,该属性变为可搜索的。 DMS使用索引树提供分层分布式索引,以便于搜索。

    METHOD AND SYSTEM FOR VIRTUAL ON-DEMAND RECOVERY

    公开(公告)号:US20120266019A1

    公开(公告)日:2012-10-18

    申请号:US13534435

    申请日:2012-06-27

    Abstract: A data management system (“DMS”) provides an automated, continuous, real-time, substantially no downtime data protection service to one or more data sources. A host driver embedded in an application server captures real-time data transactions, preferably in the form of an event journal. The driver functions to translate traditional file/database/block I/O and the like into a continuous, application-aware, output data stream. The host driver includes an event processor that can perform a recovery operation to an entire data source or a subset of the data source using former point-in-time data in the DMS. The recovery operation may have two phases. First, the structure of the host data in primary storage is recovered to the intended recovering point-in-time. Thereafter, the actual data itself is recovered. The event processor enables such data recovery in an on-demand manner, by allowing recovery to happen simultaneously while an application accesses and updates the recovering data.

    Recovering a file system to any point-in-time in the past with guaranteed structure, content consistency and integrity
    7.
    发明授权
    Recovering a file system to any point-in-time in the past with guaranteed structure, content consistency and integrity 有权
    将文件系统恢复到过去任何时间点,保证结构,内容一致性和完整性

    公开(公告)号:US08131723B2

    公开(公告)日:2012-03-06

    申请号:US12059505

    申请日:2008-03-31

    CPC classification number: G06F17/30088 G06F11/1435 G06F11/1471 G06F17/3023

    Abstract: A data management system (DMS) includes a continuous real-time object store that captures all real-time activities, with associated object metadata information. The DMS is capable of reintroducing any point-in-time view of data ranging from a granular object to an entire file system. A set of algorithms (for creation of a file or directory, modification of a file or directory, deletion of a file or directory, and relocation/renaming of a file or directory) are used to generate and maintain a file system history in the DMS and to ensure that a latest version of a directory always refers to a latest version of its children until the directory changed. Any point-in-time recovery is implemented using the file system history in one of various ways to provide strong individual file integrity, exact point-in-time crash consistency, and/or recovery of last version of all files in the file system.

    Abstract translation: 数据管理系统(DMS)包括一个连续的实时对象存储,它捕获所有的实时活动以及关联的对象元数据信息。 DMS能够重新引入从粒状对象到整个文件系统的数据的任何时间点视图。 使用一组算法(用于创建文件或目录,修改文件或目录,删除文件或目录以及文件或目录的重定位/重命名),以生成和维护DMS中的文件系统历史记录 并确保最新版本的目录总是引用其子版本的最新版本,直到目录更改为止。 使用文件系统历史记录以各种方式之一实现任何时间点恢复,以提供强大的单个文件完整性,准确的时间点崩溃一致性和/或文件系统中所有文件的最后版本的恢复。

    METHOD AND SYSTEM FOR DATA REDUCTION
    8.
    发明申请
    METHOD AND SYSTEM FOR DATA REDUCTION 有权
    减少数据的方法和系统

    公开(公告)号:US20110252004A1

    公开(公告)日:2011-10-13

    申请号:US12901824

    申请日:2010-10-11

    CPC classification number: G06F17/3015

    Abstract: A “forward” delta data management technique uses a “sparse” index associated with a delta file to achieve both delta management efficiency and to eliminate read latency while accessing history data. The invention may be implemented advantageously in a data management system that provides real-time data services to data sources associated with a set of application host servers. A host driver embedded in an application server connects an application and its data to a cluster. The host driver captures real-time data transactions, preferably in the form of an event journal that is provided to the data management system. In particular, the driver functions to translate traditional file/database/block I/O into a continuous, application-aware, output data stream. A given application-aware data stream is processed through a multi-stage data reduction process to produce a compact data representation from which an “any point-in-time” reconstruction of the original data can be made.

    Abstract translation: “前进”增量数据管理技术使用与增量文件相关联的“稀疏”索引来实现增量管理效率,并在访问历史数据时消除读延迟。 本发明可以有利地在向与一组应用主机服务器相关联的数据源提供实时数据服务的数据管理系统中实现。 嵌入在应用程序服务器中的主机驱动程序将应用程序及其数据连接到集群。 主机驱动程序最好以提供给数据管理系统的事件日志的形式捕获实时数据事务。 特别地,驱动程序用于将传统的文件/数据库/块I / O转换成连续的,应用感知的输出数据流。 给定的应用感知数据流通过多级数据缩减处理进行处理,以产生紧凑的数据表示,可以从该数据表示形成原始数据的“任何时间点”重建。

    Method of creating hierarchical indices for a distributed object system
    9.
    发明授权
    Method of creating hierarchical indices for a distributed object system 有权
    为分布式对象系统创建分层索引的方法

    公开(公告)号:US07979441B2

    公开(公告)日:2011-07-12

    申请号:US12691616

    申请日:2010-01-21

    Abstract: A data management system or “DMS” provides data services to data sources associated with a set of application host servers. The data management system typically comprises one or more regions, with each region having one or more clusters. A given cluster has one or more nodes that share storage. When providing continuous data protection and data distribution, the DMS nodes create distributed object storage to provide the necessary real-time data management services. The objects created by the DMS nodes are so-called active objects. The distributed object store can be built above raw storage devices, a traditional file system, a special purpose file system, a clustered file system, a database, and so on. According to the present invention, the DMS active object store provides an indexing service to the active objects. In an illustrative embodiment, any object property that has a given attribute is indexed and, as a result, the attribute becomes searchable. The DMS provides hierarchical distributed indexing using index trees to facilitate searching in a highly efficient manner.

    Abstract translation: 数据管理系统或“DMS”向与一组应用主机服务器相关联的数据源提供数据服务。 数据管理系统通常包括一个或多个区域,每个区域具有一个或多个簇。 给定的集群具有一个或多个共享存储的节点。 当提供连续的数据保护和数据分发时,DMS节点创建分布式对象存储以提供必要的实时数据管理服务。 由DMS节点创建的对象是所谓的活动对象。 分布式对象存储可以在原始存储设备,传统文件系统,专用文件系统,集群文件系统,数据库等之上构建。 根据本发明,DMS活动对象存储向活动对象提供索引服务。 在说明性实施例中,具有给定属性的任何对象属性被索引,并且因此,该属性变为可搜索的。 DMS使用索引树提供分层分布式索引,以便以高效的方式进行搜索。

Patent Agency Ranking