Log Structured Content Addressable Deduplicating Storage

    公开(公告)号:US20100082547A1

    公开(公告)日:2010-04-01

    申请号:US12235325

    申请日:2008-09-22

    Abstract: A log structured content addressable deduplicated data storage system may be used to store deduplicated data. Data to be stored is partitioned into data segments. Each unique data segment is associated with a label. The storage system maintains a transaction log. Mutating storage operations are initiated by storing transaction records in the transaction log. Additional transaction records are stored in the log when storage operations are completed. Upon restarting an embodiment of the data storage system, the transaction records from the transaction logs are replayed to recreate the state of the data storage system. The data storage system updates file system metadata with transaction information while a storage operation associated with the file is being processed. This transaction information serves as atomically updated transaction commit points, allowing fully internally consistent snapshots of deduplicated volumes to be taken at any time.

    Log structured content addressable deduplicating storage
    12.
    发明授权
    Log structured content addressable deduplicating storage 有权
    记录结构化内容可寻址的重复数据删除存储

    公开(公告)号:US09208031B2

    公开(公告)日:2015-12-08

    申请号:US12416067

    申请日:2009-03-31

    Abstract: A log structured content addressable deduplicated data storage system may be used to store deduplicated data. Data to be stored is partitioned into data segments. Each unique data segment is associated with a label. The storage system maintains a transaction log. Mutating storage operations are initiated by storing transaction records in the transaction log. Additional transaction records are stored in the log when storage operations are completed. Upon restarting an embodiment of the data storage system, the transaction records from the transaction logs are replayed to recreate the state of the data storage system. The data storage system updates file system metadata with transaction information while a storage operation associated with the file is being processed. This transaction information serves as atomically updated transaction commit points, allowing fully internally consistent snapshots of deduplicated volumes to be taken at any time.

    Abstract translation: 可以使用日志结构化内容可寻址的重复数据删除的数据存储系统来存储重复数据删除的数据。 要存储的数据被分割成数据段。 每个唯一数据段都与一个标签相关联。 存储系统维护事务日志。 通过将事务记录存储在事务日志中来启动变量存储操作。 存储操作完成后,其他事务记录将存储在日志中。 在重新启动数据存储系统的实施例时,重播来自事务日志的事务记录以重新创建数据存储系统的状态。 数据存储系统在处理与文件相关联的存储操作的同时,用事务信息更新文件系统元数据。 此事务信息用作原子更新的事务提交点,允许在任何时间完全内部一致地执行重复数据删除卷的快照。

    DISASTER RECOVERY USING LOCAL AND CLOUD SPANNING DEDUPLICATED STORAGE SYSTEM
    14.
    发明申请
    DISASTER RECOVERY USING LOCAL AND CLOUD SPANNING DEDUPLICATED STORAGE SYSTEM 审中-公开
    使用本地和云计算复制存储系统进行灾难恢复

    公开(公告)号:US20110161723A1

    公开(公告)日:2011-06-30

    申请号:US12942988

    申请日:2010-11-09

    Abstract: A spanning storage interface facilitates the use of cloud storage services by storage clients and may perform data deduplication. The spanning storage interface may include local storage for caching data from storage clients. A disaster recovery application includes at least first and second spanning storage interfaces at first and second network locations. The second spanning storage interface is provided for at least disaster recovery operations. The second spanning storage interface includes second local storage for improving data access performance. A copy of the local cache of the first spanning storage interface is transferred to the second local storage while the first network location is operating. In the event of a disaster affecting the first network location, the second spanning storage interface can provide data access to the first network location's data with improved performance from using the copy of local cache in the second local storage.

    Abstract translation: 跨越存储接口有助于存储客户端使用云存储服务,并可执行重复数据删除。 跨越存储接口可以包括用于从存储客户端缓存数据的本地存储器。 灾难恢复应用程序在第一和第二网络位置处至少包括第一和第二跨越存储接口。 提供了第二个跨越存储接口,用于至少进行灾难恢复操作。 第二个跨越存储接口包括用于提高数据访问性能的第二本地存储。 当第一个网络位置正在运行时,第一个跨接存储接口的本地缓存的副本被传送到第二个本地存储。 在发生影响第一网络位置的灾难的情况下,第二跨越存储接口可以使用第二本地存储器中的本地高速缓存的副本来提供对具有改进性能的第一网络位置的数据的数据访问。

    CLOUD SYNTHETIC BACKUPS
    15.
    发明申请
    CLOUD SYNTHETIC BACKUPS 有权
    云合成备用

    公开(公告)号:US20110161297A1

    公开(公告)日:2011-06-30

    申请号:US12895811

    申请日:2010-09-30

    Applicant: Nitin Parab

    Inventor: Nitin Parab

    Abstract: Synthetic backups are created without accessing previous backup data or retrieving backup data from a cloud storage service. A backup system provides two or more backup data sets to a cloud spanning storage interface for storage in deduplicated form as label maps and data segments in a cloud storage service. A specification defines portions of two or more previous backup data sets to be copied into the synthetic backup. Labels corresponding with the specified portions of previous backup data sets are identified and added to a new label map to create a deduplicated synthetic backup. The completed label map is transferred to the cloud storage service. To provide access to the synthetic backup, the cloud spanning storage interface reconstructs all or a portion of the synthetic backup from the new label map and the data segments created during deduplication of previous backup data sets.

    Abstract translation: 创建合成备份,而无需访问以前的备份数据或从云存储服务检索备份数据。 备份系统为跨云存储接口的云提供两个或多个备份数据集,用于以重复数据删除的形式存储,作为云存储服务中的标签映射和数据段。 规范定义了要复制到合成备份中的两个或多个以前的备份数据集的部分。 标识与先前备份数据集的指定部分相对应的标签,并添加到新标签映射中以创建重复数据删除的综合备份。 完成的标签映射被转移到云存储服务。 为了提供对合成备份的访问,跨云存储接口的云将从新的标签映射和在先前的备份数据集的重复数据删除期间创建的数据段重构合成备份的全部或部分。

    WAN-OPTIMIZED LOCAL AND CLOUD SPANNING DEDUPLICATED STORAGE SYSTEM
    16.
    发明申请
    WAN-OPTIMIZED LOCAL AND CLOUD SPANNING DEDUPLICATED STORAGE SYSTEM 审中-公开
    WAN优化的本地和云计算复制存储系统

    公开(公告)号:US20110161291A1

    公开(公告)日:2011-06-30

    申请号:US12942991

    申请日:2010-11-09

    Abstract: A spanning storage interface facilitates the use of cloud storage services by storage clients. The spanning storage interface presents one or more data interfaces to storage clients at a network location, such as file, object, data backup, archival, and storage block based interfaces. The data interfaces allows storage clients to store and retrieve data using non-cloud based protocols. The spanning storage interface may perform data deduplication on data received from storage clients. The spanning storage interface may transfer the deduplicated version of the data to the cloud storage service. The spanning storage interface may include local storage for storing a copy or all or a portion of the data from storage clients. The local storage may be used as a local cache of frequently accessed data, which may be stored data in its deduplicated form.

    Abstract translation: 跨越存储接口便于存储客户端使用云存储服务。 跨越存储接口向诸如文件,对象,数据备份,归档和基于存储块的接口的网络位置处的存储客户端呈现一个或多个数据接口。 数据接口允许存储客户端使用非基于云的协议来存储和检索数据。 跨越存储接口可以对从存储客户端接收的数据执行重复数据删除。 跨越存储接口可以将重复数据删除的数据版本传输到云存储服务。 跨越存储接口可以包括用于存储副本的本地存储器或来自存储客户端的数据的全部或一部分。 本地存储器可以用作经常访问的数据的本地缓存,其可以以其重复数据删除的形式存储数据。

    VIRTUAL FILE SYSTEM STACK FOR DATA DEDUPLICATION
    17.
    发明申请
    VIRTUAL FILE SYSTEM STACK FOR DATA DEDUPLICATION 审中-公开
    用于数据重复的虚拟文件系统堆栈

    公开(公告)号:US20100088349A1

    公开(公告)日:2010-04-08

    申请号:US12416057

    申请日:2009-03-31

    Applicant: Nitin Parab

    Inventor: Nitin Parab

    CPC classification number: G06F16/1748 G06F3/0641

    Abstract: A data virtualization storage appliance performs data deduplication transformations on the data. The original or non-deduplicated file system is used as shell to hold the directory/file hierarchy and file metadata. The data of the file system is stored by a separate data storage in a transformed and deduplicated form. The deduplicated data store may be implemented as one or more hidden files. The shell file system preserves the hierarchy structure and potentially the file metadata of the original, non-deduplicated file system in its original format, allowing clients to access file metadata and hierarchy information easily. The data of a file may be removed from the shell file system and replaced with a data layout that specifies the arrangement of deduplicated data segments needed to reconstruct the file data. The data layout associated with a file may be stored in a separate data stream in the shell file system.

    Abstract translation: 数据虚拟化存储设备对数据进行重复数据删除转换。 原始或非重复数据删除的文件系统用作shell来保存目录/文件层次结构和文件元数据。 文件系统的数据通过单独的数据存储以转换和重复数据删除的形式存储。 重复数据删除的数据存储可以被实现为一个或多个隐藏文件。 shell文件系统保留原始格式的原始,非重复数据删除的文件系统的层次结构和潜在的文件元数据,从而允许客户端轻松访问文件元数据和层次结构信息。 可以从shell文件系统中删除文件的数据,并替换为指定重建文件数据所需的重复数据删除数据段的排列的数据布局。 与文件相关联的数据布局可以存储在shell文件系统中的单独的数据流中。

Patent Agency Ranking