MANAGING BACKUPS OF DATA OBJECTS IN CONTAINERS
    1.
    发明申请
    MANAGING BACKUPS OF DATA OBJECTS IN CONTAINERS 有权
    管理集装箱数据对象的备份

    公开(公告)号:US20130110784A1

    公开(公告)日:2013-05-02

    申请号:US13285331

    申请日:2011-10-31

    IPC分类号: G06F7/00

    摘要: Containers that store data objects that were written to those containers during a particular backup are accessed. Then, a subset of the containers is identified; the containers in the subset have less than a threshold number of data objects associated with the particular backup. Data objects that are in containers in that subset and that are associated with the backup are copied to one or more other containers. Those other containers are subsequently used to restore data objects associated with the backup.

    摘要翻译: 存储在特定备份期间存储写入这些容器的数据对象的容器。 然后,识别容器的一个子集; 子集中的容器具有小于阈值数量的与特定备份相关联的数据对象。 位于该子集中并与备份关联的容器中的数据对象将复制到一个或多个其他容器。 这些其他容器随后用于还原与备份相关联的数据对象。

    Managing backups of data objects in containers
    2.
    发明授权
    Managing backups of data objects in containers 有权
    管理容器中数据对象的备份

    公开(公告)号:US08874522B2

    公开(公告)日:2014-10-28

    申请号:US13285331

    申请日:2011-10-31

    IPC分类号: G06F17/30 G06F7/00 G06F11/14

    摘要: Containers that store data objects that were written to those containers during a particular backup are accessed. Then, a subset of the containers is identified; the containers in the subset have less than a threshold number of data objects associated with the particular backup. Data objects that are in containers in that subset and that are associated with the backup are copied to one or more other containers. Those other containers are subsequently used to restore data objects associated with the backup.

    摘要翻译: 存储在特定备份期间存储写入这些容器的数据对象的容器。 然后,识别容器的一个子集; 子集中的容器具有小于阈值数量的与特定备份相关联的数据对象。 位于该子集中并与备份关联的容器中的数据对象将复制到一个或多个其他容器。 这些其他容器随后用于还原与备份相关联的数据对象。

    De-duplication storage system with improved reference update efficiency
    3.
    发明授权
    De-duplication storage system with improved reference update efficiency 有权
    重复数据删除存储系统具有改进的参考更新效率

    公开(公告)号:US08914324B1

    公开(公告)日:2014-12-16

    申请号:US12580785

    申请日:2009-10-16

    IPC分类号: G06F17/30

    CPC分类号: G06F11/1453 G06F17/30156

    摘要: A system and method for backing up files to a single-instance storage system are disclosed. The files may be split into segments, and the file data may be stored in the single-instance storage system as individual segments. The single-instance storage system uses the concept of a file region which covers multiple segments of the file. If a region of a file is unchanged from one backup to the next, the system may use a region object to refer to the unchanged region. This avoids the need to update the reference information for each of the segments within the region, thus increasing the efficiency of backing up the new version of the file.

    摘要翻译: 公开了将文件备份到单实例存储系统的系统和方法。 文件可以被分割成段,并且文件数据可以作为单个段存储在单实例存储系统中。 单实例存储系统使用涵盖文件多个段的文件区域的概念。 如果文件的一个区域从一个备份到下一个备份不变,则系统可以使用区域对象来引用未更改的区域。 这避免了需要更新区域内每个段的参考信息,从而提高了备份新版本文件的效率。

    Systems and methods for providing increased scalability in deduplication storage systems
    4.
    发明授权
    Systems and methods for providing increased scalability in deduplication storage systems 有权
    在重复数据删除存储系统中提供更高可扩展性的系统和方法

    公开(公告)号:US08954401B2

    公开(公告)日:2015-02-10

    申请号:US13007301

    申请日:2011-01-14

    摘要: A computer-implemented method for providing increased scalability in deduplication storage systems may include (1) identifying a database that stores a plurality of reference objects, (2) determining that at least one size-related characteristic of the database has reached a predetermined threshold, (3) partitioning the database into a plurality of sub-databases capable of being updated independent of one another, (4) identifying a request to perform an update operation that updates one or more reference objects stored within at least one sub-database, and then (5) performing the update operation on less than all of the sub-databases to avoid processing costs associated with performing the update operation on all of the sub-databases. Various other systems, methods, and computer-readable media are also disclosed.

    摘要翻译: 用于在重复数据删除存储系统中提供增加的可扩展性的计算机实现的方法可以包括(1)识别存储多个参考对象的数据库,(2)确定数据库的至少一个尺寸相关特性已经达到预定阈值, (3)将数据库分割成能够彼此独立地更新的多个子数据库,(4)识别执行更新存储在至少一个子数据库中的一个或多个参考对象的更新操作的请求,以及 然后(5)在小于所有子数据库的情况下执行更新操作,以避免处理与对所有子数据库执行更新操作相关联的成本。 还公开了各种其它系统,方法和计算机可读介质。

    Systems and Methods for Providing Increased Scalability in Deduplication Storage Systems
    5.
    发明申请
    Systems and Methods for Providing Increased Scalability in Deduplication Storage Systems 有权
    在重复数据删除存储系统中提高可扩展性的系统和方法

    公开(公告)号:US20120185447A1

    公开(公告)日:2012-07-19

    申请号:US13007301

    申请日:2011-01-14

    IPC分类号: G06F17/30

    摘要: A computer-implemented method for providing increased scalability in deduplication storage systems may include (1) identifying a database that stores a plurality of reference objects, (2) determining that at least one size-related characteristic of the database has reached a predetermined threshold, (3) partitioning the database into a plurality of sub-databases capable of being updated independent of one another, (4) identifying a request to perform an update operation that updates one or more reference objects stored within at least one sub-database, and then (5) performing the update operation on less than all of the sub-databases to avoid processing costs associated with performing the update operation on all of the sub-databases. Various other systems, methods, and computer-readable media are also disclosed.

    摘要翻译: 用于在重复数据删除存储系统中提供增加的可扩展性的计算机实现的方法可以包括(1)识别存储多个参考对象的数据库,(2)确定数据库的至少一个尺寸相关特性已经达到预定阈值, (3)将数据库分割成能够彼此独立地更新的多个子数据库,(4)识别执行更新存储在至少一个子数据库中的一个或多个参考对象的更新操作的请求,以及 然后(5)在小于所有子数据库的情况下执行更新操作,以避免处理与对所有子数据库执行更新操作相关联的成本。 还公开了各种其它系统,方法和计算机可读介质。

    De-duplication Storage System with Multiple Indices for Efficient File Storage
    6.
    发明申请
    De-duplication Storage System with Multiple Indices for Efficient File Storage 审中-公开
    具有多个索引的重复数据删除存储系统,用于高效的文件存储

    公开(公告)号:US20110093439A1

    公开(公告)日:2011-04-21

    申请号:US12580697

    申请日:2009-10-16

    申请人: Fanglu Guo Weibao Wu

    发明人: Fanglu Guo Weibao Wu

    IPC分类号: G06F12/00 G06F12/16

    CPC分类号: G06F11/1453 G06F11/1464

    摘要: A de-duplication storage system which uses multiple indices is described. A first group of one or more indices may be stored in random access memory (RAM) or another type of fast storage. A second group of one or more indices may be stored on one or more disk drives or another type of storage where large amounts of data can be stored inexpensively. The first group of indices may be used when adding new files to the de-duplication storage system in order to determine whether the file segments of the new files are already stored. The second group of indices may be used when restoring files in order to lookup the segments of the files.

    摘要翻译: 描述了使用多个索引的重复数据删除存储系统。 一个或多个索引的第一组可以存储在随机存取存储器(RAM)或另一类型的快速存储器中。 可以将一个或多个索引的第二组存储在一个或多个磁盘驱动器或其他类型的存储器上,其中可以廉价地存储大量数据。 当将新文件添加到重复数据删除存储系统中时,可以使用第一组索引,以便确定新文件的文件段是否已被存储。 在恢复文件时可以使用第二组索引,以便查找文件的段。

    Systems and methods for removing unreferenced data segments from deduplicated data systems
    7.
    发明授权
    Systems and methods for removing unreferenced data segments from deduplicated data systems 有权
    从重复数据删除的数据系统中删除未引用的数据段的系统和方法

    公开(公告)号:US08224875B1

    公开(公告)日:2012-07-17

    申请号:US12652333

    申请日:2010-01-05

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30303

    摘要: A computer-implemented method for removing unreferenced data segments from deduplicated data systems may include: 1) identifying a deduplicated data system that contains a plurality of data segments, 2) identifying a plurality of containers within the deduplicated data system, with each container containing a subset of the data segments within the deduplicated data system, 3) identifying at least one container within the plurality of containers that is likely to include a large proportion of data segments that are not referenced by data objects within the deduplicated data system, and then, for each identified container, 4) searching for unreferenced data segments within the identified container and 5) removing the unreferenced data segments from the identified container. Various other methods, systems, and computer-readable media are also disclosed.

    摘要翻译: 用于从重复数据删除的数据系统中去除未引用的数据段的计算机实现的方法可以包括:1)识别包含多个数据段的重复数据删除的数据系统,2)识别重复数据删除数据系统内的多个容器,每个容器包含 在重复数据删除的数据系统内的数据段的子集,3)识别多个容器内的至少一个可能包含大量数据段的容器,这些数据段不被重复数据删除的数据系统内的数据对象引用, 对于每个已识别的容器,4)搜索所识别的容器内的未引用的数据段,以及5)从识别的容器中移除未引用的数据段。 还公开了各种其它方法,系统和计算机可读介质。

    Systems and methods for removing unreferenced data segments from deduplicated data systems
    8.
    发明授权
    Systems and methods for removing unreferenced data segments from deduplicated data systems 有权
    从重复数据删除的数据系统中删除未引用的数据段的系统和方法

    公开(公告)号:US08224874B2

    公开(公告)日:2012-07-17

    申请号:US12652300

    申请日:2010-01-05

    IPC分类号: G06F17/30

    摘要: A computer-implemented method for removing unreferenced data segments from deduplicated data systems may include: 1) identifying a deduplicated data system that contains a plurality of data objects, 2) dividing the data objects within the deduplicated data system into a plurality of data object groups, 3) identifying, within the data object groups, at least one data object group that has changed subsequent to a prior garbage-collection operation that removed data segments that were not referenced by data objects within the deduplicated data system, 4) identifying at least one container within the deduplicated data system that contains data segments referenced by data objects within the changed data object group, and then, for each identified container, 5) removing data segments from the identified container that are not referenced by data objects within the deduplicated data system. Various other methods, systems, and computer-readable media are also disclosed.

    摘要翻译: 用于从重复数据删除的数据系统中去除未引用的数据段的计算机实现的方法可以包括:1)识别包含多个数据对象的重复数据删除的数据系统,2)将重复数据删除的数据系统内的数据对象划分为多个数据对象组 ,3)在所述数据对象组内识别在先前的垃圾收集操作之后改变的至少一个数据对象组,所述数据对象组移除了未被重复数据删除的数据系统内的数据对象引用的数据段,4)至少识别 在重复数据删除的数据系统中的一个容器,其中包含由更改的数据对象组内的数据对象引用的数据段,然后针对每个已标识的容器,5)从已识别的容器中删除未被重复数据删除的数据中的数据对象引用的数据段 系统。 还公开了各种其它方法,系统和计算机可读介质。

    Systems and Methods for Removing Unreferenced Data Segments from Deduplicated Data Systems
    9.
    发明申请
    Systems and Methods for Removing Unreferenced Data Segments from Deduplicated Data Systems 有权
    从重复数据删除数据系统中删除未引用的数据段的系统和方法

    公开(公告)号:US20110167096A1

    公开(公告)日:2011-07-07

    申请号:US12652300

    申请日:2010-01-05

    IPC分类号: G06F17/00 G06F12/00 G06F12/16

    摘要: A computer-implemented method for removing unreferenced data segments from deduplicated data systems may include: 1) identifying a deduplicated data system that contains a plurality of data objects, 2) dividing the data objects within the deduplicated data system into a plurality of data object groups, 3) identifying, within the data object groups, at least one data object group that has changed subsequent to a prior garbage-collection operation that removed data segments that were not referenced by data objects within the deduplicated data system, 4) identifying at least one container within the deduplicated data system that contains data segments referenced by data objects within the changed data object group, and then, for each identified container, 5) removing data segments from the identified container that are not referenced by data objects within the deduplicated data system. Various other methods, systems, and computer-readable media are also disclosed.

    摘要翻译: 用于从重复数据删除的数据系统中去除未引用的数据段的计算机实现的方法可以包括:1)识别包含多个数据对象的重复数据删除的数据系统,2)将重复数据删除的数据系统内的数据对象划分为多个数据对象组 ,3)在所述数据对象组内识别在先前的垃圾收集操作之后改变的至少一个数据对象组,所述数据对象组移除了未被重复数据删除的数据系统内的数据对象引用的数据段,4)至少识别 在重复数据删除的数据系统中的一个容器,其中包含由更改的数据对象组内的数据对象引用的数据段,然后针对每个已标识的容器,5)从已识别的容器中删除未被重复数据删除的数据中的数据对象引用的数据段 系统。 还公开了各种其它方法,系统和计算机可读介质。

    Constrained backup image defragmentation optimization within deduplication system

    公开(公告)号:US09928210B1

    公开(公告)日:2018-03-27

    申请号:US13459987

    申请日:2012-04-30

    IPC分类号: G06F12/00 G06F15/80

    CPC分类号: G06F15/8084 G06F17/30

    摘要: The present disclosure provides for defragmenting deduplicated data, such as one or more backup image files, stored in a deduplicated data store. A defragmentation module can be implemented on a deduplication server to reduce fragmentation of backup images and improve processing time for restoring a backup image. A defragmentation module can be configured to defragment a backup image file by migrating portions of data of the backup image file that are stored in various containers at non-contiguous locations throughout deduplicated data store. A defragmentation module can contiguously write the portions to one or more containers, which are stored at one or more new locations in the deduplicated data store. A defragmentation module can be configured to evaluate whether portions of a backup image file meet criteria for defragmentation. A defragmentation module can also be configured to update location information about the portions that are migrated to the new container(s).