FILE SYSTEM WITH INTERNAL DEDUPLICATION AND MANAGEMENT OF DATA BLOCKS
    1.
    发明申请
    FILE SYSTEM WITH INTERNAL DEDUPLICATION AND MANAGEMENT OF DATA BLOCKS 有权
    具有数据块内部重用和管理的文件系统

    公开(公告)号:US20100121825A1

    公开(公告)日:2010-05-13

    申请号:US12270101

    申请日:2008-11-13

    IPC分类号: G06F12/02

    CPC分类号: G06F17/30156

    摘要: A method for deduplicating and managing data blocks within a file system includes adding a deduplication identifier to each pointer pointing to a data block to indicate whether the data block is deduplicated, detecting duplicate data blocks, determining whether one of the duplicate data blocks has been deduplicated, when detected, determining that one duplicate data block is a master copy when it is determined that one duplicate data block has been deduplicated, selecting one of the duplicate data blocks to be a master copy when it is determined that the duplicate data blocks have not been deduplicated, and setting the deduplication identifier of the selected duplicate data block to indicate deduplication, and determining that the other duplicate data block is a new duplicate data block and setting the deduplication identifier of the other duplicate data block to indicate deduplication and directing the respective pointer to the master copy.

    摘要翻译: 一种用于对文件系统内的数据块进行重复数据删除和管理的方法包括:将重复数据删除标识符添加到指向数据块的每个指针,以指示数据块是否被重复数据删除,检测重复数据块,确定重复数据块中的一个是否已被重复数据删除 当检测到时,当确定一个重复数据块已被重复数据删除时,确定一个重复数据块是主副本,当确定重复数据块没有被复制时,选择一个复制数据块作为主副本 被重复数据删除,并且将所选择的重复数据块的重复数据删除标识符设置为指示重复数据消除,并且确定另一个重复数据块是新的重复数据块,并且设置其他重复数据块的重复数据删除标识符以指示重复数据删除并且指示相应的 指向主副本的指针。

    Managing storage and migration of backup data
    2.
    发明授权
    Managing storage and migration of backup data 失效
    管理备份数据的存储和迁移

    公开(公告)号:US08090917B2

    公开(公告)日:2012-01-03

    申请号:US12117778

    申请日:2008-05-09

    IPC分类号: G06F12/00 G06F13/00 G06F13/28

    CPC分类号: G06F11/1461

    摘要: A method and system for client backup data management and storage using virtual tape libraries (VTLs). A VTL controller executing a software method receives metadata that distinguishes among a plurality of different versions of backup data. The VTL controller determines a latest version of the backup data. The VTL controller determines a migration set of zero or more versions of the backup data. The latest version and any version included in the migration set are included in the plurality of different versions. The VTL controller determines that a storage of the latest version in a first storage medium (e.g., magnetic disk) of the VTL is complete. The VTL controller migrates the migration set to a second storage medium (e.g., magnetic tape) of the VTL if the migration set includes at least one version of the backup data.

    摘要翻译: 使用虚拟磁带库(VTL)的客户端备份数据管理和存储的方法和系统。 执行软件方法的VTL控制器接收在多个不同版本的备份数据之间进行区分的元数据。 VTL控制器确定备份数据的最新版本。 VTL控制器确定备份数据的零个或多个版本的迁移集。 包含在迁移集中的最新版本和任何版本都包含在多个不同版本中。 VTL控制器确定在VTL的第一存储介质(例如,磁盘)中的最新版本的存储是完整的。 如果迁移集包括备份数据的至少一个版本,VTL控制器将迁移集迁移到VTL的第二存储介质(例如磁带)。

    Data deduplication using CRC-seed differentiation between data and stubs
    3.
    发明授权
    Data deduplication using CRC-seed differentiation between data and stubs 有权
    使用数据和存根之间的CRC种子差异进行重复数据删除

    公开(公告)号:US08453031B2

    公开(公告)日:2013-05-28

    申请号:US12730400

    申请日:2010-03-24

    IPC分类号: H03M13/00 G06F13/00 G06F17/00

    摘要: Various embodiments for differentiating between data and stubs pointing to a parent copy of deduplicated data are provided. Undeduplicated data is stored with a first cyclic redundancy check (CRC) seed. A stub pointing to the parent copy of the deduplicated data is stored with a second CRC seed. Subsequent to reading the deduplicated data, the first CRC seed is associated with the undeduplicated data, and the second CRC seed is associated with the stub. A CRC check is performed using one of the first and second CRC seeds. If the CRC check is positive, an I/O operation is allowed to proceed. If the CRC check is negative, an additional CRC check is performed using another one of the first and second CRC seeds.

    摘要翻译: 提供了用于区分指向重复数据删除数据的父副本的数据和存根之间的各种实施例。 未经复制的数据与第一循环冗余校验(CRC)种子一起存储。 指向重复数据删除数据的父副本的存根与第二个CRC种子一起存储。 在读取重复数据删除的数据之后,第一个CRC种子与未被复制的数据相关联,第二个CRC种子与存根相关联。 使用第一和第二CRC种子之一执行CRC校验。 如果CRC校验为正,则允许进行I / O操作。 如果CRC校验是否定的,则使用第一和第二CRC种子中的另一个来执行附加的CRC校验。

    File system with internal deduplication and management of data blocks
    4.
    发明授权
    File system with internal deduplication and management of data blocks 有权
    具有内部重复数据删除和数据块管理的文件系统

    公开(公告)号:US08131687B2

    公开(公告)日:2012-03-06

    申请号:US12270101

    申请日:2008-11-13

    IPC分类号: G06F7/14 G06F13/00

    CPC分类号: G06F17/30156

    摘要: A method for deduplicating and managing data blocks within a file system includes adding a deduplication identifier to each pointer pointing to a data block to indicate whether the data block is deduplicated, detecting duplicate data blocks, determining whether one of the duplicate data blocks has been deduplicated, when detected, determining that one duplicate data block is a master copy when it is determined that one duplicate data block has been deduplicated, selecting one of the duplicate data blocks to be a master copy when it is determined that the duplicate data blocks have not been deduplicated, and setting the deduplication identifier of the selected duplicate data block to indicate deduplication, and determining that the other duplicate data block is a new duplicate data block and setting the deduplication identifier of the other duplicate data block to indicate deduplication and directing the respective pointer to the master copy.

    摘要翻译: 一种用于对文件系统内的数据块进行重复数据删除和管理的方法包括:将重复数据删除标识符添加到指向数据块的每个指针,以指示数据块是否被重复数据删除,检测重复数据块,确定重复数据块中的一个是否已被重复数据删除 当检测到时,当确定一个重复数据块已被重复数据删除时,确定一个重复数据块是主副本,当确定重复数据块没有被复制时,选择一个复制数据块作为主副本 被重复数据删除,并且将所选择的重复数据块的重复数据删除标识符设置为指示重复数据消除,并且确定另一个重复数据块是新的重复数据块,并且设置其他重复数据块的重复数据删除标识符以指示重复数据删除并且指示相应的 指向主副本的指针。

    MANAGING STORAGE AND MIGRATION OF BACKUP DATA
    5.
    发明申请
    MANAGING STORAGE AND MIGRATION OF BACKUP DATA 失效
    管理备份数据的存储和迁移

    公开(公告)号:US20090282203A1

    公开(公告)日:2009-11-12

    申请号:US12117778

    申请日:2008-05-09

    IPC分类号: G06F12/00

    CPC分类号: G06F11/1461

    摘要: A method and system for client backup data management and storage using virtual tape libraries (VTLs). A VTL controller executing a software method receives metadata that distinguishes among a plurality of different versions of backup data. The VTL controller determines a latest version of the backup data. The VTL controller determines a migration set of zero or more versions of the backup data. The latest version and any version included in the migration set are included in the plurality of different versions. The VTL controller determines that a storage of the latest version in a first storage medium (e.g., magnetic disk) of the VTL is complete. The VTL controller migrates the migration set to a second storage medium (e.g., magnetic tape) of the VTL if the migration set includes at least one version of the backup data.

    摘要翻译: 使用虚拟磁带库(VTL)的客户端备份数据管理和存储的方法和系统。 执行软件方法的VTL控制器接收在多个不同版本的备份数据之间进行区分的元数据。 VTL控制器确定备份数据的最新版本。 VTL控制器确定备份数据的零个或多个版本的迁移集。 包含在迁移集中的最新版本和任何版本都包含在多个不同版本中。 VTL控制器确定在VTL的第一存储介质(例如,磁盘)中的最新版本的存储是完整的。 如果迁移集包括备份数据的至少一个版本,VTL控制器将迁移集迁移到VTL的第二存储介质(例如磁带)。

    Apparatus and method to store information
    9.
    发明授权
    Apparatus and method to store information 失效
    用于存储信息的装置和方法

    公开(公告)号:US08311663B2

    公开(公告)日:2012-11-13

    申请号:US11219451

    申请日:2005-08-31

    IPC分类号: G06F7/00

    摘要: A method to store data is disclosed. The method provides a plurality of data storage media, an automated data library comprising one or more data storage devices, a first plurality of storage cells, and a robotic accessor. The method further provides a storage vault comprising a second plurality of storage cells but no data storage devices. The method selects the (i)th data storage medium and sets the (i)th data state, where that (i)th data state is selected from the group consisting of online, offline, and vault. If the method sets the (i)th data state is set to online, then the method mounts that (i)th data storage medium in one of the data storage devices. If the method sets the (i)th data state to offline, then the method removeably places the (i)th data storage medium in one of the first plurality of storage cells. If the method sets the (i)th data state is set to vault, then the method places the (i)th data storage medium in one of the second plurality of storage cells.

    摘要翻译: 公开了存储数据的方法。 该方法提供多个数据存储介质,包括一个或多个数据存储设备的自动数据库,第一多个存储单元和机器人存取器。 该方法还提供包括第二多个存储单元但不包括数据存储设备的存储库。 该方法选择第(i)个数据存储介质并设置第(i)个数据状态,其中第(i)个数据状态从由在线,离线和库组成的组中选择。 如果方法设置(i)数据状态设置为联机,则该方法将第(i)个数据存储介质安装在数据存储设备之一中。 如果该方法将(i)数据状态设置为离线,则该方法可移除地将第(i)个数据存储介质放置在第一多个存储单元之一中。 如果该方法将第(i)个数据状态设置为保险库,则该方法将第(i)个数据存储介质放置在第二多个存储单元之一中。

    HAMMING RADIUS SEPARATED DEDUPLICATION LINKS
    10.
    发明申请
    HAMMING RADIUS SEPARATED DEDUPLICATION LINKS 审中-公开
    激活RADIUS分离的重复链接

    公开(公告)号:US20120210192A1

    公开(公告)日:2012-08-16

    申请号:US13453062

    申请日:2012-04-23

    IPC分类号: H03M13/19 G06F11/10 H03M13/29

    摘要: A data storage system includes a data storage array configured for de-duplication of duplicate data therein by: identification of a plurality of portions of data; a comparison of each portion of the data to identify duplicate data and identification of a link associated with each duplicate data; a determination of whether a Hamming link-separation-distance of the identified link is greater than twice a Hamming radius of an error correction code in the data storage system; and replacement of the duplicate data with the identified link when it is determined that the Hamming link-separation-distance is greater than twice the Hamming radius.

    摘要翻译: 数据存储系统包括:数据存储阵列,其配置用于通过以下方式对重复数据删除重复数据:识别数据的多个部分; 数据的每个部分的比较以识别与每个重复数据相关联的重复数据和标识; 确定所识别的链路的汉明链路间隔距离是否大于数据存储系统中的纠错码的汉明半径的两倍; 并且当确定汉明链路间隔距离大于汉明半径的两倍时,用所识别的链路替换重复数据。