Identifying modified chunks in a data set for storage
    1.
    发明授权
    Identifying modified chunks in a data set for storage 有权
    识别用于存储的数据集中的修改的块

    公开(公告)号:US08452732B2

    公开(公告)日:2013-05-28

    申请号:US13474624

    申请日:2012-05-17

    IPC分类号: G06F7/00

    摘要: Provided are a computer program product, system, and method for identifying modified chunks in a data set for storage. Modifications are received to at least one of the chunks in the data set. A determination is made of at least one range of least one of the chunks including data affected by the modifications determination is made as to whether at least one chunk outside of the at least one range has changed. For each determined at least one chunk outside of the at least one range that has changed, a determination is made of at least one new chunk and a new digest of the at least one new chunk and information is added on the at least one new chunk and information to locate the new chunk in the data set.

    摘要翻译: 提供了用于识别用于存储的数据集中的修改的块的计算机程序产品,系统和方法。 修改被接收到数据集中的至少一个块。 确定至少一个范围,包括至少一个块,包括受修改影响的数据确定是否至少一个范围之外的至少一个块已经改变。 对于已经改变的至少一个范围之外的每个确定的至少一个块,确定至少一个新块的至少一个新块和新摘要,并且在至少一个新块上添加信息 以及在数据集中定位新块的信息。

    OPTIMIZATION OF FINGERPRINT-BASED DEDUPLICATION
    2.
    发明申请
    OPTIMIZATION OF FINGERPRINT-BASED DEDUPLICATION 有权
    指纹识别优化

    公开(公告)号:US20130138620A1

    公开(公告)日:2013-05-30

    申请号:US13305218

    申请日:2011-11-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30159 G06F17/30156

    摘要: Described are embodiments of an invention for identifying chunk boundaries for optimization of fingerprint-based deduplication in a computing environment. Storage objects that are backed up in a computing environment are often compound storage objects which include many individual storage objects. The computing device of the computing environment breaks the storage objects into chunks of data by determining a hash value on a range of data. The computing device creates an artificial chunk boundary when the end of data of the storage object is reached. When an artificial chunk boundary is created for the end of data of a storage object, the computing device stores a pseudo fingerprint for the artificial chunk boundary. If a hash value matches a fingerprint or a pseudo fingerprint, then the computing device determines that the range of data corresponds to a chunk and the computing system defines the chunk boundaries.

    摘要翻译: 描述了用于识别用于在计算环境中优化基于指纹的重复数据删除的块边界的发明的实施例。 在计算环境中备份的存储对象通常是复合存储对象,包括许多单独的存储对象。 计算环境的计算设备通过确定数据范围上的哈希值来将存储对象分解成数据块。 当到达存储对象的数据结束时,计算设备创建一个人造块边界。 当为存储对象的数据结束创建人造块边界时,计算装置存储用于人造块边界的伪指纹。 如果哈希值与指纹或伪指纹匹配,则计算设备确定数据范围对应于块,并且计算系统定义块边界。

    IDENTIFYING MODIFIED CHUNKS IN A DATA SET FOR STORAGE
    3.
    发明申请
    IDENTIFYING MODIFIED CHUNKS IN A DATA SET FOR STORAGE 失效
    在存储的数据集中识别修改的信息

    公开(公告)号:US20120290537A1

    公开(公告)日:2012-11-15

    申请号:US13103977

    申请日:2011-05-09

    IPC分类号: G06F7/00 G06F17/30

    摘要: Provided are a computer program product, system, and method for identifying modified chunks in a data set for storage. Information is maintained on a data set of variable length chunks, including a digest of each chunk and information to locate the chunk in the data set. Modifications are received to at least one of the chunks in the data set. A determination is made of at least one range of least one of the chunks including data affected by the modifications, wherein each range identifies one chunk or sequential chunks having data affected by the modifications. The at least one chunk in each range is processed to determine at least one new chunk in each range, and for each determined new chunk, a digest of the new chunk. A determination is made as to whether at least one chunk outside of the at least one range has changed. For each determined at least one chunk outside of the at least one range that has changed, a determination is made of at least one new chunk and a new digest of the at least one new chunk. Adding to the set information the new digest information on the at least one new chunk and information to locate the new chunk in the data set.

    摘要翻译: 提供了用于识别用于存储的数据集中的修改的块的计算机程序产品,系统和方法。 信息被保存在可变长度块的数据集上,包括每个块的摘要和用于定位数据集中的块的信息。 修改被接收到数据集中的至少一个块。 确定包括受修改影响的数据的块中的至少一个块的至少一个范围,其中每个范围标识具有受修改影响的数据的一个块或连续块。 处理每个范围中的至少一个块以确定每个范围中的至少一个新块,并且对于每个确定的新块,新块的摘要。 确定至少一个范围之外的至少一个块是否已经改变。 对于已经改变的至少一个范围之外的每个确定的至少一个块,确定至少一个新块的至少一个新块和新摘要。 将关于至少一个新块的新摘要信息和信息添加到集合信息中以在数据集中定位新块。

    Optimization of fingerprint-based deduplication
    4.
    发明授权
    Optimization of fingerprint-based deduplication 有权
    基于指纹的重复数据删除优化

    公开(公告)号:US09047304B2

    公开(公告)日:2015-06-02

    申请号:US13305218

    申请日:2011-11-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30159 G06F17/30156

    摘要: Described are embodiments of an invention for identifying chunk boundaries for optimization of fingerprint-based deduplication in a computing environment. Storage objects that are backed up in a computing environment are often compound storage objects which include many individual storage objects. The computing device of the computing environment breaks the storage objects into chunks of data by determining a hash value on a range of data. The computing device creates an artificial chunk boundary when the end of data of the storage object is reached. When an artificial chunk boundary is created for the end of data of a storage object, the computing device stores a pseudo fingerprint for the artificial chunk boundary. If a hash value matches a fingerprint or a pseudo fingerprint, then the computing device determines that the range of data corresponds to a chunk and the computing system defines the chunk boundaries.

    摘要翻译: 描述了用于识别用于在计算环境中优化基于指纹的重复数据删除的块边界的发明的实施例。 在计算环境中备份的存储对象通常是复合存储对象,包括许多单独的存储对象。 计算环境的计算设备通过确定数据范围上的哈希值来将存储对象分解成数据块。 当到达存储对象的数据结束时,计算设备创建一个人造块边界。 当为存储对象的数据结束创建人造块边界时,计算装置存储用于人造块边界的伪指纹。 如果哈希值与指纹或伪指纹匹配,则计算设备确定数据范围对应于块,并且计算系统定义块边界。

    Identifying modified chunks in a data set for storage
    5.
    发明授权
    Identifying modified chunks in a data set for storage 失效
    识别用于存储的数据集中的修改的块

    公开(公告)号:US08612392B2

    公开(公告)日:2013-12-17

    申请号:US13103977

    申请日:2011-05-09

    IPC分类号: G06F7/00

    摘要: Provided are a computer program product, system, and method for identifying modified chunks in a data set for storage. Information is maintained on a data set of variable length chunks, including a digest of each chunk and information to locate the chunk in the data set. Modifications are received to at least one of the chunks in the data set. A determination is made of at least one range of least one of the chunks including data affected by the modifications, wherein each range identifies one chunk or sequential chunks having data affected by the modifications. The at least one chunk in each range is processed to determine at least one new chunk in each range, and for each determined new chunk, a digest of the new chunk. A determination is made as to whether at least one chunk outside of the at least one range has changed. For each determined at least one chunk outside of the at least one range that has changed, a determination is made of at least one new chunk and a new digest of the at least one new chunk. Adding to the set information the new digest information on the at least one new chunk and information to locate the new chunk in the data set.

    摘要翻译: 提供了用于识别用于存储的数据集中的修改的块的计算机程序产品,系统和方法。 信息被保存在可变长度块的数据集上,包括每个块的摘要和用于定位数据集中的块的信息。 修改被接收到数据集中的至少一个块。 确定包括受修改影响的数据的块中的至少一个块的至少一个范围,其中每个范围标识具有受修改影响的数据的一个块或连续块。 处理每个范围中的至少一个块以确定每个范围中的至少一个新块,并且对于每个确定的新块,新块的摘要。 确定至少一个范围之外的至少一个块是否已经改变。 对于已经改变的至少一个范围之外的每个确定的至少一个块,确定至少一个新块的至少一个新块和新摘要。 将关于至少一个新块的新摘要信息和信息添加到集合信息中以在数据集中定位新块。

    IDENTIFYING MODIFIED CHUNKS IN A DATA SET FOR STORAGE
    6.
    发明申请
    IDENTIFYING MODIFIED CHUNKS IN A DATA SET FOR STORAGE 有权
    在存储的数据集中识别修改的信息

    公开(公告)号:US20120290546A1

    公开(公告)日:2012-11-15

    申请号:US13474624

    申请日:2012-05-17

    IPC分类号: G06F17/30

    摘要: Provided are a computer program product, system, and method for identifying modified chunks in a data set for storage. Modifications are received to at least one of the chunks in the data set. A determination is made of at least one range of least one of the chunks including data affected by the modifications determination is made as to whether at least one chunk outside of the at least one range has changed. For each determined at least one chunk outside of the at least one range that has changed, a determination is made of at least one new chunk and a new digest of the at least one new chunk and information is added on the at least one new chunk and information to locate the new chunk in the data set.

    摘要翻译: 提供了用于识别用于存储的数据集中的修改的块的计算机程序产品,系统和方法。 修改被接收到数据集中的至少一个块。 确定至少一个范围,包括至少一个块,包括受修改影响的数据确定是否至少一个范围之外的至少一个块已经改变。 对于已经改变的至少一个范围之外的每个确定的至少一个块,确定至少一个新块的至少一个新块和新摘要,并且在至少一个新块上添加信息 以及在数据集中定位新块的信息。

    Restoring deduplicated data objects from sequential backup devices
    7.
    发明授权
    Restoring deduplicated data objects from sequential backup devices 有权
    从顺序备份设备恢复重复数据删除的数据对象

    公开(公告)号:US08892603B2

    公开(公告)日:2014-11-18

    申请号:US13479003

    申请日:2012-05-23

    IPC分类号: G06F17/30 G06F11/14

    摘要: Provided are computer program product, system, and method for restoring deduplicated data objects from sequential backup devices. A server stores data objects of extents having deduplicated data in the at least one sequential backup device. The server receives from a client a request for data objects. The server determines extents stored in the at least one sequential backup device for the requested data objects. The server or client sorts the extents according to an order in which they are stored in the at least one sequential backup device to generate a sort list. The server retrieves the extents from the at least one sequential backup device according to the order in the sort list to access the extents sequentially from the sequential backup device in the order in which they were stored. The server returns the retrieved extents to the client and the client reconstructs the requested data objects from the received extents.

    摘要翻译: 提供的是用于从顺序备份设备恢复重复数据删除的数据对象的计算机程序产品,系统和方法。 服务器将具有重复数据删除数据的盘区的数据对象存储在所述至少一个顺序备份设备中。 服务器从客户端接收对数据对象的请求。 服务器确定存储在所请求的数据对象的至少一个顺序备份设备中的区段。 服务器或客户端根据它们存储在至少一个顺序备份设备中的顺序对扩展区进行排序以生成排序列表。 服务器根据排序列表中的顺序从至少一个顺序备份设备中检索扩展数据块,以顺序备份设备按顺序从存储顺序访问扩展数据块。 服务器将检索到的扩展区返回到客户端,客户机从接收到的扩展区重新构建所请求的数据对象。

    Developing software components and capability testing procedures for testing coded software component
    8.
    发明授权
    Developing software components and capability testing procedures for testing coded software component 失效
    开发用于测试编码软件组件的软件组件和功能测试程序

    公开(公告)号:US08561024B2

    公开(公告)日:2013-10-15

    申请号:US11626274

    申请日:2007-01-23

    IPC分类号: G06F9/44

    CPC分类号: G06F8/73 G06F8/20 G06F11/3684

    摘要: Capability test programs are generated implementing the capability test test cases for the components, wherein component developers use the capability test programs to test components during the development and coding of the components. Documentation is generated describing component abstract test cases incorporating information on capability test test cases. Component test programs are generated implementing the component abstract test cases for the components and utilizing capability test programs, wherein each component test program tests one component for at least one test case specified in the component abstract test case documentation for the component. The capability test programs and component test programs are stored in a shared repository. A software development program is deployed to enable the developers and testers to execute groups of component test programs in the shared repository to test the components during different phases of the development of the software product.

    摘要翻译: 生成能力测试程序,实现组件的能力测试测试用例,其中组件开发人员使用能力测试程序在组件的开发和编码期间测试组件。 生成文档,描述组合抽象测试用例,并结合有关能力测试测试用例的信息。 生成组件测试程序,实现组件的组件抽象测试用例并利用能力测试程序,其中每个组件测试程序针对组件的组件抽象测试用例文档中指定的至少一个测试用例测试一个组件。 能力测试程序和组件测试程序存储在共享存储库中。 部署软件开发程序,使开发人员和测试人员能够在共享存储库中执行组件测试程序组,以在软件产品开发的不同阶段测试组件。

    Restoring a restore set of files from backup objects stored in sequential backup devices
    9.
    发明授权
    Restoring a restore set of files from backup objects stored in sequential backup devices 失效
    从存储在顺序备份设备中的备份对象恢复一组还原文件

    公开(公告)号:US08209298B1

    公开(公告)日:2012-06-26

    申请号:US12972291

    申请日:2010-12-17

    IPC分类号: G06F17/30

    摘要: Provided are a computer program product, system, and method for restoring a restore set of files from backup objects stored in sequential backup devices. Backup objects are stored in at least one sequential backup device. A client initiates a restore request to restore a restore set of data in a volume as of a restore point-in-time. A determination is made of backup objects stored in at least one sequential backup device including the restore set of data for the restore point-in-time, wherein the determined backup objects are determined from a set of backup objects including a full volume backup and delta backups providing data in the volume at different points-in-time, and wherein extents in different backup objects providing data for blocks in the volume at different points-in-time are not stored contiguously in the sequential backup device. A determination is made of extents stored in the at least one sequential backup device for the determined backup objects. The determined extents are sorted according to an order in which they are stored in the at least one sequential backup device to generate a sort list. The extents are retrieved from the at least one sequential backup device according to the order in the sort list to access the extents sequentially from the sequential backup device in the order in which they were stored. The retrieved extents are returned to the client and the client reconstructs the restore data set from the received extents.

    摘要翻译: 提供了一种计算机程序产品,系统和方法,用于从存储在顺序备份设备中的备份对象恢复恢复文件集。 备份对象存储在至少一个顺序备份设备中。 客户端启动还原请求,以恢复卷中恢复的还原数据集。 确定存储在至少一个顺序备份设备中的备份对象,包括用于恢复时间点的还原数据集,其中所确定的备份对象由一组备份对象确定,包括全卷备份和增量 在不同时间点在卷中提供数据的备份,并且其中不同备份对象中的不同备份对象中的盘区在不同时间点为块中的块提供数据不会连续存储在顺序备份设备中。 确定存储在所确定的备份对象的至少一个顺序备份设备中的范围。 确定的区段根据它们存储在至少一个顺序备份设备中的顺序进行排序以生成排序列表。 根据排序列表中的顺序从至少一个顺序备份设备检索扩展数据块,以顺序备份设备按顺序从存储顺序访问区段。 检索到的盘区返回到客户端,客户机从收到的盘区重构恢复数据集。

    Encrypting data objects to back-up
    10.
    发明授权
    Encrypting data objects to back-up 失效
    加密数据对象进行备份

    公开(公告)号:US08762743B2

    公开(公告)日:2014-06-24

    申请号:US13484028

    申请日:2012-05-30

    IPC分类号: G06F12/16 H04L9/28

    摘要: Provided are a computer program product, system, and method for encrypting data objects to back-up to a server. A client private key is intended to be maintained only by the client. A data object of chunks to store at the server is generated. A first portion of the chunks in the data object is encrypted with the client private key and the first portion of the chunks in the data object encrypted with the client private key are sent to the server to store. A second portion of the chunks in the data object not encrypted with the client private key are sent to the server to store.

    摘要翻译: 提供了用于加密数据对象以备份到服务器的计算机程序产品,系统和方法。 客户端私钥仅由客户端维护。 生成存储在服务器上的块的数据对象。 数据对象中的块的第一部分用客户端私钥加密,并且使用客户端私钥加密的数据对象中的块的第一部分被发送到服务器进行存储。 没有使用客户端私钥加密的数据对象中的块的第二部分被发送到服务器进行存储。