Efficient construction of synthetic backups within deduplication storage system
    3.
    发明授权
    Efficient construction of synthetic backups within deduplication storage system 有权
    在重复数据删除存储系统中高效构建合成备份

    公开(公告)号:US08682854B2

    公开(公告)日:2014-03-25

    申请号:US13488180

    申请日:2012-06-04

    IPC分类号: G06F7/00

    摘要: A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.

    摘要翻译: 重复数据消除存储系统使新输入数据能够与已构建的合成备份数据进行重复数据删除,为此,根据已综合备份引用的数据摘要,高效地计算正在构建的合成备份的重复数据删除摘要。 对于正在构造的合成备份的多个输入数据段的每个输入数据段,由索引检索由输入数据段引用的存储的数据段的重复数据删除摘要。 每个输入数据段被划分成多个固定大小的数据子段中的每一个。 对数据子段进行重复数据删除摘要的计算,其中计算基于由输入数据子段所参考的多个存储的数据子段的检索到的重复数据删除摘要。

    Incremental modification of an error detection code background of the invention
    4.
    发明授权
    Incremental modification of an error detection code background of the invention 有权
    本发明的错误检测码背景的增量修改

    公开(公告)号:US08839062B2

    公开(公告)日:2014-09-16

    申请号:US13348453

    申请日:2012-01-11

    摘要: Exemplary method, system, and computer program product embodiments for an incremental modification of an error detection code operation are provided. In one embodiment, by way of example only, for a data block requiring a first error detection code (EDC) value to be calculated and verified and is undergoing modification for at least one randomly positioned sub-blocks that becomes available and modified in independent time intervals, a second EDC value is calculated for each of the randomly positioned sub-blocks. An incremental effect of the second EDC value is applied for calculating the first EDC value and for recalculating the first EDC value upon replacing at least one of the randomly positioned sub-blocks. The resource consumption is proportional to the size of at least one of the randomly positioned sub-blocks that are added and modified. Additional system and computer program product embodiments are disclosed and provide related advantages.

    摘要翻译: 提供了用于错误检测码操作的增量修改的示例性方法,系统和计算机程序产品实施例。 在一个实施例中,仅作为示例,对于需要计算和验证第一错误检测码(EDC)值的数据块,并且对于在独立时间变得可用和修改的至少一个随机定位的子块正在进行修改 间隔,为每个随机定位的子块计算第二EDC值。 应用第二EDC值的增量效应来计算第一EDC值,并且在替换随机定位的子块中的至少一个时重新计算第一EDC值。 资源消耗与添加和修改的随机定位的子块中的至少一个的大小成比例。 公开了附加的系统和计算机程序产品实施例并提供相关的优点。

    Efficient construction of synthetic backups within deduplication storage system

    公开(公告)号:US08682873B2

    公开(公告)日:2014-03-25

    申请号:US12958205

    申请日:2010-12-01

    IPC分类号: G06F7/00

    摘要: A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.

    Full exploitation of parallel processors for data processing
    7.
    发明授权
    Full exploitation of parallel processors for data processing 有权
    全面利用并行处理器进行数据处理

    公开(公告)号:US08959524B2

    公开(公告)日:2015-02-17

    申请号:US13536320

    申请日:2012-06-28

    IPC分类号: G06F9/46

    摘要: Exemplary method, system, and computer program product embodiments for full exploitation of parallel processors for data processing are provided. In one embodiment, by way of example only, a set of parallel processors is partitioned into disjoint subsets according to indices of the set of the parallel processors. The size of each of the disjoint subsets corresponds to a number of processors assigned to the processing of the data chunks at one of the layers. Each of the processors are assigned to different layers in different data chunks such that each of processors are busy and the data chunks are fully processed within a number of the time steps equal to the number of the layers. A transition function is devised from the indices of the set of the parallel processors at one time steps to the indices of the set of the parallel processors at a following time step.

    摘要翻译: 提供了用于完全利用用于数据处理的并行处理器的示例性方法,系统和计算机程序产品实施例。 在一个实施例中,仅作为示例,根据并行处理器的集合的索引将一组并行处理器划分为不相交的子集。 每个不相交子集的大小对应于分配给处理一个层上的数据块的处理器的数量。 每个处理器被分配到不同数据块中的不同层,使得每个处理器都忙,并且数据块在等于层的数量的时间步长的数量中被完全处理。 从并行处理器的集合的索引在一个时间步骤设计转换功能到在随后的时间步长的并行处理器的集合的索引。

    Computation of a remainder by division using pseudo-remainders
    8.
    发明授权
    Computation of a remainder by division using pseudo-remainders 有权
    使用伪余数进行除法计算余数

    公开(公告)号:US08819098B2

    公开(公告)日:2014-08-26

    申请号:US12953161

    申请日:2010-11-23

    IPC分类号: G06F7/38

    CPC分类号: G06F7/535

    摘要: Methods, computer systems, and computer program products for calculating a remainder by division of a sequence of bytes interpreted as a first number by a second number is provided. A pseudo-remainder by division associated with a first subsequence of the sequence of bytes is calculated. A property of this pseudo-remainder is that the first subsequence of the sequence of bytes, interpreted as a third number, and the pseudo-remainder by division have the same remainder by division when divided by the second number. A second subsequence of the sequence of bytes interpreted as the first number is appended to the pseudo-remainder, interpreted as a sequence of bytes, so as to create a sequence of bytes interpreted as a fourth number. The first number and the fourth number have the same remainder by division when divided by the second number.

    摘要翻译: 提供了用于通过将被解释为第一数量的字节序列除以第二数来计算余数的方法,计算机系统和计算机程序产品。 计算与字节序列的第一子序列相关联的伪余数。 该伪余数的特征在于,当除以第二个数字时,字节序列的第一个子序列,被解释为第三个数字,并且除法的伪余数除以相同的余数。 解释为第一个数字的字节序列的第二个子序列被附加到伪余数,被解释为字节序列,以便创建被解释为第四个数字的字节序列。 第一个数字和第四个数字除以第二个数字也有相同的余数。

    Full exploitation of parallel processors for data processing
    9.
    发明授权
    Full exploitation of parallel processors for data processing 有权
    全面利用并行处理器进行数据处理

    公开(公告)号:US08959522B2

    公开(公告)日:2015-02-17

    申请号:US13361544

    申请日:2012-01-30

    IPC分类号: G06F9/46

    摘要: For full exploitation of parallel processors for data processing, a set of parallel processors is partitioned into disjoint subsets according to indices of the set of the parallel processors. The size of each of the disjoint subsets corresponds to a number of processors assigned to the processing of the data chunks at one of the layers. Each of the processors are assigned to different layers in different data chunks such that each of processors are busy and the data chunks are fully processed within a number of the time steps equal to the number of the layers. A transition function is devised from the indices of the set of the parallel processors at one time steps to the indices of the set of the parallel processors at a following time step.

    摘要翻译: 为了充分利用并行处理器进行数据处理,根据并行处理器集合的索引,将一组并行处理器划分为不相交的子集。 每个不相交子集的大小对应于分配给处理一个层上的数据块的处理器的数量。 每个处理器被分配到不同数据块中的不同层,使得每个处理器都忙,并且数据块在等于层的数量的时间步长的数量中被完全处理。 从并行处理器的集合的索引在一个时间步骤设计转换功能到在随后的时间步长的并行处理器的集合的索引。

    Parallel computation of a remainder by division of a sequence of bytes
    10.
    发明授权
    Parallel computation of a remainder by division of a sequence of bytes 有权
    通过划分字节序列来对余数进行并行计算

    公开(公告)号:US08935310B2

    公开(公告)日:2015-01-13

    申请号:US13495533

    申请日:2012-06-13

    IPC分类号: G06F7/38 G06F7/52 G06F7/44

    CPC分类号: G06F7/462 G06F7/44 G06F7/72

    摘要: A remainder by division of a sequence of bytes interpreted as a first number by a second number is calculated. A first remainder by division associated with a first subset of the sequence of bytes is calculated with a first processor. A second remainder by division associated with a second subset of the sequence of bytes is calculated with a second processor. The calculating of the second remainder by division may occur at least partially during the calculating of the first remainder by division. A third remainder by division is calculated based on the calculating of the first remainder by division and the calculating of the second remainder by division.

    摘要翻译: 计算被解释为第一个数字的第二个数字的字节序列的余数。 用第一处理器计算与字节序列的第一子集相关联的第一余数除法。 用第二处理器计算与字节序列的第二子集相关联的第二余数。 通过除法计算第二余数可以至少部分地在第一余数除法计算期间进行。 基于通过划分的第一余数的计算和除以第二余数的计算来计算第三余数。