-
公开(公告)号:US11093151B1
公开(公告)日:2021-08-17
申请号:US16780210
申请日:2020-02-03
摘要: A method, a system and a computer program product for performing deduplicating data. A data stream having a plurality of data zones is received. One or more data storage locations in a plurality of data storage locations for deduplicating one or more zones in the plurality of zones is identified. Each data storage location stores its respective deduplicated data zones. A data storage location for deduplicating a first data zone is selected. The first data zone is deduplicated using the selected data storage location.
-
公开(公告)号:US20190251189A1
公开(公告)日:2019-08-15
申请号:US15893163
申请日:2018-02-09
CPC分类号: G06F16/1744 , G06F11/1458 , G06F16/1756 , H03M7/3091
摘要: Delta compression method, system and computer program product. Portions of source and target data files are hashed using a hashing function. A target data file is compared against the source data file to determine at least one delta difference between the files. A source data file hashing table is generated. The table includes hashed portions of the source and target data files stored in corresponding source file offset locations and corresponding target file offset locations, respectively. Portions of the source and target files are compared using corresponding source and target file offset locations. At least one common sequence of characters in the portions of the source and target files is determined based on the comparison. A patch file is generated based on the determined sequence of characters.
-
公开(公告)号:US10067946B2
公开(公告)日:2018-09-04
申请号:US15482376
申请日:2017-04-07
摘要: A method, a system, and a computer program product for performing next level multi-level deduplication. A first zone stamp for a first data zone is generated and compared to a second zone stamp representing a second data zone, where the zones are first level data zones. The first and second data zones are deduplicated when the first zone stamp matches the second zone stamp. A second-level first zone stamp is selected when there is no match between first and second zone stamps. The second-level first zone stamp, representing a second-level first data zone in the first data zone, is compared to the second zone stamp and/or a second-level second zone stamp representing a second-level second data zone. The second-level first zone and one of the second data zone and the second-level second zone are deduplicated when the second-level first zone stamp matches one of the second zone stamp and the second-level second zone stamp.
-
公开(公告)号:US11336295B2
公开(公告)日:2022-05-17
申请号:US16698140
申请日:2019-11-27
发明人: Mark Bennett Hecker , Ashok T. Ramu
IPC分类号: H03M7/30 , G06F16/174
摘要: A system, a method and a computer program product for storing data, which include receiving a data stream having a plurality of transactions that include at least one portion of data, determining whether at least one portion of data within at least one transaction is substantially similar to at least another portion of data within at least one transaction, clustering together at least one portion of data and at least another portion of data within at least one transaction, selecting one of at least one portion of data and at least another portion of data as a representative of at least one portion of data and at least another portion of data in the received data stream, and storing each representative of a portion of data from each transaction in the plurality of transactions, wherein a plurality of representatives is configured to form a chain representing the received data stream.
-
公开(公告)号:US20210240377A1
公开(公告)日:2021-08-05
申请号:US16780210
申请日:2020-02-03
IPC分类号: G06F3/06
摘要: A method, a system and a computer program product for performing deduplicating data. A data stream having a plurality of data zones is received. One or more data storage locations in a plurality of data storage locations for deduplicating one or more zones in the plurality of zones is identified. Each data storage location stores its respective deduplicated data zones. A data storage location for deduplicating a first data zone is selected. The first data zone is deduplicated using the selected data storage location.
-
公开(公告)号:US10452617B2
公开(公告)日:2019-10-22
申请号:US15620246
申请日:2017-06-12
发明人: David G. Therrien , Yee-ching Chao , Thomas G. Hansen , Daniel P. Martinelli , Lucas H. Makosky , Mark B. Hecker , Stephen A. Smith , Adrian VanderSpek
IPC分类号: G06F17/30 , G06F16/174 , G06F16/2455 , G06F11/14
摘要: A method, a system, and a computer-implemented method for performing multi-level deduplication of data are disclosed. A zone stamp is generated for each zone in a plurality of zones contained in at least one data stream. The zone stamp is compared to another zone stamp. The zone stamp and another zone stamp represent zones in the plurality of zones. The comparison is performed for zones at corresponding zone levels based on a determination that a zone stamp of a zone of a preceding zone level is not similar to another zone stamp of another preceding zone level. The zone at the preceding zone level includes at least one zone of a next zone level having a size smaller than or equal to a size of the zone of the preceding zone level. The zone and another zone are deduplicated based on a determination that the zone stamp is similar to another zone stamp.
-
公开(公告)号:US20170277711A1
公开(公告)日:2017-09-28
申请号:US15620246
申请日:2017-06-12
发明人: David G. Therrien , Yee-ching Chao , Thomas G. Hansen , Daniel P. Martinelli , Lucas H. Makosky , Mark B. Hecker , Stephen A. Smith , Adrian VanderSpek
CPC分类号: G06F16/1748 , G06F11/1453 , G06F16/1756 , G06F16/24568
摘要: A method, a system, and a computer-implemented method for performing multi-level deduplication of data are disclosed. A zone stamp is generated for each zone in a plurality of zones contained in at least one data stream. The zone stamp is compared to another zone stamp. The zone stamp and another zone stamp represent zones in the plurality of zones. The comparison is performed for zones at corresponding zone levels based on a determination that a zone stamp of a zone of a preceding zone level is not similar to another zone stamp of another preceding zone level. The zone at the preceding zone level includes at least one zone of a next zone level having a size smaller than or equal to a size of the zone of the preceding zone level. The zone and another zone are deduplicated based on a determination that the zone stamp is similar to another zone stamp.
-
公开(公告)号:US11269733B2
公开(公告)日:2022-03-08
申请号:US16677373
申请日:2019-11-07
IPC分类号: G06F11/14 , G06F16/174
摘要: A method, a system, and a computer program product for executing synthetic backup processes and deduplication backup storage with landing zone. A synthetic backup of a data file is received. A partial re-synthesis of the synthetic backup of the data file is performed. A total size of the partial re-synthesized backup of the data file and the received synthetic backup is determined. A size of a complete re-synthesis of the synthetic backup of the data file is computed. The complete re-synthesis of the synthetic backup of the data file is performed when the determined total size exceeds the computed size of the complete re-synthesis of the synthetic backup of the data file.
-
公开(公告)号:US20200099392A1
公开(公告)日:2020-03-26
申请号:US16698140
申请日:2019-11-27
发明人: Mark Bennett Hecker , Ashok T. Ramu
IPC分类号: H03M7/30 , G06F16/174
摘要: A system, a method and a computer program product for storing data, which include receiving a data stream having a plurality of transactions that include at least one portion of data, determining whether at least one portion of data within at least one transaction is substantially similar to at least another portion of data within at least one transaction, clustering together at least one portion of data and at least another portion of data within at least one transaction, selecting one of at least one portion of data and at least another portion of data as a representative of at least one portion of data and at least another portion of data in the received data stream, and storing each representative of a portion of data from each transaction in the plurality of transactions, wherein a plurality of representatives is configured to form a chain representing the received data stream.
-
公开(公告)号:US20190278750A1
公开(公告)日:2019-09-12
申请号:US16410613
申请日:2019-05-13
IPC分类号: G06F16/174 , G06F16/2455 , G06F11/14
摘要: A method, a system, and a computer program product for performing a backup of data are disclosed. A grid server in a plurality of grid servers is selected for deduplicating a segment of data in a plurality of segments of data contained within a data stream. The segment of data is forwarded to the selected grid server for deduplication. A zone contained within the forwarded segment of data is deduplicated using the selected server. The deduplication is performed based on a listing of a plurality of zone stamps. Each zone stamp in the plurality of zone stamps represents a zone in a plurality of zones deduplicated by at least one server in the plurality of grid servers.
-
-
-
-
-
-
-
-
-