-
公开(公告)号:US08447740B1
公开(公告)日:2013-05-21
申请号:US12291989
申请日:2008-11-14
申请人: Mark Huang , Philip Shilane , Grant Wallace , Nitin Garg , Edward K. Lee , Ming Benjamin Zhu , Kai Li
发明人: Mark Huang , Philip Shilane , Grant Wallace , Nitin Garg , Edward K. Lee , Ming Benjamin Zhu , Kai Li
IPC分类号: G06F17/00
CPC分类号: G06F17/30162 , G06F17/30153 , G06F17/30156 , G06F17/30864
摘要: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
-
公开(公告)号:US08751462B2
公开(公告)日:2014-06-10
申请号:US12291998
申请日:2008-11-14
申请人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
发明人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
CPC分类号: G06F11/1453 , G06F11/1464 , H03M7/30 , H03M7/3091
摘要: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
摘要翻译: 披露了身份重复数据删除后的增量压缩。 第一数据段被确定为与先前的第一数据段相同。 然后确定未被确定为与第二先前数据段相同的第二数据段以类似于第三先前数据段。
-
公开(公告)号:US20100125553A1
公开(公告)日:2010-05-20
申请号:US12291998
申请日:2008-11-14
申请人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
发明人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
IPC分类号: G06F17/30
CPC分类号: G06F11/1453 , G06F11/1464 , H03M7/30 , H03M7/3091
摘要: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
摘要翻译: 披露了身份重复数据删除后的增量压缩。 第一数据段被确定为与先前的第一数据段相同。 然后确定未被确定为与第二先前数据段相同的第二数据段以类似于第三先前数据段。
-
公开(公告)号:US08849772B1
公开(公告)日:2014-09-30
申请号:US12291997
申请日:2008-11-14
申请人: Mark Huang , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
发明人: Mark Huang , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
CPC分类号: G06F17/30575 , G06F11/1451 , G06F11/1453 , G06F11/1464 , G06F17/30153 , G06F17/30162 , G06F17/30212
摘要: Data replication with delta compression is disclosed. A primary system and a replica system are determined to both have an identical first data segment that is similar to a second data segment. The second data segment is encoded, wherein the encoding refers to the first data segment.
摘要翻译: 公开了使用增量压缩的数据复制。 确定主系统和副本系统都具有类似于第二数据段的相同的第一数据段。 第二数据段被编码,其中编码是指第一数据段。
-
公开(公告)号:US08312006B2
公开(公告)日:2012-11-13
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US20080294660A1
公开(公告)日:2008-11-27
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US20110196869A1
公开(公告)日:2011-08-11
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US07962520B2
公开(公告)日:2011-06-14
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US09535624B1
公开(公告)日:2017-01-03
申请号:US11197126
申请日:2005-08-04
IPC分类号: G06F3/06
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0673 , G06F11/1453
摘要: A method of managing duplicate segments from a segmented file storage system is disclosed. The method comprises indexing a segment according to a key for the segments wherein the index includes an identification of a first data location where the segment is stored and identifying a duplicate segment having the same key that is stored in a second location. The method further comprises determining that the duplicate segment is an undesired duplicate segment and eliminating the undesired duplicate segment.
摘要翻译: 公开了一种从分段文件存储系统管理重复段的方法。 该方法包括根据段的密钥索引段,其中索引包括存储段的第一数据位置的标识,并且识别具有存储在第二位置中的相同密钥的重复段。 所述方法还包括确定所述重复片段是不期望的重复片段并消除不期望的重复片段。
-
公开(公告)号:US07631144B1
公开(公告)日:2009-12-08
申请号:US10940408
申请日:2004-09-13
IPC分类号: G06F12/00
CPC分类号: G06F11/1453
摘要: A method for storing data is disclosed. The method comprises receiving a data stream comprising a plurality of data segments and preliminarily checking in a memory having a relatively low latency whether one of the plurality of data segments has been stored previously. The method further comprises in the event that the preliminary check does not conclusively determine whether the data segment has been stored previously, limiting checking in a memory having a relatively high latency to conclusively determine whether the data segment has been previously stored, and in the event that checking is limited or in the event that the check in the memory having relatively high latency conclusively determines the data segment has not been previously stored, storing the data segment.
摘要翻译: 公开了一种用于存储数据的方法。 该方法包括:接收包括多个数据段的数据流,并且预先检查具有相对较低等待时间的存储器,以便先前已经存储了多个数据段中的一个。 该方法还包括在初步检查不能最终确定数据段是否已经被先前存储的情况下,限制在具有相对高的等待时间的存储器中的检查以最终确定数据段是否已经被预先存储,并且在事件中 该检查是有限的,或者在具有较高延迟的存储器中的检查结果确定数据段尚未被预先存储的情况下,存储该数据段。
-
-
-
-
-
-
-
-
-