-
公开(公告)号:US08447740B1
公开(公告)日:2013-05-21
申请号:US12291989
申请日:2008-11-14
申请人: Mark Huang , Philip Shilane , Grant Wallace , Nitin Garg , Edward K. Lee , Ming Benjamin Zhu , Kai Li
发明人: Mark Huang , Philip Shilane , Grant Wallace , Nitin Garg , Edward K. Lee , Ming Benjamin Zhu , Kai Li
IPC分类号: G06F17/00
CPC分类号: G06F17/30162 , G06F17/30153 , G06F17/30156 , G06F17/30864
摘要: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.
-
公开(公告)号:US08751462B2
公开(公告)日:2014-06-10
申请号:US12291998
申请日:2008-11-14
申请人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
发明人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
CPC分类号: G06F11/1453 , G06F11/1464 , H03M7/30 , H03M7/3091
摘要: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
摘要翻译: 披露了身份重复数据删除后的增量压缩。 第一数据段被确定为与先前的第一数据段相同。 然后确定未被确定为与第二先前数据段相同的第二数据段以类似于第三先前数据段。
-
公开(公告)号:US20100125553A1
公开(公告)日:2010-05-20
申请号:US12291998
申请日:2008-11-14
申请人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
发明人: Mark Huang , Edward K. Lee , Kai Li , Philip Shilane , Grant Wallace , Ming Benjamin Zhu
IPC分类号: G06F17/30
CPC分类号: G06F11/1453 , G06F11/1464 , H03M7/30 , H03M7/3091
摘要: Delta compression after identity deduplication is disclosed. A first data segment is determined to be identical to a first previous data segment. A second data segment, not determined to be identical to a second previous data segment, is then determined to be similar to a third previous data segment.
摘要翻译: 披露了身份重复数据删除后的增量压缩。 第一数据段被确定为与先前的第一数据段相同。 然后确定未被确定为与第二先前数据段相同的第二数据段以类似于第三先前数据段。
-
公开(公告)号:US20110196869A1
公开(公告)日:2011-08-11
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US07962520B2
公开(公告)日:2011-06-14
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US08312006B2
公开(公告)日:2012-11-13
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US20080294660A1
公开(公告)日:2008-11-27
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US20120317381A1
公开(公告)日:2012-12-13
申请号:US13592746
申请日:2012-08-23
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
IPC分类号: G06F12/00
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
摘要翻译: 公开了一种用于提供有效数据存储的系统和方法。 在数据流中接收多个数据段。 系统初步检查具有较低延迟的存储器,该多个数据段中的一个是否可能已经先前存储在数据段存储库中。 具有较低延迟的存储器存储数据段信息。 在初步检查确定多个数据段中的一个可能已经存储在数据段存储库中的情况下,检查具有相对较高等待时间的存储器以确定数据段是否已经先前存储在数据段存储库中 。
-
公开(公告)号:US20110040819A1
公开(公告)日:2011-02-17
申请号:US12910758
申请日:2010-10-22
申请人: Kai Li , Ming Benjamin Zhu
发明人: Kai Li , Ming Benjamin Zhu
IPC分类号: G06F17/15
CPC分类号: G06F11/1451 , G06F3/0608 , G06F3/064 , G06F3/067 , G06F17/15 , H04L9/0643
摘要: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
摘要翻译: 公开了确定汇总特征集。 选择第一段的多个子段。 对于每个子段,计算通过对每个子段应用一组函数的多个值。 从为所有子段计算的所有值中,选择值的第一个子集。
-
公开(公告)号:US07562186B2
公开(公告)日:2009-07-14
申请号:US11402631
申请日:2006-04-11
申请人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
发明人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
IPC分类号: G06F13/00
CPC分类号: G06F13/16 , G06F17/30162
摘要: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a prior stored segment. The prior stored segment comprises a segment stored previously from any location in an input data stream. A delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the delta is stored.
摘要翻译: 公开了与数据段相似的存储。 确定新的段类似于先前存储的段。 先前存储的段包括先前从输入数据流中的任何位置存储的段。 确定新分段和先前存储分段之间的增量。 存储至少部分基于该增量的新段的表示。
-
-
-
-
-
-
-
-
-