-
公开(公告)号:US08312006B2
公开(公告)日:2012-11-13
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US20080294660A1
公开(公告)日:2008-11-27
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US20080010322A1
公开(公告)日:2008-01-10
申请号:US11483131
申请日:2006-07-06
IPC分类号: G06F17/30
CPC分类号: G06F17/30174 , G06F17/30212
摘要: File system replication includes determining whether one of a plurality of files included in an original file system has been updated since a previous replication, the file having a plurality of data segments, and in the event that the file has been updated, locating among the plurality of data segments a previously stored data segment that is newly referenced by the file, and that does not require replication.
摘要翻译: 文件系统复制包括确定原始文件系统中包括的多个文件中的一个文件是否已经从先前的复制更新,该文件具有多个数据段,并且在文件已被更新的情况下,位于多个 的数据段分段由文件新引用的先前存储的数据段,并且不需要复制。
-
公开(公告)号:US20110196869A1
公开(公告)日:2011-08-11
申请号:US13090166
申请日:2011-04-19
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/30
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
摘要翻译: 披露数据段的存储。 对于每个段,标识与段相似的段,其中类似段已经由集群节点管理。 在类似的段被识别的情况下,引用相似的段和相似的段和段之间的增量,而不是段被存储。
-
公开(公告)号:US07962520B2
公开(公告)日:2011-06-14
申请号:US12082244
申请日:2008-04-09
申请人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
发明人: R. Hugo Patterson , Kai Li , Ming Benjamin Zhu , Sazzala Venkata Reddy , Umesh Maheshwari , Edward K. Lee
IPC分类号: G06F17/00
CPC分类号: G06F3/0604 , G06F3/0644 , G06F3/0683 , G06F17/30091 , G06F17/30138 , G06F17/3015 , G06F17/30312 , G06F17/30489
摘要: Cluster storage is disclosed. A data stream or a data block is received. The data stream or the data block is broken into segments. For each segment, a cluster node is selected, and in the event that a similar segment to the segment is identified that is already managed by the selected cluster node, a reference to the similar segment and a delta between the similar segment and the segment is caused to be stored on the selected cluster node.
摘要翻译: 公开集群存储。 接收数据流或数据块。 数据流或数据块被分成段。 对于每个段,选择集群节点,并且在识别已经由所选择的集群节点管理的与该段相似的段的情况下,对类似段的引用和类似段和段之间的差异是 导致存储在所选群集节点上。
-
公开(公告)号:US07882064B2
公开(公告)日:2011-02-01
申请号:US11483131
申请日:2006-07-06
IPC分类号: G06F17/30
CPC分类号: G06F17/30174 , G06F17/30212
摘要: File system replication includes determining whether one of a plurality of files included in an original file system has been updated since a previous replication, the file having a plurality of data segments, and in the event that the file has been updated, locating among the plurality of data segments a previously stored data segment that is newly referenced by the file, and that does not require replication.
摘要翻译: 文件系统复制包括确定原始文件系统中包括的多个文件中的一个文件是否已经从先前的复制更新,该文件具有多个数据段,并且在文件已被更新的情况下,位于多个 的数据段分段由文件新引用的先前存储的数据段,并且不需要复制。
-
公开(公告)号:US09535624B1
公开(公告)日:2017-01-03
申请号:US11197126
申请日:2005-08-04
IPC分类号: G06F3/06
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0673 , G06F11/1453
摘要: A method of managing duplicate segments from a segmented file storage system is disclosed. The method comprises indexing a segment according to a key for the segments wherein the index includes an identification of a first data location where the segment is stored and identifying a duplicate segment having the same key that is stored in a second location. The method further comprises determining that the duplicate segment is an undesired duplicate segment and eliminating the undesired duplicate segment.
摘要翻译: 公开了一种从分段文件存储系统管理重复段的方法。 该方法包括根据段的密钥索引段,其中索引包括存储段的第一数据位置的标识,并且识别具有存储在第二位置中的相同密钥的重复段。 所述方法还包括确定所述重复片段是不期望的重复片段并消除不期望的重复片段。
-
公开(公告)号:US07631144B1
公开(公告)日:2009-12-08
申请号:US10940408
申请日:2004-09-13
IPC分类号: G06F12/00
CPC分类号: G06F11/1453
摘要: A method for storing data is disclosed. The method comprises receiving a data stream comprising a plurality of data segments and preliminarily checking in a memory having a relatively low latency whether one of the plurality of data segments has been stored previously. The method further comprises in the event that the preliminary check does not conclusively determine whether the data segment has been stored previously, limiting checking in a memory having a relatively high latency to conclusively determine whether the data segment has been previously stored, and in the event that checking is limited or in the event that the check in the memory having relatively high latency conclusively determines the data segment has not been previously stored, storing the data segment.
摘要翻译: 公开了一种用于存储数据的方法。 该方法包括:接收包括多个数据段的数据流,并且预先检查具有相对较低等待时间的存储器,以便先前已经存储了多个数据段中的一个。 该方法还包括在初步检查不能最终确定数据段是否已经被先前存储的情况下,限制在具有相对高的等待时间的存储器中的检查以最终确定数据段是否已经被预先存储,并且在事件中 该检查是有限的,或者在具有较高延迟的存储器中的检查结果确定数据段尚未被预先存储的情况下,存储该数据段。
-
公开(公告)号:US07562186B2
公开(公告)日:2009-07-14
申请号:US11402631
申请日:2006-04-11
申请人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
发明人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
IPC分类号: G06F13/00
CPC分类号: G06F13/16 , G06F17/30162
摘要: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a prior stored segment. The prior stored segment comprises a segment stored previously from any location in an input data stream. A delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the delta is stored.
摘要翻译: 公开了与数据段相似的存储。 确定新的段类似于先前存储的段。 先前存储的段包括先前从输入数据流中的任何位置存储的段。 确定新分段和先前存储分段之间的增量。 存储至少部分基于该增量的新段的表示。
-
公开(公告)号:US08612696B2
公开(公告)日:2013-12-17
申请号:US13592746
申请日:2012-08-23
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system preliminarily checks in a memory having a relatively low latency whether one of the plurality of data segments may have been stored previously in a data segment repository. The memory having the relatively low latency stores data segment information. In the event that the preliminary check determines that one of the plurality of data segments may have been stored in the data segment repository, a memory having a relatively higher latency is checked to determine whether the data segment has been stored previously in the data segment repository.
-
-
-
-
-
-
-
-
-