-
公开(公告)号:US07562186B2
公开(公告)日:2009-07-14
申请号:US11402631
申请日:2006-04-11
申请人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
发明人: Kai Li , Ming Benjamin Zhu , Umesh Maheshwari , Zheng Yang
IPC分类号: G06F13/00
CPC分类号: G06F13/16 , G06F17/30162
摘要: Storage using resemblance of data segments is disclosed. It is determined that a new segment resembles a prior stored segment. The prior stored segment comprises a segment stored previously from any location in an input data stream. A delta between the new segment and the prior stored segment is determined. A representation of the new segment based at least in part on the delta is stored.
摘要翻译: 公开了与数据段相似的存储。 确定新的段类似于先前存储的段。 先前存储的段包括先前从输入数据流中的任何位置存储的段。 确定新分段和先前存储分段之间的增量。 存储至少部分基于该增量的新段的表示。
-
公开(公告)号:US08275955B2
公开(公告)日:2012-09-25
申请号:US12819356
申请日:2010-06-21
申请人: Ming Benjamin Zhu , R. Hugo Patterson , Kai Li
发明人: Ming Benjamin Zhu , R. Hugo Patterson , Kai Li
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
摘要翻译: 公开了一种用于提供有效数据存储的系统和方法。 在数据流中接收多个数据段。 系统确定数据段是否先前存储在低延迟存储器中。 在数据段被确定为先前存储的情况下,返回先前存储的数据段的标识符。
-
公开(公告)号:US20100257315A1
公开(公告)日:2010-10-07
申请号:US12819356
申请日:2010-06-21
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
摘要翻译: 公开了一种用于提供有效数据存储的系统和方法。 在数据流中接收多个数据段。 系统确定数据段是否先前存储在低延迟存储器中。 在数据段被确定为先前存储的情况下,返回先前存储的数据段的标识符。
-
公开(公告)号:US07373464B2
公开(公告)日:2008-05-13
申请号:US11136263
申请日:2005-05-24
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A method for storing data comprising is disclosed. The method comprises receiving a data stream comprising a plurality of data segments wherein each data segment is associated with an identifier. The method further determining using a subset of identifiers that are stored in a low latency memory whether a data segments has been previously stored and returning the identifier for the data segment in the event the data segment is determined to have been stored previously.
摘要翻译: 公开了一种存储数据的方法。 该方法包括接收包括多个数据段的数据流,其中每个数据段与标识符相关联。 该方法进一步确定使用存储在低延迟存储器中的标识符的子集,无论数据段是否已经被预先存储,并且在先前已经确定数据段的事件中返回数据段的标识符。
-
公开(公告)号:US08527568B2
公开(公告)日:2013-09-03
申请号:US12910758
申请日:2010-10-22
申请人: Kai Li , Ming Benjamin Zhu
发明人: Kai Li , Ming Benjamin Zhu
IPC分类号: G06F17/15
CPC分类号: G06F11/1451 , G06F3/0608 , G06F3/064 , G06F3/067 , G06F17/15 , H04L9/0643
摘要: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
摘要翻译: 公开了确定汇总特征集。 选择第一段的多个子段。 对于每个子段,计算通过对每个子段应用一组函数的多个值。 从为所有子段计算的所有值中,选择值的第一个子集。
-
公开(公告)号:US07747581B1
公开(公告)日:2010-06-29
申请号:US11788407
申请日:2007-04-19
申请人: Kai Li , R. Hugo Patterson , Ming Benjamin Zhu , Allan Bricker , Richard Johnsson , Sazzala Reddy , Jeffery Zabarsky
发明人: Kai Li , R. Hugo Patterson , Ming Benjamin Zhu , Allan Bricker , Richard Johnsson , Sazzala Reddy , Jeffery Zabarsky
IPC分类号: G06F17/30
CPC分类号: G06F17/30067 , G06F3/0608 , G06F3/0641 , G06F3/0659 , G06F3/067
摘要: A network file system-based data storage system that converts random I/O requests into a piecewise sequential data structure to facilitate variable length data segment redundancy identification and elimination. For one embodiment of the invention a stateless network file system is employed. For one such embodiment, that provides multiple-client access to stored data, multiple Writes are buffered and then broken into variable length data segments. Redundant segment elimination is then effected. One embodiment of the invention allows sharing of the variable length data segments among files.
-
公开(公告)号:US07844652B2
公开(公告)日:2010-11-30
申请号:US11403154
申请日:2006-04-11
申请人: Kai Li , Ming Benjamin Zhu
发明人: Kai Li , Ming Benjamin Zhu
IPC分类号: G06F17/15
CPC分类号: G06F11/1451 , G06F3/0608 , G06F3/064 , G06F3/067 , G06F17/15 , H04L9/0643
摘要: Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
摘要翻译: 公开了确定汇总特征集。 选择第一段的多个子段。 对于每个子段,计算通过对每个子段应用一组函数的多个值。 从为所有子段计算的所有值中,选择值的第一个子集。
-
公开(公告)号:US20080183767A1
公开(公告)日:2008-07-31
申请号:US12079766
申请日:2008-03-28
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
IPC分类号: G06F17/30
CPC分类号: G06F3/0619 , G06F3/0608 , G06F3/064 , G06F3/0641 , G06F3/065 , G06F3/0683 , G06F3/0689 , G06F11/1435 , G06F11/1453 , G06F11/1464
摘要: A system and method are disclosed for providing efficient data storage. A plurality of data segments is received in a data stream. The system determines whether a data segment has been stored previously in a low latency memory. In the event that the data segment is determined to have been stored previously, an identifier for the previously stored data segment is returned.
摘要翻译: 公开了一种用于提供有效数据存储的系统和方法。 在数据流中接收多个数据段。 系统确定数据段是否先前存储在低延迟存储器中。 在数据段被确定为先前存储的情况下,返回先前存储的数据段的标识符。
-
公开(公告)号:US20080133835A1
公开(公告)日:2008-06-05
申请号:US11974961
申请日:2007-10-16
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
CPC分类号: G06F11/1453 , G06F11/1464 , G06F12/0866 , Y10S707/99952
摘要: A system and method are disclosed for providing efficient data storage. A data stream comprising a plurality of data segments is received. The system determines whether one of the plurality of data segments has been stored previously using a summary in a low latency memory; in the event that the data segment is determined not to have been stored previously, assigning an identifier to the data segment.
摘要翻译: 公开了一种用于提供有效数据存储的系统和方法。 接收包括多个数据段的数据流。 该系统使用低延迟存储器中的概要来确定先前已经存储了多个数据段中的一个; 在确定数据段不被先前存储的情况下,将标识符分配给数据段。
-
公开(公告)号:US07065619B1
公开(公告)日:2006-06-20
申请号:US10325479
申请日:2002-12-20
申请人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
发明人: Ming Benjamin Zhu , Kai Li , R. Hugo Patterson
IPC分类号: G06F12/00
CPC分类号: G06F11/1453 , G06F11/1464 , G06F12/0866 , Y10S707/99952
摘要: A system and method are disclosed for providing efficient data storage. A data stream comprising a plurality of data segments is received. The system determines whether one of the plurality of data segments has been stored previously using a summary in a low latency memory; in the event that the data segment is determined not to have been stored previously, assigning an identifier to the data segment.
-
-
-
-
-
-
-
-
-