Seeding replication
    31.
    发明申请
    Seeding replication 有权
    播种复制

    公开(公告)号:US20080263109A1

    公开(公告)日:2008-10-23

    申请号:US11807204

    申请日:2007-05-24

    CPC classification number: G06F17/30174 G06F17/30159 G06F17/30212

    Abstract: Seeding replication is disclosed. One or more but not all files stored on a deduplicated storage system are selected to be replicated. One or more segments referred to by the selected one or more but not all files are determined. A data structure is created that is used to indicate that at least the one or more segments are to be replicated. In the event that an indication based at least in part on the data structure indicates that a candidate segment stored on the deduplicating storage system is to be replicated, the candidate segment is replicated.

    Abstract translation: 公开了播种复制。 选择存储在重复数据删除的存储系统上的一个或多个但不是全部文件进行复制。 确定所选择的一个或多个但不是全部文件所引用的一个或多个段。 创建用于指示至少一个或多个段被复制的数据结构。 在至少部分基于数据结构的指示指示将复制存储在重复数据删除存储系统上的候选片段的情况下,复制候选片段。

    Locality-based stream segmentation for data deduplication
    32.
    发明申请
    Locality-based stream segmentation for data deduplication 有权
    用于重复数据删除的基于位置的流分段

    公开(公告)号:US20080013830A1

    公开(公告)日:2008-01-17

    申请号:US11484881

    申请日:2006-07-11

    CPC classification number: G06F11/1451 G06F11/1464 G06F11/1469

    Abstract: Selecting a segment boundary is disclosed. A segmentation window is determined. A plurality of values associated with candidate boundaries within the segmentation window are computed. One of the candidate boundaries is selected based at least in part on a comparison between two or more of the computed values. And, a boundary is determined within the segmentation window.

    Abstract translation: 公开了选择段边界。 确定分割窗口。 计算与分割窗口内的候选边界相关联的多个值。 至少部分地基于两个或更多个计算值之间的比较来选择候选边界之一。 并且,在分割窗口内确定边界。

    File system replication
    33.
    发明申请
    File system replication 有权
    文件系统复制

    公开(公告)号:US20080010322A1

    公开(公告)日:2008-01-10

    申请号:US11483131

    申请日:2006-07-06

    CPC classification number: G06F17/30174 G06F17/30212

    Abstract: File system replication includes determining whether one of a plurality of files included in an original file system has been updated since a previous replication, the file having a plurality of data segments, and in the event that the file has been updated, locating among the plurality of data segments a previously stored data segment that is newly referenced by the file, and that does not require replication.

    Abstract translation: 文件系统复制包括确定原始文件系统中包括的多个文件中的一个文件是否已经从先前的复制更新,该文件具有多个数据段,并且在文件已被更新的情况下,位于多个 的数据段分段由文件新引用的先前存储的数据段,并且不需要复制。

    Seeding replication
    35.
    发明授权
    Seeding replication 有权
    播种复制

    公开(公告)号:US08527455B2

    公开(公告)日:2013-09-03

    申请号:US12890688

    申请日:2010-09-26

    CPC classification number: G06F17/30174 G06F17/30159 G06F17/30212

    Abstract: Seeding replication is disclosed. One or more but not all files stored on a deduplicated storage system are selected to be replicated. One or more segments referred to by the selected one or more but not all files are determined. A data structure is created that is used to indicate that at least the one or more segments are to be replicated. In the event that an indication based at least in part on the data structure indicates that a candidate segment stored on the deduplicating storage system is to be replicated, the candidate segment is replicated.

    Abstract translation: 公开了播种复制。 选择存储在重复数据删除的存储系统上的一个或多个但不是全部文件进行复制。 确定所选择的一个或多个但不是全部文件所引用的一个或多个段。 创建用于指示至少一个或多个段被复制的数据结构。 在至少部分基于数据结构的指示指示将复制存储在重复数据删除存储系统上的候选片段的情况下,复制候选片段。

    Partitioning a data stream using embedded anchors
    36.
    发明授权
    Partitioning a data stream using embedded anchors 有权
    使用嵌入式锚点分割数据流

    公开(公告)号:US08234413B2

    公开(公告)日:2012-07-31

    申请号:US13152110

    申请日:2011-06-02

    CPC classification number: G06F17/30156

    Abstract: Selecting a segment boundary within block b is disclosed. A first anchor location j|j+1 is identified wherein a value of f(b[j−A+1 . . . j+B]) satisfies a constraint and wherein A and B are non-negative integers. A segment boundary location k|k+1 is determined wherein k is greater than minimum distance from j.

    Abstract translation: 公开了在块b内选择段边界。 识别第一锚定位置j | j + 1,其中f(b [j-A + 1 ... j + B])的值满足约束,并且其中A和B是非负整数。 确定分段边界位置k | k + 1,其中k大于距j的最小距离。

    EFFICIENTLY INDEXING AND SEARCHING SIMILAR DATA
    37.
    发明申请
    EFFICIENTLY INDEXING AND SEARCHING SIMILAR DATA 有权
    有效索引和搜索类似数据

    公开(公告)号:US20120041957A1

    公开(公告)日:2012-02-16

    申请号:US13280195

    申请日:2011-10-24

    CPC classification number: G06F17/30964 Y10S707/99956

    Abstract: Techniques for efficiently indexing and searching similar data are described herein. According to one embodiment, in response to a query for one or more terms received from a client, a query index is accessed to retrieve a list of one or more super files. Each super file is associated with a group of similar files. Each super file includes terms and/or sequences of terms obtained from the associated group of similar files. Thereafter, the super files representing groups of similar files are presented to the client, where each of the super files includes at least one of the queried terms. Other methods and apparatuses are also described.

    Abstract translation: 本文描述了用于有效地索引和搜索类似数据的技术。 根据一个实施例,响应于从客户端接收的对一个或多个条件的查询,访问查询索引以检索一个或多个超级文件的列表。 每个超级文件与一组相似的文件相关联。 每个超级文件包括从相关联的相似文件组获得的术语和/或术语序列。 此后,将表示相似文件的组的超级文件呈现给客户端,其中每个超级文件包括至少一个查询的术语。 还描述了其它方法和装置。

    Efficiently indexing and searching similar data
    38.
    发明授权
    Efficiently indexing and searching similar data 有权
    有效索引和搜索类似的数据

    公开(公告)号:US08099401B1

    公开(公告)日:2012-01-17

    申请号:US11779486

    申请日:2007-07-18

    CPC classification number: G06F17/30964 Y10S707/99956

    Abstract: Techniques for efficiently indexing and searching similar data are described herein. According to one embodiment, in response to a query for one or more terms received from a client, a query index is accessed to retrieve a list of one or more super files. Each super file is associated with a group of similar files. Each super file includes terms and/or sequences of terms obtained from the associated group of similar files. Thereafter, the super files representing groups of similar files are presented to the client, where each of the super files includes at least one of the queried terms. Other methods and apparatuses are also described.

    Abstract translation: 本文描述了用于有效地索引和搜索类似数据的技术。 根据一个实施例,响应于从客户端接收的对一个或多个条件的查询,访问查询索引以检索一个或多个超级文件的列表。 每个超级文件与一组相似的文件相关联。 每个超级文件包括从相关联的相似文件组获得的术语和/或术语序列。 此后,将表示相似文件的组的超级文件呈现给客户端,其中每个超级文件包括至少一个查询的术语。 还描述了其它方法和装置。

    PARTITIONING A DATA STREAM USING EMBEDDED ANCHORS
    39.
    发明申请
    PARTITIONING A DATA STREAM USING EMBEDDED ANCHORS 有权
    使用嵌入式锚杆分割数据流

    公开(公告)号:US20110302326A1

    公开(公告)日:2011-12-08

    申请号:US13152110

    申请日:2011-06-02

    CPC classification number: G06F17/30156

    Abstract: Selecting a segment boundary within block b is disclosed. A first anchor location j|j+1 is identified wherein a value of f(b[j−A+1 . . . j+B]) satisfies a constraint and wherein A and B are non-negative integers. A segment boundary location k|k+1 is determined wherein k is greater than minimum distance from j.

    Abstract translation: 公开了在块b内选择段边界。 识别第一锚定位置j | j + 1,其中f(b [j-A + 1 ... j + B])的值满足约束,并且其中A和B是非负整数。 确定分段边界位置k | k + 1,其中k大于距j的最小距离。

Patent Agency Ranking