Systems and Methods for Version Chain Clustering
    3.
    发明申请
    Systems and Methods for Version Chain Clustering 审中-公开
    版本链聚类的系统和方法

    公开(公告)号:US20130066868A1

    公开(公告)日:2013-03-14

    申请号:US13273080

    申请日:2011-10-13

    IPC分类号: G06F17/30

    摘要: A system, a method and a computer program product for storing data, which include receiving a data stream having a plurality of transactions that include at least one portion of data, determining whether at least one portion of data within at least one transaction is substantially similar to at least another portion of data within at least one transaction, clustering together at least one portion of data and at least another portion of data within at least one transaction, selecting one of at least one portion of data and at least another portion of data as a representative of at least one portion of data and at least another portion of data in the received data stream, and storing each representative of a portion of data from each transaction in the plurality of transactions, wherein a plurality of representatives is configured to form a chain representing the received data stream.

    摘要翻译: 一种用于存储数据的系统,方法和计算机程序产品,其包括接收具有包括至少一部分数据的多个事务的数据流,确定至少一个事务中的至少一部分数据是否基本相似 至少一个交易中的至少另一部分数据,将数据的至少一部分和至少一个交易中的至少另一部分数据聚集在一起,选择数据的至少一部分和至少另一部分数据中的一个 作为所接收的数据流中数据的至少一部分和至少另一部分数据的代表,并且存储每个代表来自多个事务中每个交易的一部分数据的代表,其中多个代表被配置为形成 代表所接收的数据流的链。