METHOD AND APPARATUS FOR PROVIDING A FILTER JOIN ON DATA STREAMS
    1.
    发明申请
    METHOD AND APPARATUS FOR PROVIDING A FILTER JOIN ON DATA STREAMS 审中-公开
    提供数据流过滤器的方法和装置

    公开(公告)号:US20110131198A1

    公开(公告)日:2011-06-02

    申请号:US12627079

    申请日:2009-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/24568 G06F16/2456

    摘要: A method and apparatus for processing at least one data stream are disclosed. For example, the method receives at least a join query for the at least one data stream, wherein the join query specifies a lifetime for keeping a tuple as a marker for a beginning of a sequence of interest, and receives a tuple from the at least one data stream. The method marks the tuple as a beginning of a sequence of interest and stores the tuple, if the tuple is the beginning of the sequence of interest. The method applies one or more initial predicates to the tuple, and determines if the tuple matched a marked tuple, if the tuple meets the one or more initial predicates. The method determines if the tuple meets one or more conditions to be outputted, if the tuple meets the one or more initial predicates conditions.

    摘要翻译: 公开了一种用于处理至少一个数据流的方法和装置。 例如,该方法至少接收至少一个数据流的连接查询,其中连接查询指定用于将元组保持为感兴趣序列的开始的标记的寿命,并从至少接收元组 一个数据流。 该方法将元组标记为感兴趣的序列的开始,并存储元组,如果元组是感兴趣的序列的开始。 该方法将一个或多个初始谓词应用于元组,并确定元组是否与标记的元组匹配,如果元组符合一个或多个初始谓词。 如果元组满足一个或多个初始谓词条件,该方法确定元组是否满足要输出的一个或多个条件。

    Method and apparatus for data stream sampling
    2.
    发明申请
    Method and apparatus for data stream sampling 审中-公开
    用于数据流采样的方法和装置

    公开(公告)号:US20070226188A1

    公开(公告)日:2007-09-27

    申请号:US11389851

    申请日:2006-03-27

    IPC分类号: G06F17/30

    摘要: In one embodiment, the present invention is a method and apparatus for data stream sampling. In one embodiment, a tuple of a data stream is received from a sampling window of the data stream. The tuple is associated with a group, selected from a set of one or more groups, which reflects a subset of information relating to a sample of the data stream. In addition, the tuple is associated with a supergroup, selected from a set of one or more supergroups, which reflects global information relating to the sample. It is then determined whether receipt of the tuple triggers a cleaning phase in which one or more tuples are shed from the sample. The operator can be implemented to execute a variety of different sampling algorithms, including well-known and experimental algorithms.

    摘要翻译: 在一个实施例中,本发明是用于数据流采样的方法和装置。 在一个实施例中,从数据流的采样窗口接收数据流的元组。 元组与从一组或多个组中选择的组相关联,该组反映与数据流的样本相关的信息的子集。 此外,元组与从一组或多个超组中选择的超组相关联,其反映与样本有关的全局信息。 然后确定元组的接收是否触发一个清除阶段,其中一个或多个元组从样本中脱落。 操作员可以实现执行各种不同的采样算法,包括众所周知的和实验的算法。

    LINK-BASED CLASSIFICATION OF GRAPH NODES
    3.
    发明申请
    LINK-BASED CLASSIFICATION OF GRAPH NODES 审中-公开
    基于链接的图表分类

    公开(公告)号:US20090132561A1

    公开(公告)日:2009-05-21

    申请号:US11943681

    申请日:2007-11-21

    IPC分类号: G06F17/30

    CPC分类号: G06F16/958 G06F16/9024

    摘要: A method of labeling unlabeled nodes in a graph that represents objects that have an explicit structure between them. A computing device can use a labeling engine to labeled nodes in a graph that are labeled and can identify an unlabeled node in the graph that is structurally associated with the labeled nodes. The labeling engine can label the unlabeled node with the label of the labeled node based on the structural association between the unlabeled node and the labeled node.

    摘要翻译: 在图中标记未标记节点的方法,该节点表示在它们之间具有明确结构的对象。 计算设备可以使用标记引擎来标记图中的标记节点,并且可以标识图中与标记节点结构相关联的未标记节点。 标签引擎可以基于未标记节点和标记节点之间的结构关联来标记带有标记节点的标签的未标记节点。

    MONITORING REGULAR EXPRESSIONS ON OUT-OF-ORDER STREAMS
    4.
    发明申请
    MONITORING REGULAR EXPRESSIONS ON OUT-OF-ORDER STREAMS 审中-公开
    监测超出订单流量的正常表达

    公开(公告)号:US20070226362A1

    公开(公告)日:2007-09-27

    申请号:US11554264

    申请日:2006-10-30

    IPC分类号: G06F15/16

    摘要: A system, method and computer-readable medium provide for regular expression matching over a plurality of packets. The method embodiment comprises, for each data segment in a flow with no predecessor in a stored list of objects generated from traversing a deterministic finite sate automation (DFA) associated with the regular expression: traversing the DFA using the data segment and a list of all non-accepting states; and if the plurality of packets is not declared as matching, then storing, as list of equivalence classes, automaton state pairs having different starting states but an identical ending state. Finally, the method comprises determining whether the flow matches the regular expression.

    摘要翻译: 系统,方法和计算机可读介质提供在多个分组上的正则表达式匹配。 该方法实施例包括对于每个流程中的每个数据段,其中没有先前存储在存储的对象列表中的对象,该对象通过与正则表达式相关联的确定性有限状态自动化(DFA)生成:遍历使用数据段的DFA和全部列表 不接受国家; 并且如果多个分组未被声明为匹配,则存储作为等价类的列表,具有不同起始状态但相同结束状态的自动状态对。 最后,该方法包括确定流是否匹配正则表达式。