Handling Silent Relations In A Data Stream Management System
    1.
    发明申请
    Handling Silent Relations In A Data Stream Management System 有权
    在数据流管理系统中处理无声关系

    公开(公告)号:US20090100029A1

    公开(公告)日:2009-04-16

    申请号:US11873407

    申请日:2007-10-16

    IPC分类号: G06F7/00 G06F15/16

    CPC分类号: G06F17/30548

    摘要: A new continuous query to a data stream management system (DSMS) may use a silent relation whose source does not provide any heartbeats. During execution of any given operator, the DSMS processes data from one input (“active input”) using its time stamp as if the same time stamp is received from another input whose relation has fallen silent (“empty input”), if one or more predetermined conditions are met. One predetermined condition is that the empty input's operator has all its own inputs silent. And another predetermined condition is that a maximum time stamp that is currently present among all the inputs of the empty input's operator is equal to a time stamp that was last received from these same inputs. When both conditions are met, previously received data from the empty input is time stamped with the same time stamp as the most recently received data from the active input.

    摘要翻译: 对数据流管理系统(DSMS)的新的连续查询可以使用其源不提供任何心跳的无声关系。 在执行任何给定的操作者期间,DSMS使用其时间戳来处理来自一个输入(“活动输入”)的数据,就好像从另一个输入中接收到相同时间标记的关系已经下沉(“空输入”),如果一个或 更符合规定的条件。 一个预定的条件是空输入的运算符具有所有自己的输入静音。 并且另一个预定条件是当前存在于空输入运算符的所有输入中的最大时间戳等于从这些相同输入最后接收的时间戳。 当满足这两个条件时,来自空输入的先前接收到的数据与来自活动输入的最近接收的数据具有相同的时间标记的时间戳。

    Support for user defined aggregations in a data stream management system
    2.
    发明申请
    Support for user defined aggregations in a data stream management system 有权
    支持数据流管理系统中用户定义的聚合

    公开(公告)号:US20090106218A1

    公开(公告)日:2009-04-23

    申请号:US11977439

    申请日:2007-10-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30489 G06F17/30516

    摘要: A computer is programmed to accept a command to create a new aggregation defined by a user during execution of continuous queries on streams of data. The computer is further programmed to thereafter accept and process new continuous queries using the new aggregation, in a manner similar to built-in aggregations. The user typically writes a set of instructions to perform the new aggregation, and identifies in the command, a location of the set of instructions. In response to such a command, the computer creates metadata identifying the new aggregation. The metadata is used to instantiate one aggregation for each group of data in a current window, grouped by an attribute identified in a new query.

    摘要翻译: 计算机被编程为接受命令以在对数据流的连续查询执行期间创建由用户定义的新聚合。 该计算机被进一步编程为此后以类似于内置聚合的方式接受和处理使用新聚合的新的连续查询。 用户通常写一组指令来执行新的聚合,并且在命令中标识该组指令的位置。 响应于这样的命令,计算机创建标识新聚合的元数据。 元数据用于为当前窗口中的每组数据实例化一个聚合,按照新查询中标识的属性进行分组。

    DELETING A CONTINUOUS QUERY FROM A DATA STREAM MANAGEMENT SYSTEM CONTINUING TO OPERATE ON OTHER QUERIES
    3.
    发明申请
    DELETING A CONTINUOUS QUERY FROM A DATA STREAM MANAGEMENT SYSTEM CONTINUING TO OPERATE ON OTHER QUERIES 有权
    从数据流管理系统中删除连续查询,继续运行其他查询

    公开(公告)号:US20090106215A1

    公开(公告)日:2009-04-23

    申请号:US11874850

    申请日:2007-10-18

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516

    摘要: A first continuous query is deleted from several continuous queries that are being executed, by performing different actions that depend on how resources are shared therebetween. As an example, a second continuous query is deleted if a view is referenced by the first continuous query, and if the view is built on the second continuous query, and if the view is not referenced by any query other than the first continuous query. Such deletion is followed by compiling the second continuous query, followed by updating the view to refer to a newly-compiled version of the second continuous query. As another example, if an operator in the first continuous query is not used by any other query, this operator is de-scheduled from execution, and any exclusively-owned resources are released. Also, data to be read by the operator and data generated by the operator, if present in shared resources, are deleted therefrom.

    摘要翻译: 通过执行取决于资源如何共享的不同动作,从正在执行的几个连续查询中删除第一个连续查询。 作为示例,如果第一连续查询引用了视图,并且视图是建立在第二连续查询上,并且视图没有被第一连续查询之外的任何查询引用,则删除第二连续查询。 这样的删除之后是编译第二连续查询,随后更新视图以引用第二连续查询的新编译版本。 作为另一示例,如果第一连续查询中的运算符不被任何其他查询使用,则该运算符从执行中去调度,并且释放任何专有资源。 此外,由用户读取的数据和由运营商产生的数据(如果存在于共享资源中)将被删除。

    Support for sharing computation between aggregations in a data stream management system
    4.
    发明申请
    Support for sharing computation between aggregations in a data stream management system 有权
    支持在数据流管理系统中的聚合之间共享计算

    公开(公告)号:US20090106198A1

    公开(公告)日:2009-04-23

    申请号:US11977440

    申请日:2007-10-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516 G06F17/30489

    摘要: A computer is programmed to process a continuous query that is known to perform a new aggregation on one or more stream(s) of data, using one or more other aggregations on the stream(s). The computer creates an operator to execute the continuous query, and schedules the operator for execution in a specific order. In several embodiments, the computer determines the order based on dependency of the new aggregation on other aggregation(s), and on the order of performance of the other aggregation(s). The new aggregation is scheduled for performance after performance of each of the other aggregations. The computer is further programmed to pass results of the other aggregations to the new aggregation, by execution of a predetermined function. Support for use of the other aggregations results within the new aggregation eliminates redundant computation of the other aggregations within the new aggregation. The new aggregation may be user defined or built-in.

    摘要翻译: 计算机被编程为处理已知的在一条或多条数据流上执行新聚合的连续查询,使用流上的一个或多个其他聚合。 计算机创建一个运算符来执行连续查询,并按照特定顺序安排运营商执行。 在几个实施例中,计算机基于新聚合对其他聚合的依赖性以及其他聚合的性能的顺序来确定顺序。 新的聚合计划在执行每个其他聚合后执行性能。 计算机还被编程为通过执行预定功能将其他聚合的结果传递给新聚合。 在新集合中支持使用其他聚合结果可以消除新聚合中其他聚合的冗余计算。 新聚合可以是用户定义的或内置的。

    Support for user defined aggregations in a data stream management system
    5.
    发明授权
    Support for user defined aggregations in a data stream management system 有权
    支持数据流管理系统中用户定义的聚合

    公开(公告)号:US08204875B2

    公开(公告)日:2012-06-19

    申请号:US13184528

    申请日:2011-07-16

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30489 G06F17/30516

    摘要: A computer is programmed to accept a command to create a new aggregation defined by a user during execution of continuous queries on streams of data. The computer is further programmed to thereafter accept and process new continuous queries using the new aggregation, in a manner similar to built-in aggregations. The user typically writes a set of instructions to perform the new aggregation, and identifies in the command, a location of the set of instructions. In response to such a command, the computer creates metadata identifying the new aggregation. The metadata is used to instantiate one aggregation for each group of data in a current window, grouped by an attribute identified in a new query.

    摘要翻译: 计算机被编程为接受命令以在对数据流的连续查询执行期间创建由用户定义的新聚合。 该计算机被进一步编程为此后以类似于内置聚合的方式接受和处理使用新聚合的新的连续查询。 用户通常写一组指令来执行新的聚合,并且在命令中标识该组指令的位置。 响应于这样的命令,计算机创建标识新聚合的元数据。 元数据用于为当前窗口中的每组数据实例化一个聚合,按照新查询中标识的属性进行分组。

    Support for incrementally processing user defined aggregations in a data stream management system
    6.
    发明申请
    Support for incrementally processing user defined aggregations in a data stream management system 有权
    支持在数据流管理系统中逐步处理用户定义的聚合

    公开(公告)号:US20090106440A1

    公开(公告)日:2009-04-23

    申请号:US11977437

    申请日:2007-10-20

    IPC分类号: G06F15/16

    CPC分类号: G06F17/30412 G06F17/30516

    摘要: A computer is programmed to accept a command for creation of a new aggregation defined by a user to process data incrementally, one tuple at a time. One or more incremental function(s) in a set of instructions written by the user to implement the new aggregation maintain(s) locally any information that is to be passed between successive invocations, to support computing the aggregation for a given set of tuples as a whole. The user writes a set of instructions to perform the aggregation incrementally, including a plus function which is repeatedly invoked, only once, for each addition to a window of a message. The user also writes a minus function to be invoked with the message, to return the value of incremental aggregation over the window after removal of the message. In such embodiments, the computer does not maintain copies of messages in the window for use by aggregation function(s).

    摘要翻译: 计算机被编程为接受用于创建由用户定义的新聚合的命令以逐步处理数据,一次一个元组。 由用户写入的用于实现新聚合的一组指令中的一个或多个递增函数在本地维护将在连续调用之间传递的任何信息,以支持计算给定的一组元组的聚合, 整个。 用户写入一组指令以递增地执行聚合,包括对于每个添加到消息窗口的一次,重复调用一次的加号功能。 用户还可以使用该消息写入要调用的减号函数,以便在删除消息后返回窗口上的增量聚合值。 在这样的实施例中,计算机不维护窗口中的消息副本以供聚合功能使用。

    ADDING NEW CONTINUOUS QUERIES TO A DATA STREAM MANAGEMENT SYSTEM OPERATING ON EXISTING QUERIES
    7.
    发明申请
    ADDING NEW CONTINUOUS QUERIES TO A DATA STREAM MANAGEMENT SYSTEM OPERATING ON EXISTING QUERIES 有权
    对现有问题的数据流管理系统增加新的连续性查询

    公开(公告)号:US20090106214A1

    公开(公告)日:2009-04-23

    申请号:US11874202

    申请日:2007-10-17

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516

    摘要: A new continuous query to a data stream management system (DSMS) may use a stream or a relation which may or may not be used by continuous queries previously registered in the DSMS. The DSMS is programmed to modify an execution plan to accommodate execution of the new query while continuing to execute the previously registered continuous queries. The modified execution plan may include new operators and/or share existing operators. The DSMS is programmed to cause operators which output a relation to propagate a current state of the relation to each newly-coupled operator that uses the relation. The current state is propagated only to operators that have been newly coupled and have thus not yet received any state information previously. After propagation of current state to newly-coupled operators, results of processing any new data for the relation are supplied to all operators coupled thereto, including newly-coupled operators and existing operators.

    摘要翻译: 对数据流管理系统(DSMS)的新的连续查询可以使用可以或可能不被先前在DSMS中注册的连续查询使用的流或关系。 DSMS被编程为修改执行计划以适应新查询的执行,同时继续执行先前注册的连续查询。 修改的执行计划可以包括新的运营商和/或共享现有运营商。 DSMS被编程为使得输出关系的操作者将关系的当前状态传播到使用该关系的每个新耦合的运算符。 当前状态仅传播到已经新耦合的操作符,并且因此以前尚未接收到任何状态信息。 在将当前状态传播到新耦合的运营商之后,将与关联关系的任何新数据进行处理的结果提供给所有与之耦合的运营商,包括新耦合运营商和现有运营商。

    Dynamically sharing a subtree of operators in a data stream management system operating on existing queries
    8.
    发明授权
    Dynamically sharing a subtree of operators in a data stream management system operating on existing queries 有权
    在现有查询操作的数据流管理系统中动态共享运算符子树

    公开(公告)号:US08296316B2

    公开(公告)日:2012-10-23

    申请号:US11874197

    申请日:2007-10-17

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30516

    摘要: A new continuous query to a data stream management system (DSMS) may use several operators that are equivalent to operators currently being used by continuous queries that have been previously registered in the DSMS. To determine equivalence of operators, the DSMS checks at least the function and the data input to the operators. On finding equivalence, the DSMS modifies a global plan being executed, to use at least an existing subtree of operators during execution of the new continuous query, to generate a modified plan. The DSMS is also programmed to cause each relation source operator (which outputs a relation) to propagate a current state of the relation to each operator newly coupled to the relational operator. After propagation of current state to newly-coupled operators, each operator in the modified plan processes any new data and supplies the result to all operators coupled thereto, including newly-coupled operators and existing operators.

    摘要翻译: 对数据流管理系统(DSMS)的新的连续查询可以使用几个等效于先前在DSMS中注册的连续查询使用的运算符的运算符。 为了确定运算符的等效性,DSMS至少检查运算符的功能和数据输入。 在发现等价性时,DSMS修改正在执行的全局计划,以便在执行新的连续查询期间至少使用现有的运算符子树来生成修改的计划。 DSMS还被编程为使得每个关系源操作符(其输出关系)将关系的当前状态传播到新耦合到关系运算符的每个运算符。 在将当前状态传播到新耦合的运营商之后,修改的计划中的每个运营商处理任何新的数据,并将结果提供给与其耦合的所有运营商,包括新耦合运营商和现有运营商。

    SUPPORT FOR USER DEFINED FUNCTIONS IN A DATA STREAM MANAGEMENT SYSTEM
    9.
    发明申请
    SUPPORT FOR USER DEFINED FUNCTIONS IN A DATA STREAM MANAGEMENT SYSTEM 有权
    支持数据流管理系统中用户定义的功能

    公开(公告)号:US20120041934A1

    公开(公告)日:2012-02-16

    申请号:US13244272

    申请日:2011-09-23

    IPC分类号: G06F17/00 G06F17/30

    CPC分类号: G06F8/20

    摘要: A data stream management system (DSMS) is designed to support a new user-defined function, by creating and using at least two structures as follows. A first structure (“metadata entry”) is created in response to a command for creation of the new function, and maps a single instance of a class to the function's name. A second structure is created with creation of an operator on receipt of each new continuous query that uses the new function. The second structure (“operator specific data structure”) contains a path to the newly-created instance, which path is obtained by looking up the first structure. Additional second structures are created on receipt of additional continuous queries which use the new function, but all second structures contain the same path. All continuous queries use the same instance. Repeated use of a single instance to compile and execute multiple queries eliminates repeated instantiation of the same function.

    摘要翻译: 数据流管理系统(DSMS)旨在通过如下创建和使用至少两个结构来支持新的用户定义的功能。 响应于用于创建新功能的命令创建第一个结构(“元数据条目”),并将类的单个实例映射到函数的名称。 在接收到使用新功能的每个新的连续查询时,创建第二个结构。 第二个结构(“运算符特定数据结构”)包含新创建的实例的路径,通过查找第一个结构获取哪个路径。 附加的第二个结构是在接收到使用新功能的附加连续查询时创建的,但所有第二个结构都包含相同的路径。 所有连续查询使用相同的实例。 重复使用单个实例来编译和执行多个查询消除了相同功能的重复实例化。

    SUPPORT FOR USER DEFINED AGGREGATIONS IN A DATA STREAM MANAGEMENT SYSTEM
    10.
    发明申请
    SUPPORT FOR USER DEFINED AGGREGATIONS IN A DATA STREAM MANAGEMENT SYSTEM 有权
    支持数据流管理系统中用户定义的聚合

    公开(公告)号:US20110270879A1

    公开(公告)日:2011-11-03

    申请号:US13184528

    申请日:2011-07-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30489 G06F17/30516

    摘要: A computer is programmed to accept a command to create a new aggregation defined by a user during execution of continuous queries on streams of data. The computer is further programmed to thereafter accept and process new continuous queries using the new aggregation, in a manner similar to built-in aggregations. The user typically writes a set of instructions to perform the new aggregation, and identifies in the command, a location of the set of instructions. In response to such a command, the computer creates metadata identifying the new aggregation. The metadata is used to instantiate one aggregation for each group of data in a current window, grouped by an attribute identified in a new query.

    摘要翻译: 计算机被编程为接受命令以在对数据流的连续查询执行期间创建由用户定义的新聚合。 该计算机被进一步编程为此后以类似于内置聚合的方式接受和处理使用新聚合的新的连续查询。 用户通常写一组指令来执行新的聚合,并且在命令中标识该组指令的位置。 响应于这样的命令,计算机创建标识新聚合的元数据。 元数据用于为当前窗口中的每组数据实例化一个聚合,按照新查询中标识的属性进行分组。