EXTENSIBILITY MODEL FOR STREAM-BASED OPERATORS AND AGGREGATES
    1.
    发明申请
    EXTENSIBILITY MODEL FOR STREAM-BASED OPERATORS AND AGGREGATES 有权
    基于流水线算子和集合的可扩展模型

    公开(公告)号:US20110283295A1

    公开(公告)日:2011-11-17

    申请号:US12777259

    申请日:2010-05-11

    IPC分类号: G06F9/46

    CPC分类号: G06F9/542 G06F8/24

    摘要: An extensibility framework that allows a user to write user-defined modules that include user-defined operators (UDO) and user-defined aggregators (UDA) in a non-temporal fashion without the need to worry about temporal attributes of events (or event types). The temporal domain is managed on behalf of the user, and allows the user to write operators and aggregates in the temporal data streaming domain as well as to port existing libraries of non-temporal UDOs/UDAs to the temporal data streaming domain. Temporal attributes and event types are managed for non-temporal UDOs/UDAs by the extensibility framework on behalf of the UDO/UDA writer. Windows can be employed to bridge the gap between the non-temporal domains and temporal domains. Support for complex event processing (CEP) is provided in UDOs/UDAs for base classes related to a CEP operator, CEP aggregate, CEP time sensitive operator, and CEP time sensitive aggregate.

    摘要翻译: 一种可扩展性框架,允许用户以非时间方式编写包含用户定义的运算符(UDO)和用户定义聚合器(UDA)的用户定义模块,而无需担心事件(或事件类型)的时间属性 )。 时域是代表用户管理的,并允许用户在时间数据流域中编写运算符和聚合,并将现有的非时间UDO / UDA库链接到时间数据流域。 代表UDO / UDA作者的可扩展性框架为非时间UDO / UDAs管理时间属性和事件类型。 Windows可用于弥合非时间域和时域之间的差距。 对于与CEP运算符,CEP聚合,CEP时间敏感运算符和CEP时间敏感聚合相关的基类,在UDO / UDA中提供了对复杂事件处理(CEP)的支持。

    Streaming operator placement for distributed stream processing
    2.
    发明授权
    Streaming operator placement for distributed stream processing 有权
    用于分布式流处理的流式操作员位置

    公开(公告)号:US08060614B2

    公开(公告)日:2011-11-15

    申请号:US12141914

    申请日:2008-06-19

    IPC分类号: G06F15/16

    CPC分类号: G06F9/5083

    摘要: A streaming operator assignment system and method for determining a streaming operator assignment that minimizes overload in a data processing system. Embodiments of the streaming operator assignment system include an optimization goals definition module, which defines optimization goals in terms of fundamental quantities that system administrators and application writers want to control, such as minimizing the worst case latency over all periods of time, or minimizing how much the system is backlogged with work. Embodiments of the streaming operator assignment system also include an optimization goals solution module that optimizes and solves a selected optimization goal. A specialized optimization technique is used to find the best operator (or load) assignment using the optimization goals to measure of the value of the assignment. This technique minimizes an optimization goal by iterating over all possible operators assignments over all possible nodes to find the operator assignment that minimizes the desired optimization goal.

    摘要翻译: 一种用于确定使数据处理系统中的过载最小化的流操作员分配的流操作员分配系统和方法。 流操作者分配系统的实施例包括优化目标定义模块,其定义系统管理员和应用程序编写者想要控制的基本量的优化目标,例如最小化所有时间段内的最坏情况延迟,或最小化多少 系统用工作积压。 流式操作员分配系统的实施例还包括优化和解决所选优化目标的优化目标解决方案模块。 使用专门的优化技术来使用优化目标来找到最佳的运算符(或加载)分配,以度量分配值。 该技术通过遍历所有可能节点上的所有可能的运算符分配来最小化优化目标,以找到最小化期望的优化目标的运算符分配。

    Extensibility model for stream-based operators and aggregates
    3.
    发明授权
    Extensibility model for stream-based operators and aggregates 有权
    基于流的运算符和聚合的可扩展性模型

    公开(公告)号:US08381233B2

    公开(公告)日:2013-02-19

    申请号:US12777259

    申请日:2010-05-11

    IPC分类号: G06F3/00

    CPC分类号: G06F9/542 G06F8/24

    摘要: An extensibility framework that allows a user to write user-defined modules that include user-defined operators (UDO) and user-defined aggregators (UDA) in a non-temporal fashion without the need to worry about temporal attributes of events (or event types). The temporal domain is managed on behalf of the user, and allows the user to write operators and aggregates in the temporal data streaming domain as well as to port existing libraries of non-temporal UDOs/UDAs to the temporal data streaming domain. Temporal attributes and event types are managed for non-temporal UDOs/UDAs by the extensibility framework on behalf of the UDO/UDA writer. Windows can be employed to bridge the gap between the non-temporal domains and temporal domains. Support for complex event processing (CEP) is provided in UDOs/UDAs for base classes related to a CEP operator, CEP aggregate, CEP time sensitive operator, and CEP time sensitive aggregate.

    摘要翻译: 一种可扩展性框架,允许用户以非时间方式编写包含用户定义的运算符(UDO)和用户定义聚合器(UDA)的用户定义模块,而无需担心事件(或事件类型)的时间属性 )。 时域是代表用户管理的,并允许用户在时间数据流域中编写运算符和聚合,并将现有的非时间UDO / UDA库链接到时间数据流域。 代表UDO / UDA作者的可扩展性框架为非时间UDO / UDAs管理时间属性和事件类型。 Windows可用于弥合非时间域和时域之间的差距。 对于与CEP运算符,CEP聚合,CEP时间敏感运算符和CEP时间敏感聚合相关的基类,在UDO / UDA中提供了对复杂事件处理(CEP)的支持。

    Recursive processing in streaming queries
    6.
    发明授权
    Recursive processing in streaming queries 有权
    流式查询中的递归处理

    公开(公告)号:US09229986B2

    公开(公告)日:2016-01-05

    申请号:US13298159

    申请日:2011-11-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516

    摘要: The described implementations relate to recursive streaming queries. One technique processes a recursive streaming query through a query graph. The technique also detects when output produced by executing the query graph advances to a specific point.

    摘要翻译: 所描述的实现涉及递归流查询。 一种技术通过查询图来处理递归流式查询。 该技术还检测通过执行查询图形产生的输出何时前进到特定点。

    Operator state checkpoint markers and rehydration
    7.
    发明授权
    Operator state checkpoint markers and rehydration 有权
    操作员状态检查点标记和补液

    公开(公告)号:US08726076B2

    公开(公告)日:2014-05-13

    申请号:US13117165

    申请日:2011-05-27

    IPC分类号: G06F11/00 G06F11/16

    CPC分类号: G06F11/1438 G06F11/1471

    摘要: A checkpoint marker can be received at a first operator. The first operator can process the checkpoint marker by sending the checkpoint marker to a second operator and sending state checkpoint information representing a state of the first operator to a checkpoint writer. The checkpoint information can be used to rehydrate the state of one or more operators. For example, after a system failure, system shutdown, etc., checkpoint information can be received from a reader unit at a checkpoint information input queue of the first operator. A state of the first operator can be rehydrated using the checkpoint information. Processing of information in a data input queue of the first operator can be suspended while the checkpoint information is used to rehydrate the state of the first operator. Other operators in a system with the first operator (e.g., the second operator) may be checkpointed and rehydrated in the same manner as the first operator.

    摘要翻译: 可以在第一个操作员处接收检查点标记。 第一个操作员可以通过将检查点标记发送到第二个操作符来处理检查点标记,并将表示第一个操作符的状态的状态检查点信息发送到检查点编写器。 检查点信息可用于对一个或多个操作员的状态进行补水。 例如,在系统故障,系统关机等之后,可以在第一运营商的检查点信息输入队列处从读取单元接收检查点信息。 可以使用检查点信息对第一操作者的状态进行再水化。 当使用检查点信息来补充第一操作者的状态时,可以暂停处理第一操作者的数据输入队列中的信息。 具有第一操作者(例如,第二操作者)的系统中的其他操作者可以以与第一操作者相同的方式进行检查点和再水化。

    RECURSIVE PROCESSING IN STREAMING QUERIES
    8.
    发明申请
    RECURSIVE PROCESSING IN STREAMING QUERIES 有权
    循环查询中的回归处理

    公开(公告)号:US20120084322A1

    公开(公告)日:2012-04-05

    申请号:US13298159

    申请日:2011-11-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516

    摘要: The described implementations relate to recursive streaming queries. One technique processes a recursive streaming query through a query graph. The technique also detects when output produced by executing the query graph advances to a specific point.

    摘要翻译: 所描述的实现涉及递归流查询。 一种技术通过查询图来处理递归流式查询。 该技术还检测通过执行查询图形产生的输出何时前进到特定点。

    Optimizing parameterized queries in a relational database management system
    9.
    发明授权
    Optimizing parameterized queries in a relational database management system 有权
    在关系数据库管理系统中优化参数化查询

    公开(公告)号:US08032522B2

    公开(公告)日:2011-10-04

    申请号:US11467377

    申请日:2006-08-25

    IPC分类号: G06F17/30 G06F15/16

    CPC分类号: G06F17/30474

    摘要: Parameterized queries are optimized by a transformational optimizer. The optimizer produces a dynamic plan that embeds multiple plan options that may be selected to execute a particular query. Parameter distribution improves query execution efficiency and performance by exploring a sample parameter space representative of the parameter values actually used. The dynamic plans can be simplified while maintaining an acceptable level of optimality by reducing the number of plan options. The reduction is achieved by eliminating switch unions to alternatives that are close in cost. Both approaches of parameter space exploration and dynamic plan generation are deeply integrated into the query optimizer.

    摘要翻译: 参数化查询由转换优化器进行优化。 优化器生成嵌入可以选择执行特定查询的多个计划选项的动态计划。 参数分布通过探索代表实际使用的参数值的样本参数空间来提高查询执行效率和性能。 通过减少计划选项的数量,可以简化动态计划,同时保持可接受的最优性水平。 这种减少是通过消除开关工会到成本接近的替代方案来实现的。 参数空间探索和动态计划生成的两种方法都深入集成到查询优化器中。

    CONSISTENCY SENSITIVE STREAMING OPERATORS
    10.
    发明申请
    CONSISTENCY SENSITIVE STREAMING OPERATORS 有权
    一致的敏感流程操作员

    公开(公告)号:US20090125635A1

    公开(公告)日:2009-05-14

    申请号:US11937118

    申请日:2007-11-08

    IPC分类号: G06F15/16

    CPC分类号: G06F11/0793 G06F11/0721

    摘要: A new approach for handling stream imperfections based on speculative execution involves the retraction of incorrect events facilitated using operators to remove speculatively produced incorrect output. Additionally, parameters are disclosed that define a spectrum of consistency levels. A first parameter, maximum blocking time, exposes a tradeoff between a degree of speculation and latency. A second parameter, the maximum time data is remembered before being purged from the system, exposes a tradeoff between state size and correctness. Varying these two parameters produces a spectrum of consistency levels (e.g., strong, middle, weak) which address the specific tradeoffs built into other systems. Retraction is accomplished using operators that include Select, AlterLifetime, Join, Sum, Align, and Finalize.

    摘要翻译: 基于推测执行的处理流瑕疵的新方法涉及使用运算符来消除误导事件的撤销,以消除推测出错的输出。 另外,公开了定义一致性级别的参数。 第一个参数,最大阻塞时间,暴露了一定程度的推测和延迟之间的折衷。 第二个参数是在从系统清除之前记住的最大时间数据暴露了状态大小和正确性之间的折衷。 改变这两个参数产生了一系列一致性级别(例如,强,中,弱),其解决了内置到其他系统中的特定权衡。 使用包括Select,AlterLifetime,Join,Sum,Align和Finalize的操作符来实现退回。