REAL-TIME-READY BEHAVIORAL TARGETING IN A LARGE-SCALE ADVERTISEMENT SYSTEM
    1.
    发明申请
    REAL-TIME-READY BEHAVIORAL TARGETING IN A LARGE-SCALE ADVERTISEMENT SYSTEM 有权
    大规模广告系统中的实时行为指标

    公开(公告)号:US20110313844A1

    公开(公告)日:2011-12-22

    申请号:US12818100

    申请日:2010-06-17

    IPC分类号: G06Q30/00 G06F17/30

    摘要: A “Real-Time-Ready Analyzer” combines a data stream management system (DSMS) with a map-reduce (M-R) framework to construct a streaming map-reduce framework that is suitable for real-time Behavioral Targeting (BT) (or other temporal queries). The Real-Time-Ready Analyzer allows users to write “dual-intent” temporal analysis queries for BT. These queries are succinct and easy to express, scale well on large-scale offline data, and can also work over real-time data. Further, the Real-Time-Ready Analyzer uses the aforementioned streaming map-reduce framework to provide dual-intent algorithms for end-to-end BT phases. Experiments using real data from an advertisement system show that the Real-Time-Ready Analyzer is very efficient and incurs orders-of-magnitude lower development effort than conventional systems.

    摘要翻译: “实时就绪分析器”将数据流管理系统(DSMS)与地图缩减(MR)框架相结合,构建适用于实时行为定位(BT)(或其他)的流映射缩减框架 时间查询)。 实时就绪分析器允许用户为BT写入“双重意图”时间分析查询。 这些查询简洁易懂,在大型离线数据上可以很好地扩展,也可以处理实时数据。 此外,实时就绪分析器使用上述流映射缩减框架来提供端到端BT阶段的双重意图算法。 使用广告系统的实际数据的实验表明,实时就绪分析器非常有效,并且比传统系统产生了比数字级更低的开发工作。

    Real-time-ready behavioral targeting in a large-scale advertisement system
    2.
    发明授权
    Real-time-ready behavioral targeting in a large-scale advertisement system 有权
    在大规模广告系统中实时就绪的行为定位

    公开(公告)号:US08442863B2

    公开(公告)日:2013-05-14

    申请号:US12818100

    申请日:2010-06-17

    IPC分类号: G06Q30/00

    摘要: A “Real-Time-Ready Analyzer” combines a data stream management system (DSMS) with a map-reduce (M-R) framework to construct a streaming map-reduce framework that is suitable for real-time Behavioral Targeting (BT) (or other temporal queries). The Real-Time-Ready Analyzer allows users to write “dual-intent” temporal analysis queries for BT. These queries are succinct and easy to express, scale well on large-scale offline data, and can also work over real-time data. Further, the Real-Time-Ready Analyzer uses the aforementioned streaming map-reduce framework to provide dual-intent algorithms for end-to-end BT phases. Experiments using real data from an advertisement system show that the Real-Time-Ready Analyzer is very efficient and incurs orders-of-magnitude lower development effort than conventional systems.

    摘要翻译: “实时就绪分析器”将数据流管理系统(DSMS)与地图缩减(MR)框架相结合,构建适用于实时行为定位(BT)(或其他)的流映射缩小框架 时间查询)。 实时就绪分析器允许用户为BT写入“双重意图”时间分析查询。 这些查询简洁易懂,在大型离线数据上可以很好地扩展,也可以处理实时数据。 此外,实时就绪分析器使用上述流映射缩减框架来提供端到端BT阶段的双重意图算法。 使用广告系统的实际数据的实验表明,实时就绪分析器非常有效,并且比传统系统产生了比数字级更低的开发工作。

    ESTIMATING LATENCIES FOR QUERY OPTIMIZATION IN DISTRIBUTED STREAM PROCESSING
    3.
    发明申请
    ESTIMATING LATENCIES FOR QUERY OPTIMIZATION IN DISTRIBUTED STREAM PROCESSING 审中-公开
    估计分布式流程中查询优化的延迟

    公开(公告)号:US20100030896A1

    公开(公告)日:2010-02-04

    申请号:US12573108

    申请日:2009-10-03

    IPC分类号: G06F15/173

    CPC分类号: G06F16/24568

    摘要: A “Query Optimizer” provides a cost estimation metric referred to as “Maximum Accumulated Overload” (MAO). MAO is approximately equivalent to maximum system latency in a data stream management system (DSMS). Consequently, MAO is directly relevant for use in optimizing latencies in real-time streaming applications running multiple continuous queries (CQs) over high data-rate event sources. In various embodiments, the Query Optimizer computes MAO given knowledge of original operator statistics, including “operator selectivity” and “cycles/event” in combination with an expected event arrival workload. Beyond use in query optimization to minimize worst-case latency, MAO is useful for addressing problems including admission control, system provisioning, user latency reporting, operator placements (in a multi-node environment), etc. In addition, MAO, as a surrogate for worst-case latency, is generally applicable beyond streaming systems, to any queue-based workflow system with control over the scheduling strategy.

    摘要翻译: “查询优化器”提供了称为“最大累积过载”(MAO)的成本估算度量。 MAO大致相当于数据流管理系统(DSMS)中的最大系统延迟。 因此,MAO与通过高数据速率事件源运行多个连续查询(CQ)的实时流应用的优化延迟有直接关系。 在各种实施例中,查询优化器结合预期的事件到达工作负载来计算给定原始运营商统计信息的MAO,包括“运营商选择性”和“周期/事件”。 除了在查询优化中使用以最小化最坏情况的延迟之外,MAO可用于解决问题,包括准入控制,系统配置,用户延迟报告,运营商布置(在多节点环境中)等。此外,MAO作为替代 对于最差情况下的延迟,通常适用于流式传输系统以及具有对调度策略的控制的任何基于队列的工作流系统。

    Dynamic pattern matching over ordered and disordered data streams
    4.
    发明授权
    Dynamic pattern matching over ordered and disordered data streams 有权
    有序和无序数据流的动态模式匹配

    公开(公告)号:US08762297B2

    公开(公告)日:2014-06-24

    申请号:US12780939

    申请日:2010-05-17

    IPC分类号: G06N5/00

    摘要: Architecture introduces a new pattern operator referred to as called an augmented transition network (ATN), which is a streaming adaptation of non-reentrant, fixed-state ATNs for dynamic patterns. Additional user-defined information is associated with automaton states and is accessible to transitions during execution. ATNs are created that directly model complex pattern continuous queries with arbitrary cycles in a transition graph. The architecture can express the desire to ignore some events during pattern detection, and can also detect the absence of data as part of a pattern. The architecture facilitates efficient support for negation, ignorable events, and state cleanup based on predicate punctuations.

    摘要翻译: 架构引入了称为增强转换网络(ATN)的新的模式运算符,其是用于动态模式的非可重入固定状态ATN的流适配。 其他用户定义的信息与自动机状态相关联,并且在执行期间可以转换。 创建ATN,在转换图中直接建模具有任意周期的复杂模式连续查询。 该架构可以表示在模式检测期间忽略某些事件的愿望,并且还可以检测不存在数据作为模式的一部分。 该架构有助于有效支持基于谓词标点符号的否定,可忽略事件和状态清理。

    ASYNCHRONOUS TRANSFER OF STATE INFORMATION BETWEEN CONTINUOUS QUERY PLANS
    5.
    发明申请
    ASYNCHRONOUS TRANSFER OF STATE INFORMATION BETWEEN CONTINUOUS QUERY PLANS 有权
    连续查询计划之间的状态信息异步传输

    公开(公告)号:US20120166417A1

    公开(公告)日:2012-06-28

    申请号:US12977107

    申请日:2010-12-23

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30463 G06F17/30516

    摘要: Described herein are technologies pertaining to migrating state information of operators in a first continuous query plan to a second continuous query plan in an asynchronous manner, such that the first continuous query plan need not cease executing during the migrating of the state information. State information pertaining to stateful operators, such as join operators, is migrated from the first continuous query plan to the second continuous query plan by way of a transformation plan. State matching is utilized to generate the transformation plan.

    摘要翻译: 这里描述的是涉及将第一连续查询计划中的运营商的状态信息以异步方式迁移到第二连续查询计划的技术,使得在迁移状态信息期间第一连续查询计划不需要停止执行。 通过转换方案,将有状态运算符(例如加入运算符)的状态信息从第一个连续查询计划迁移到第二个连续查询计划。 利用状态匹配来生成转换计划。

    Operator state checkpoint markers and rehydration
    7.
    发明授权
    Operator state checkpoint markers and rehydration 有权
    操作员状态检查点标记和补液

    公开(公告)号:US08726076B2

    公开(公告)日:2014-05-13

    申请号:US13117165

    申请日:2011-05-27

    IPC分类号: G06F11/00 G06F11/16

    CPC分类号: G06F11/1438 G06F11/1471

    摘要: A checkpoint marker can be received at a first operator. The first operator can process the checkpoint marker by sending the checkpoint marker to a second operator and sending state checkpoint information representing a state of the first operator to a checkpoint writer. The checkpoint information can be used to rehydrate the state of one or more operators. For example, after a system failure, system shutdown, etc., checkpoint information can be received from a reader unit at a checkpoint information input queue of the first operator. A state of the first operator can be rehydrated using the checkpoint information. Processing of information in a data input queue of the first operator can be suspended while the checkpoint information is used to rehydrate the state of the first operator. Other operators in a system with the first operator (e.g., the second operator) may be checkpointed and rehydrated in the same manner as the first operator.

    摘要翻译: 可以在第一个操作员处接收检查点标记。 第一个操作员可以通过将检查点标记发送到第二个操作符来处理检查点标记,并将表示第一个操作符的状态的状态检查点信息发送到检查点编写器。 检查点信息可用于对一个或多个操作员的状态进行补水。 例如,在系统故障,系统关机等之后,可以在第一运营商的检查点信息输入队列处从读取单元接收检查点信息。 可以使用检查点信息对第一操作者的状态进行再水化。 当使用检查点信息来补充第一操作者的状态时,可以暂停处理第一操作者的数据输入队列中的信息。 具有第一操作者(例如,第二操作者)的系统中的其他操作者可以以与第一操作者相同的方式进行检查点和再水化。

    OPERATOR STATE CHECKPOINTS
    10.
    发明申请
    OPERATOR STATE CHECKPOINTS 有权
    操作员状态检查

    公开(公告)号:US20120304003A1

    公开(公告)日:2012-11-29

    申请号:US13117165

    申请日:2011-05-27

    IPC分类号: G06F11/14

    CPC分类号: G06F11/1438 G06F11/1471

    摘要: A checkpoint marker can be received at a first operator. The first operator can process the checkpoint marker by sending the checkpoint marker to a second operator and sending state checkpoint information representing a state of the first operator to a checkpoint writer. The checkpoint information can be used to rehydrate the state of one or more operators. For example, after a system failure, system shutdown, etc., checkpoint information can be received from a reader unit at a checkpoint information input queue of the first operator. A state of the first operator can be rehydrated using the checkpoint information. Processing of information in a data input queue of the first operator can be suspended while the checkpoint information is used to rehydrate the state of the first operator. Other operators in a system with the first operator (e.g., the second operator) may be checkpointed and rehydrated in the same manner as the first operator.

    摘要翻译: 可以在第一个操作员处接收检查点标记。 第一个操作员可以通过将检查点标记发送到第二个操作符来处理检查点标记,并将表示第一个操作符的状态的状态检查点信息发送到检查点编写器。 检查点信息可用于对一个或多个操作员的状态进行补水。 例如,在系统故障,系统关机等之后,可以在第一运营商的检查点信息输入队列处从读取单元接收检查点信息。 可以使用检查点信息对第一操作者的状态进行再水化。 当使用检查点信息来补充第一操作者的状态时,可以暂停处理第一操作者的数据输入队列中的信息。 具有第一操作者(例如,第二操作者)的系统中的其他操作者可以以与第一操作者相同的方式进行检查点和再水化。