Method and Apparatus for Maintaining and Processing Provenance Data in Data Stream Processing System
    1.
    发明申请
    Method and Apparatus for Maintaining and Processing Provenance Data in Data Stream Processing System 有权
    数据流处理系统中维护和处理出处数据的方法和装置

    公开(公告)号:US20090292729A1

    公开(公告)日:2009-11-26

    申请号:US12125212

    申请日:2008-05-22

    IPC分类号: G06F7/00

    CPC分类号: G06N5/042 G16H50/20

    摘要: Techniques are disclosed for maintaining and processing provenance data in such data stream processing systems. For example, a method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, comprising the following steps. A portion of data associated with the data stream is maintained. The maintained data comprises inputs to each processing element that contributed to an output of each processing element. In response to an alert generated by one of the processing elements, a scheduler is triggered to determine when a pre-calculation of a prospective query related to the alert should be executed. In response to the scheduler, at least a portion of the maintained data is used to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof.

    摘要翻译: 公开了用于在这种数据流处理系统中维护和处理原始数据的技术。 例如,一种用于处理与由数据流处理系统接收的数据流相关联的数据的方法,其中所述系统包括多个处理元件,包括以下步骤。 保持与数据流相关联的一部分数据。 维护的数据包括对每个处理元件的输入,每个处理元件有助于每个处理元件的输出。 响应于由处理元件之一生成的警报,触发调度器以确定何时应执行与警报相关的预期查询的预计算。 响应于调度器,使用维护数据的至少一部分来确定有助于警报的一组数据,使得可以使用提醒贡献的数据集来抵达预期查询。

    Method and apparatus for maintaining and processing provenance data in data stream processing system
    2.
    发明授权
    Method and apparatus for maintaining and processing provenance data in data stream processing system 有权
    用于在数据流处理系统中维护和处理原始数据的方法和装置

    公开(公告)号:US08301626B2

    公开(公告)日:2012-10-30

    申请号:US12125212

    申请日:2008-05-22

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06N5/042 G16H50/20

    摘要: Techniques are disclosed for maintaining and processing provenance data in such data stream processing systems. For example, a method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, comprising the following steps. A portion of data associated with the data stream is maintained. The maintained data comprises inputs to each processing element that contributed to an output of each processing element. In response to an alert generated by one of the processing elements, a scheduler is triggered to determine when a pre-calculation of a prospective query related to the alert should be executed. In response to the scheduler, at least a portion of the maintained data is used to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof.

    摘要翻译: 公开了用于在这种数据流处理系统中维护和处理原始数据的技术。 例如,一种用于处理与由数据流处理系统接收的数据流相关联的数据的方法,其中所述系统包括多个处理元件,包括以下步骤。 保持与数据流相关联的一部分数据。 维护的数据包括对每个处理元件的输入,每个处理元件有助于每个处理元件的输出。 响应于由处理元件之一生成的警报,触发调度器以确定何时应执行与警报相关的预期查询的预计算。 响应于调度器,使用维护数据的至少一部分来确定有助于警报的一组数据,使得可以使用提醒贡献的数据集来抵达预期查询。

    Method and Apparatus for Determining and Validating Provenance Data in Data Stream Processing System
    3.
    发明申请
    Method and Apparatus for Determining and Validating Provenance Data in Data Stream Processing System 有权
    用于确定和验证数据流处理系统中的出处数据的方法和装置

    公开(公告)号:US20090292818A1

    公开(公告)日:2009-11-26

    申请号:US12125219

    申请日:2008-05-22

    IPC分类号: G06F15/16

    CPC分类号: G06K9/00496 G16H50/20

    摘要: Techniques are disclosed for determining and validating provenance data in such data stream processing systems. For example, a method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, comprises the following steps. Input data elements and output data elements associated with at least one processing element of the plurality of processing elements are obtained. One or more intervals are computed for the processing element using data representing observations of associations between inputs elements and output elements of the processing element, wherein, for a given one of the intervals, one or more particular input elements contained within the given interval are determined to have contributed to a particular output element. In another method, intervals are specified, and then validated by comparing the specified intervals against intervals computed based on observations.

    摘要翻译: 公开了用于确定和验证这种数据流处理系统中的来源数据的技术。 例如,一种用于处理与由数据流处理系统接收的数据流相关联的数据的方法,其中所述系统包括多个处理元件,包括以下步骤。 获得与多个处理元件中的至少一个处理元件相关联的输入数据元素和输出数据元素。 使用表示输入元件和处理元件的输出元件之间的关联的观察值的数据来计算处理元件的一个或多个间隔,其中对于给定的一个间隔,确定包含在给定间隔内的一个或多个特定输入元素 为特定的输出元素做出了贡献。 在另一种方法中,指定间隔,然后通过将指定的间隔与基于观察值计算的间隔进行比较来验证。

    Determining and validating provenance data in data stream processing system
    4.
    发明授权
    Determining and validating provenance data in data stream processing system 有权
    确定和验证数据流处理系统中的来源数据

    公开(公告)号:US08775344B2

    公开(公告)日:2014-07-08

    申请号:US12125219

    申请日:2008-05-22

    IPC分类号: G06F17/00 G06N3/00

    CPC分类号: G06K9/00496 G16H50/20

    摘要: Techniques are disclosed for determining and validating provenance data in such data stream processing systems. For example, a method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, comprises the following steps. Input data elements and output data elements associated with at least one processing element of the plurality of processing elements are obtained. One or more intervals are computed for the processing element using data representing observations of associations between inputs elements and output elements of the processing element, wherein, for a given one of the intervals, one or more particular input elements contained within the given interval are determined to have contributed to a particular output element. In another method, intervals are specified, and then validated by comparing the specified intervals against intervals computed based on observations.

    摘要翻译: 公开了用于确定和验证这种数据流处理系统中的来源数据的技术。 例如,一种用于处理与由数据流处理系统接收的数据流相关联的数据的方法,其中所述系统包括多个处理元件,包括以下步骤。 获得与多个处理元件中的至少一个处理元件相关联的输入数据元素和输出数据元素。 使用表示输入元件和处理元件的输出元件之间的关联的观察值的数据来计算处理元件的一个或多个间隔,其中对于给定的一个间隔,确定包含在给定间隔内的一个或多个特定输入元素 为特定的输出元素做出了贡献。 在另一种方法中,指定间隔,然后通过将指定的间隔与基于观察值计算的间隔进行比较来验证。

    Methods and Apparatus for Functional Model-Based Data Provenance in Stream Processing Environments
    5.
    发明申请
    Methods and Apparatus for Functional Model-Based Data Provenance in Stream Processing Environments 失效
    流处理环境中基于功能模型的数据发现的方法与装置

    公开(公告)号:US20080307104A1

    公开(公告)日:2008-12-11

    申请号:US11760070

    申请日:2007-06-08

    IPC分类号: G06F15/16

    CPC分类号: G06F17/30516

    摘要: Techniques for deriving a provenance of one or more of a plurality of output data elements generated from a given output port of a PC are provided. At least one dependency function is created that relates the one or more output data elements to a set of one or more input ports of the PC and a corresponding plurality of input data elements. The dependency function comprises an encoding of at least one of one or more temporal filters and one or more sequence filters relating to the plurality of input data elements. The at least one dependency function is stored. A history of stream-level bindings of one or more input streams to one or more input ports of the processing component and one or more output streams from one or more output ports of the processing component is stored. The plurality of input data elements belonging to the one or more input streams and the plurality of output data elements belonging to the one or more output streams are stored. The set of one or more input data elements from the plurality of input data elements are determined that relate to the one or more output data elements in accordance with at least one dependency function and the history of stream-level bindings.

    摘要翻译: 提供了用于导出从PC的给定输出端口生成的多个输出数据元素中的一个或多个的来源的技术。 创建至少一个依赖函数,其将一个或多个输出数据元素与PC的一个或多个输入端口的集合和相应的多个输入数据元素相关联。 依赖函数包括一个或多个时间滤波器和与多个输入数据元素相关的一个或多个序列滤波器中的至少一个的编码。 存储至少一个依赖函数。 存储一个或多个输入流到处理组件的一个或多个输入端口和来自处理组件的一个或多个输出端口的一个或多个输出流的流级绑定的历史。 存储属于一个或多个输入流的多个输入数据元素和属于一个或多个输出流的多个输出数据元素。 根据至少一个依赖函数和流级绑定的历史,确定来自多个输入数据元素的一个或多个输入数据元素的集合,其与一个或多个输出数据元素相关。

    Methods and apparatus for functional model-based data provenance in stream processing environments
    6.
    发明授权
    Methods and apparatus for functional model-based data provenance in stream processing environments 失效
    在流处理环境中基于功能模型的数据来源的方法和装置

    公开(公告)号:US07539753B2

    公开(公告)日:2009-05-26

    申请号:US11760070

    申请日:2007-06-08

    IPC分类号: G06F11/34

    CPC分类号: G06F17/30516

    摘要: Techniques for deriving a provenance of one or more of a plurality of output data elements generated from a given output port of a PC are provided. At least one dependency function is created that relates the one or more output data elements to a set of one or more input ports of the PC and a corresponding plurality of input data elements. The dependency function comprises an encoding of at least one of one or more temporal filters and one or more sequence filters relating to the plurality of input data elements. The at least one dependency function is stored. A history of stream-level bindings of one or more input streams to one or more input ports of the processing component and one or more output streams from one or more output ports of the processing component is stored. The plurality of input data elements belonging to the one or more input streams and the plurality of output data elements belonging to the one or more output streams are stored. The set of one or more input data elements from the plurality of input data elements are determined that relate to the one or more output data elements in accordance with at least one dependency function and the history of stream-level bindings.

    摘要翻译: 提供了用于导出从PC的给定输出端口生成的多个输出数据元素中的一个或多个的来源的技术。 创建至少一个依赖函数,其将一个或多个输出数据元素与PC的一个或多个输入端口的集合和相应的多个输入数据元素相关联。 依赖函数包括一个或多个时间滤波器和与多个输入数据元素相关的一个或多个序列滤波器中的至少一个的编码。 存储至少一个依赖函数。 存储一个或多个输入流到处理组件的一个或多个输入端口和来自处理组件的一个或多个输出端口的一个或多个输出流的流级绑定的历史。 存储属于一个或多个输入流的多个输入数据元素和属于一个或多个输出流的多个输出数据元素。 根据至少一个依赖函数和流级绑定的历史,确定来自多个输入数据元素的一个或多个输入数据元素的集合,其与一个或多个输出数据元素相关。

    Fast computation of spatial queries in location-based services
    10.
    发明授权
    Fast computation of spatial queries in location-based services 有权
    基于位置的服务中空间查询的快速计算

    公开(公告)号:US07007013B2

    公开(公告)日:2006-02-28

    申请号:US10207683

    申请日:2002-07-26

    IPC分类号: G06F17/30

    摘要: This invention provides methods, systems and apparatus for performing fast computation of metric queries. To achieve this, in an example embodiment, the present invention segments metric regions into disjoint primitive atomic shapes. It then represents these primitive atomic shapes and then performs off-line computation of their relevant properties. As a result of the off-line computation, the execution of a query requires a minimal number of on-line calculations resulting in a very fast query. Further optimization occurs via storage of query histories and prioritization of queries with respect to the access frequency of a metric space's primitive atomic shapes.

    摘要翻译: 本发明提供用于执行度量查询的快速计算的方法,系统和装置。 为了实现这一点,在示例性实施例中,本发明将度量区域分割成不相交的原始原子形状。 然后它代表这些原始原子形状,然后执行其相关属性的离线计算。 作为离线计算的结果,查询的执行需要最少数量的在线计算,导致非常快的查询。 进一步优化通过存储查询历史和查询的优先级相对于度量空间的原始原子形状的访问频率而发生。