QUERYING A DATA SOURCE ON A NETWORK
    1.
    发明公开

    公开(公告)号:EP3553670A1

    公开(公告)日:2019-10-16

    申请号:EP19176570.0

    申请日:2016-02-16

    IPC分类号: G06F16/2452

    摘要: A computer-implemented method, performed by a data processing system, of executing a computer program is described. The method includes receiving (602) a SQL query, where the SQL query includes an identifier associated with a resource that is external to a data processing system and generating (604) a computer program based on the SQL query. The generating includes: identifying (614) a catalog containing data corresponding to the resource, the data representing configuration data usable to configure a computer program to transmit instructions to the resource; generating (632) a request to a query planner based on the SQL query; providing the request to the query planner; receiving (634) a query plan generated by the query planner based on the request, the query plan including a description of one or more steps to be performed by a system managing a relational database, and generating the computer program based on the query plan, wherein the generated computer program is configured to communicate with the resource based on the data of the catalog. The method further includes receiving, from the identified catalog, the configuration data; and executing (606) the computer program on an executing system, based on the configuration data, the execution of the computer program causing: transmitting (622) one or more instructions to the resource, the instructions defining operations other than operations of the SQL query, and receiving (624) data from the resource in response to the instructions.

    PROCESSING QUERIES CONTAINING A UNION-TYPE OPERATION
    3.
    发明公开
    PROCESSING QUERIES CONTAINING A UNION-TYPE OPERATION 审中-公开
    处理包含联合式操作的查询

    公开(公告)号:EP3218823A1

    公开(公告)日:2017-09-20

    申请号:EP15797570.7

    申请日:2015-11-10

    IPC分类号: G06F17/30 G06F9/44

    摘要: Among other things, a method of generating a computer program based on an SQL query includes receiving a SQL query, including a reference to a first data set stored at a first data source, and including a reference to a second data set stored at a second data source different from the first data source, determining that the SQL query includes two or more commands, the commands including a first union-type operation, and a first aggregation operation, and determining that the SQL query describes that the first union-type operation shall be applied to at least a portion of data from the first data set, and applied to at least a portion of data from the second data set, determining that the SQL query describes that the first aggregation operation shall be applied to data resulting from the first union-type operation, and generating the computer program.

    摘要翻译: 除其他之外,基于SQL查询生成计算机程序的方法包括接收SQL查询,该SQL查询包括对存储在第一数据源处的第一数据集的引用,并且包括对存储在第二数据集处的第二数据集的引用 数据源与第一数据源不同,确定SQL查询包括两个或更多个命令,所述命令包括第一联合类型操作和第一聚合操作,并且确定SQL查询描述第一联合类型操作 将应用于来自第一数据集的至少一部分数据,并应用于来自第二数据集的至少一部分数据,确定SQL查询描述了将第一聚合操作应用于由 第一联合型操作,并生成计算机程序。

    QUERYING A DATA SOURCE ON A NETWORK
    5.
    发明公开
    QUERYING A DATA SOURCE ON A NETWORK 有权
    在网络上查询数据源

    公开(公告)号:EP3259686A1

    公开(公告)日:2017-12-27

    申请号:EP16708842.6

    申请日:2016-02-16

    IPC分类号: G06F17/30

    摘要: Among other things, we describe a computer-implemented method, performed by a data processing system, of executing a computer program based on a query that is expressed in accordance with a query language applicable to a relational database, the computer program executed based at least in part on data stored in a tangible, non-transitory computer-readable medium, the executing including receiving a SQL query, where the SQL query includes an identifier associated with a resource that is external to the data processing system, generating a computer program based on the SQL query, and executing the computer program, causing transmitting one or more instructions to the resource, the instructions defining operations other than operations of the SQL query, and receiving data from the resource in response to the instructions.

    PROCESSING DATA FROM MULTIPLE SOURCES
    6.
    发明公开
    PROCESSING DATA FROM MULTIPLE SOURCES 审中-公开
    DATENVERARBEITUNG VON MEHREREN QUELLEN

    公开(公告)号:EP3132348A1

    公开(公告)日:2017-02-22

    申请号:EP15719559.5

    申请日:2015-04-16

    IPC分类号: G06F9/50

    摘要: In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.

    摘要翻译: 在第一方面,一种方法包括在Hadoop簇的节点处存储在HDFS数据存储中存储数据的第一部分的节点,执行能够从外部的数据源接收数据的数据处理引擎的第一实例 Hadoop集群,由数据处理引擎接收计算机可执行程序,由数据处理引擎的第一个实施程序的至少一部分,由数据处理引擎从外部数据源接收第二部分数据 存储除HDFS存储之外的第二数据部分,并且由数据处理引擎执行至少使用数据的第一部分和数据的第二部分由程序识别的数据处理操作。

    MANAGING DATA QUERIES
    7.
    发明公开
    MANAGING DATA QUERIES 审中-公开
    VERWALTUNG VON DATENABFRAGEN

    公开(公告)号:EP2524328A1

    公开(公告)日:2012-11-21

    申请号:EP11702533.8

    申请日:2011-01-14

    IPC分类号: G06F17/30

    摘要: One method includes receiving a database query, receiving information about a database table in data storage populated with data elements, producing a structural representation of the database table that includes a formatted data organization reflective of the database table and is absent the data elements of the database table, and providing the structural representation and the database query to a plan generator capable of producing a query plan representing operations for executing the database query on the database table. Another method includes receiving a query plan from a plan generator, the plan representing operations for executing a database query on a database table, and producing a dataflow graph from the query plan, wherein the dataflow graph includes at least one node that represents at least one operation represented by the query plan, and includes at least one link that represents at least one dataflow associated with the query plan.

    摘要翻译: 一种方法包括接收数据库查询,在数据存储器中接收关于数据库表的信息,生成包含反映数据库表的格式化数据组织的数据库表的结构表示,并且不存在数据库的数据元素 表,并将结构表示和数据库查询提供给能够生成表示用于在数据库表上执行数据库查询的操作的查询计划的计划生成器。 另一种方法包括从计划生成器接收查询计划,所述计划表示用于在数据库表上执行数据库查询的操作,以及从所述查询计划生成数据流图,其中所述数据流图包括至少一个表示至少一个 由查询计划表示的操作,并且包括表示与查询计划相关联的至少一个数据流的至少一个链接。