Parallelizing SQL user defined transformation functions

    公开(公告)号:US10885031B2

    公开(公告)日:2021-01-05

    申请号:US15114913

    申请日:2014-03-10

    Abstract: Example embodiments relate to parallelizing structured query language (SQL) user defined transformation functions. In example embodiments, a subquery of a query is received from a query engine, where each of the subqueries is associated with a distinct magic number in a magic table. A user defined transformation function that includes local, role-based functionality may then be executed, where the magic number triggers parallel execution of the user defined transformation function. At this stage, the results of the user defined transformation function are sent to the query engine, where the query engine unions the results with other results that are obtained from the other database nodes.

    BASE USER DEFINED FUNCTIONS
    14.
    发明申请

    公开(公告)号:US20180046659A1

    公开(公告)日:2018-02-15

    申请号:US15555634

    申请日:2015-03-20

    CPC classification number: G06F16/2291 G06F16/245

    Abstract: Methods, devices, and techniques for base user defined functions in a database management system are discussed herein. For example, in one aspect, a query request is received from a computer device. The query request may include a query operator representing a specialized user defined function (SUDF). The SUDF may then be executed. Executing the SUDF may include executing a base operation of a base user defined function (BUDF). The base operation may interact with an application programming interface (API) of the query engine to obtain a tuple stored in the database. Executing the SUDF may further include executing a specialized operation that processes the tuple according to an analytics function. The specialized operation may generate a result. Then, a query result may be returned to the computer device. The query result can include the result.

    ACCESSING ELECTRONIC DATABASES
    15.
    发明申请

    公开(公告)号:US20180011905A1

    公开(公告)日:2018-01-11

    申请号:US15202636

    申请日:2016-07-06

    CPC classification number: G06F16/24554 G06F16/21 G06F16/2433

    Abstract: Examples disclosed herein relate to accessing electronic databases. Some examples disclosed herein may include partitioning a computation task into subtasks. A processing node of a computation engine may generate a database query for retrieving an electronic data segment associated with at least one of the subtasks from a database. The database query may include pre-processing instructions for a database management system (DBMS) associated with the database to pre-process the electronic data segment before providing the electronic data segment to the processing node. The pre-processing instructions may include at least one of: filtering, projection, join, aggregation, count, and user-defined instructions. The generated query may be provided to the DBMS.

    ABSTRACTION LAYER BETWEEN A DATABASE QUERY ENGINE AND A DISTRIBUTED FILE SYSTEM
    16.
    发明申请
    ABSTRACTION LAYER BETWEEN A DATABASE QUERY ENGINE AND A DISTRIBUTED FILE SYSTEM 审中-公开
    数据库查询引擎与分布式文件系统之间的摘要层

    公开(公告)号:US20160267132A1

    公开(公告)日:2016-09-15

    申请号:US15033163

    申请日:2013-12-17

    Abstract: A system includes a distributed file system to control storage of data across storage nodes and a database query engine to receive a database query for access of data, the database query engine to process the database query using an index, and using a buffer pool to cache data retrieved in response to the database query and to store updated data. An abstraction layer is provided between the database query engine and the distributed file system, the abstraction layer to read and write data of the distributed file system in response to the database query.

    Abstract translation: 系统包括一个分布式文件系统,用于控制跨存储节点的数据存储和一个数据库查询引擎,用于接收数据库访问数据的数据库查询,数据库查询引擎使用索引处理数据库查询,并使用缓冲池进行缓存 响应于数据库查询检索的数据并存储更新的数据。 在数据库查询引擎和分布式文件系统之间提供抽象层,抽象层响应数据库查询读写分布式文件系统的数据。

Patent Agency Ranking