QUERIES BASED ON RANGES OF HASH VALUES
    1.
    发明申请

    公开(公告)号:US20180268030A1

    公开(公告)日:2018-09-20

    申请号:US15762586

    申请日:2015-09-25

    Abstract: A system includes a database client, and a distributed database comprising database nodes. The distributed database may receive a database query from the client, determine that the query comprises a range of hash values of a table partition stored by a node of the distributed database, and determine that the range of hash values is not stored by other nodes of the distributed database. Responsive to determining that the range of hash values of the query is stored by the node and not by the other nodes, the database may generate an optimized distributed execution plan that includes the node that stores the range of hash values and excludes the nodes that do not include the range of hash values.

    PARALLELIZING SQL ON DISTRIBUTED FILE SYSTEMS
    2.
    发明申请
    PARALLELIZING SQL ON DISTRIBUTED FILE SYSTEMS 审中-公开
    在分布式文件系统上并行SQL

    公开(公告)号:US20170011090A1

    公开(公告)日:2017-01-12

    申请号:US15114328

    申请日:2014-03-31

    Abstract: Example embodiments relate to parallelizing structured query language (SQL) on distributed file systems. In example embodiments, a subquery of a distributed file system is received from a query engine, where the subquery is one of multiple subqueries that are scheduled to execute on a cluster of server nodes. At this stage, a user defined function that comprises local, role-based functionality is executed, where the partitioned magic table triggers parallel execution of the user defined function. The execution of the UDF determines a sequence number based on a quantity of the cluster of server nodes and retrieve nonconsecutive chunks from a file of the distributed file system, where each of the nonconsecutive chunks is offset by the sequence number.

    Abstract translation: 示例性实施例涉及在分布式文件系统上并行化结构化查询语言(SQL)。 在示例实施例中,从查询引擎接收分布式文件系统的子查询,其中子查询是被调度为在服务器节点集群上执行的多个子查询之一。 在此阶段,执行包含本地,基于角色的功能的用户定义的功能,其中分区魔术表触发并行执行用户定义的功能。 UDF的执行基于服务器节点的簇的数量来确定序列号,并从分布式文件系统的文件中检索非连续的块,其中每个非连续的块被序列号偏移。

    Accessing electronic databases
    3.
    发明授权

    公开(公告)号:US10909119B2

    公开(公告)日:2021-02-02

    申请号:US15202636

    申请日:2016-07-06

    Abstract: Examples disclosed herein relate to accessing electronic databases. Some examples disclosed herein may include partitioning a computation task into subtasks. A processing node of a computation engine may generate a database query for retrieving an electronic data segment associated with at least one of the subtasks from a database. The database query may include pre-processing instructions for a database management system (DBMS) associated with the database to pre-process the electronic data segment before providing the electronic data segment to the processing node. The pre-processing instructions may include at least one of: filtering, projection, join, aggregation, count, and user-defined instructions. The generated query may be provided to the DBMS.

    Parallelizing SQL on distributed file systems

    公开(公告)号:US10534770B2

    公开(公告)日:2020-01-14

    申请号:US15114328

    申请日:2014-03-31

    Abstract: Example embodiments relate to parallelizing structured query language (SQL) on distributed file systems. In example embodiments, a subquery of a distributed file system is received from a query engine, where the subquery is one of multiple subqueries that are scheduled to execute on a cluster of server nodes. At this stage, a user defined function that comprises local, role-based functionality is executed, where the partitioned magic table triggers parallel execution of the user defined function. The execution of the UDF determines a sequence number based on a quantity of the cluster of server nodes and retrieve nonconsecutive chunks from a file of the distributed file system, where each of the nonconsecutive chunks is offset by the sequence number.

    DATA STREAM PROCESSING BASED ON A BOUNDARY PARAMETER
    6.
    发明申请
    DATA STREAM PROCESSING BASED ON A BOUNDARY PARAMETER 审中-公开
    基于边界参数的数据流处理

    公开(公告)号:US20160253219A1

    公开(公告)日:2016-09-01

    申请号:US15032884

    申请日:2013-12-13

    CPC classification number: G06F9/52 G06F17/18

    Abstract: In one implementation, a system for processing a data stream can comprise a station engine, an execution engine, and a synchronize engine. A station engine can provide a stream operator to receive application logic, punctuate the data stream, and determine a number of input channels for parallel processing. The execution engine can perform a behavior of the application logic during a process operation. The synchronize engine can hold data of the data stream associated with a window until each input channel has reached a data boundary based on a boundary parameter.

    Abstract translation: 在一个实现中,用于处理数据流的系统可以包括站引擎,执行引擎和同步引擎。 站引擎可以提供流操作者来接收应用逻辑,标记数据流,并确定用于并行处理的多个输入通道。 执行引擎可以在进程操作期间执行应用程序逻辑的行为。 同步引擎可以保持与窗口相关联的数据流的数据,直到每个输入通道基于​​边界参数到达数据边界。

    Processing streaming data with open executors
    9.
    发明授权
    Processing streaming data with open executors 有权
    使用开放执行程序处理流数据

    公开(公告)号:US09348580B2

    公开(公告)日:2016-05-24

    申请号:US14416610

    申请日:2012-10-26

    Abstract: Processing streaming data with open executors includes receiving input data at a computation dataflow station where the computation dataflow station contains a computation file and an open executor that accepts code plug-ins, converting contents of the computation file into a program string with the code plug-ins from a system library, and launching the program string together with the input data to calculate an output with a graphics processing unit.

    Abstract translation: 使用开放式执行器处理流数据包括在计算数据流站接收输入数据,其中计算数据流站包含计算文件和接受代码插件的开放执行器,将计算文件的内容转换成具有代码插件的程序串, 并将程序串与输入数据一起启动,以使用图形处理单元计算输出。

    DATABASE COMPARISON OPERATION TO IDENTIFY AN OBJECT

    公开(公告)号:US20180293311A1

    公开(公告)日:2018-10-11

    申请号:US15570878

    申请日:2015-05-19

    Abstract: Examples disclosed herein relate to a database comparison operation to identify an object. For example, a processor may enroll a set of object templates in a storage based on objects within input content and enroll a target object template in the storage based on a target object in target content. The processor may identify an object within the input content associated with the target object based on a database comparison operation of the stored set of object templates to the stored target object template. The processor may output object recognition information related to the identified object.

Patent Agency Ranking