DISTRIBUTING PARTIAL RESULTS FROM AN EXTERNAL DATA SYSTEM BETWEEN WORKER NODES

    公开(公告)号:US20190147084A1

    公开(公告)日:2019-05-16

    申请号:US16051304

    申请日:2018-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed and generates a subquery for the external data system indicating that the results of the subquery are to be sent to one worker node of multiple worker nodes. The system instructs the one worker node to distribute the results received from the external data system to multiple worker nodes for processing.

    RESOURCE ALLOCATION FOR MULTIPLE DATASETS
    23.
    发明申请

    公开(公告)号:US20180089258A1

    公开(公告)日:2018-03-29

    申请号:US15665187

    申请日:2017-07-31

    Applicant: Splunk Inc.

    CPC classification number: G06F16/2425 G06F16/2272 G06F16/24535

    Abstract: Systems and methods are disclosed for processing queries against multiple dataset sources. One dataset source can include indexers that index and store data. The system can receive a query that identifies a set of data to be processed and a manner of processing the set of data. The set of data can include a first dataset that is accessible by one or more indexers and a second dataset that is accessible by one or more other dataset sources. A query coordinator can define a query processing scheme for obtaining and processing the set of data that includes a dynamic allocation of multiple layers of partitions. The partitions can operate on multiple worker nodes. The query can then be executed based on the query processing scheme.

    Execution of a query received from a data intake and query system

    公开(公告)号:US11314753B2

    公开(公告)日:2022-04-26

    申请号:US16051310

    申请日:2018-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for receiving and executing a query received from a data intake and query system and providing results to a first group of worker nodes in a distributed execution environment. The query identifies a set of data to be processed and a manner of processing the set of data. Based on the query, the system defines a query processing scheme, and generates instructions for a second group of worker nodes to obtain the set of data from one or more dataset sources and to process the set of data. The system communicates results of the query to the first group of worker nodes.

Patent Agency Ranking