Addressing memory limits for partition tracking among worker nodes

    公开(公告)号:US11989194B2

    公开(公告)日:2024-05-21

    申请号:US16657867

    申请日:2019-10-18

    Applicant: Splunk Inc.

    CPC classification number: G06F16/2471 G06F16/278

    Abstract: Systems and methods are described for distributed processing a query in a first query language utilizing a query execution engine intended for single-device execution. While distributed processing provides numerous benefits over single-device processing, distributed query execution engines can be significantly more difficult to develop that single-device engines. Embodiments of this disclosure enable the use of a single-device engine to support distributed processing, by dividing a query into multiple stages, each of which can be executed by multiple, concurrent executions of a single-device engine. Between stages, data can be shuffled between executions of the engine, such that individual executions of the engine are provided with a complete set of records needed to implement an individual stage. Because single-device engines can be significantly less difficult to develop, use of the techniques described herein can enable a distributed system to rapidly support multiple query languages.

    Generating a subquery for an external data system using a configuration file

    公开(公告)号:US11636105B2

    公开(公告)日:2023-04-25

    申请号:US17196577

    申请日:2021-03-09

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for receiving, at a data intake and query system, a query that includes an indication to process data managed by a third-party data storage and processing system that supports a different query language than the data intake and query system. The data intake and query system identifies a third-party data storage and processing system that manages the data to be processed and generates a subquery for execution by the third-party data storage and processing system, generates instructions for one or more worker nodes to receive and process results of the subquery from the third-party data storage and processing system, and instructs the worker nodes to provide results of the processing to the data intake and query system.

    EXECUTING COMMANDS FROM A DISTRIBUTED EXECUTION MODEL

    公开(公告)号:US20230049579A1

    公开(公告)日:2023-02-16

    申请号:US17960286

    申请日:2022-10-05

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for generating a distributed execution model with untrusted commands. The system can receive a query, and process the query to identify the untrusted commands. The system can use data associated with the untrusted command to identify one or more files associated with the untrusted command. Based on the files, the system can generate a data structure and include one or more identifiers associated with the data structure in the distributed execution model. The system can distribute the distributed execution model to one or more nodes in a distributed computing environment for execution.

Patent Agency Ranking