Data fabric service system architecture

    公开(公告)号:US10353965B2

    公开(公告)日:2019-07-16

    申请号:US15276717

    申请日:2016-09-26

    Applicant: Splunk Inc.

    Abstract: Disclosed is a technique that can be performed in a distributed computer network. The technique can include a data index and query system that receives search query, defines a search scheme for applying the search query on distributed data storage systems including an internal data storage system of the data index and query system and an external data storage system. The internal data storage system stores data as time-indexed events including respective segments of raw machine data. The data index and query system can transfer a portion of the search scheme to a search service, which can return search results obtained by application of the search scheme to the distributed data storage systems including the internal data storage system and the external data storage system. Lastly, the search results or data indicative of the search results can be output on a display device to the user.

    SEARCH FUNCTIONALITY OF WORKER NODES IN A DATA FABRIC SERVICE SYSTEM

    公开(公告)号:US20190171678A1

    公开(公告)日:2019-06-06

    申请号:US16264462

    申请日:2019-01-31

    Applicant: Splunk Inc.

    Abstract: Disclosed is a technique that can be performed in a distributed computer network. The technique can include a worker node that receives search instructions defined by a search service based on at least a portion of a search scheme defined by a data intake and query system, to cause the worker node to obtain search results from distributed data storage systems communicatively coupled to the worker node over a network. The distributed data storage systems include an external data storage system and/or an internal data storage system of the data intake and query system. The worker node obtains the search results by searching the distributed data storage systems in accordance with the search instructions, and communicating, over the network to the search service, a combination of search results based on the search results to cause an output by the data intake and query system in accordance with the search scheme.

    DISTRIBUTING PARTIAL RESULTS TO WORKER NODES FROM AN EXTERNAL DATA SYSTEM

    公开(公告)号:US20190147092A1

    公开(公告)日:2019-05-16

    申请号:US16051223

    申请日:2018-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed, and generates a subquery for the external data system indicating that the results of the subquery are to be sent to multiple worker nodes. The system also generates instructions for multiple worker nodes to receive and process results of the subquery from the external data system.

    EXECUTING A DISTRIBUTED EXECUTION MODEL WITH UNTRUSTED COMMANDS

    公开(公告)号:US20190095488A1

    公开(公告)日:2019-03-28

    申请号:US15714133

    申请日:2017-09-25

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for executing a distributed execution model with untrusted commands. The distributed execution model can be distributed to multiple nodes in a distributed computing environment. At least one node can process the distributed execution model to identify an untrusted command. The node can use data associated with the untrusted command to identify one or more files associated with the untrusted command. Based on the files, the node can generate a data structure, and execute at least a portion of the data structure.

    QUERY ACCELERATION DATA STORE
    130.
    发明申请

    公开(公告)号:US20180089306A1

    公开(公告)日:2018-03-29

    申请号:US15665279

    申请日:2017-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods for a data index and query system that utilize a query acceleration data store. An example method includes receiving a query identifying a set of data to be processed and a manner of processing the set of data. A query processing scheme for obtaining and processing the set of data is defined. First partial results of the query stored in a data store are identified, with the first partial results corresponding to a first portion of the set of data. One or more partitions are dynamically allocated to obtain a second portion of the set of data from different data sources. The second portion of the set of data is processed to obtain second partial results. The first partial results and second partial results are combined. The query is executed based on the query processing scheme.

Patent Agency Ranking