Determining a record generation estimate of a processing task

    公开(公告)号:US11442935B2

    公开(公告)日:2022-09-13

    申请号:US16397930

    申请日:2019-04-29

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for determining a record generation estimate related to a particular processing task. The system obtains a sample set of data that includes multiple records. The system applies a processing task, such as a transform or regular expression rule to the sample set of data and determines how many records are generated by the processing task. Based on the number of records generated, the system determines a record generation estimate. The system can use the record generation estimate to allocate compute resources or determine a query execution time for at least a portion of the query based on the record generation estimate.

    Multi-threaded processing of search responses returned by search peers

    公开(公告)号:US11310313B2

    公开(公告)日:2022-04-19

    申请号:US17014244

    申请日:2020-09-08

    Applicant: Splunk Inc.

    Abstract: Multi-threaded processing of search responses returned by search peers is disclosed. An example method may include transmitting, by a computer system, a search request to a plurality of search peers of a data aggregation and analysis system; receiving, by a first processing thread, a plurality of data packets from the plurality of search peers; parsing, by a second processing thread operating asynchronously with respect to the first processing thread, one or more data packets of the plurality of data packets, to produce a partial response to the search request; splitting the partial response into two or more fields; and generating, based on the two or more fields of the partial response, an aggregated response to the search request.

    Multi-thread processing of messages

    公开(公告)号:US11184467B2

    公开(公告)日:2021-11-23

    申请号:US16668808

    申请日:2019-10-30

    Applicant: Splunk Inc.

    Abstract: Systems and methods for multi-thread processing of messages. An example method comprises: receiving, by a first processing thread, a plurality of network packets from a server; processing the plurality of network packets to produce a message; writing the message to a message queue; retrieving, by a second processing thread, the message from the message queue; producing a memory data structure based on the message; placing the memory data structure into a result queue; and responsive to determining that a total size of messages in the message queue exceeds a certain threshold, causing the first processing thread to suspend receiving network packets.

    External dataset capability compensation

    公开(公告)号:US11163758B2

    公开(公告)日:2021-11-02

    申请号:US15665248

    申请日:2017-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for processing queries against an external data source utilizing dynamically allocated partitions operating on one or more worker nodes. The external data source can include data that has not been processed by the system. To query the external data source, a query coordinator can generate a subquery for the external data source based on determined functionality of the data source. The subquery can identify data in the external data source for processing and a manner for processing the data. In addition, the query coordinator can dynamically allocate partitions operating on worker nodes to retrieve and intake results of the subquery. In some cases, number of partitions allocated can be based on a number of partitions supported by the external data source.

    Converting and modifying a subquery for an external data system

    公开(公告)号:US11023463B2

    公开(公告)日:2021-06-01

    申请号:US16146990

    申请日:2018-09-28

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for receiving, at a data intake and query system, a query that includes a subquery that is to be executed at an external data system that supports a different query language than the data intake and query system. The data intake and query system converts the subquery from the query language supported by the external data system to the query language supported by the data intake and query system. The data intake and query system then processes the query including the translated subquery. The translated subquery is then translated back to the language supported by the external data system including any processing or optimizations performed with respect to the subquery.

    MANAGEMENT OF DISTRIBUTED COMPUTING FRAMEWORK COMPONENTS IN A DATA FABRIC SERVICE SYSTEM

    公开(公告)号:US20210117425A1

    公开(公告)日:2021-04-22

    申请号:US16657899

    申请日:2019-10-18

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework. This approach may provide numerous benefits, including improved security, flexibility in the selection of worker nodes, and redundancy for failures of physical components of the data intake and query system.

    Batch searches in data fabric service system

    公开(公告)号:US10592563B2

    公开(公告)日:2020-03-17

    申请号:US15339853

    申请日:2016-10-31

    Applicant: Splunk Inc.

    Abstract: The disclosed embodiments include a technique to obtain search results from the application of transformation operations on partial search results obtained from across internal and/or external data sources. Examples of transformation operations include arithmetic operations such as an average, mean, count, or the like. Examples of reporting transformations include join operations, statistics, sort, top head. Hence, the search results of a search query can be derived from partial search result rather than include the actual partial search results. In this case, the ordering of the search results may be nonessential. An example of a search query that requires a transformation operation is a “batch” or “reporting” search query. The related disclosed techniques involve obtaining data stored in the bid data ecosystem, and returning that data or data derived from that data.

Patent Agency Ranking