-
公开(公告)号:US10067876B2
公开(公告)日:2018-09-04
申请号:US15402105
申请日:2017-01-09
Applicant: Splunk, Inc.
Inventor: Ledion Bitincka , Alexandros Batsakis , Paul J. Lucas , Nicholas Robert Romito
IPC: G06F12/00 , G06F12/0875 , G06F17/30 , G06F12/0862 , G06F3/06 , G06F12/0873 , G06F12/0802 , G06F12/0866 , G06F12/0868 , G06F12/0871
CPC classification number: G06F12/0875 , G06F16/172 , G06F16/951 , G06F16/9574 , G06F2212/1021 , G06F2212/45 , G06F2212/6024 , G06F2212/6026 , G06F2212/6028
Abstract: Embodiments are disclosed for a prefetching method that may include copying, in response to a search query, a first bucket from a remote storage to a cache. The first bucket may include first data associated with the search query. The method may further include identifying a first file type associated with a first file in the first bucket. The first file may be associated with a usage status. The method may further include accessing, based on the search query, a second bucket from the remote storage. The second bucket may include second data associated with the search query. The method may further include identifying a second file in the second bucket having the first file type, and copying, in response to the usage status indicating that the first file was used in processing the search query, the second file from the remote storage to the cache.
-
公开(公告)号:US20180196824A1
公开(公告)日:2018-07-12
申请号:US15402119
申请日:2017-01-09
Applicant: Splunk, Inc.
Inventor: Ledion Bitincka , Alexandros Batsakis , Paul J. Lucas , Nicholas Robert Romito
IPC: G06F17/30
CPC classification number: G06F12/0875 , G06F12/0802 , G06F12/0862 , G06F12/0866 , G06F12/0868 , G06F12/0871 , G06F12/0873 , G06F17/30106 , G06F17/30132 , G06F17/30864 , G06F17/30902 , G06F2212/1021 , G06F2212/45 , G06F2212/6024 , G06F2212/6026 , G06F2212/6028
Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
-
公开(公告)号:US12093272B1
公开(公告)日:2024-09-17
申请号:US17661528
申请日:2022-04-29
Applicant: Splunk Inc.
Inventor: Alexandros Batsakis , Nitilaksha Satyaveera Halakatti , Ningxuan He , Prem Kumar Jayaraj , Manuel Gregorio Martinez , Balaji Rao , Jianming Zhang , Steve Yu Zhang
IPC: G06F16/2458
CPC classification number: G06F16/2471
Abstract: A computing device can receive a query that identifies a set of data to be processed and determine that a portion of the set of data resides in an external data system. The query system can request data identifiers associated with data objects of the set of data from the external data system and communicate the data identifiers to a data queue. The computing device can instruct one or more search nodes to retrieve the identifiers from the data queue. The search nodes can use the data identifiers to retrieve data objects from the external data system and process the data objects according to instructions received from the computing device. The search nodes can provide results of the processing to the computing device.
-
公开(公告)号:US11892976B2
公开(公告)日:2024-02-06
申请号:US17163047
申请日:2021-01-29
Applicant: SPLUNK INC.
Inventor: Alexandros Batsakis , Ankit Jain , Manu Jose , Jonah Pan , Hailun Yan
IPC: G06F16/00 , G06F16/14 , G06F16/182 , G06F16/738
CPC classification number: G06F16/144 , G06F16/156 , G06F16/1824 , G06F16/738
Abstract: Embodiments described herein facilitate enhancement of data model acceleration, including generating data model summaries and performing searches in an accelerated manner. In one implementation, obtaining a search query from a user device. A determination may be made to execute a search, in association with the search query, via an external computing service. As such, the search query, or a variant thereof, can be provided to the external computing service, wherein the external computing service executes the search using data model summaries stored in a remote data store that is separate from a set of events from which the data model summaries were generated. A set of search results are received from the external computing service, and such search results are provided to the user device.
-
公开(公告)号:US11720537B2
公开(公告)日:2023-08-08
申请号:US17661510
申请日:2022-04-29
Applicant: Splunk Inc.
Inventor: Tameem Anwar , Tianyi Gou , Alexandros Batsakis , Abhinav Prasad Nekkanti , Sai Krishna Sajja , Jiahan Wang
CPC classification number: G06F16/2228 , G06F16/14 , G06F16/16
Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.
-
公开(公告)号:US11693710B1
公开(公告)日:2023-07-04
申请号:US17589722
申请日:2022-01-31
Applicant: Splunk Inc.
Inventor: Bharath Kishore Reddy Aleti , Alexandros Batsakis , Mitchell Neuman Blank , Rama Gopalan , Hongxun Liu , Anish Shrigondekar
IPC: G06F16/00 , G06F7/00 , G06F9/50 , G06F16/2455 , G06F16/242 , G06F16/22
CPC classification number: G06F9/505 , G06F16/22 , G06F16/2425 , G06F16/24552
Abstract: Resource management includes storing, for multiple workload pools of a data intake and query system, a workload pool hierarchy arranged in multiple workload pool layers. After storing a processing request is assigned a selected subset of workload pools in a second layer of the workload pool hierarchy based on a type of processing request. The processing request is then assigned to an individual workload pool in the selected subset to obtain a selected workload pool. Execution of the processing request is initiated on the selected workload pool.
-
公开(公告)号:US20230147068A1
公开(公告)日:2023-05-11
申请号:US18051481
申请日:2022-10-31
Applicant: Splunk Inc.
Inventor: Balaji Rao , Jindrich Dinga , Kieran Cairney , Manuel Martinez , Nitilaksha Halakatti , Ningxuan He , Arindam Bhattacharjee , Sourav Pal , Alexandros Batsakis
IPC: G06F16/2453 , G06F16/2458 , H04L9/08 , H04L41/0806 , H04L67/10 , G06F8/61 , H04L67/52
CPC classification number: G06F16/24547 , G06F16/2465 , H04L9/0866 , H04L41/0806 , H04L67/10 , G06F8/61 , H04L67/52
Abstract: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework. This approach may provide numerous benefits, including improved security, flexibility in the selection of worker nodes, and redundancy for failures of physical components of the data intake and query system.
-
公开(公告)号:US11645210B2
公开(公告)日:2023-05-09
申请号:US17652635
申请日:2022-02-25
Applicant: Splunk Inc.
Inventor: Ledion Bitincka , Alexandros Batsakis , Paul J. Lucas , Nicholas Robert Romito
IPC: G06F12/00 , G06F12/0875 , G06F16/172 , G06F16/951 , G06F16/957 , G06F3/06 , G06F12/0802 , G06F16/14 , G06F12/0862 , G06F12/0866 , G06F12/0868 , G06F12/0871 , G06F12/0873
CPC classification number: G06F12/0875 , G06F3/061 , G06F3/0611 , G06F12/0802 , G06F12/0862 , G06F12/0866 , G06F12/0868 , G06F12/0871 , G06F12/0873 , G06F16/148 , G06F16/172 , G06F16/951 , G06F16/9574 , G06F2212/1021 , G06F2212/45 , G06F2212/6024 , G06F2212/6026 , G06F2212/6028
Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
-
公开(公告)号:US11630695B1
公开(公告)日:2023-04-18
申请号:US17163160
申请日:2021-01-29
Applicant: Splunk Inc.
Inventor: Bharath Kishore Reddy Aleti , Alexandros Batsakis , Mitchell Neuman Blank , Rama Gopalan , Hongxun Liu , Anish Shrigondekar
Abstract: Dynamic reassignment of search processes into workload pools includes receiving a search query to search at least one data store, assigning the search query to a first workload pool, and executing the search query using a first hardware resource in the first workload pool, the first hardware resource corresponding to a first portion of a hardware device. Dynamic reassignment further includes receiving, while executing the search query, an update command to move the search query to a second workload pool, moving, while executing the search query, the search query to the second workload pool; and continuing execution of the search query using a second hardware resource in the second workload pool. The second hardware resource corresponds to a second portion of the hardware device.
-
公开(公告)号:US20220382755A1
公开(公告)日:2022-12-01
申请号:US17652620
申请日:2022-02-25
Applicant: Splunk Inc.
Inventor: Alexandros Batsakis , Scott Calvert , Alexander Douglas James , Bei Li , Ashish Mathew , James Monschke , Sogol Moshtaghi , Christopher Madden Pride , Xiaowei Wang
IPC: G06F16/2453 , G06F16/13
Abstract: Systems and methods are disclosed for mapping search nodes to a search head in a data intake and query system based on a tenant identifier in order to execute a query received by the data intake and query system. The mapping may allow same or similar search nodes to be used to execute queries that are associated with a particular tenant identifier, in order to take advantage of caching and local data stored with those search nodes. In some cases, search nodes can be mapped based on the tenant identifier using a hashing algorithm, such as a consistent hashing algorithm.
-
-
-
-
-
-
-
-
-