-
公开(公告)号:US12131233B1
公开(公告)日:2024-10-29
申请号:US17074407
申请日:2020-10-19
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06N20/00 , G06F16/245 , G06F18/214 , G06N3/08 , G06N5/025 , G06N5/04
CPC classification number: G06N20/00 , G06F16/245 , G06F18/2148 , G06N3/08 , G06N5/025 , G06N5/04
Abstract: A method for deployment of machine-learning based operators within a query is described. For this embodiment, a sequence of operators associated with a query is identified, which includes at least a first operator and at least a second operator. The second operator is configured to perform operations, in accordance with a machine learning (ML) component, on data received as input from execution of the first operator. Schemas associated with the machine learning component is retrieved along with schemas associated with other operators within the sequence. Compatibility between at least an output schema associated with the first operator and an input schema associated with the second operator associated with the ML component is determined. Thereafter, a portion of the sequence of operators including at least the second operator and another operator of the sequence of operators successive to the second operator may be stored within a data store for subsequent use.
-
公开(公告)号:US11921720B1
公开(公告)日:2024-03-05
申请号:US17978684
申请日:2022-11-01
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06F16/2453 , G06F16/2455 , G06N20/00
CPC classification number: G06F16/24549 , G06F16/2455 , G06N20/00
Abstract: A computer-implemented method is disclosed that includes operations of parsing a query comprised of a sequence of operators to detect each operator of the sequence of operators, where the sequence of operators includes a machine learning (ML) operator representing a trained ML model. Additionally, a schema of the ML operator is determined through metadata. A filter or a projection is generated based on the schema of the ML operator, where the filter or projection is configured to reduce an amount of data retrieved upon application of the filter of the projection to an operator of the sequence of operators comprising the query. The schema of the ML operator indicates a schema of input data to be provided to the ML operator and a schema of output data to be provided by the ML operator following processing.
-
公开(公告)号:US11567735B1
公开(公告)日:2023-01-31
申请号:US17074280
申请日:2020-10-19
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06F8/30 , G06F16/21 , G06F16/953 , G06N20/00 , G06F8/41
Abstract: According to one embodiment, a method that supports queries deploying operators based on multiple programming languages is described. A sequence of operators associated with a query is identified, where the sequence of operators includes at least two neighboring operators including a first operator based on a first programming language and a second operator based on a second programming language that is different from the first programming language. Thereafter, a schema associated with the first operator and a schema associated with the second operator is determined along with the compatibility between the schema of the first operator and the schema of the second operator. A query error message is generated in response to incompatibility between the first operator schema and the second operator schema. Compatibility is determined when an output generated by execution of the first operator provides machine data needed as input for execution of the second operator.
-
公开(公告)号:US11537951B2
公开(公告)日:2022-12-27
申请号:US17146339
申请日:2021-01-11
Applicant: SPLUNK INC.
Inventor: Lin Ma , Jacob Leverich , Adam Oliner , Alex Cruise , Hongyang Zhang
IPC: G06F7/08 , G06N20/00 , H04L67/10 , H04L9/40 , G06F16/28 , G06F16/951 , G06F16/2455 , G06F16/903 , H04L41/14
Abstract: Embodiments of the present invention are directed to facilitating distributed data processing for machine learning. In accordance with aspects of the present disclosure, a set of commands in a query to process at an external computing service is identified. For each command in the set of commands, at least one compute unit including at least one operation to perform at the external computing service is identified. Each of the at least one compute unit associated with each command is analyzed to identify an optimized manner in which to execute the set of commands at the external computing service. An indication of the optimized manner in which to execute the set of commands and a corresponding set of data is provided to the external computing service to utilize for executing the set of commands at the external computing service.
-
公开(公告)号:US11727007B1
公开(公告)日:2023-08-15
申请号:US17073752
申请日:2020-10-19
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06F16/21 , G06F16/2455 , G06F16/2453 , G06F16/242 , G06N20/00 , G06F16/248 , G06N5/01
CPC classification number: G06F16/24553 , G06F16/248 , G06F16/2423 , G06F16/24534 , G06N5/01 , G06N20/00
Abstract: A computer-implemented method is disclosed including operations of receiving a request to store a representation of a machine learning model in a non-transitory computer-readable medium, validating the representation of the machine learning model, storing the representation of the machine learning model, receiving a query from a web-based programming application, the query including a sequence of operators, parsing the query to detect and identify each operator within the sequence of operators, converting the query to directed acyclic graph (DAG) and providing the DAG to a distributed processing engine configured to execute the DAG. The computer-implemented method includes further operations of, prior to converting the query to the DAG, altering the query to improve efficiency of execution of the DAG. Altering the query may include at least one of consolidating at least two operators, applying a filter operation to an operator, or applying a projection to the operator.
-
公开(公告)号:US12164522B1
公开(公告)日:2024-12-10
申请号:US17476323
申请日:2021-09-15
Applicant: Splunk Inc.
IPC: G06F16/2455 , G06F11/34 , G06F16/23 , G06N20/00
Abstract: An interface and improved data intake and query system is described herein that allows users to define metrics and that aggregates metric values regardless of the level at which a metric is defined and/or the level at which metric values are available. The improved data intake and query system can initialize a sketch in response to a user providing one or more metric definitions. The initialized sketch includes one or more instances, where each instance produces an output and collects metric value(s), appends the metric value(s) to the output, and forwards the appended data to a process function downstream in a data processing pipeline. The process function separates the output and the metric value(s), sending the output further downstream in the data processing pipeline and sending the metric value(s) to a parallel process function that sits outside the data processing pipeline. The parallel process function can persist the metric value(s).
-
公开(公告)号:US11921693B1
公开(公告)日:2024-03-05
申请号:US17305650
申请日:2021-07-12
Applicant: Splunk Inc.
Inventor: Itay Alfred Neeman , Glenn Block , Lin Ma , Mitch Blank , Vishal Patel
IPC: G06F16/23 , G06F16/22 , G06F16/28 , G06F16/951
CPC classification number: G06F16/2322 , G06F16/22 , G06F16/282 , G06F16/951
Abstract: A data intake and query system receives a message including raw machine via an internet protocol (IP) such as the hypertext transfer protocol (HTTP). The message includes a distinct payload portion and a distinct custom field portion. The payload portion includes raw machine data, while the custom field portion includes values for fields. An event that includes the raw machine data and the values is generated from the payload portion and the values are extracted from the custom field portion. The event is then stored such that the values are associated with the event.
-
公开(公告)号:US11748634B1
公开(公告)日:2023-09-05
申请号:US17074206
申请日:2020-10-19
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06F18/214 , G06F16/28 , G06F16/2455 , G06F16/248 , G06F8/41 , G06N5/025 , G06N20/00
CPC classification number: G06N5/025 , G06F18/2148 , G06N20/00 , G06F8/4452 , G06F16/248 , G06F16/24553 , G06F16/287
Abstract: A computer-implemented method for integration of machine learning components within a pipelined search query to generate a visualization is described. Herein, an interface is provided for receipt of pipelined code into a web-based programming application. The pipelined code features a series of operators configured to perform one or more tasks based on collective operations by the series of operators, wherein a first operator of the series of operators is to receive input data from a selected data source and each remaining operator of the series of operators to receive input based on an output from a preceding operator of the remaining operators. The task(s) performed by the pipelined code generate results including visualizations. The visualization is rendered in a manner that allows the pipelined code to be scrolled to display the pipelined code or the visualization.
-
公开(公告)号:US11500871B1
公开(公告)日:2022-11-15
申请号:US17074100
申请日:2020-10-19
Applicant: SPLUNK Inc.
Inventor: Chinmay Madhav Kulkarni , Lin Ma , Amir Malekpour , Mohan Rajagopalan , John C. Reed , Ram Sriharsha
IPC: G06F16/2453 , G06N20/00 , G06F16/2455
Abstract: A computer-implemented method is disclosed that includes operations of receiving a query to be executed, the query including an indication of a data source at which input data is be to obtained, wherein the query is to be executed on the input data, determining a schema of the input data, determining fields of the input data that are required for execution of the query by analyzing a sequence of operators forming the query, determining one or more alterations to the query to improve efficiency of the execution of the query based on the fields of input data required for the execution, and generating an altered query be altering the query in accordance with the one or more alterations. The method may further include converting the query to a directed acyclic graph (DAG) and providing the DAG to a distributed processing engine configured to execute the DAG.
-
公开(公告)号:US10922625B2
公开(公告)日:2021-02-16
申请号:US15885395
申请日:2018-01-31
Applicant: Splunk Inc.
Inventor: Lin Ma , Jacob Leverich , Adam Oliner , Alex Cruise , Hongyang Zhang
IPC: G06F17/00 , G06N20/00 , G06F7/08 , H04L29/08 , H04L29/06 , G06F16/28 , G06F16/951 , G06F16/2455 , G06F16/903 , H04L12/24
Abstract: Embodiments of the present invention are directed to facilitating distributed data processing for machine learning. In accordance with aspects of the present disclosure, a set of commands in a query to process at an external computing service is identified. For each command in the set of commands, at least one compute unit including at least one operation to perform at the external computing service is identified. Each of the at least one compute unit associated with each command is analyzed to identify an optimized manner in which to execute the set of commands at the external computing service. An indication of the optimized manner in which to execute the set of commands and a corresponding set of data is provided to the external computing service to utilize for executing the set of commands at the external computing service.
-
-
-
-
-
-
-
-
-