-
公开(公告)号:US20200042626A1
公开(公告)日:2020-02-06
申请号:US16050487
申请日:2018-07-31
Applicant: SPLUNK INC.
Inventor: Kristal Lyn Curtis , Archana Sulochana Ganapathi , Adam Oliner , Steve Yu Zhang
IPC: G06F17/30
Abstract: Embodiments of the present invention are directed to identifying related data, in particular, data associated with different source types. In embodiments, a first source type related to a second source type associated with a search query is identified. Field set pairs are identified from a first data set associated with the first source type and a second data set associated with the second source type. Each field set pair can include one field set associated with the first source type and another field set associated with the second source type. For each field set pair, an extent of similarity is determined between the corresponding field sets. Based on the extent of similarities between the corresponding field sets, at least one pair of related field sets is identified. An indication of the at least one pair of related field sets is provided, for example, for presentation to a user.
-
公开(公告)号:US10552728B2
公开(公告)日:2020-02-04
申请号:US15224493
申请日:2016-07-29
Applicant: Splunk, Inc.
Inventor: Adam Oliner , Zidong Yang , Sinduja Sreshta
IPC: G06N3/04 , G06F17/27 , G06Q10/06 , G06F16/2453
Abstract: Described herein is a technology that facilitates the production of and the use of automated datagens for event-based systems. A datagen (i.e., data-generator or data generation system) is a component, module, or subsystem of computer systems that searches, monitors, and analyzes machine data. Existing datagens are not capable of detecting an anomaly in machine data. An anomaly is a variance in the input data stream that exceeds some acceptable amount of deviation from the norm (i.e., standard, expectation, etc.). An embodiment of datagen, in accordance with the technology described herein, detects anomalies in the input machine data.
-
43.
公开(公告)号:US20190034484A1
公开(公告)日:2019-01-31
申请号:US15663726
申请日:2017-07-29
Applicant: Splunk Inc.
Inventor: Dipock Das , Dayanand Pochugari , Neeraj Verma , Nikesh Padakanti , Aungon Nag Radon , Anand Srinivasabagavathar , Adam Oliner
CPC classification number: G06F16/24534 , G06F16/24522 , G06F16/248 , G06N3/08 , G06N5/022 , G06N5/046 , G06N20/00 , G06N20/10
Abstract: In various embodiments, a natural language (NL) application implements functionality that enables users to more effectively access various data storage systems based on NL requests. As described, the operations of the NL application are guided by, at least in part, on one or more templates and/or machine-learning models. Advantageously, the templates and/or machine-learning models provide a flexible framework that may be readily tailored to reduce the amount of time and user effort associated with processing NL requests and to increase the overall accuracy of NL application implementations.
-
44.
公开(公告)号:US20190034429A1
公开(公告)日:2019-01-31
申请号:US15663720
申请日:2017-07-29
Applicant: Splunk Inc.
Inventor: Dipock Das , Dayanand Pochugari , Neeraj Verma , Nikesh Padakanti , Aungon Nag Radon , Anand Srinivasabagavathar , Adam Oliner
Abstract: In various embodiments, a natural language (NL) application implements functionality that enables users to more effectively access various data storage systems based on NL requests. As described, the operations of the NL application are guided by, at least in part, on one or more templates and/or machine-learning models. Advantageously, the templates and/or machine-learning models provide a flexible framework that may be readily tailored to reduce the amount of time and user effort associated with processing NL requests and to increase the overall accuracy of NL application implementations.
-
公开(公告)号:US11816140B1
公开(公告)日:2023-11-14
申请号:US17659305
申请日:2022-04-14
Applicant: SPLUNK Inc.
Inventor: Adam Oliner
Abstract: Described herein are technologies that facilitate effective use (e.g., indexing and searching) of non-text machine data (e.g., audio/visual data) in an event-based machine-data intake and query system.
-
公开(公告)号:US11755938B2
公开(公告)日:2023-09-12
申请号:US16776302
申请日:2020-01-29
Applicant: SPLUNK INC.
Inventor: Nghi Nguyen , Jacob Leverich , Adam Oliner
Abstract: Methods and systems for determining event probabilities and anomalous events are provided. In one implementation, a method includes: receiving source data, where the source data is configured as a plurality of events with associated timestamps; searching the source data, where the searching provides a search result including N events from the plurality of events, where N is an integer greater than one, where each event of the N events includes a plurality of field values, where at least one event of the N events can include one or more categorical field values and one or more numerical field values; and for an event of the N events, determining a probability of occurrence for each field value of the plurality of field values; and using probabilities determined for the plurality of field values, determining a probability of occurrence for the event.
-
公开(公告)号:US11741396B1
公开(公告)日:2023-08-29
申请号:US17969569
申请日:2022-10-19
Applicant: SPLUNK Inc.
Inventor: Lin Ma , Jacob Leverich , Adam Oliner , Alex Cruise , Hongyang Zhang
IPC: G06F16/00 , G06N20/00 , G06F7/08 , H04L67/10 , H04L9/40 , G06F16/28 , G06F16/951 , G06F16/2455 , G06F16/903 , H04L41/14
CPC classification number: G06N20/00 , G06F7/08 , G06F16/24564 , G06F16/283 , G06F16/90335 , G06F16/951 , H04L41/14 , H04L63/1416 , H04L67/10
Abstract: Embodiments of the present invention are directed to facilitating distributed data processing for machine learning. In accordance with aspects of the present disclosure, a set of commands in a query to process at an external computing service is identified. For each command in the set of commands, at least one compute unit including at least one operation to perform at the external computing service is identified. Each of the at least one compute unit associated with each command is analyzed to identify an optimized manner in which to execute the set of commands at the external computing service. An indication of the optimized manner in which to execute the set of commands and a corresponding set of data is provided to the external computing service to utilize for executing the set of commands at the external computing service.
-
公开(公告)号:US11501112B1
公开(公告)日:2022-11-15
申请号:US15967435
申请日:2018-04-30
Applicant: SPLUNK, INC.
Inventor: Adam Oliner , Kristal Curtis , Nghi Huu Nguyen , Alexander Johnson
IPC: G06F16/90 , G06K9/62 , G06F11/07 , G06F17/18 , G06N20/00 , G06F16/907 , G06F16/903 , G06F16/28
Abstract: A computerized method of diagnosing a mislabeling of a source type of a received event. The method comprising operations of receiving an event by a source type analysis logic with a data index and query system, wherein the event includes a portion of raw machine data and is associated with a specific point in time, obtaining an original source type assigned to the event and one or more predicted source types. The one or more predicted source types are determined by analysis of a data representation of the event in view of training data and the training data includes a plurality of data representations corresponding to known source types. Additionally, the computerized method also includes an operation of, determining whether the event has been mislabeled and in response to determining the event has been mislabeled, diagnosing a source of the mislabeling.
-
公开(公告)号:US11232146B2
公开(公告)日:2022-01-25
申请号:US15664991
申请日:2017-07-31
Applicant: SPLUNK, Inc.
Inventor: Adam Oliner
IPC: G06F16/00 , G06F16/43 , G06F16/438
Abstract: Described herein are technologies that facilitate effective use (e.g., indexing and searching) of non-text machine data (e.g., audio/visual data) in an event-based machine-data intake and query system.
-
公开(公告)号:US11106681B2
公开(公告)日:2021-08-31
申请号:US16175636
申请日:2018-10-30
Applicant: Splunk, Inc.
Inventor: Adam Oliner , Eric Sammer , Kristal Curtis , Nghi Nguyen
IPC: G06F17/00 , G06F16/2455 , G06F40/205 , G06F16/248 , G06N5/04
Abstract: Messages of a first data stream may be accessed from an ingestion buffer in communication with a streaming data processor to receive data from the first data stream. At the streaming data processor and using an inference model, a sourcetype associated with one or more messages from the first data stream may be determined. The one or more messages may include a portion of machine data. Using the streaming data processor, a second data stream may be generated from the first data stream. The second data stream may include a subset of messages from the first data stream. A message of the subset of messages may be included in the second data stream based on a condition associated with the sourcetype for the message. At least one processing operation may be performed on at least one of the subset of messages from the second data stream.
-
-
-
-
-
-
-
-
-