SWAPPABLE ONLINE MACHINE LEARNING ALGORITHMS IMPLEMENTED IN A DATA INTAKE AND QUERY SYSTEM

    公开(公告)号:US20210117868A1

    公开(公告)日:2021-04-22

    申请号:US16779509

    申请日:2020-01-31

    Applicant: Splunk Inc.

    Inventor: Ram Sriharsha

    Abstract: Systems and methods are described for testing one or more machine learning algorithms in parallel with an existing machine learning algorithm implemented within a data processing pipeline. Each machine learning algorithm can train a machine learning model that receives a live stream of raw machine data. The output of the machine learning model trained by the existing machine learning algorithm may be written to an external storage system, but the output of the machine learning model(s) trained by the test machine learning algorithm(s) may not be written to an external storage system. After some time, performance of the test machine learning algorithm(s) and the existing machine learning algorithm is evaluated. If the test machine learning algorithm performs better than the existing machine learning algorithm, then the machine learning algorithms can be swapped without any downtime and without needed to re-train a machine learning model using previously seen raw machine data.

    SAMPLING-BASED PREVIEW MODE FOR A DATA INTAKE AND QUERY SYSTEM

    公开(公告)号:US20210117382A1

    公开(公告)日:2021-04-22

    申请号:US16779486

    申请日:2020-01-31

    Applicant: Splunk Inc.

    Inventor: Ram Sriharsha

    Abstract: Systems and methods are described for providing a user interface through which a user can program operation of a data processing pipeline by specifying a graph of nodes that transform data and interconnections that designate routing of data between individual nodes within the graph. In response to a user request, a preview mode can be activated that causes the data processing pipeline to retrieve data from at least one source specified by the graph, transform the data according to the nodes of the graph, sample the transformed data, and display the sampling of the transformed data to at least one node without writing the transformed data to at least one destination specified by the graph.

Patent Agency Ranking