File update detection and processing

    公开(公告)号:US09767112B2

    公开(公告)日:2017-09-19

    申请号:US15224649

    申请日:2016-07-31

    Applicant: Splunk Inc.

    CPC classification number: G06F17/30144 G06F17/3015 G06F17/30286

    Abstract: Embodiments are directed towards managing and tracking item identification of a plurality of items to determine if an item is a new or existing item, where an existing item has been previously processed. In some embodiments, two or more item identifiers may be generated. In one embodiment, generating the two or more item identifiers may include analyzing the item using a small item size characteristic, a compressed item, or for an identifier collision. The two or more item identifiers may be employed to determine if the item is a new or existing item. In one embodiment, the two or more item identifiers may be compared to a record about an existing item to determine if the item is a new or existing item. If the item is an existing item, then the item may be further processed to determine if the existing item has actually changed.

    Multi-site clustering
    22.
    发明授权
    Multi-site clustering 有权
    多站点集群

    公开(公告)号:US09124612B2

    公开(公告)日:2015-09-01

    申请号:US14266817

    申请日:2014-04-30

    Applicant: Splunk Inc.

    CPC classification number: H04L67/1097 G06F11/20 G06F17/30575

    Abstract: According to various embodiments, techniques are described for managing data within a multi-site clustered data intake and query system. A data intake and query system as described herein generally refers to a system for collecting, retrieving, and analyzing data. In this context, a clustered data intake and query system generally refers to a system environment that is configured to provide data redundancy and other features that improve the availability of data stored by the system. For example, a clustered data intake and query system may be configured to store multiple copies of data stored by the system across multiple components such that recovery from a failure of one or more of the components is possible by using copies of the data stored elsewhere in the cluster.

    Abstract translation: 根据各种实施例,描述了用于管理多站点群集数据访问和查询系统内的数据的技术。 本文所述的数据采集和查询系统通常是指用于收集,检索和分析数据的系统。 在这种情况下,集群数据采集和查询系统通常是指被配置为提供数据冗余和提高系统存储的数据的可用性的其他特征的系统环境。 例如,集群数据采集和查询系统可以被配置为存储由多个组件存储的系统的多个副本,以便可以通过使用其他地方存储的数据的副本来从一个或多个组件的故障中恢复 集群。

    Clustering for high availability and disaster recovery
    23.
    发明授权
    Clustering for high availability and disaster recovery 有权
    群集高可用性和灾难恢复

    公开(公告)号:US08788459B2

    公开(公告)日:2014-07-22

    申请号:US13648116

    申请日:2012-10-09

    Applicant: Splunk Inc.

    CPC classification number: H04L67/1097 G06F11/2097 G06F17/30312

    Abstract: Embodiments are directed towards managing within a cluster environment having a plurality of indexers for data storage using redundancy the data being managed using a generation identifier, such that a primary indexer is designated for a given generation of data. When a master device for the cluster fails, data may continue to be stored using redundancy, and data searches performed may still be performed.

    Abstract translation: 实施例旨在在具有多个索引器的集群环境内管理,用于使用生成标识符来管理数据的冗余来进行数据存储,从而为指定的生成数据指定主索引器。 当集群的主设备发生故障时,可以继续使用冗余来存储数据,并且仍然可以执行数据搜索。

    Ingest health monitoring
    24.
    发明授权

    公开(公告)号:US12061533B1

    公开(公告)日:2024-08-13

    申请号:US17877725

    申请日:2022-07-29

    Applicant: Splunk Inc.

    CPC classification number: G06F11/3476 G06F3/0619 G06F2201/81

    Abstract: Ingest health monitoring includes receiving an event stream of events in a data intake and query system to store on at least one storage system and obtaining an event from the event stream. Ingest health monitoring further includes transmitting the event to a selected ingest module queue for the event, updating an output rate indicator counter for the selected ingest module queue when failure to store the event in the ingest module queue occurs, obtaining the event from the selected ingest module queue, processing the event to generate a file for the event, and transmitting the file to the at least one storage system. Ingest health monitoring further includes updating the write failure indicator counter for a storage system of the at least one storage system when failure to transmit to the storage system occurs and updating the user interface based on the output rate indicator counter and the write failure indicator counter.

    Systems and methods for load balancing in a system providing dynamic indexer discovery

    公开(公告)号:US11550829B2

    公开(公告)日:2023-01-10

    申请号:US16353886

    申请日:2019-03-14

    Applicant: Splunk Inc

    Abstract: The present invention is related to a method for providing dynamic indexer discovery. The method comprises receiving, from an index manager, a status indication associated with a plurality of indexers, wherein each of the plurality of indexers indexes events of raw machine-generated data received from a plurality of data collectors. The method further comprises determining a weight associated with each of the plurality of indexers and selecting an indexer from the plurality of indexers. Subsequently, the method comprises allocating data to the indexer in accordance with a respective weight assigned to the indexer and transmitting the allocated data to the indexer.

    DISTRIBUTED TASK ASSIGNMENT IN A CLUSTER COMPUTING SYSTEM

    公开(公告)号:US20220398128A1

    公开(公告)日:2022-12-15

    申请号:US17343508

    申请日:2021-06-09

    Applicant: Splunk Inc.

    Abstract: A processing node selects a first task from a task list and sends, to a task assignment repository, a first write operation with a first task identifier of the first task to assign the first task to the processing node. The processing node detects failure of the first write operation based on the first task already being assigned and selects a second task from the task list. The processing node sends, to the task assignment repository, a second write operation with a second task identifier of the second task to assign the second task to the processing node. The processing node detects success of the second write operation and executes the second task.

    Periodically processing data in files identified using checksums

    公开(公告)号:US10860537B2

    公开(公告)日:2020-12-08

    申请号:US15663652

    申请日:2017-07-28

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards managing and tracking item identification of a plurality of items to determine if an item is a new or existing item, where an existing item has been previously processed. In some embodiments, two or more item identifiers may be generated. In one embodiment, generating the two or more item identifiers may include analyzing the item using a small item size characteristic, a compressed item, or for an identifier collision. The two or more item identifiers may be employed to determine if the item is a new or existing item. In one embodiment, the two or more item identifiers may be compared to a record about an existing item to determine if the item is a new or existing item. If the item is an existing item, then the item may be further processed to determine if the existing item has actually changed.

Patent Agency Ranking