Bucket merging for a data intake and query system using size thresholds

    公开(公告)号:US11720537B2

    公开(公告)日:2023-08-08

    申请号:US17661510

    申请日:2022-04-29

    Applicant: Splunk Inc.

    CPC classification number: G06F16/2228 G06F16/14 G06F16/16

    Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.

    Processing data associated with different tenant identifiers

    公开(公告)号:US11416465B1

    公开(公告)日:2022-08-16

    申请号:US16513378

    申请日:2019-07-16

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for processing incoming data. The system can receive, from a first partition manager of a data intake and query system, first data that is associated with a first identifier, and can receive, from a second partition manager of the data intake and query system, second data that is associated with a second identifier. The system can process the first data and store first results of said processing the first data in one or more first buckets associated with the first tenant identifier. The system can process the second data and store second results of said processing the second data in one or more second buckets associated with the second tenant identifier.

    Scalable bucket merging for a data intake and query system

    公开(公告)号:US11334543B1

    公开(公告)日:2022-05-17

    申请号:US16657924

    申请日:2019-10-18

    Applicant: Splunk Inc.

    Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.

Patent Agency Ranking