Identifying an indexing node to process data using a resource catalog

    公开(公告)号:US11892996B1

    公开(公告)日:2024-02-06

    申请号:US16513365

    申请日:2019-07-16

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for monitoring indexing nodes, populating and maintaining a resource catalog with relevant information, receiving requests for indexing node availability or assignments, identifying indexing nodes that are available to process data, and/or communicating information relating to available indexing nodes. The system can maintain the resource catalog based on communications with each of the containerized indexing nodes. The system can receive, from a partition manager of a data intake and query system, a request for a containerized indexing node that the partition manager can assign to process data received by the partition manager. The system can identify an available containerized indexing node to process the data. The system can communicate, to the partition manager, an indexing node identifier associated with the available containerized indexing node.

    EXTERNALLY DISTRIBUTED BUCKETS FOR EXECUTION OF QUERIES

    公开(公告)号:US20250028698A1

    公开(公告)日:2025-01-23

    申请号:US18414157

    申请日:2024-01-16

    Applicant: Splunk Inc.

    Abstract: A data intake and query system can manage the search of data stored at an external location relative to the data intake and query system using one or more indexers. The data intake and query system can receive data stored at the external location. The data intake and query system can process the data and generate an index using the one or more indexers. The data intake and query system can discard the data and store the index and a location identifier of the external location in one or more buckets. In response to a query, the data intake and query system can identify that at least a subset of the data is responsive to the query using the index and can obtain the at least the subset of the data from the external location using the location identifier.

    Bucket merging for a data intake and query system using size thresholds

    公开(公告)号:US11720537B2

    公开(公告)日:2023-08-08

    申请号:US17661510

    申请日:2022-04-29

    Applicant: Splunk Inc.

    CPC classification number: G06F16/2228 G06F16/14 G06F16/16

    Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.

    Processing data associated with different tenant identifiers

    公开(公告)号:US11416465B1

    公开(公告)日:2022-08-16

    申请号:US16513378

    申请日:2019-07-16

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for processing incoming data. The system can receive, from a first partition manager of a data intake and query system, first data that is associated with a first identifier, and can receive, from a second partition manager of the data intake and query system, second data that is associated with a second identifier. The system can process the first data and store first results of said processing the first data in one or more first buckets associated with the first tenant identifier. The system can process the second data and store second results of said processing the second data in one or more second buckets associated with the second tenant identifier.

Patent Agency Ranking