-
公开(公告)号:US11995079B2
公开(公告)日:2024-05-28
申请号:US18181900
申请日:2023-03-10
申请人: Splunk Inc.
发明人: Sourav Pal , Arindam Bhattacharjee
IPC分类号: G06F16/00 , G06F16/17 , G06F16/22 , G06F16/242 , G06F16/2453 , G06F16/2458 , G06F16/25
CPC分类号: G06F16/24535 , G06F16/1734 , G06F16/22 , G06F16/2425 , G06F16/24545 , G06F16/2471 , G06F16/258
摘要: Systems and methods are disclosed for receiving, at a data intake and query system, a query that includes an indication to process data managed by a third-party data storage and processing system that supports a different query language than the data intake and query system. The data intake and query system identifies a third-party data storage and processing system that manages the data to be processed and generates a subquery for execution by the third-party data storage and processing system, generates instructions for one or more worker nodes to receive and process results of the subquery from the third-party data storage and processing system, and instructs the worker nodes to provide results of the processing to the data intake and query system.
-
公开(公告)号:US11797618B2
公开(公告)日:2023-10-24
申请号:US17810165
申请日:2022-06-30
申请人: Splunk Inc.
发明人: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC分类号: G06F16/00 , G06F16/951 , G06F16/21 , G06F16/25 , G06F16/904 , G06F16/901 , G06F16/9038 , G06F16/903 , G06F16/248 , G06F16/2458 , G06F16/27 , G06F16/2455
CPC分类号: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/248 , G06F16/2455 , G06F16/2471 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/904 , G06F16/9024 , G06F16/9038 , G06F16/90335
摘要: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US20230147068A1
公开(公告)日:2023-05-11
申请号:US18051481
申请日:2022-10-31
申请人: Splunk Inc.
发明人: Balaji Rao , Jindrich Dinga , Kieran Cairney , Manuel Martinez , Nitilaksha Halakatti , Ningxuan He , Arindam Bhattacharjee , Sourav Pal , Alexandros Batsakis
IPC分类号: G06F16/2453 , G06F16/2458 , H04L9/08 , H04L41/0806 , H04L67/10 , G06F8/61 , H04L67/52
CPC分类号: G06F16/24547 , G06F16/2465 , H04L9/0866 , H04L41/0806 , H04L67/10 , G06F8/61 , H04L67/52
摘要: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework. This approach may provide numerous benefits, including improved security, flexibility in the selection of worker nodes, and redundancy for failures of physical components of the data intake and query system.
-
公开(公告)号:US11593377B2
公开(公告)日:2023-02-28
申请号:US16397922
申请日:2019-04-29
申请人: Splunk Inc.
IPC分类号: G06F16/00 , G06F16/2455 , G06F16/2458
摘要: Systems and methods are described for assigning a processing task from one component of a data intake and query system to a different component of the data intake and query system. As part of processing a query, the system can determine that a particular processing task is to be executed by a particular component of the data intake and query system. Based on the characteristics of the component that is to execute the processing task, the system can assign the task or a supplemental task to one or more other components of the data intake and query system.
-
公开(公告)号:US20220405331A1
公开(公告)日:2022-12-22
申请号:US17810165
申请日:2022-06-30
申请人: Splunk Inc.
发明人: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC分类号: G06F16/951 , G06F16/21 , G06F16/25 , G06F16/904 , G06F16/901 , G06F16/9038 , G06F16/903 , G06F16/248 , G06F16/2458 , G06F16/27 , G06F16/2455
摘要: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US20220327125A1
公开(公告)日:2022-10-13
申请号:US17658792
申请日:2022-04-11
申请人: Splunk Inc.
发明人: Sourav Pal , Arindam Bhattacharjee , Nikhil Roy
IPC分类号: G06F16/2453 , G06F16/242 , G06F16/2458 , G06F16/17
摘要: Systems and methods are described for scheduling a query for execution. The system receives and parses a query to identify one or more portions of the query. The system determines a resource allocation for each portion of the query, and determines an availability of compute resources for the different portions of the query. Based on the resource allocation and the availability of compute resources, the system schedules the query.
-
公开(公告)号:US11461334B2
公开(公告)日:2022-10-04
申请号:US15665197
申请日:2017-07-31
申请人: Splunk Inc.
IPC分类号: G06F16/2455 , H04L43/08 , G06F11/20 , H04L43/12 , H04L69/22 , H04L67/1097 , G06F16/27 , G06F16/951 , G06F16/2458 , G06F16/903 , H04L43/028 , H04L43/00 , G06F3/06 , G06F11/34
摘要: Systems and methods are disclosed for processing queries against one or more dataset sources utilizing dynamically allocated partitions operating on one or more worker nodes. The results of the processing are stored in a dataset destination. The queries can identify data in the one or more dataset sources for processing and a manner for processing the data. In addition, the queries can identify the dataset destination for storing results of the query. To process the query, a query coordinator can dynamically allocate partitions operating on worker nodes to retrieve data for processing, process the data, and communicate the data to the dataset sources. In addition, the query coordinator can dynamically allocate partitions based on an identification of the dataset destination.
-
公开(公告)号:US11341131B2
公开(公告)日:2022-05-24
申请号:US16398031
申请日:2019-04-29
申请人: Splunk Inc.
发明人: Sourav Pal , Arindam Bhattacharjee , Nikhil Roy
IPC分类号: G06F16/00 , G06F16/2453 , G06F16/242 , G06F16/2458 , G06F16/17
摘要: Systems and methods are described for scheduling a query for execution. The system receives and parses a query to identify one or more portions of the query. The system determines a resource allocation for each portion of the query, and determines an availability of compute resources for the different portions of the query. Based on the resource allocation and the availability of compute resources, the system schedules the query.
-
公开(公告)号:US11294941B1
公开(公告)日:2022-04-05
申请号:US16000688
申请日:2018-06-05
申请人: Splunk Inc.
IPC分类号: G06F16/00 , G06F16/31 , G06F16/38 , G06F16/951 , G06F16/33
摘要: Systems and methods are described for preprocessing data later ingested into an indexing system. The preprocessing can include receiving messages published to a first publish-subscribe messaging system, the messages containing raw machine data generated by one or more components in an information technology environment, performing one or more processing operations on at least some of the messages to generate preprocessed messages, republishing the preprocessed messages to a second publish-subscribe messaging system, and providing to the indexing system, a subset of the messages from the second publish-subscribe messaging system.
-
公开(公告)号:US11243963B2
公开(公告)日:2022-02-08
申请号:US16051223
申请日:2018-07-31
申请人: Splunk Inc.
发明人: Sourav Pal , Arindam Bhattacharjee
IPC分类号: G06F17/00 , G06F16/2458 , H04L29/08 , G06F16/25 , G06F16/9032 , G06F16/903
摘要: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed, and generates a subquery for the external data system indicating that the results of the subquery are to be sent to multiple worker nodes. The system also generates instructions for multiple worker nodes to receive and process results of the subquery from the external data system.
-
-
-
-
-
-
-
-
-