Resegmenting chunks of data based on source type to facilitate load balancing

    公开(公告)号:US11599396B2

    公开(公告)日:2023-03-07

    申请号:US17237904

    申请日:2021-04-22

    申请人: SPLUNK INC.

    IPC分类号: G06F9/50 G06F3/06

    摘要: Resegmenting chunks of data for load balancing is disclosed. A plurality of first chunks of data is received. The plurality of first chunks of data includes one or more entries that include raw data produced by a component of an information technology environment and that reflects activity in the information technology environment. The plurality of first chunks of data is resegmented into a plurality of second chunks of data based on a source type of the plurality of first chunks. A first subset of the plurality of second chunks of data is distributed to a first indexer of a set of indexers. An occurrence of a trigger event is determined, and in response to the trigger event, a second subset of the plurality of second chunks of data is distributed to a second indexer of the set of indexers.

    Resegmenting chunks of data for efficient load balancing across indexers

    公开(公告)号:US11016821B2

    公开(公告)日:2021-05-25

    申请号:US16703236

    申请日:2019-12-04

    申请人: SPLUNK INC.

    IPC分类号: G06F9/50 G06F3/06

    摘要: Resegmenting chunks of data for load balancing is disclosed. A plurality of first chunks of data is received. The plurality of first chunks of data includes one or more entries that include raw data produced by a component of an information technology environment and that reflects activity in the information technology environment. The plurality of first chunks of data is resegmented into a plurality of second chunks of data based on entry boundaries in at least some of the plurality of first chunks of data. A first subset of the plurality of second chunks of data is distributed to a first indexer of a set of indexers. An occurrence of a trigger event is determined, and in response to the trigger event, a second subset of the plurality of second chunks of data is distributed to a second indexer of the set of indexers.

    Utilizing persistent and non-persistent connections for distributed tasks for performing a job

    公开(公告)号:US10255322B2

    公开(公告)日:2019-04-09

    申请号:US15965278

    申请日:2018-04-27

    申请人: Splunk Inc.

    摘要: A method to assist with processing distributed jobs by retrieving and/or synchronizing supplemental job data. The method includes receiving a request to perform a job and opening a first connection (e.g., persistent connection) between a primary machine and a secondary machine, and transmitting by the primary machine a request pertaining to the job to the secondary machine using a second connection, the job to be performed by the secondary machine. The method also includes receiving by the primary machine using the second connection a task request for supplemental information pertaining to the job, transmitting by the primary machine a task response including the supplemental information to the secondary machine, and receiving a job result for the job using the second connection.

    Utilizing multiple connections for generating a job result

    公开(公告)号:US11252224B2

    公开(公告)日:2022-02-15

    申请号:US16922632

    申请日:2020-07-07

    申请人: Splunk Inc.

    摘要: A method to assist with processing distributed jobs by retrieving and/or synchronizing supplemental job data. The method includes receiving a request pertaining to a job from a first virtualized execution environment using a non-persistent connection between the first virtualized execution environment and a second virtualized execution environment, transmitting, by the secondary machine using a persistent connection between the first virtualized execution environment and the second virtualized execution environment, a task request for supplemental information pertaining to the job, generating a job result for the job based on the supplemental information received from the first virtualized execution environment via the persistent connection, and transmitting, to the first virtualized execution environment, the job result for the job using the non-persistent connection.

    Distributed tasks for retrieving supplemental job information
    6.
    发明授权
    Distributed tasks for retrieving supplemental job information 有权
    用于检索补充工作信息的分布式任务

    公开(公告)号:US09128779B1

    公开(公告)日:2015-09-08

    申请号:US14448048

    申请日:2014-07-31

    申请人: Splunk Inc.

    IPC分类号: G06F9/52 G06F17/30 H04L29/08

    摘要: A method to assist with processing distributed jobs by retrieving and/or synchronizing supplemental job data. The method includes receiving a request to perform a job and opening a long-lived connection and a short-lived connection between a primary machine and a secondary machine, and transmitting by the primary machine a request pertaining to the job to the secondary machine using the short-lived connection, the job to be performed by the secondary machine. The method also includes receiving by the primary machine using the long-lived connection a task request for supplemental information pertaining to the job, transmitting by the primary machine a task response including the supplemental information to the secondary machine, and receiving a job result for the job using the short-lived connection.

    摘要翻译: 一种通过检索和/或同步补充作业数据来协助处理分布式作业的方法。 该方法包括接收执行作业的请求并打开长时间的连接以及主机和次机之间的短暂连接,并且由主机通过使用本机将与作业相关的请求发送到辅助机器 短暂的连接,由二次机执行的作业。 所述方法还包括由所述主机使用所述长寿命连接接收与所述作业有关的补充信息的任务请求,由所述主计算机发送包括所述补充信息的任务响应到所述次机器,以及接收所述辅助机器的作业结果 工作使用短命连接。

    Utilizing persistent and non-persistent connections for generating a job result for a job

    公开(公告)号:US10713245B2

    公开(公告)日:2020-07-14

    申请号:US16296685

    申请日:2019-03-08

    申请人: Splunk Inc.

    摘要: A method to assist with processing distributed jobs by retrieving and/or synchronizing supplemental job data. The method includes receiving a request pertaining to a job from a first virtualized execution environment using a non-persistent connection between the first virtualized execution environment and a second virtualized execution environment, transmitting, by the secondary machine using a persistent connection between the first virtualized execution environment and the second virtualized execution environment, a task request for supplemental information pertaining to the job, generating a job result for the job based on the supplemental information received from the first virtualized execution environment via the persistent connection, and transmitting, to the first virtualized execution environment, the job result for the job using the non-persistent connection.

    Resegmenting chunks of data for efficient load balancing across indexers

    公开(公告)号:US10545798B2

    公开(公告)日:2020-01-28

    申请号:US15420590

    申请日:2017-01-31

    申请人: Splunk Inc.

    IPC分类号: G06F9/50 G06F3/06

    摘要: Resegmenting chunks of data for load balancing is disclosed. A plurality of first chunks of data is received. The plurality of first chunks of data includes one or more entries that include raw data produced by a component of an information technology environment and that reflects activity in the information technology environment. The plurality of first chunks of data is resegmented into a plurality of second chunks of data based on entry boundaries in at least some of the plurality of first chunks of data. A first subset of the plurality of second chunks of data is distributed to a first indexer of a set of indexers. An occurrence of a trigger event is determined, and in response to the trigger event, a second subset of the plurality of second chunks of data is distributed to a second indexer of the set of indexers.