Distributed data processing for machine learning

    公开(公告)号:US10922625B2

    公开(公告)日:2021-02-16

    申请号:US15885395

    申请日:2018-01-31

    Applicant: Splunk Inc.

    Abstract: Embodiments of the present invention are directed to facilitating distributed data processing for machine learning. In accordance with aspects of the present disclosure, a set of commands in a query to process at an external computing service is identified. For each command in the set of commands, at least one compute unit including at least one operation to perform at the external computing service is identified. Each of the at least one compute unit associated with each command is analyzed to identify an optimized manner in which to execute the set of commands at the external computing service. An indication of the optimized manner in which to execute the set of commands and a corresponding set of data is provided to the external computing service to utilize for executing the set of commands at the external computing service.

    Clustering events based on extraction rules

    公开(公告)号:US10909140B2

    公开(公告)日:2021-02-02

    申请号:US15276693

    申请日:2016-09-26

    Applicant: SPLUNK INC.

    Abstract: Systems and methods include causing presentation of a first cluster in association with an event of the first cluster, the first cluster from a first set of clusters of events. Each event includes a time stamp and event data. Based on the presentation of the first cluster, an extraction rule corresponding to the event of the first cluster is received from a user. Similarities in the event data between the events are determined based on the received extraction rule. The events are grouped into a second set of clusters based on the determined similarities. Presentation is caused of a second cluster in association with an event of the second cluster, where the second cluster is from the second set of clusters.

    Graphical user interface indicating anomalous events

    公开(公告)号:US11755938B2

    公开(公告)日:2023-09-12

    申请号:US16776302

    申请日:2020-01-29

    Applicant: SPLUNK INC.

    CPC classification number: G06N7/01 G06F3/00 G06N20/00

    Abstract: Methods and systems for determining event probabilities and anomalous events are provided. In one implementation, a method includes: receiving source data, where the source data is configured as a plurality of events with associated timestamps; searching the source data, where the searching provides a search result including N events from the plurality of events, where N is an integer greater than one, where each event of the N events includes a plurality of field values, where at least one event of the N events can include one or more categorical field values and one or more numerical field values; and for an event of the N events, determining a probability of occurrence for each field value of the plurality of field values; and using probabilities determined for the plurality of field values, determining a probability of occurrence for the event.

    DISTRIBUTED DATA PROCESSING FOR MACHINE LEARNING

    公开(公告)号:US20190095817A1

    公开(公告)日:2019-03-28

    申请号:US15885395

    申请日:2018-01-31

    Applicant: Splunk Inc.

    Abstract: Embodiments of the present invention are directed to facilitating distributed data processing for machine learning. In accordance with aspects of the present disclosure, a set of commands in a query to process at an external computing service is identified. For each command in the set of commands, at least one compute unit including at least one operation to perform at the external computing service is identified. Each of the at least one compute unit associated with each command is analyzed to identify an optimized manner in which to execute the set of commands at the external computing service. An indication of the optimized manner in which to execute the set of commands and a corresponding set of data is provided to the external computing service to utilize for executing the set of commands at the external computing service.

Patent Agency Ranking