Data field extraction by a data intake and query system

    公开(公告)号:US12205022B2

    公开(公告)日:2025-01-21

    申请号:US16945415

    申请日:2020-07-31

    Applicant: Splunk Inc.

    Abstract: Systems and methods are described for extracting data fields from logs ingested in a data processing pipeline or otherwise stored. For example, a log can be applied as an input to an artificial intelligence model trained to infer a log sourcetype of logs, and the artificial intelligence model can output an inferred log sourcetype of the log. The inferred log sourcetype can be used to select another artificial intelligence model trained to extract data fields from logs having the inferred log sourcetype, and the log can then be applied as an input to the other artificial intelligence model. The other artificial intelligence model may then output one or more data fields extracted from the log.

    Swappable online machine learning algorithms implemented in a data intake and query system

    公开(公告)号:US11615102B2

    公开(公告)日:2023-03-28

    申请号:US16779509

    申请日:2020-01-31

    Applicant: Splunk Inc.

    Inventor: Ram Sriharsha

    Abstract: Systems and methods are described for testing one or more machine learning algorithms in parallel with an existing machine learning algorithm implemented within a data processing pipeline. Each machine learning algorithm can train a machine learning model that receives a live stream of raw machine data. The output of the machine learning model trained by the existing machine learning algorithm may be written to an external storage system, but the output of the machine learning model(s) trained by the test machine learning algorithm(s) may not be written to an external storage system. After some time, performance of the test machine learning algorithm(s) and the existing machine learning algorithm is evaluated. If the test machine learning algorithm performs better than the existing machine learning algorithm, then the machine learning algorithms can be swapped without any downtime and without needed to re-train a machine learning model using previously seen raw machine data.

    Systems and methods for integration of multiple programming languages within a pipelined search query

    公开(公告)号:US11567735B1

    公开(公告)日:2023-01-31

    申请号:US17074280

    申请日:2020-10-19

    Applicant: SPLUNK Inc.

    Abstract: According to one embodiment, a method that supports queries deploying operators based on multiple programming languages is described. A sequence of operators associated with a query is identified, where the sequence of operators includes at least two neighboring operators including a first operator based on a first programming language and a second operator based on a second programming language that is different from the first programming language. Thereafter, a schema associated with the first operator and a schema associated with the second operator is determined along with the compatibility between the schema of the first operator and the schema of the second operator. A query error message is generated in response to incompatibility between the first operator schema and the second operator schema. Compatibility is determined when an output generated by execution of the first operator provides machine data needed as input for execution of the second operator.

Patent Agency Ranking