Patent search ap:("Cisco Technology Page Inc.") AND inv:"Jan Brabec"

1.

发明授权
Distributed random forest training with a predictor trained to balance tasks 有权

公开(公告)号：US11625640B2

公开(公告)日：2023-04-11

申请号：US16152578

申请日：2018-10-05

Applicant: Cisco Technology, Inc.

Inventor： Radek Starosta , Jan Brabec , Lukas Machlica

IPC: G06N20/00 , G06N7/00 , G06N5/00

Abstract: In one embodiment, a device distributes sets of training records from a training dataset for a random forest-based classifier among a plurality of workers of a computing cluster. Each worker determines whether it can perform a node split operation locally on the random forest by comparing a number of training records at the worker to a predefined threshold. The device determines, for each of the split operations, a data size and entropy measure of the training records to be used for the split operation. The device applies a machine learning-based predictor to the determined data size and entropy measure of the training records to be used for the split operation, to predict its completion time. The device coordinates the workers of the computing cluster to perform the node split operations in parallel such that the node split operations in a given batch are grouped based on their predicted completion times.

2.

发明申请
DEVICE DETECTION IN NETWORK TELEMETRY WITH TLS FINGERPRINTING 有权

公开(公告)号：US20210152526A1

公开(公告)日：2021-05-20

申请号：US16686364

申请日：2019-11-18

Applicant: Cisco Technology, Inc.

Inventor： Jan Kohout , Martin Kopp , Jan Brabec , Lukas Bajer

IPC: H04L29/06 , H04L12/26

Abstract: In one embodiment, a traffic analysis service obtains telemetry data regarding encrypted traffic associated with a particular device in the network, wherein the telemetry data comprises Transport Layer Security (TLS) features of the traffic. The service determines, based on the TLS features from the obtained telemetry data, a set of one or more TLS fingerprints for the traffic associated with the particular device. The service calculates a measure of similarity between the set of one or more TLS fingerprints for the traffic associated with the particular device and a set of one or more TLS fingerprints of traffic associated with a second device. The service determines, based on the measure of similarity, that the particular device and the second device were operated by the same user.

3.

发明授权
Scalable training of random forests for high precise malware detection 有权

公开(公告)号：US10885469B2

公开(公告)日：2021-01-05

申请号：US15722412

申请日：2017-10-02

Applicant: Cisco Technology, Inc.

Inventor： Jan Brabec , Lukas Machlica

IPC: G06N20/00 , G06F21/56 , G06N5/04 , G06K9/62 , G06N5/02 , H04L29/06 , G06N5/00 , G06N20/20

Abstract: In one embodiment, a device trains a machine learning-based malware classifier using a first randomly selected subset of samples from a training dataset. The classifier comprises a random decision forest. The device identifies, using at least a portion of the training dataset as input to the malware classifier, a set of misclassified samples from the training dataset that the malware classifier misclassifies. The device retrains the malware classifier using a second randomly selected subset of samples from the training dataset and the identified set of misclassified samples. The device adjusts prediction labels of individual leaves of the random decision forest of the retrained malware classifier based in part on decision changes in the forest that result from assessing the entire training dataset with the classifier. The device sends the malware classifier with the adjusted prediction labels for deployment into a network.

4.

发明授权
Bayesian tree aggregation in decision forests to increase detection of rare malware 有权

公开(公告)号：US10728271B2

公开(公告)日：2020-07-28

申请号：US16437417

申请日：2019-06-11

Applicant: Cisco Technology, Inc.

Inventor： Jan Brabec , Lukas Machlica

IPC: G06F21/55 , G06N7/00 , H04L29/06 , G06N20/00 , G06N5/00 , G06N20/20

Abstract: In one embodiment, a computing device provides a feature vector as input to a random decision forest comprising a plurality of decision trees trained using a training dataset, each decision tree being configured to output a classification label prediction for the input feature vector. For each of the decision trees, the computing device determines a conditional probability of the decision tree based on a true classification label and the classification label prediction from the decision tree for the input feature vector. The computing device generates weightings for the classification label predictions from the decision trees based on the determined conditional probabilities. The computing device applies a final classification label to the feature vector based on the weightings for the classification label predictions from the decision trees.

5.

发明公开
LEARNING OF MALICIOUS BEHAVIOR VOCABULARY AND THREAT DETECTION 审中-公开

公开(公告)号：US20240106836A1

公开(公告)日：2024-03-28

申请号：US18225517

申请日：2023-07-24

Applicant: Cisco Technology, Inc.

Inventor： Petr Somol , Martin Kopp , Jan Kohout , Jan Brabec , Marc René Jacques Marie Dupont , Cenek Skarda , Lukas Bajer , Danila Khikhlukha

IPC: H04L9/40 , G06N3/045 , G06N3/08

CPC classification number: H04L63/14 , G06N3/045 , G06N3/08

Abstract: In one embodiment, a device obtains input features for a neural network-based model. The device pre-defines a set of neurons of the model to represent known behaviors associated with the input features. The device constrains weights for a plurality of outputs of the model. The device trains the neural network-based model using the constrained weights for the plurality of outputs of the model and by excluding the pre-defined set of neurons from updates during the training.

6.

发明公开
MULTIPLE INSTANCE LEARNING MODELS FOR CYBERSECURITY USING JAVASCRIPT OBJECT NOTATION (JSON) TRAINING DATA 审中-公开

公开(公告)号：US20230376836A1

公开(公告)日：2023-11-23

申请号：US17749740

申请日：2022-05-20

Applicant: Cisco Technology, Inc.

Inventor： Tomas Komarek , Stepan Dvorak , Jan Brabec

IPC: G06N20/00 , H04L9/40

CPC classification number: G06N20/00 , H04L63/1441

Abstract: Techniques and architecture are described for converting tree structured data such as, for example, JavaScript Object Notation (JSON) data, into multiple feature vectors to train multiple instance learning (MIL) models for providing cybersecurity in networks. In particular, a data set is provided, wherein the data set comprises a sample configured as a hierarchal tree. The sample is converted into a set of path and value pairs, e.g., flattened into a set of path and value pairs, where the path is a sequence of field names and array indices encoding a position of a value. Each path and value pair of the set of path and value pairs is converted into a respective feature vector to form a set of feature vectors. The set of feature vectors is used to train a multiple instance learning (MIL) model, wherein each feature vector has a same, fixed length.

7.

发明授权
Malware detection using inverse imbalance subspace searching 有权

公开(公告)号：US11799904B2

公开(公告)日：2023-10-24

申请号：US17117942

申请日：2020-12-10

Applicant: Cisco Technology, Inc.

Inventor： Tomas Komarek , Jan Brabec , Cenek Skarda

IPC: H04L9/40

CPC classification number: H04L63/1466 , H04L63/1416 , H04L63/1425 , H04L63/1433 , H04L63/20

Abstract: Inverse imbalance subspace searching techniques are used to detect potential malware among samples of network communication data. A large number of samples of network communication data, such as proxy log data and/or network flows, are received and analyzed by a malware detection system. A number of the samples are associated with known malware, while other unlabeled samples are either benign or may be associated with unknown malware. An inverse imbalance subspace search may be performed, in which the sample sets are divided into subsets based on random feature thresholds, and each subset is evaluated based on the ratio of known malware samples to unlabeled samples. Unlabeled samples within subsets having high malware sample ratios may be identified, aggregated, and processed as potential malware.

8.

发明授权
Learning of malicious behavior vocabulary and threat detection through behavior matching 有权

公开(公告)号：US11750621B2

公开(公告)日：2023-09-05

申请号：US16831197

申请日：2020-03-26

Applicant: Cisco Technology, Inc.

Inventor： Petr Somol , Martin Kopp , Jan Kohout , Jan Brabec , Marc René Jacques Marie Dupont , Cenek Skarda , Lukas Bajer , Danila Khikhlukha

IPC: H04L9/40 , G06N3/08 , G06N3/045

CPC classification number: H04L63/14 , G06N3/045 , G06N3/08

Abstract: In one embodiment, a device obtains input features for a neural network-based model. The device pre-defines a set of neurons of the model to represent known behaviors associated with the input features. The device constrains weights for a plurality of outputs of the model. The device trains the neural network-based model using the constrained weights for the plurality of outputs of the model and by excluding the pre-defined set of neurons from updates during the training.

9.

发明申请
LEARNING OF MALICIOUS BEHAVIOR VOCABULARY AND THREAT DETECTION THROUGH BEHAVIOR MATCHING 有权

公开(公告)号：US20210306350A1

公开(公告)日：2021-09-30

申请号：US16831197

申请日：2020-03-26

Applicant: Cisco Technology, Inc.

Inventor： Petr Somol , Martin Kopp , Jan Kohout , Jan Brabec , Marc René Jacques Marie Dupont , Cenek Skarda , Lukas Bajer , Danila Khikhlukha

IPC: H04L29/06 , G06N3/08 , G06N3/04

Abstract: In one embodiment, a device obtains input features for a neural network-based model. The device pre-defines a set of neurons of the model to represent known behaviors associated with the input features. The device constrains weights for a plurality of outputs of the model. The device trains the neural network-based model using the constrained weights for the plurality of outputs of the model and by excluding the pre-defined set of neurons from updates during the training.

10.

发明申请
SCALABLE TRAINING OF RANDOM FORESTS FOR HIGH PRECISE MALWARE DETECTION 审中-公开

公开(公告)号：US20190102337A1

公开(公告)日：2019-04-04

申请号：US15722412

申请日：2017-10-02

Applicant: Cisco Technology, Inc.

Inventor： Jan Brabec , Lukas Machlica

IPC: G06F15/18 , G06F21/56 , G06N99/00 , H04L29/06 , G06K9/62 , G06N5/02 , G06N5/04

Abstract: In one embodiment, a device trains a machine learning-based malware classifier using a first randomly selected subset of samples from a training dataset. The classifier comprises a random decision forest. The device identifies, using at least a portion of the training dataset as input to the malware classifier, a set of misclassified samples from the training dataset that the malware classifier misclassifies. The device retrains the malware classifier using a second randomly selected subset of samples from the training dataset and the identified set of misclassified samples. The device adjusts prediction labels of individual leaves of the random decision forest of the retrained malware classifier based in part on decision changes in the forest that result from assessing the entire training dataset with the classifier. The device sends the malware classifier with the adjusted prediction labels for deployment into a network.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification