-
公开(公告)号:US20170091190A1
公开(公告)日:2017-03-30
申请号:US14869859
申请日:2015-09-29
Applicant: Cisco Technology, Inc.
Inventor: ROBERTO ATTIAS , Alberto Gonzalez Prieto
IPC: G06F17/30
Abstract: A data processing method includes receiving a stream of digital data with a plurality of objects and, in response to receiving an object, tokenizing the object to create a tokenized object, and storing the tokenized object in a token database. The method further includes comparing the tokenized object to a plurality of other tokenized objects stored in the token database, computing a pattern associated with the tokenized object, storing the pattern in a pattern database, and managing a size of the pattern database by identifying, a subset of patterns that are eligible for deletion from the pattern database based on an age of each pattern, ranking each pattern of the subset based on a quality and a popularity metric, identifying, based on the ranking and from the subset, a second pattern and deleting the second pattern from the pattern database to produce an updated database.