- Patent Title: Computer system programmed to identify common subsequences in logs
-
Application No.: US14869859Application Date: 2015-09-29
-
Publication No.: US10664481B2Publication Date: 2020-05-26
- Inventor: Roberto Attias , Alberto Gonzalez Prieto
- Applicant: Cisco Technology, Inc.
- Applicant Address: US CA San Jose
- Assignee: Cisco Technology, Inc.
- Current Assignee: Cisco Technology, Inc.
- Current Assignee Address: US CA San Jose
- Agency: Lee & Hayes, P.C.
- Main IPC: G06F16/2457
- IPC: G06F16/2457 ; G06F16/2455 ; G06F16/17 ; G06F16/33 ; G06F16/2458 ; G06F16/903 ; G06F16/215 ; G06F11/34 ; G06N5/02 ; G06F17/40 ; G06F40/211 ; G06F40/284

Abstract:
A data processing method includes receiving a stream of digital data with a plurality of objects and, in response to receiving an object, tokenizing the object to create a tokenized object, and storing the tokenized object in a token database. The method further includes comparing the tokenized object to a plurality of other tokenized objects stored in the token database, computing a pattern associated with the tokenized object, storing the pattern in a pattern database, and managing a size of the pattern database by identifying, a subset of patterns that are eligible for deletion from the pattern database based on an age of each pattern, ranking each pattern of the subset based on a quality and a popularity metric, identifying, based on the ranking and from the subset, a second pattern and deleting the second pattern from the pattern database to produce an updated database.
Public/Granted literature
- US20170091190A1 COMPUTER SYSTEM PROGRAMMED TO IDENTIFY COMMON SUBSEQUENCES IN LOGS Public/Granted day:2017-03-30
Information query