- 专利标题: High volume message classification and distribution
-
申请号: US16225038申请日: 2018-12-19
-
公开(公告)号: US10956672B1公开(公告)日: 2021-03-23
- 发明人: Ron Ben-Natan , Derek Difilippo , Uri Hershenhorn , Roman Krashanitsa , Luigi Labigalini , Ury Segal
- 申请人: Ron Ben-Natan , Derek Difilippo , Uri Hershenhorn , Roman Krashanitsa , Luigi Labigalini , Ury Segal
- 申请人地址: US MA Lexington; US MA Lexington; US MA Lexington; US MA Lexington; US MA Lexington; CA Vancouver
- 专利权人: Ron Ben-Natan,Derek Difilippo,Uri Hershenhorn,Roman Krashanitsa,Luigi Labigalini,Ury Segal
- 当前专利权人: Ron Ben-Natan,Derek Difilippo,Uri Hershenhorn,Roman Krashanitsa,Luigi Labigalini,Ury Segal
- 当前专利权人地址: US MA Lexington; US MA Lexington; US MA Lexington; US MA Lexington; US MA Lexington; CA Vancouver
- 代理机构: Armis IP Law, LLC
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F40/284 ; G06F16/28 ; G06N20/20 ; G06F40/216 ; G06F40/242
摘要:
A log message classifier employs machine learning for identifying a corresponding parser for interpreting the incoming log message and for retraining a classification logic model processing the incoming log messages. Voluminous log messages generate a large amount of data, typically in a text form. Data fields are parseable from the message by a parser that knows a format of the message. The classification logic is trained by a set of messages having a known format for defining groups of messages recognizable by a corresponding parser. The classification logic is defined by a random forest that outputs a corresponding group and confidence value for each incoming message. Groups may be split to define new groups based on a recurring matching tail (latter portion) of the incoming messages. A trend of decreased confidence scores triggers a periodic retraining of the random forest, and may also generate an alert to operators.
信息查询