发明申请
- 专利标题: Code Labeling Based on Tokenized Code Samples
- 专利标题(中): 基于令牌代码示例的代码标签
-
申请号: US14599394申请日: 2015-01-16
-
公开(公告)号: US20160212153A1公开(公告)日: 2016-07-21
- 发明人: Benjamin Livshits , Benjamin G. Zorn , Benjamin Stock
- 申请人: Microsoft Technology Licensing, LLC.
- 主分类号: H04L29/06
- IPC分类号: H04L29/06 ; G06F17/30
摘要:
Disclosed herein are systems and methods for detecting script code malware and generating signatures. A plurality of script code samples are received and transformed into a plurality of tokenized samples. The tokenized samples are based on syntactical elements of the plurality of script code samples. One or more clusters of samples are determined based on similarities in different ones of the plurality of tokenized samples, and known malicious code having a threshold similarity to a representative sample of the cluster of samples is identified. Based on the identifying, the cluster of samples is identified as malicious. Based at least on respective ones of the plurality of tokenized samples associated with the cluster of samples, a generalized code signature usable to identify the script code samples in the cluster of samples is generated.
公开/授权文献
- US10044750B2 Code labeling based on tokenized code samples 公开/授权日:2018-08-07
信息查询