BAYESIAN TREE AGGREGATION IN DECISION FORESTS TO INCREASE DETECTION OF RARE MALWARE

    公开(公告)号:US20190297105A1

    公开(公告)日:2019-09-26

    申请号:US16437417

    申请日:2019-06-11

    Abstract: In one embodiment, a computing device provides a feature vector as input to a random decision forest comprising a plurality of decision trees trained using a training dataset, each decision tree being configured to output a classification label prediction for the input feature vector. For each of the decision trees, the computing device determines a conditional probability of the decision tree based on a true classification label and the classification label prediction from the decision tree for the input feature vector. The computing device generates weightings for the classification label predictions from the decision trees based on the determined conditional probabilities. The computing device applies a final classification label to the feature vector based on the weightings for the classification label predictions from the decision trees.

    SUPERVISED LEARNING SYSTEM
    22.
    发明申请

    公开(公告)号:US20190258965A1

    公开(公告)日:2019-08-22

    申请号:US15901915

    申请日:2018-02-22

    Abstract: In one embodiment, a method including accessing a trained classifier, the trained classifier trained based at least on a first data item and including both decision determination information of the first data item and decision explanation information of at least one second data item, the second data item being distinct from the first data item; receiving an item for classification; using the trained classifier to classify the item for classification; and providing item decision information regarding a reason for classifying the item for classification, the item decision information being based on at least a part of the decision explanation information. Other embodiments are also described.

    ASSESSING DETECTABILITY OF MALWARE RELATED TRAFFIC

    公开(公告)号:US20190253442A1

    公开(公告)日:2019-08-15

    申请号:US15895072

    申请日:2018-02-13

    CPC classification number: H04L63/1425 G06N20/00

    Abstract: In one embodiment, a computing device trains a multi-class classifier (having a plurality of classes) on a training dataset, and evaluates the multi-class classifier on a testing dataset to determine a performance of each of the plurality of classes. The plurality of classes may then be partitioned into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold, and then a predicting classifier can be trained on the training dataset, where data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds. Accordingly, the computing device may then use the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable, and may retrain the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable.

    Bayesian tree aggregation in decision forests to increase detection of rare malware

    公开(公告)号:US10356117B2

    公开(公告)日:2019-07-16

    申请号:US15648563

    申请日:2017-07-13

    Abstract: In one embodiment, a computing device provides a feature vector as input to a random decision forest comprising a plurality of decision trees trained using a training dataset, each decision tree being configured to output a classification label prediction for the input feature vector. For each of the decision trees, the computing device determines a conditional probability of the decision tree based on a true classification label and the classification label prediction from the decision tree for the input feature vector. The computing device generates weightings for the classification label predictions from the decision trees based on the determined conditional probabilities. The computing device applies a final classification label to the feature vector based on the weightings for the classification label predictions from the decision trees.

    BAYESIAN TREE AGGREGATION IN DECISION FORESTS TO INCREASE DETECTION OF RARE MALWARE

    公开(公告)号:US20190020670A1

    公开(公告)日:2019-01-17

    申请号:US15648563

    申请日:2017-07-13

    Abstract: In one embodiment, a computing device provides a feature vector as input to a random decision forest comprising a plurality of decision trees trained using a training dataset, each decision tree being configured to output a classification label prediction for the input feature vector. For each of the decision trees, the computing device determines a conditional probability of the decision tree based on a true classification label and the classification label prediction from the decision tree for the input feature vector. The computing device generates weightings for the classification label predictions from the decision trees based on the determined conditional probabilities. The computing device applies a final classification label to the feature vector based on the weightings for the classification label predictions from the decision trees.

    DETECTING USER BEHAVIOR ACTIVITIES OF INTEREST IN A NETWORK

    公开(公告)号:US20180375884A1

    公开(公告)日:2018-12-27

    申请号:US15629906

    申请日:2017-06-22

    Abstract: A user behavior activity detection method is provided in which network traffic relating to user behavior activities in a network is monitored. Data is stored representing network traffic within a plurality of time periods, each of the time periods serving as a transaction. Subsets of the network traffic in the transactions are identified as traffic suspected of relating to certain user behavior activities. The subsets of the network traffic in the transactions are assigned into one or more groups. A determination is made of one or more detection rules for each of the one or more groups based on identifying, for each of the groups, a number of user behavior activities common to each of the subsets of the network traffic. The one or more detection rules are used to monitor future network traffic in the network to detect occurrence of the certain user behavior activities.

    Identifying Malware Communications with DGA Generated Domains by Discriminative Learning
    27.
    发明申请
    Identifying Malware Communications with DGA Generated Domains by Discriminative Learning 有权
    通过歧视性学习识别与DGA生成的域的恶意软件通信

    公开(公告)号:US20170026390A1

    公开(公告)日:2017-01-26

    申请号:US14806236

    申请日:2015-07-22

    Abstract: Techniques are presented to identify malware communication with domain generation algorithm (DGA) generated domains. Sample domain names are obtained and labeled as DGA domains, non-DGA domains or suspicious domains. A classifier is trained in a first stage based on the sample domain names. Sample proxy logs including proxy logs of DGA domains and proxy logs of non-DGA domains are obtained to train the classifier in a second stage based on the plurality of sample domain names and the plurality of sample proxy logs. Live traffic proxy logs are obtained and the classifier is tested by classifying the live traffic proxy logs as DGA proxy logs, and the classifier is forwarded to a second computing device to identify network communication of a third computing device as malware network communication with DGA domains via a network interface unit of the third computing device based on the trained and tested classifier.

    Abstract translation: 提出技术来识别与域生成算法(DGA)生成域的恶意软件通信。 获取样品域名并标记为DGA域,非DGA域或可疑域。 分类器在第一阶段根据样本域名进行培训。 获得包括DGA域的代理日志和非DGA域的代理日志的示例代理日志,以在第二阶段中基于多个示例域名和多个示例代理日志来训练分类器。 获取实时流量代理日志,并通过将实时流量代理日志分类为DGA代理日志来测试分类器,并将分类器转发到第二计算设备,以将第三计算设备的网络通信识别为与DGA域的恶意软件网络通信,通过 基于经过训练和测试的分类器的第三计算设备的网络接口单元。

Patent Agency Ranking