-
公开(公告)号:US20140006405A1
公开(公告)日:2014-01-02
申请号:US14016497
申请日:2013-09-03
申请人: Rishi Bhargava , David P Reese, JR.
发明人: Rishi Bhargava , David P Reese, JR.
IPC分类号: G06F17/30
CPC分类号: G06F17/30705 , G06F17/30017 , H04L41/0893
摘要: A method in one example implementation includes obtaining a plurality of host file inventories corresponding respectively to a plurality of hosts, calculating input data using the plurality of host file inventories, and then providing the input data to a clustering procedure to group the plurality of hosts into one or more clusters of hosts. The method further includes each cluster of hosts being grouped using predetermined similarity criteria. In more specific embodiments, each of the host file inventories includes a set of one or more file identifiers with each file identifier representing a different executable software file on a corresponding one of the plurality of hosts. In other more specific embodiments, calculating the input data includes transforming the host file inventories into a matrix of keyword vectors in Euclidean space. In further embodiments, calculating the input data includes transforming the host file inventories into a similarity matrix.
摘要翻译: 一个示例实现中的方法包括获得分别对应于多个主机的多个主机文件库存,使用多个主机文件库存计算输入数据,然后将输入数据提供给聚类程序以将多个主机分组成 一个或多个主机群集。 该方法还包括使用预定的相似性标准分组的每个主机群。 在更具体的实施例中,每个主机文件库存包括一组一个或多个文件标识符,其中每个文件标识符表示多个主机中对应的一个上的不同的可执行软件文件。 在其他更具体的实施例中,计算输入数据包括将主文件库存变换为欧几里得空间中的关键词向量矩阵。 在另外的实施例中,计算输入数据包括将主机文件库存变换为相似性矩阵。