Technologies for node-degree based clustering of data sets

    公开(公告)号:US10452717B2

    公开(公告)日:2019-10-22

    申请号:US15272976

    申请日:2016-09-22

    Abstract: Technologies for node-degree based clustering include a computing device to construct a graph that includes multiple vertices corresponding to the data points of a data set. The computing device inserts an edge between each pair of vertices that has a corresponding similarity metric that meets a predetermined threshold similarity metric. The computing device determines a node degree for each vertex in the graph and initializes a cutoff node degree as the lowest node degree of the vertices. The computing device selects a test subset of the graph that includes vertices having a node degree less than or equal to the cutoff node degree. The computing device determines whether the test subset covers the graph and if not increases the cutoff node degree. If the test subset covers the graph, the data points corresponding to the vertices of the test subset are the representative cluster. Other embodiments are described and claimed.

    TECHNOLOGIES FOR NODE-DEGREE BASED CLUSTERING OF DATA SETS

    公开(公告)号:US20180081986A1

    公开(公告)日:2018-03-22

    申请号:US15272976

    申请日:2016-09-22

    CPC classification number: G06F16/9024 G06N5/022 G06N20/00

    Abstract: Technologies for node-degree based clustering include a computing device to construct a graph that includes multiple vertices corresponding to the data points of a data set. The computing device inserts an edge between each pair of vertices that has a corresponding similarity metric that meets a predetermined threshold similarity metric. The computing device determines a node degree for each vertex in the graph and initializes a cutoff node degree as the lowest node degree of the vertices. The computing device selects a test subset of the graph that includes vertices having a node degree less than or equal to the cutoff node degree. The computing device determines whether the test subset covers the graph and if not increases the cutoff node degree. If the test subset covers the graph, the data points corresponding to the vertices of the test subset are the representative cluster. Other embodiments are described and claimed.

Patent Agency Ranking