DETECTING NOVEL ASSOCIATIONS IN LARGE DATASETS

    公开(公告)号:US20180349466A1

    公开(公告)日:2018-12-06

    申请号:US15611563

    申请日:2017-06-01

    Abstract: Certain embodiments involve determining and outputting correlations between metrics in large-scale web analytics datasets. For example, a processor identifies pairs of data metrics in a web analytics data set and determines a Maximal Information Coefficient (MIC) score for each pair of data metrics that indicates a strength of a correlation between the pair of data metrics. The processor generates an interactive user interface that graphically displays each pair of correlated data metrics having an MIC score above a threshold and the interactive user interface indicates the strength of the correlation between each displayed pair of correlated data metrics. The processor receives user input indicating an adjustment to the threshold and modifies the interactive user interface in response to receiving the user input by adding pairs of correlated data metrics to, or removing pairs of correlated metrics from, the interactive user interface based on the adjustment to the threshold.

Patent Agency Ranking