摘要:
A method and apparatus for quantitatively measuring differences between portions of a multivariate, multi-dimensional sample distribution, may comprise summarizing the data by dividing the data into clusters each having a signature representative of a position of the cluster and a fraction of the entire distribution within the cluster; matching a plurality of first supplier signatures to a respective one of a plurality of second receiver signatures using a cost factor indicative of the separation between first signature elements and second signature elements; and determining a measurement of the work required to transform the first signature to the second signature. The step of determining a measurement of the work may comprise applying the earth mover distance (“EMD”) algorithm between the first signature or elements of the first signature and the respective second signatures or elements of the respective second signature.
摘要:
A method and apparatus for quantitatively measuring differences between portions of a multivariate, multi-dimensional sample distribution, may comprise summarizing the data by dividing the data into clusters each having a signature representative of a position of the cluster and a fraction of the entire distribution within the cluster; matching a plurality of first supplier signatures to a respective one of a plurality of second receiver signatures using a cost factor indicative of the separation between first signature elements and second signature elements; and determining a measurement of the work required to transform the first signature to the second signature. The step of determining a measurement of the work may comprise applying the earth mover distance (“EMD”) algorithm between the first signature or elements of the first signature and the respective second signatures or elements of the respective second signature.
摘要:
Some embodiments provide methods, systems and computer-readable media that employ adaptive binning and dissimilarity scores based on a quadratic form distance for multidimensional data for matching clusters in data corresponding to different sample. Some embodiments provide methods, systems and computer-readable media for rendering a first interactive display including a two-dimensional plot of at least a portion of a multidimensional data set and a corresponding second interactive display including a plurality of single parameter charts or histograms, each displaying information corresponding to one-dimensional measurements of a different parameter in the multidimensional data set.
摘要:
Some embodiments provide methods, systems and computer-readable media that employ adaptive binning and dissimilarity scores based on a quadratic form distance for multidimensional data for matching clusters in data corresponding to different sample. Some embodiments provide methods, systems and computer-readable media for rendering a first interactive display including a two-dimensional plot of at least a portion of a multidimensional data set and a corresponding second interactive display including a plurality of single parameter charts or histograms, each displaying information corresponding to one-dimensional measurements of a different parameter in the multidimensional data set.
摘要:
The described invention provides a method and/or system for analyzing data using population clustering through density based merging, and a method for guiding clustering strategy through entropy-based ranking score.