Invention Grant
- Patent Title: Clustering sparse high dimensional data using sketches
-
Application No.: US14990161Application Date: 2016-01-07
-
Publication No.: US10268749B1Publication Date: 2019-04-23
- Inventor: Gourav Roy , Amit Chandak , Prateek Gupta , Srujana Merugu , Aswin Natarajan , Sathish Kumar Palanisamy , Gowda Dayananda Anjaneyapura Range , Jagannathan Srinivasan , Bharath Venkatesh
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Meyertons, Hood, Kivlin, Kowert & Goetzel, P.C.
- Agent Robert C. Kowert
- Main IPC: G06N5/04
- IPC: G06N5/04 ; G06N99/00 ; G06F17/30

Abstract:
An approximate data structure to represent clusters of observation records of a data set is identified. A hierarchical representation of a plurality of clusters, including the targeted number of clusters among which the observation records are to be distributed, is generated. Each node of the hierarchy comprises an instance of the approximate data structure. Until a set of termination criteria are met, iterations of a selected clustering methodology are run. In a given iteration, distances of observation records from the cluster representatives of a current version of the model are computed using the hierarchical representation, and a new version of the model with modified cluster representatives is generated.
Public/Granted literature
- US3188772A Lock ball roof edging Public/Granted day:1965-06-15
Information query