Invention Grant
- Patent Title: Enabling advanced analytics with large data sets
-
Application No.: US16256519Application Date: 2019-01-24
-
Publication No.: US11562002B2Publication Date: 2023-01-24
- Inventor: Paul Pallath , Rouzbeh Razavi
- Applicant: BUSINESS OBJECTS SOFTWARE LTD.
- Applicant Address: IE Dublin
- Assignee: BUSINESS OBJECTS SOFTWARE LTD.
- Current Assignee: BUSINESS OBJECTS SOFTWARE LTD.
- Current Assignee Address: IE Dublin
- Agency: Fish & Richardson P.C.
- Main IPC: G06F16/28
- IPC: G06F16/28 ; G06F16/2458

Abstract:
The present disclosure describes methods, systems, and computer program products for enabling advanced analytics with large datasets. One computer-implemented method includes receiving, by operation of a computer system, a dataset of multiple data records, each of the plurality of data records comprising one or more features and a target variable; selecting key features among the one or more features based at least on relevance measures of the one or more features with respect to the target variable; dividing the dataset into multiple subsets; for each of the multiple subsets, identifying a number of clusters and respective centroids of the number of clusters based on the key features; identifying a number of final centroids based on the respective centroids of the number of clusters for the each of the number of subsets, the number of final centroids being respective centroids of a number of final clusters; and for each data record in the multiple subsets, assigning the data record to one of the number of final clusters based on distances between the data record and the number of final centroids.
Public/Granted literature
- US20190155824A1 ENABLING ADVANCED ANALYTICS WITH LARGE DATA SETS Public/Granted day:2019-05-23
Information query