-
公开(公告)号:US20220398246A1
公开(公告)日:2022-12-15
申请号:US17342812
申请日:2021-06-09
Applicant: BUSINESS OBJECTS SOFTWARE LTD.
Inventor: Paul O'HARA , Malte Christian KAUFMANN , Anirban BANERJEE , Ian DENVER , Alan McSHANE
IPC: G06F16/2458 , G06F16/28
Abstract: Systems and methods include determination, determine, for each of a plurality of discrete features, of statistics for each discrete value of the discrete feature based on values of a continuous feature associated with the discrete value, determination, for each discrete feature, of first summary statistics based on the statistics determined for each discrete value of the discrete feature, determination, for each discrete feature, of a dissimilarity based on the first summary statistics determined for the discrete feature and on the statistics determined for each discrete value of the discrete feature, determination of candidate discrete features of the discrete features based on the determined dissimilarities, the candidate discrete features comprising less than all of the discrete features, determination, for each of the candidate discrete features, of second summary statistics based on values of the continuous feature associated with each discrete value of the candidate discrete feature, determine of a deviation score for each of the candidate discrete features based on the second summary statistics, and presentation of the candidate discrete features based on the determined deviation scores.
-
公开(公告)号:US20230010992A1
公开(公告)日:2023-01-12
申请号:US17367882
申请日:2021-07-06
Applicant: BUSINESS OBJECTS SOFTWARE LTD.
Inventor: Paul O'HARA , Malte Christian KAUFMANN , Alan McSHANE , Anirban BANERJEE , Mark AHERN
IPC: G06F16/2458 , G06F16/28 , G06F16/2457
Abstract: Systems and methods include determination, for each of a plurality of discrete features, of statistics based on a number of occurrences of each discrete value of the discrete feature in the data, determination of first summary statistics based on the determined statistics, determine of a dissimilarity for each discrete feature based on the first summary statistics and on the statistics determined for the discrete feature, determination of candidate discrete features based on the determined dissimilarities, determination, for each of the candidate discrete features, of second summary statistics based on values of a continuous feature associated with each discrete value of the candidate discrete feature, determination of a deviation score for each of the candidate discrete features based on the second summary statistics, and transmission of the candidate discrete features for display in association with the continuous feature based on the determined deviation scores.
-