- 专利标题: Database aggregation query result estimator
-
申请号: US11246355申请日: 2005-10-07
-
公开(公告)号: US20060036600A1公开(公告)日: 2006-02-16
- 发明人: Surajit Chaudhuri , Vivek Narasayya , Rajeev Motwani , Mayur Datar
- 申请人: Surajit Chaudhuri , Vivek Narasayya , Rajeev Motwani , Mayur Datar
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F7/00
- IPC分类号: G06F7/00
摘要:
Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data is extrapolated and added to the aggregated outlier values to provide an estimate for each aggregation query. Outlier values are identified by selecting values outside of a selected sliding window of data having the lowest variance. An index is created for the outlier values. The outlier data is removed from the window of data, and separately aggregated. The remaining data without the outliers is then sampled to provide a statistically relevant sample that is then aggregated and extrapolated to provide an estimate for the remaining data. This sampled estimate is combined with the outlier aggregate to form an estimate for the entire set of data.
公开/授权文献
- US07363301B2 Database aggregation query result estimator 公开/授权日:2008-04-22
信息查询