Database aggregation query result estimator

发明申请

US20060036600A1 Database aggregation query result estimator 有权

请登陆查看更多内容

专利标题： Database aggregation query result estimator
申请号： US11246355

申请日： 2005-10-07
公开(公告)号： US20060036600A1

公开(公告)日： 2006-02-16
发明人: Surajit Chaudhuri , Vivek Narasayya , Rajeev Motwani , Mayur Datar
申请人： Surajit Chaudhuri , Vivek Narasayya , Rajeev Motwani , Mayur Datar
申请人地址： US WA Redmond
专利权人： Microsoft Corporation
当前专利权人： Microsoft Corporation
当前专利权人地址： US WA Redmond
主分类号： G06F7/00
IPC分类号： G06F7/00

Database aggregation query result estimator

摘要：

Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data is extrapolated and added to the aggregated outlier values to provide an estimate for each aggregation query. Outlier values are identified by selecting values outside of a selected sliding window of data having the lowest variance. An index is created for the outlier values. The outlier data is removed from the window of data, and separately aggregated. The remaining data without the outliers is then sampled to provide a statistically relevant sample that is then aggregated and extrapolated to provide an estimate for the remaining data. This sampled estimate is combined with the outlier aggregate to form an estimate for the entire set of data.

公开/授权文献

US07363301B2 Database aggregation query result estimator 公开/授权日：2008-04-22

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）