- 专利标题: Systems and methods for quantile determination in a distributed data system using sampling
-
申请号: US15212010申请日: 2016-07-15
-
公开(公告)号: US09703852B2公开(公告)日: 2017-07-11
- 发明人: Guy Blanc , Georges H. Guirguis , Xiangqian Hu , Guixian Lin , Scott Pope
- 申请人: SAS Institute Inc.
- 申请人地址: US NC Cary
- 专利权人: SAS INSTITUTE INC.
- 当前专利权人: SAS INSTITUTE INC.
- 当前专利权人地址: US NC Cary
- 代理机构: Kilpatrick Townsend & Stockton LLP
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
In accordance with the teachings described herein, systems and methods are provided for estimating or determining quantiles for data stored in a distributed system. In one embodiment, an instruction is received to estimate or determine a specified quantile for a variate in a set of data stored at a plurality of nodes in the distributed system. A plurality of data bins for the variate are defined that are each associated with a different range of data values in the set of data. Lower and upper quantile bounds for each of the plurality of data bins are determined based on the total number of data values that fall within each of the plurality of data bins. The specified quantile is estimated or determined based on an identified one of the plurality of data bins that includes the specified quantile based on the lower and upper quantile bounds.
公开/授权文献
信息查询