SYSTEM AND METHOD FOR DYNAMIC BULK DATA INGESTION PRIORITIZATION

    公开(公告)号:US20200050676A1

    公开(公告)日:2020-02-13

    申请号:US16058724

    申请日:2018-08-08

    摘要: A data system may dynamically prioritize and ingest data so that, regardless of the memory size of the dataset hosted by the data system, it may process and analyze the hosted dataset in constant time. The system and method may implement a first space-efficient probabilistic data structure on the dataset, wherein the dataset includes a plurality of profile data. It may then receive update data corresponding to some of the plurality of profile data and implement a second space-efficient probabilistic data structure on the dataset including the update data. The system and method may then determine a set of non-shared profile data of the second space-efficient probabilistic data structure and prioritize the set of non-shared profile data of the second space-efficient probabilistic data structure over other profile data of the dataset for caching.