System for continuous monitoring of data quality in a dynamic feed environment

    公开(公告)号:US10977147B2

    公开(公告)日:2021-04-13

    申请号:US16257936

    申请日:2019-01-25

    Abstract: A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.

    System For Continuous Monitoring Of Data Quality In A Dynamic Feed Environment

    公开(公告)号:US20190155822A1

    公开(公告)日:2019-05-23

    申请号:US16257936

    申请日:2019-01-25

    Abstract: A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.

    System for continuous monitoring of data quality in a dynamic feed environment

    公开(公告)号:US10191962B2

    公开(公告)日:2019-01-29

    申请号:US14813403

    申请日:2015-07-30

    Abstract: A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.

    System For Continuous Monitoring Of Data Quality In A Dynamic Feed Environment
    4.
    发明申请
    System For Continuous Monitoring Of Data Quality In A Dynamic Feed Environment 审中-公开
    用于在动态饲料环境中连续监测数据质量的系统

    公开(公告)号:US20170032015A1

    公开(公告)日:2017-02-02

    申请号:US14813403

    申请日:2015-07-30

    CPC classification number: G06F17/30592 G06F17/30598

    Abstract: A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.

    Abstract translation: 公开了一种用于在动态馈送环境中提供对数据质量的连续监视的系统。 特别地,该系统利用进料检测工具来检测从进料元数据中检测到的数据收集异常和基于文件内容检测的数据测量异常。 为了这样做,饲料检查工具可以在多个聚集间隔中聚合数据馈送和相关联的元数据馈送。 一旦数据馈送和元数据馈送被聚合,馈送检查工具可以通过利用不同长度的滑动窗口中的聚合馈送的历史数据为基线模型馈送生成基线统计模型。 然后,通过将聚合的进料与基准模型进料进行比较,进料检测工具可以通过多个监测时间延迟来识别数据异常值。 然后可以生成并发布基于所识别的数据异常值的数据质量提要。

Patent Agency Ranking