Abstract:
A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.
Abstract:
A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.
Abstract:
A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.
Abstract:
A system for providing continuous monitoring of data quality in a dynamic feed environment is disclosed. In particular, the system utilizes a feed inspection tool to detect anomalies in data gathering detected from feed metadata and anomalies in data measurement detected based on file contents. In order to do so, the feed inspection tool may aggregate, for a plurality of aggregation intervals, data feeds and associated metadata feeds. Once the data feeds and metadata feeds are aggregated, the feed inspection tool may generate, for a baseline model feed, baseline statistical models by utilizing historical data of the aggregated feeds in sliding windows of different lengths. The feed inspection tool may then identify, for a plurality of monitoring time delays, data outliers by comparing the aggregated feeds with the baseline model feed. A data quality feed based on the data outliers identified may then be generated and published.