摘要:
A method and system are described for including data quality in data streams. An example method may include obtaining a first group of data items, each data item including one or more data attribute values. A first group of data quality items may be determined, each data quality item including one or more data quality attribute values associated with one of the data items of the first group. A first aggregated data quality value may be determined based on the first group of data quality items. A first data stream interval including the first group of data items and the first aggregated data quality value may be output.
摘要:
A system and method to perform data quality driven optimization of data are described. In one embodiment, a method is presented to iteratively test configurations of a data processing path until a configuration that processes data to predefined quality requirements is identified. In one embodiment, a system is presented. The system includes a data quality initialization module (404), a primary data stream processing module (406) and an optimization module (408) that is incorporated in a memory chip on a computer processor.
摘要:
A system and method to delete overload in a data stream are described. A method of an embodiment of the invention may analyze data quality information in a data stream and delete data items that are found to be of lower than a desired data quality. In one embodiment, data items may be evaluated according to maximize a particular aspect of the utility of the data in a data stream. In one embodiment, a system of an embodiment of the invention may evaluate data quality in a data stream to suggest one or more actions to be performed to improve the data quality in the data stream. Further, the system of the embodiment of the invention may evaluate each suggested action to determine how the suggested action may impact the data quality in the data stream if performed.