Integrating Data Quality Analyses for Modeling Metrics

    公开(公告)号:US20220004822A1

    公开(公告)日:2022-01-06

    申请号:US16921579

    申请日:2020-07-06

    Abstract: Techniques for generating a composite score for data quality are disclosed. Univariate analysis is performed on a plurality of data points corresponding to each of a first feature, a second feature, and a third feature of a data set. The univariate analysis includes at least a first type of analysis generating a first score having a first range of possible values, and a second type of analysis generating a second score having a second range of possible values. A first quality score is computed for the data values for the first, second, and third features based on a normalized first score and a normalized second score. Machine learning is performed on the data points corresponding to one or both of the first feature and the second feature having a first quality score above a threshold value to model the third feature.

    Univariate Anomaly Detection in a Sensor Network

    公开(公告)号:US20210097416A1

    公开(公告)日:2021-04-01

    申请号:US16585764

    申请日:2019-09-27

    Abstract: Embodiments determine anomalies in sensor data generated by a sensor by receiving an evaluation time window of clean sensor data generated by the sensor. Embodiments receive a threshold value for determining anomalies. When the clean sensor data has a cyclic pattern, embodiments divide the evaluation time window into a plurality of segments of equal length, wherein each equal length comprises the cyclic pattern. When the clean sensor data does not have the cyclic pattern, embodiments divide the evaluation time window into a pre-defined number of plurality of segments of equal length. Embodiments convert the evaluation time window and each of the plurality of segments into corresponding curves using Kernel Density Estimation (“KDE”). For each of the plurality of segments, embodiments determine a Kullback-Leibler (“KL”) divergence value between corresponding curves of the segment and the evaluation time window to generate a plurality of KL divergence values.

    Action determination using recommendations based on prediction of sensor-based systems

    公开(公告)号:US12223397B2

    公开(公告)日:2025-02-11

    申请号:US17007601

    申请日:2020-08-31

    Abstract: Techniques for providing actionable recommendations for configuring system parameters are disclosed. A set of environmental constraints and a set of values for a set of parameters for a target device is applied to a machine learning model to predict a first performance value of the target device. Candidate values for the set of parameters are identified that are within a threshold range from the first set of values in a multi-dimensional space. For each particular candidate set of values of the candidate sets of values the machine learning model to predicts a performance value of the target device and identifies a subset of the candidate sets of values with corresponding performance values that meet a performance criteria. A subset of candidate sets of values that meets performance criteria is provided as a recommendation.

    ACTION DETERMINATION USING RECOMMENDATIONS BASED ON PREDICTION OF SENSOR-BASED SYSTEMS

    公开(公告)号:US20220067572A1

    公开(公告)日:2022-03-03

    申请号:US17007601

    申请日:2020-08-31

    Abstract: Techniques for providing actionable recommendations for configuring system parameters are disclosed. A set of environmental constraints and a set of values for a set of parameters for a target device is applied to a machine learning model to predict a first performance value of the target device. Candidate values for the set of parameters are identified that are within a threshold range from the first set of values in a multi-dimensional space. For each particular candidate set of values of the candidate sets of values the machine learning model to predicts a performance value of the target device and identifies a subset of the candidate sets of values with corresponding performance values that meet a performance criteria. A subset of candidate sets of values that meets performance criteria is provided as a recommendation.

    IDENTIFYING AND RANKING ANOMALOUS MEASUREMENTS TO IDENTIFY FAULTY DATA SOURCES IN A MULTI-SOURCE ENVIRONMENT

    公开(公告)号:US20210406110A1

    公开(公告)日:2021-12-30

    申请号:US17147737

    申请日:2021-01-13

    Abstract: Techniques for identifying anomalous multi-source data points and ranking the contributions of measurement sources of the multi-source data points are disclosed. A system obtains a data point including a plurality of measurements from a plurality of sources. The system determines that the data point is an anomalous data point based on a deviation of the data point from a plurality of additional data points. The system determines a contribution of two or more measurements, from the plurality of measurements, to the deviation of the data point from the plurality of additional data points. The system ranks the at least the two or more measurements, from the plurality of measurements, based on the respective contribution of each of the two or more measurements to the deviation of the anomalous data point from the plurality of prior data points.

Patent Agency Ranking