USER INTERFACE DATA ANALYZER HIGHLIGHTER

    公开(公告)号:US20250013819A1

    公开(公告)日:2025-01-09

    申请号:US18348172

    申请日:2023-07-06

    Abstract: A data analyzer highlighter highlights elements of a user interface to enable a user to better understand and analyze the data presented. To do this, a first visualization is generated in a user interface. A configuration panel including elements for selecting statistical techniques is also generated in the user interface. Selections are obtained via the user interface of one or more statistical techniques. Then statistics are determined from the dataset using each of the one or more selected statistical techniques. Rows of data or the columns of data are then sorted based on a number of extreme values in the particular row or column, wherein the extreme value is a minimum value, a maximum value, or an outlier value. A second visualization sorted based on the number of extreme values in the particular row or column is then generated in the user interface.

    MULTI-STEP TIME SERIES FORECASTING WITH RESIDUAL LEARNING

    公开(公告)号:US20220172130A1

    公开(公告)日:2022-06-02

    申请号:US17673307

    申请日:2022-02-16

    Abstract: A method includes receiving training data including sequential data, determining a plurality of future time points, generating a first prediction by applying a first forecasting algorithm to the training data, generating a second prediction by applying a second forecasting algorithm to the training data, extracting predicted values from the first prediction and the second prediction that corresponds to a future time point of the plurality of future time points, applying a regression model in sequence on each of the plurality of future time points to generate a final predicted value of each of the plurality of future time points, and outputting the final predicted values of the plurality of future time points.

    AUTOMATIC HOT AREA DETECTION IN HEAT MAP VISUALIZATIONS

    公开(公告)号:US20210349911A1

    公开(公告)日:2021-11-11

    申请号:US16867036

    申请日:2020-05-05

    Abstract: The present disclosure involves systems, software, and computer implemented methods for automatically detecting hot areas in heat map visualizations. One example method includes identifying a two-dimensional heat map. The identified two-dimensional heat map is converted to a one-dimensional heat map. Cells of the one-dimensional heat map are clustered using a density-based clustering algorithm to generate at least one dense region of cells. A mean value of cells in each dense region is calculated and the dense regions are sorted by mean value in descending order. An approach for identifying hot areas is selected and the selected approach is used to identify at least one dense region as a hot area of the one-dimensional heat map.

    TIME SERIES ANALYSIS USING A CLUSTERING BASED SYMBOLIC REPRESENTATION

    公开(公告)号:US20190179835A1

    公开(公告)日:2019-06-13

    申请号:US16277725

    申请日:2019-02-15

    Abstract: Techniques are described for performing a time series analysis using a clustering based symbolic representation. Implementations employ a clustering based symbolic representation applied to time series data. In some implementations, the time series data is discretized into subsequences with regular time intervals, and symbols encoding the time intervals may be derived by performing clustering algorithms on the subsequences. In the new representation, a time series is transformed into a sequence of categorical values. The symbolic representation is suitable to perform time series classification and forecast with higher accuracy and greater efficiency compared to previously used techniques. Through use of the symbolic representation, a dimension reduction is applied to transform the time sequences to a feature space with lower dimensions. As output of such transformation, a new representation is obtained based on the original time series. This new reduced-dimension representation improves the efficiency of time series data mining and forecasting.

    Composite relationship discovery framework

    公开(公告)号:US11693879B2

    公开(公告)日:2023-07-04

    申请号:US17324667

    申请日:2021-05-19

    CPC classification number: G06F16/26 G06F16/2465

    Abstract: Systems and methods include reception of a set of data including continuous features and a discrete feature, each continuous feature associated with a plurality of values and the discrete feature associated with a plurality of discrete values, determine, for each continuous feature, a relationship factor representing a relationship between the discrete feature and the continuous feature based on the plurality of values associated with the continuous feature and the plurality of discrete values, identify one of the continuous features associated with a largest one of the determined relationship factors, generate, for each of the other features, a correlation factor representing a correlation between the continuous feature and the identified continuous feature, determine, for each of the continuous features other than the identified continuous feature, a composite relationship score based on the relationship factor and the correlation factor associated with the feature, and present a visualization associated with the discrete feature, the identified continuous feature, and a continuous feature associated with a largest composite relationship score.

    Time series analysis using a clustering based symbolic representation

    公开(公告)号:US11036766B2

    公开(公告)日:2021-06-15

    申请号:US16277725

    申请日:2019-02-15

    Abstract: Techniques are described for performing a time series analysis using a clustering based symbolic representation. Implementations employ a clustering based symbolic representation applied to time series data. In some implementations, the time series data is discretized into subsequences with regular time intervals, and symbols encoding the time intervals may be derived by performing clustering algorithms on the subsequences. In the new representation, a time series is transformed into a sequence of categorical values. The symbolic representation is suitable to perform time series classification and forecast with higher accuracy and greater efficiency compared to previously used techniques. Through use of the symbolic representation, a dimension reduction is applied to transform the time sequences to a feature space with lower dimensions. As output of such transformation, a new representation is obtained based on the original time series. This new reduced-dimension representation improves the efficiency of time series data mining and forecasting.

    MULTIPLE MACHINE LEARNING MODEL ANOMALY DETECTION FRAMEWORK

    公开(公告)号:US20250013668A1

    公开(公告)日:2025-01-09

    申请号:US18754570

    申请日:2024-06-26

    Abstract: Anomalies may be detected using a multiple machine learning model anomaly detection framework. A clustering model is trained using an unsupervised machine learning algorithm on a historical anomaly dataset. A plurality of clusters of records are determined by applying the historical anomaly dataset to the clustering model. Then it is determined whether each cluster of the plurality of clusters is an anomaly-type cluster or a normal-type cluster. The plurality of labels for the plurality of records are updated based on the particular record's cluster classification. Non-pure clusters are determined from among the plurality of clusters based on a purity threshold. A supervised machine learning model is trained for each of the non-pure clusters using the records in the given cluster and the labels for each of those records. Then, predictions of an anomaly are made using the clustering model and the supervised machine learning models.

    Automatic frequency recommendation for time series data

    公开(公告)号:US11321332B2

    公开(公告)日:2022-05-03

    申请号:US16876463

    申请日:2020-05-18

    Abstract: The present disclosure involves systems, software, and computer implemented methods for automatically recommending one or more frequencies for time series data. One example method includes receiving a request for an insight analysis for an input time series included in a dataset. For each of multiple frequencies to analyze, the input time series is transformed into a frequency time series. An absolute percentage change impact factor and an absolute trend impact factor are determined for each frequency time series. A frequency interest score is determined based on the determined absolute percentage change factors and the determined absolute trend impact factors, for each time frequency time series. The frequency interest score is provided for at least some of the frequency time series.

    ARTIFICIAL IMMUNE SYSTEM FOR FUZZY COGNITIVE MAP LEARNING

    公开(公告)号:US20180285769A1

    公开(公告)日:2018-10-04

    申请号:US15475482

    申请日:2017-03-31

    Abstract: The present disclosure involves systems, software, and computer implemented methods for learning relationships between concepts using an artificial immune system. A method includes identifying a set of concepts; determining a state value for each concept at each of a set of time points; generating an initial state and a system response; designating the system response as an antigen a clonal selection algorithm; generating a set of candidate weight matrices to be used as a population of antibodies in the clonal selection algorithm; determining a system response for each antibody; determining an affinity value for each antibody, using the system response for the antibody, the affinity value for a respective antibody representing how closely the respective antibody fits the antigen; cloning a set of antibodies based on the affinity values; repeating the cloning until a stopping point is reached; and selecting a candidate weight matrix with a highest affinity value.

Patent Agency Ranking