Time series analysis using a clustering based symbolic representation

    公开(公告)号:US10248713B2

    公开(公告)日:2019-04-02

    申请号:US15364681

    申请日:2016-11-30

    Abstract: Techniques are described for performing a time series analysis using a clustering based symbolic representation. Implementations employ a clustering based symbolic representation applied to time series data. In some implementations, the time series data is discretized into subsequences with regular time intervals, and symbols encoding the time intervals may be derived by performing clustering algorithms on the subsequences. In the new representation, a time series is transformed into a sequence of categorical values. The symbolic representation is suitable to perform time series classification and forecast with higher accuracy and greater efficiency compared to previously used techniques. Through use of the symbolic representation, a dimension reduction is applied to transform the time sequences to a feature space with lower dimensions. As output of such transformation, a new representation is obtained based on the original time series. This new reduced-dimension representation improves the efficiency of time series data mining and forecasting.

    TIME SERIES ANALYSIS USING A CLUSTERING BASED SYMBOLIC REPRESENTATION

    公开(公告)号:US20180150547A1

    公开(公告)日:2018-05-31

    申请号:US15364681

    申请日:2016-11-30

    CPC classification number: G06F17/30598 G06F17/30539 G06N99/005

    Abstract: Techniques are described for performing a time series analysis using a clustering based symbolic representation. Implementations employ a clustering based symbolic representation applied to time series data. In some implementations, the time series data is discretized into subsequences with regular time intervals, and symbols encoding the time intervals may be derived by performing clustering algorithms on the subsequences. In the new representation, a time series is transformed into a sequence of categorical values. The symbolic representation is suitable to perform time series classification and forecast with higher accuracy and greater efficiency compared to previously used techniques. Through use of the symbolic representation, a dimension reduction is applied to transform the time sequences to a feature space with lower dimensions. As output of such transformation, a new representation is obtained based on the original time series. This new reduced-dimension representation improves the efficiency of time series data mining and forecasting.

Patent Agency Ranking