摘要:
A method for projection mining comprises performing a first projection on a first data object of a first type comprising a plurality of data entries and a second data object of a second type comprising a plurality of data entries to create definitions of attributes of the first data object and definitions of attributes of the second data object, performing a second projection of the definitions of the attributes of the first data object and the definitions of the attributes of the second data object into a space of meta-attributes based on semantic relationships among the attributes of the first data object and the second data object, learning relationships between the space of meta-attributes formed by the projections of the first data object and the second data object and a space of meta-attributes relating to new data not included in the first data object and the second data object, and generating at least one new data object of the first or second type based on the new data using the learned relationships.
摘要:
An incident predictor system is described herein for predicting impactful incidents in which server computer system operations fail or perform poorly. According to one embodiment of the invention, the incident prediction system trains a generalized linear model (GLM) to predict when a system health indicator will reach a level that represents an incident for the server system.
摘要:
A method is provided for detecting when users are being adversely impacted by poor system performance. A system health indicator is determined that is based on the amount of work that is blocked waiting for each of a set of an external events and combined with a heuristic that is based on the number of users waiting for the work to complete. The system health indicator is compared to a threshold such that an alert is generated when the system health indicator crosses the threshold. However, the system health indicator is designed so that an alert is only generated when a significant user base is or will in the near future experience a problem with the system. Furthermore, the system health indicator is designed to vary smoothly to maintain its suitability for the application of predictive technology.
摘要:
A method is provided for detecting when users are being adversely impacted by poor system performance. A system health indicator is determined that is based on the amount of work that is blocked waiting for each of a set of an external events and combined with a heuristic that is based on the number of users waiting for the work to complete. The system health indicator is compared to a threshold such that an alert is generated when the system health indicator crosses the threshold. However, the system health indicator is designed so that an alert is only generated when a significant user base is or will in the near future experience a problem with the system. Furthermore, the system health indicator is designed to vary smoothly to maintain its suitability for the application of predictive technology.
摘要:
An incident predictor system is described herein for predicting impactful incidents in which server computer system operations fail or perform poorly. According to one embodiment of the invention, the incident prediction system trains a generalized linear model (GLM) to predict when a system health indicator will reach a level that represents an incident for the server system.
摘要:
An implementation of NMF functionality integrated into a relational database management system provides the capability to apply NMF to relational datasets and to sparse datasets. A database management system comprises a multi-dimensional data table operable to store data and a processing unit operable to perform non-negative matrix factorization on data stored in the multi-dimensional data table and to generate a plurality of data tables, each data table being smaller than the multi-dimensional data table and having reduced dimensionality relative to the multi-dimensional data table. The multi-dimensional data table may be a relational data table.
摘要:
A database management provides the capability to perform cluster analysis and provides improved performance in model building and data mining, good integration with the various databases throughout the enterprise, and flexible specification and adjustment of the models being built, but which provides data mining functionality that is accessible to users having limited data mining expertise and which provides reductions in development times and costs for data mining projects. The database management system for in-database clustering comprises a first data table and a second data table, each data table including a plurality of rows of data, means for building an enhanced K-means clustering model using the first data table, and means for applying the enhanced K-means clustering model using the second data table to generate apply output data.
摘要:
A database management system provides the capability to perform cluster analysis and provides improved performance in model building and data mining, good integration with the various databases throughout the enterprise, and flexible specification and adjustment of the models being built, but which provides data mining functionality that is accessible to users having limited data mining expertise and which provides reductions in development times and costs for data mining projects. The database management system for in-database clustering comprises a first data table and a second data table, each data table including a plurality of rows of data, means for building an Orthogonal Partitioning Clustering model using the first data table, and means for applying the Orthogonal Partitioning Clustering model using the second data table to generate apply output data.
摘要:
A system, method, and computer program product for in-database clustering provides the capability to perform cluster analysis and provides improved performance in model building and data mining, good integration with the various databases throughout the enterprise, and flexible specification and adjustment of the models being built, but which provides data mining functionality that is accessible to users having limited data mining expertise and which provides reductions in development times and costs for data mining projects. A database management system for in-database clustering, comprises a first data table and a second data table, each data table including a plurality of rows of data, means for building a clustering model using the first data table, and means for applying the clustering model using the second data table to generate apply output data.
摘要:
A database management system provides the capability to perform cluster analysis and provides improved performance in model building and data mining, good integration with the various databases throughout the enterprise, and flexible specification and adjustment of the models being built, but which provides data mining functionality that is accessible to users having limited data mining expertise and which provides reductions in development times and costs for data mining projects. The database management system for in-database clustering comprises a first data table and a second data table, each data table including a plurality of rows of data, means for building a clustering model using the first data table, means for building a rule-based model using the clustering model, and means for applying the rule-based model using the second data table to generate apply output data.