Abstract:
The disclosed embodiments relate to a system that optimizes a prognostic-surveillance system to achieve a user-selectable functional objective. During operation, the system allows a user to select a functional objective to be optimized from a set of functional objectives for the prognostic-surveillance system. Next, the system optimizes the selected functional objective by performing Monte Carlo simulations, which vary operational parameters for the prognostic-surveillance system while the prognostic-surveillance system operates on synthesized signals, to determine optimal values for the operational parameters that optimize the selected functional objective.
Abstract:
We describe a system that performs prognostic-surveillance operations based on an inferential model that dynamically adapts to evolving operational characteristics of a monitored asset. During a surveillance mode, the system receives a set of time-series signals gathered from sensors in the monitored asset. Next, the system uses an inferential model to generate estimated values for the set of time-series signals, and then performs a pairwise differencing operation between actual values and the estimated values for the set of time-series signals to produce residuals. Next, the system performs a sequential probability ratio test (SPRT) on the residuals to produce SPRT alarms. When a tripping frequency of the SPRT alarms exceeds a threshold value, which is indicative of an incipient anomaly in the monitored asset, the system triggers an alert. While the prognostic-surveillance system is operating in the surveillance mode, the system incrementally updates the inferential model based on the time-series signals.
Abstract:
A method and apparatus queries a table in a database where the table includes at least one column declared to be sparse. A binary large object may be used to store the sparse column data. The object includes a column-id and column-value pair for each non-null value. To answer a query with a constraint on a sparse column, the object is searched for one or more column ids to obtain the column values. Rows whose column values match a constraint are returned. In another embodiment, an internal table is used. Each tuple in the internal table has a column id and a value array indexed by an ordinal row number. To answer a query with a constraint on a sparse column, the column value in the internal table is found and matched against the constraint. If the match is successful, the index of the column value in the internal table is returned.
Abstract:
Techniques for using machine learning model validated sensor data to generate recommendations for remediating issues in a monitored system are disclosed. A machine learning model is trained to identify correlations among sensors for a monitored system. Upon receiving current sensor data, the machine learning model identifies a subset of the current sensor data that cannot be validated. The system generates estimated values for the sensor data that cannot be validated based on the learned correlations among the sensor values. The system generates the recommendations for remediating the issues in the monitored system based on validated sensor values and the estimated sensor values.
Abstract:
The disclosed embodiments relate to a system for analyzing telemetry data. During operation, the system obtains telemetry data gathered from sensors during operation of a monitored system. Next, the system applies a univariate model to the telemetry data to identify an operational phase for the monitored system, wherein the univariate model analyzes an individual signal in the telemetry data without reference to other signals in the telemetry data. The system then selects a phase-specific multivariate model based on the identified operational phase, wherein the phase-specific multivariate model was previously trained based on telemetry data gathered while the system was operating in the identified operational phase. Finally, the system uses the phase-specific multivariate model to monitor the telemetry data to detect incipient anomalies associated with the operation of the monitored system.
Abstract:
The disclosed embodiments relate to a system that certifies provenance of time-series data in a time-series database. During operation, the system retrieves time-series data from the time-series database, wherein the time-series data comprises a sequence of observations comprising sensor readings for each signal in a set of signals. The system also retrieves multivariate state estimation technique (MSET) estimates, which were computed for the time-series data, from the time-series database. Next, the system performs a reverse MSET computation to produce reconstituted time-series data from the MSET estimates. The system then compares the reconstituted time-series data with the time-series data. If the reconstituted time-series data matches the original time-series data, the system certifies provenance for the time-series data.
Abstract:
The disclosed embodiments provide a system that proactively resilvers a disk array when a disk drive in the array is determined to have an elevated risk of failure. The system receives time-series signals associated with the disk array during operation of the disk array. Next, the system analyzes the time-series signals to identify at-risk disk drives that have an elevated risk of failure. If one or more disk drives are identified as being at-risk, the system performs a proactive resilvering operation on the disk array using a background process while the disk array continues to operate using the at-risk disk drives.
Abstract:
Techniques related to an in-memory key-value store for a multi-model database are disclosed. In an embodiment, a relational database may be maintained on persistent storage. The relational database may be managed by a database server and may include a database table. The database table may be stored in a persistent format. Key-value records may be generated within volatile memory accessible to the database server by converting data in the database table to a key-value format. The key-value format may be different from and independent of the persistent format. A database statement referencing the database table may be executed based on determining whether to access one or more key-value records in the volatile memory or to access the data in the database table. In response to determining to access the one or more key-value records, the database server may access the one or more key-value records in the volatile memory.
Abstract:
The disclosed embodiments relate to a system for analyzing telemetry data. During operation, the system obtains telemetry data gathered from sensors during operation of a monitored system. Next, the system applies a univariate model to the telemetry data to identify an operational phase for the monitored system, wherein the univariate model analyzes an individual signal in the telemetry data without reference to other signals in the telemetry data. The system then selects a phase-specific multivariate model based on the identified operational phase, wherein the phase-specific multivariate model was previously trained based on telemetry data gathered while the system was operating in the identified operational phase. Finally, the system uses the phase-specific multivariate model to monitor the telemetry data to detect incipient anomalies associated with the operation of the monitored system.
Abstract:
A method and apparatus queries a table in a database where the table includes at least one column declared to be sparse. A binary large object may be used to store the sparse column data. The object includes a column-id and column-value pair for each non-null value. To answer a query with a constraint on a sparse column, the object is searched for one or more column ids to obtain the column values. Rows whose column values match a constraint are returned. In another embodiment, an internal table is used. Each tuple in the internal table has a column id and a value array indexed by an ordinal row number. To answer a query with a constraint on a sparse column, the column value in the internal table is found and matched against the constraint. If the match is successful, the index of the column value in the internal table is returned.