Abstract:
Systems and methods for automated remediation of issues arising in a data management storage system are provided. Deployed assets of a storage solution vendor may deliver telemetry data to the vendor on a regular basis. The received telemetry data may be processed by an AIOps platform to perform predictive analytics and arrive at “community wisdom” from the vendor's installed user base. In one embodiment, an insight-based approach is used to facilitate risk detection and remediation including proactively addressing issues before they turn into more serious problems. For example, based on continuous learning based on the community wisdom and making one or both of a rule set and a remediation set derived therefrom available for use by cognitive computing co-located with a customer's storage system, a risk to which the storage system is exposed may be determined and a corresponding remediation may be deployed to address or mitigate the risk.
Abstract:
Methods and systems for a networked storage system are provided. One method includes receiving a resource identifier identifying a resource of a network storage environment as an input to a processor executable application programming interface (API); and predicting available performance capacity of the resource by using an optimum utilization of the resource, a current utilization and a predicted utilization based on impact of a workload change at the resource, where the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency in processing the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes receiving a resource identifier identifying a resource of a network storage environment as an input to a processor executable application programming interface (API); and predicting available performance capacity of the resource by using an optimum utilization of the resource, a current utilization and a predicted utilization based on impact of a workload change at the resource, where the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency in processing the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes assigning by a processor executable management module a service level objective (SLO) for a workload, where the SLO is allotted a plurality of performance parameters for tracking performance of the workload for storing data in a networked storage environment; tracking historical performance of the workload to determine a duration when SLO allotment defined by the plurality of performance parameters is being under-utilized; adjusting automatically the SLO allotment for the workload during the duration when the SLO allotment is under-utilized; and re-allocating automatically the available performance capacity of a resource used by the workload to another workload whose assigned SLO is not being under-utilized.
Abstract:
Systems and methods for automated remediation of issues arising in a data management storage system are provided. Deployed assets of a storage solution vendor may deliver telemetry data to the vendor on a regular basis. The received telemetry data may be processed by an AIOps platform to perform predictive analytics and arrive at “community wisdom” from the vendor's installed user base. In one embodiment, an insight-based approach is used to facilitate risk detection and remediation including proactively addressing issues before they turn into more serious problems. For example, based on continuous learning based on the community wisdom and making one or both of a rule set and a remediation set derived therefrom available for use by cognitive computing co-located with a customer's storage system, a risk to which the storage system is exposed may be determined and a corresponding remediation may be deployed to address or mitigate the risk.
Abstract:
Methods and systems for a networked storage system are provided. One method includes receiving a resource identifier identifying a resource of a network storage environment as an input to a processor executable application programming interface (API); and predicting available performance capacity of the resource by using an optimum utilization of the resource, a current utilization and a predicted utilization based on impact of a workload change at the resource, where the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency in processing the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes determining by a processor, a demand pattern for a first workload that is assigned a service level objective (SLO) for using a resource of a networked storage system. The SLO is defined by an allotted performance parameter, and the demand pattern identifies a first duration when a SLO allotment for the first workload is underutilized, and a second duration when the SLO allotment is being utilized. The SLO allotment is dynamically adjusted for the first duration by modifying a parameter associated with the performance parameter, while maintaining the SLO allotment for the second duration. This makes additional performance capacity of the resource available for re-allocation. The additional available performance capacity is dynamically allocated for an identified second workload that needs an increase in SLO allotment for a certain duration and/or for provisioning a new workload.
Abstract:
Intelligent self-adjusting metric collection is described. A first rule set is distributed that describes a first set of one or more metrics corresponding to operation of elements of the receiving entities. One or more metrics based on the first rule set are received. A second rule set is generated in response to an indication of a condition change. The second rule set can be generated using machine learning techniques. The second rule set that describes a second set of one or more metrics is distributed. Metrics based on the second rule set are received.
Abstract:
Systems and methods for automated remediation of deviations from best practices in the context of a data management storage system are provided. Deployed assets of a storage solution vendor may periodically deliver telemetry data to the vendor. The telemetry data may be processed by an AIOps platform to perform predictive analytics and arrive at “community wisdom” from the vendor's installed base. In one embodiment, an insight-based approach is used to facilitate risk detection and remediation including proactively addressing deviations from best practices before they turn into more serious problems. Based on the community wisdom and making a rule set and a remediation set derived therefrom available for use by auto-healing service associated with a customer's storage system, a risk (e.g., a deviation from a best practice) to which the storage system is exposed may be determined and a corresponding remediation may be deployed to address or mitigate the risk.
Abstract:
Methods and systems for a networked storage system are provided. One method includes determining by a processor, a demand pattern for a first workload that is assigned a service level objective (SLO) for using a resource of a networked storage system. The SLO is defined by an allotted performance parameter, and the demand pattern identifies a first duration when a SLO allotment for the first workload is underutilized, and a second duration when the SLO allotment is being utilized. The SLO allotment is dynamically adjusted for the first duration by modifying a parameter associated with the performance parameter, while maintaining the SLO allotment for the second duration. This makes additional performance capacity of the resource available for re-allocation. The additional available performance capacity is dynamically allocated for an identified second workload that needs an increase in SLO allotment for a certain duration and/or for provisioning a new workload.