Abstract:
Systems, methods, and machine-readable media for monitoring a storage system and correcting demand imbalances among nodes in a cluster are disclosed. A performance manager for the storage system may detect performance imbalances that occur over a period of time. When operating below an optimal performance capacity, the manager may cause a volume to be moved from a node with a high load to a node with a lower load to achieve a preventive result. When operating at or near optimal performance capacity, the manager may cause a QOS limit to be imposed to prevent the workload from exceeding the performance capacity, to achieve a proactive result. When operating abnormally, the manager may cause a QOS limit to be imposed to throttle the workload to bring the node back within the optimal performance capacity of the node, to achieve a reactive result. These actions may be performed independently, or in cooperation.
Abstract:
Methods and systems for a network storage environment are provided. One method includes retrieving stored performance data associated with a resource used in the networked storage environment, where the stored performance data includes latency data, utilization data and a service time; filtering the stored performance data by removing any observations that are beyond a certain value of the service time; grouping the filtered stored performance into utilization bins and identifying a representative of each utilization bin; generating by the processor a hybrid latency versus utilization curve comprising a first portion that is based on the representative of each utilization bin and a second portion generated using a model based technique and determining by the processor available performance capacity of the resource using the hybrid latency versus utilization curve; where the available performance capacity is based on optimum utilization of the resource and actual utilization of the resource.
Abstract:
Methods and systems for a network storage environment are provided. One method includes retrieving stored performance data associated with a resource used in the networked storage environment, where the stored performance data includes latency data, utilization data and a service time; filtering the stored performance data by removing any observations that are beyond a certain value of the service time; grouping the filtered stored performance into utilization bins and identifying a representative of each utilization bin; generating by the processor a hybrid latency versus utilization curve comprising a first portion that is based on the representative of each utilization bin and a second portion generated using a model based technique and determining by the processor available performance capacity of the resource using the hybrid latency versus utilization curve; where the available performance capacity is based on optimum utilization of the resource and actual utilization of the resource.
Abstract:
Methods and systems for a networked storage system are provided. One method includes receiving a resource identifier identifying a resource of a network storage environment as an input to a processor executable application programming interface (API); and predicting available performance capacity of the resource by using an optimum utilization of the resource, a current utilization and a predicted utilization based on impact of a workload change at the resource, where the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency in processing the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes filtering performance data associated with a resource used in a networked storage environment for reading and writing data at a storage device; and determining available performance capacity of the resource using the filtered performance data. The available performance capacity is based on optimum utilization of the resource and actual utilization of the resource, where utilization of the resource is an indicator of an extent the resource is being used at any given time, the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency and latency is an indicator of delay at the resource in processing the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes receiving a resource identifier identifying a resource of a network storage environment as an input to a processor executable application programming interface (API); and predicting available performance capacity of the resource by using an optimum utilization of the resource, a current utilization and a predicted utilization based on impact of a workload change at the resource, where the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency in processing the workload.
Abstract:
Systems, methods, and machine-readable media for monitoring a storage system and assigning performance service levels to workloads running on nodes within a cluster are disclosed. A performance manager may estimate the performance demands of each workload within the cluster and assign a performance service level to each workload according to the performance requirements of the workload, and further taking into account an overall budgeting framework. The estimates are performed using historical performance data for each workload. A performance service level may include a service level object, a service level agreement, and latency parameters. These parameters may provide a ceiling to the number of operations per second that a workload may use without guaranteeing the use of the operations per second, a guaranteed number of operations per second that a workload may use before being throttled, and define the permitted delay in completing a request to the workload.
Abstract:
Methods and systems for a networked storage system are provided. One method includes determining by a processor, a demand pattern for a first workload that is assigned a service level objective (SLO) for using a resource of a networked storage system. The SLO is defined by an allotted performance parameter, and the demand pattern identifies a first duration when a SLO allotment for the first workload is underutilized, and a second duration when the SLO allotment is being utilized. The SLO allotment is dynamically adjusted for the first duration by modifying a parameter associated with the performance parameter, while maintaining the SLO allotment for the second duration. This makes additional performance capacity of the resource available for re-allocation. The additional available performance capacity is dynamically allocated for an identified second workload that needs an increase in SLO allotment for a certain duration and/or for provisioning a new workload.
Abstract:
Various embodiments are generally directed to techniques for determining whether one node of a HA group is able to take over for another. An apparatus includes a model derivation component to derive a model correlating node usage level to node data propagation latency through and to node resource utilization from a first model of a first node of a storage cluster system and a second model of a second node of the storage cluster system, the first model based on a first usage level of the first node under a first usage type, and the second model based on a second usage level of the second node under a second usage type; and an analysis component to determine whether the first node is able to take over for the second node based on applying to the derived model a total usage level derived from the first and second usage levels.
Abstract:
Methods and systems for a networked storage system are provided. One method includes filtering performance data associated with a resource used in a networked storage environment for reading and writing data at a storage device; and determining available performance capacity of the resource using the filtered performance data. The available performance capacity is based on optimum utilization of the resource and actual utilization of the resource, where utilization of the resource is an indicator of an extent the resource is being used at any given time, the optimum utilization is an indicator of resource utilization beyond which throughput gains for a workload is smaller than increase in latency and latency is an indicator of delay at the resource in processing the workload.