Workload Analysis For Long-Term Management Via Performance Service Levels

    公开(公告)号:US20220342556A1

    公开(公告)日:2022-10-27

    申请号:US17241887

    申请日:2021-04-27

    Applicant: NetApp, Inc.

    Abstract: Systems, methods, and machine-readable media for monitoring a storage system and assigning performance service levels to workloads running on nodes within a cluster are disclosed. A performance manager may estimate the performance demands of each workload within the cluster and assign a performance service level to each workload according to the performance requirements of the workload, and further taking into account an overall budgeting framework. The estimates are performed using historical performance data for each workload. A performance service level may include a service level object, a service level agreement, and latency parameters. These parameters may provide a ceiling to the number of operations per second that a workload may use without guaranteeing the use of the operations per second, a guaranteed number of operations per second that a workload may use before being throttled, and define the permitted delay in completing a request to the workload.

    Method and system for monitoring and analyzing quality of service in a storage system
    2.
    发明授权
    Method and system for monitoring and analyzing quality of service in a storage system 有权
    存储系统中服务质量的监控和分析方法和系统

    公开(公告)号:US09411834B2

    公开(公告)日:2016-08-09

    申请号:US14154992

    申请日:2014-01-14

    Applicant: NETAPP, INC.

    Abstract: Methods and systems for identifying a victim storage volume from among a plurality of storage volumes based on a comparison of current Quality of Service (QOS) data with a dynamic threshold value that is based on historical QOS collected data for the plurality of storage volumes are provided. A performance manager collects the current and historical QOS data from a storage operating system of the storage system, which includes a response time in which each of the plurality of storage volumes respond to an input/output (I/O) request. The current and historical QOS data for the resources used by the victim storage volume are retrieved and compared with the current QOS data of each resource to an expected range based on the historical QOS data. Another storage volume is identified as a bully when its usage of a resource in contention contributes to creating the victim storage volume.

    Abstract translation: 提供了基于当前服务质量(QOS)数据与基于多个存储卷的历史QOS收集数据的动态阈值的比较来从多个存储卷中识别受害者存储卷的方法和系统 。 性能管理器从存储系统的存储操作系统收集当前和历史的QOS数据,其包括多个存储卷中的每一个对输入/输出(I / O)请求做出响应的响应时间。 根据历史QOS数据,检索受害者存储卷使用的资源的当前和历史QOS数据,并将其与每个资源的当前QOS数据进行比较,达到预期范围。 当资源在争用中的使用有助于创建受害者存储卷时,另一个存储卷被识别为欺凌。

    Systems and Methods for Resource Lifecycle Management

    公开(公告)号:US20220171663A1

    公开(公告)日:2022-06-02

    申请号:US17107361

    申请日:2020-11-30

    Applicant: NetApp, Inc.

    Abstract: Systems, methods, and machine-readable media for monitoring a storage system and correcting demand imbalances among nodes in a cluster are disclosed. A performance manager for the storage system may detect performance imbalances that occur over a period of time. When operating below an optimal performance capacity, the manager may cause a volume to be moved from a node with a high load to a node with a lower load to achieve a preventive result. When operating at or near optimal performance capacity, the manager may cause a QOS limit to be imposed to prevent the workload from exceeding the performance capacity, to achieve a proactive result. When operating abnormally, the manager may cause a QOS limit to be imposed to throttle the workload to bring the node back within the optimal performance capacity of the node, to achieve a reactive result. These actions may be performed independently, or in cooperation.

    METHOD AND SYSTEM FOR MONITORING AND ANALYZING QUALITY OF SERVICE IN A STORAGE SYSTEM
    4.
    发明申请
    METHOD AND SYSTEM FOR MONITORING AND ANALYZING QUALITY OF SERVICE IN A STORAGE SYSTEM 有权
    用于监控和分析存储系统中服务质量的方法和系统

    公开(公告)号:US20150199139A1

    公开(公告)日:2015-07-16

    申请号:US14535587

    申请日:2014-11-07

    Applicant: NETAPP, INC.

    Abstract: Methods and systems for monitoring quality of service (QOS) data for a plurality of storage volumes are provided. QOS data is collected for the plurality of storage volumes and includes a response time in which each of the plurality of storage volumes respond to an input/output (I/O) request. The process determines an average of N collected QOS data points at any given time; and iteratively analyzes each QOS data point to detect if a step-up or a step-down function has occurred, where a step-up function represents an unpredictable increase in value of a data point and a step-down function is an unpredictable decrease in value of the data point. A subset of the N QOS data points based on when the step-up function or step-down function occurs is selected for analysis and an expected range for future QOS data based on the subset of the N QOS data points is generated.

    Abstract translation: 提供了用于监视多个存储卷的服务质量(QOS)数据的方法和系统。 针对多个存储卷收集QOS数据,并且包括其中多个存储卷中的每一个对输入/输出(I / O)请求做出响应的响应时间。 该过程确定在任何给定时间N个收集的QOS数据点的平均值; 并且迭代地分析每个QOS数据点以检测是否发生升压或降压功能,其中升压功能表示数据点的值的不可预测的增加,并且降压功能是不可预测的降低 数据点的值。 选择基于升压功能或降压功能何时发生的NQOS数据点的子集用于分析,并且生成基于N个QOS数据点的子集的未来QOS数据的期望范围。

    Method and system for monitoring and analyzing quality of service in a storage system
    5.
    发明授权
    Method and system for monitoring and analyzing quality of service in a storage system 有权
    存储系统中服务质量的监控和分析方法和系统

    公开(公告)号:US09542103B2

    公开(公告)日:2017-01-10

    申请号:US14535587

    申请日:2014-11-07

    Applicant: NETAPP, INC.

    Abstract: Methods and systems for monitoring quality of service (QOS) data for a plurality of storage volumes are provided. QOS data is collected for the plurality of storage volumes and includes a response time in which each of the plurality of storage volumes respond to an input/output (I/O) request. The process determines an average of N collected QOS data points at any given time; and iteratively analyzes each QOS data point to detect if a step-up or a step-down function has occurred, where a step-up function represents an unpredictable increase in value of a data point and a step-down function is an unpredictable decrease in value of the data point. A subset of the N QOS data points based on when the step-up function or step-down function occurs is selected for analysis and an expected range for future QOS data based on the subset of the N QOS data points is generated.

    Abstract translation: 提供了用于监视多个存储卷的服务质量(QOS)数据的方法和系统。 针对多个存储卷收集QOS数据,并且包括其中多个存储卷中的每一个对输入/输出(I / O)请求做出响应的响应时间。 该过程确定在任何给定时间N个收集的QOS数据点的平均值; 并且迭代地分析每个QOS数据点以检测是否发生升压或降压功能,其中升压功能表示数据点的值的不可预测的增加,并且降压功能是不可预测的降低 数据点的值。 选择基于升压功能或降压功能何时发生的NQOS数据点的子集用于分析,并且生成基于N个QOS数据点的子集的未来QOS数据的期望范围。

    METHOD AND SYSTEM FOR MONITORING AND ANALYZING QUALITY OF SERVICE IN A METRO-CLUSTER
    6.
    发明申请
    METHOD AND SYSTEM FOR MONITORING AND ANALYZING QUALITY OF SERVICE IN A METRO-CLUSTER 有权
    用于监测和分析麦克风中服务质量的方法和系统

    公开(公告)号:US20150199141A1

    公开(公告)日:2015-07-16

    申请号:US14531246

    申请日:2014-11-03

    Applicant: NETAPP, INC.

    CPC classification number: G06F3/061 G06F3/0617 G06F3/0653 G06F3/067

    Abstract: Methods and systems for inter-cluster storage system monitoring and analysis are provided. The method includes monitoring a non-volatile memory delay center for a first storage cluster having a first node and a second node configured to operate as a first high availability pair, where data for a write request to write data to the first node is also written to the second node as well as to a second cluster having a third node and a fourth node, where the third node and the fourth node are also configured to operate as a second high availability pair to store the data for the write request at one or both of the third and fourth node. The non-volatile memory delay center is used to monitor and detect latency due to any delay caused by a non-volatile memory of the first node used as a write cache.

    Abstract translation: 提供了集群间存储系统监控和分析的方法和系统。 该方法包括监视具有第一节点和第二节点的第一存储集群的非易失性存储器延迟中心,第一节点和第二节点被配置为作为第一高可用性对进行操作,其中写入数据到第一节点的写入请求也被写入 到第二节点以及具有第三节点和第四节点的第二集群,其中第三节点和第四节点也被配置为作为第二高可用性对来操作,以将写入请求的数据存储在一个或多个节点 第三和第四节点。 非易失性存储器延迟中心用于监视和检测由于用作写缓存的第一节点的非易失性存储器引起的任何延迟。

    Workload analysis for long-term management via performance service levels

    公开(公告)号:US12135877B2

    公开(公告)日:2024-11-05

    申请号:US17241887

    申请日:2021-04-27

    Applicant: NetApp, Inc.

    Abstract: Systems, methods, and machine-readable media for monitoring a storage system and assigning performance service levels to workloads running on nodes within a cluster are disclosed. A performance manager may estimate the performance demands of each workload within the cluster and assign a performance service level to each workload according to the performance requirements of the workload, and further taking into account an overall budgeting framework. The estimates are performed using historical performance data for each workload. A performance service level may include a service level object, a service level agreement, and latency parameters. These parameters may provide a ceiling to the number of operations per second that a workload may use without guaranteeing the use of the operations per second, a guaranteed number of operations per second that a workload may use before being throttled, and define the permitted delay in completing a request to the workload.

    Balance workloads on nodes based on estimated optimal performance capacity

    公开(公告)号:US12050938B2

    公开(公告)日:2024-07-30

    申请号:US17107361

    申请日:2020-11-30

    Applicant: NetApp, Inc.

    Abstract: Systems, methods, and machine-readable media for monitoring a storage system and correcting demand imbalances among nodes in a cluster are disclosed. A performance manager for the storage system may detect performance imbalances that occur over a period of time. When operating below an optimal performance capacity, the manager may cause a volume to be moved from a node with a high load to a node with a lower load to achieve a preventive result. When operating at or near optimal performance capacity, the manager may cause a QOS limit to be imposed to prevent the workload from exceeding the performance capacity, to achieve a proactive result. When operating abnormally, the manager may cause a QOS limit to be imposed to throttle the workload to bring the node back within the optimal performance capacity of the node, to achieve a reactive result. These actions may be performed independently, or in cooperation.

    Method and system for monitoring and analyzing quality of service in a storage system
    9.
    发明授权
    Method and system for monitoring and analyzing quality of service in a storage system 有权
    存储系统中服务质量的监控和分析方法和系统

    公开(公告)号:US09542346B2

    公开(公告)日:2017-01-10

    申请号:US14154941

    申请日:2014-01-14

    Applicant: NETAPP, INC.

    Abstract: Methods and systems for monitoring quality of service (QOS) data for a plurality of storage volumes from a storage operating system of a storage system are provided. A performance manager collects the QOS data from the storage operating system and the QOS data includes a response time in which each of the plurality of storage volumes respond to an input/output (I/O) request. An expected range for future QOS data is generated based on the collected QOS data. The QOS data is monitored for each storage volume for determining whether a current QOS data for each storage volume is within the expected range.

    Abstract translation: 提供了用于从存储系统的存储操作系统监测多个存储卷的服务质量(QOS)数据的方法和系统。 性能管理器从存储操作系统收集QOS数据,并且QOS数据包括多个存储卷中的每一个对输入/输出(I / O)请求做出响应的响应时间。 基于收集的QOS数据生成未来QOS数据的预期范围。 监视每个存储卷的QOS数据,以确定每个存储卷的当前QOS数据是否在预期范围内。

    Systems And Methods For Resource Lifecyle Management

    公开(公告)号:US20250028574A1

    公开(公告)日:2025-01-23

    申请号:US18787306

    申请日:2024-07-29

    Applicant: NetApp, Inc.

    Abstract: Systems, methods, and machine-readable media for monitoring a storage system and correcting demand imbalances among nodes in a cluster are disclosed. A performance manager for the storage system may detect performance imbalances that occur over a period of time. When operating below an optimal performance capacity, the manager may cause a volume to be moved from a node with a high load to a node with a lower load to achieve a preventive result. When operating at or near optimal performance capacity, the manager may cause a QOS limit to be imposed to prevent the workload from exceeding the performance capacity, to achieve a proactive result. When operating abnormally, the manager may cause a QOS limit to be imposed to throttle the workload to bring the node back within the optimal performance capacity of the node, to achieve a reactive result. These actions may be performed independently, or in cooperation.

Patent Agency Ranking