RESILIENT IMPLEMENTATION OF CLIENT FILE OPERATIONS AND REPLICATION

    公开(公告)号:US20170220258A1

    公开(公告)日:2017-08-03

    申请号:US15008724

    申请日:2016-01-28

    Applicant: NetApp Inc.

    Abstract: One or more techniques and/or computing devices are provided for resilient replication of storage operations. For example, a first storage controller may host first storage having a replication relationship with second storage hosted by a second storage controller. To improve resiliency against transient network issues of a network between the storage controllers, the first storage controller may implement a queue and retry mechanism to retry replication operations not acknowledge back by the second storage controller within a threshold time. The second storage controller may maintain a cumulative sequence number of a latest replication operation performed in order, an operation response map of replication operations performed out of order, and an operation finder map identifying currently implemented replication operations, which may be used to process incoming replication operations. Single write semantics, write order consistency, and reduction of write amplification may be provided.

    Systems and Methods to Maintain Consistent High Availability and Performance in Storage Area Networks

    公开(公告)号:US20170220249A1

    公开(公告)日:2017-08-03

    申请号:US15011050

    申请日:2016-01-29

    Applicant: NetApp, Inc.

    Abstract: Embodiments of the present disclosure enable high availability and performance in view of storage controller failure. A storage system includes three or more controllers that may be distributed in a plurality of enclosures. The controllers are in high availability pairs on a per volume basis, with volumes and corresponding mirror targets distributed throughout the storage system. When a controller fails, other controllers in the system detect the failure and assess whether one or more volumes and/or mirror targets are affected. If no volumes/mirror targets are affected, then write-back caching continues. If volume ownership is affected, then a new volume owner is selected so that write-back caching may continue. If mirror target ownership is affected, then a new mirror target is selected so that write-back caching may continue. As a result, write-back caching availability is increased to provide low latency and high throughput in degraded mode as in other modes.

    Removing overlapping ranges from a flat sorted data structure

    公开(公告)号:US09720928B2

    公开(公告)日:2017-08-01

    申请号:US14518403

    申请日:2014-10-20

    Applicant: NetApp, Inc.

    Abstract: A system can efficiently removes ranges of entries from a flat sorted data structure that represent stale fingerprints As part of fingerprint verification during deduplication, the system performs an attributes intersect range calculation (AIRC) procedure on the stale fingerprint data structure to compute a set of non-overlapping and latest consistency point (CP) ranges. During the AIRC procedure, an inode associated with a data container is selected and the FBN tuple of each deleted data block in the file is sorted in a predefined FBN order. The AIRC procedure then identifies the most recent fingerprint associated with a deleted data block. The set of non-overlapping and latest CP ranges is then used to remove stale fingerprints associated with that deleted block from the fingerprint database. A single pass through the fingerprint database identifies the set of non-overlapping and latest CP ranges, thereby improving efficiency of the storage system.

    Scalable, distributed, fault-tolerant test framework

    公开(公告)号:US09720818B2

    公开(公告)日:2017-08-01

    申请号:US14844795

    申请日:2015-09-03

    Applicant: NetApp, Inc.

    CPC classification number: G06F11/3688

    Abstract: A testing framework has been developed to address these issues that takes common functionality normally imported by the testing scripts on the client device and instead splits the functionality into standalone, fault tolerant, scalable services. Accordingly, the scripts can utilize the functionality through APIs and therefore test drivers executing a test or building a test environment or other testing processes may access the services through an API. Therefore, each testing client and test driver does not need to separately import the functionality and run the functionality on the memory of the client device separately. Rather, multiple tests can use these functionalities, allowing the testing services to be scaled between tests.

    Techniques for performing resynchronization on a clustered system

    公开(公告)号:US09720752B2

    公开(公告)日:2017-08-01

    申请号:US14518422

    申请日:2014-10-20

    Applicant: NETAPP, INC.

    Abstract: Various embodiments are generally directed an apparatus and method for receiving information to write on a clustered system comprising at least a first cluster and a second cluster, determining that a failure event has occurred on the clustered system creating unsynchronized information, the unsynchronized information comprising at least one of inflight information and dirty region information, and performing a resynchronization operation to synchronize the unsynchronized information on the first cluster and the second cluster based on log information in at least one of an inflight tracker log for the inflight information and a dirty region log for the dirty region information.

    Cluster configuration information replication

    公开(公告)号:US09720626B2

    公开(公告)日:2017-08-01

    申请号:US14491879

    申请日:2014-09-19

    Applicant: NetApp Inc.

    CPC classification number: G06F3/067 G06F3/0617 G06F3/0629

    Abstract: One or more techniques and/or systems are provided for cluster configuration information replication, managing cluster-wide service agents, and/or for cluster-wide outage detection. In an example of cluster configuration information replication, a replication workflow corresponding to a storage operation implemented for a storage object (e.g., renaming of a volume) of a first cluster may be transferred to a second storage cluster for selectively implementation. In an example of managing cluster-wide service agents, cluster-wide service agents are deployed to nodes of a cluster storage environment, where a master agent actively processes cluster service calls and standby agents passively wait for reassignment as a failover master in the event the master agent fails. In an example of cluster-wide outage detection, a cluster-wide outage may be determined for a cluster storage environment based upon a number of inaccessible nodes satisfying a cluster outage detection metric.

Patent Agency Ranking