-
公开(公告)号:US11934280B2
公开(公告)日:2024-03-19
申请号:US17680631
申请日:2022-02-25
Applicant: NetApp, Inc.
Inventor: Wei Sun , Anil Paul Thoppil , Anne Maria Vasu
CPC classification number: G06F11/1662 , G06F3/0622 , G06F3/064 , G06F3/0679 , G06F11/1088 , G06F11/3034 , G06F16/27
Abstract: Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. Rather than using a generalized one-size-fits-all approach to reduce complexity, an approach tailored to the node-level error scenario at issue may be performed to avoid doing more than necessary. According to one embodiment, responsive to identifying a missing branch of a tree implemented by a KV store of a first node of a cluster of a distributed storage management system, a branch resynchronization process may be performed, including, for each block ID in the range of block IDs of the missing branch (i) reading a data block corresponding to the block ID from a second node of the cluster that maintains redundant information relating to the block ID; and (ii) restoring the block ID within the KV store by writing the data block to the first node.
-
公开(公告)号:US11928005B2
公开(公告)日:2024-03-12
申请号:US17947339
申请日:2022-09-19
Applicant: NetApp Inc.
Inventor: Paul Yuedong Mu , Paul Ngan , Manoj Sundararajan
CPC classification number: G06F11/006 , G06F11/00 , G06F11/2097 , G06F11/2038 , G06F11/2048 , G06F2201/80
Abstract: Various embodiments are generally directed an apparatus and method for receiving information to write on a clustered system comprising at least a first cluster and a second cluster, determining that a failure event has occurred on the clustered system creating unsynchronized information, the unsynchronized information comprising at least one of inflight information and dirty region information, and performing a resynchronization operation to synchronize the unsynchronized information on the first cluster and the second cluster based on log information in at least one of an inflight tracker log for the inflight information and a dirty region log for the dirty region information.
-
公开(公告)号:US11921597B2
公开(公告)日:2024-03-05
申请号:US17828900
申请日:2022-05-31
Applicant: NetApp Inc.
Inventor: Atul Ramesh Pandit , Vijay M. Deshmukh , Michael Lee Federwisch , Ling Zheng , Kiyoshi James Komatsu , Rachita Kothiyal
IPC: G06F11/00 , G06F11/20 , H04L67/1095 , H04L67/1097
CPC classification number: G06F11/2069 , G06F11/2071 , G06F11/2092 , H04L67/1095 , H04L67/1097 , G06F2201/805
Abstract: One or more techniques and/or computing devices are provided for cross-platform replication. For example, a replication relationship may be established between a first storage endpoint and a second storage endpoint, where at least one of the storage endpoints, such as the first storage endpoint, lacks or has incompatible functionality to perform and manage replication because the storage endpoints have different storage platforms that store data differently, use different control operations and interfaces, etc. Accordingly, replication destination workflow, replication source workflow, and/or a proxy representing the first storage endpoint may be implemented at the second storage endpoint comprising the replication functionality. In this way, replication, such as snapshot replication, may be implemented between the storage endpoints by the second storage endpoint using the replication destination workflow, the replication source workflow, and/or the proxy that either locally executes tasks or routes tasks to the first storage endpoint such as for data access.
-
公开(公告)号:US11915053B2
公开(公告)日:2024-02-27
申请号:US17206871
申请日:2021-03-19
Applicant: NetApp, Inc.
Inventor: Idan Schwartz , Ohad Muchnik , Jonathan Cohen , Kevin McGrath , Amiram Shachar
CPC classification number: G06F9/5038 , G06F9/4887 , G06F9/505 , G06F9/5072 , G06F2209/503
Abstract: Systems, methods, and machine-readable media for predicting interruptions to the use of spare cloud resources and rebalancing based on those predictions are disclosed. A computing platform collects data for customers over time. The computing platform runs a machine learning algorithm on the historical data to generate a prediction classifier. The prediction classifier relates to a time window for prediction into the future, on the order of minutes or hours. The prediction classifier is run on monitored data from ongoing activity with a cloud provider to generate a risk score. Each risk score may identify an amount of risk that a spare cloud resource related to new resource metrics data will be interrupted within the future time frame corresponding to that prediction classifier. If predicted to be interrupted, the customer may be assisted in rebalancing to other resources. As a result, interruptions can be predicted hours into the future.
-
公开(公告)号:US20240064198A1
公开(公告)日:2024-02-22
申请号:US18194874
申请日:2023-04-03
Applicant: NetApp, Inc.
Inventor: Umeshkumar Vasantha Rajasekaran , Brett Sander , Wenxin Zhou , Glenn Cook , Rajavardhan Mallepally
IPC: H04L67/1029 , G06F16/27
CPC classification number: H04L67/1029 , G06F16/27
Abstract: Redistribution of files in a containerized distributed file system is disclosed. An indication of at least one remote container to which files from the local container are to be transferred is received from a rebalancing engine in the local container. One or more transfer parameters for use in selecting one or more files in the local container to be transferred the at least one remote container are received from the rebalancing engine. The local container is scanned to identify files that satisfy the one or more transfer parameters. An indication of the identified files to a file transfer mechanism is provided. Operation of the scanner is terminate until triggered in response to a query by the engine of the local container or for a predetermined period of time.
-
公开(公告)号:US20240061807A1
公开(公告)日:2024-02-22
申请号:US18194860
申请日:2023-04-03
Applicant: NetApp, Inc.
Inventor: Umeshkumar Vasantha Rajasekaran , Glenn Cook , Rajavardhan Mallepally , Brett Sander , Wenxin Zhou
IPC: G06F16/13 , G06F16/182 , G06F16/16 , G06F16/14 , G06F16/17
CPC classification number: G06F16/134 , G06F16/183 , G06F16/16 , G06F16/148 , G06F16/1734
Abstract: Redistribution of files in a containerized distributed file system is disclosed. Containers each have an engine and a scanner and each of the containers stores files and parameters for characteristics of files stored on the container. A first engine in a first container monitors characteristics of files stored on the first container and, responsive to determining that the parameters for files on the first container exceed one or more predetermined thresholds, communicates with a second engine in a second container to determine a destination container for one or more files from the first container. The second engine in the second container indicates to the first engine in the first container whether the second container is available to receive one or more files from the first container. The first engine triggers file system scanning by the scanner of the first container to identify files to be moved to the second container.
-
737.
公开(公告)号:US11892982B2
公开(公告)日:2024-02-06
申请号:US17505962
申请日:2021-10-20
Applicant: NetApp, Inc.
Inventor: Murali Subramanian , Sohan Shetty , Rakesh Bhargava , Akhil Kaushik
IPC: G06F16/17 , G06F16/23 , G06F3/06 , G06F11/20 , G06F16/178 , G06F16/172
CPC classification number: G06F16/178 , G06F3/065 , G06F3/0614 , G06F16/172 , G06F16/2365 , G06F11/2069 , G06F11/2092
Abstract: Systems and methods for reducing delays between the time at which a need for a resynchronization of data replication between a volume of a local CG and its peer volume of a remote CG is detected and the time at which the resynchronization is triggered (Reseed Time Period) are provided. According to an example, information indicative of the direction of data replication between the volume and the peer volume is maintained within a cache of a node. Responsive to a disruptive operation (e.g., relocation of the volume from an original node to a new node), the Reseed Time Period is lessened by proactively adding a passive cache entry to a cache within the new node at the time the CG relationship is created when the new node represents an HA partner of the original node and prior to the volume coming online when the new node represents a non-HA partner.
-
738.
公开(公告)号:US20240036770A1
公开(公告)日:2024-02-01
申请号:US18487366
申请日:2023-10-16
Applicant: NetApp Inc.
Inventor: Praveen Kumar Hasti , Christopher Alan Busick
IPC: G06F3/06
CPC classification number: G06F3/0659 , G06F3/067 , G06F3/061
Abstract: Techniques are provided for implementing a distributed control plane to facilitate communication between a container orchestration platform and a distributed storage architecture. The distributed storage architecture hosts worker nodes that manage distributed storage that can be made accessible to applications within the container orchestration platform through the distributed control plane. The distributed control plane includes control plane controllers that are each paired with a single worker node of the distributed storage architecture. The distributed control plane is configured to selectively route commands to control plane controllers that are paired with worker nodes that are current owners of objects targeted by the commands. If ownership of an object has changed from one worker node to another worker node, then subsequent commands will be re-routed to a control plane controller paired with the other worker node now owning the object.
-
739.
公开(公告)号:US20240028486A1
公开(公告)日:2024-01-25
申请号:US18478149
申请日:2023-09-29
Applicant: NetApp, Inc.
Inventor: Wei Sun , Anil Paul Thoppil , Anne Maria Vasu
CPC classification number: G06F11/1662 , G06F16/27 , G06F11/1088 , G06F11/3034 , G06F3/0622 , G06F3/064 , G06F3/0679
Abstract: Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. According to one embodiment, a first node of multiple nodes of distributed storage system represented in a form of a cluster of the multiple of nodes, identifies the potential existence of an error associated with a Redundant Array of Independent Disks (RAID) stripe. A list of block identifiers (IDs) associated with the RAID stripe may then be identified. Rather than performing a traditional RAID recovery/reconstruction approach that is resource intensive in nature and that requires an excessive amount of rebuild time, a more efficient RAID stripe resynchronization process may be performed to restore data associated with the RAID stripe.
-
公开(公告)号:US11880343B2
公开(公告)日:2024-01-23
申请号:US17106246
申请日:2020-11-30
Applicant: NetApp Inc.
Inventor: Bruce Leverett , Michael Kazar , Alan Stuart Driscoll , Peter Frank Corbett , Devang K. Shah , Srishylam Simharajan , Toby Smith
IPC: G06F16/178 , G06F16/11 , G06F16/182 , G06F3/06
CPC classification number: G06F16/178 , G06F16/128 , G06F16/184 , G06F16/1844
Abstract: A method and apparatus for replicating a data container of a source storage server at the logical level in an unordered stream of individual data units are described. In certain embodiments, the replication operations can be performed without regard to the differences in geometry of physical persistent storage of the source and destination storage servers and without regard to the differences in data container format of the source and destination storage servers. An initial replication operation of the source data container is performed by transferring the data objects within the source data container to a replica data container in the destination storage server as an unordered stream of individual data units while preserving data object identifiers of the source data objects. Afterwards, incremental replication operations can be performed to capture modifications to the source data container over time after the initial replication operation.
-
-
-
-
-
-
-
-
-