Distributed file system that provides scalability and resiliency

    公开(公告)号:US12045207B2

    公开(公告)日:2024-07-23

    申请号:US17449758

    申请日:2021-10-01

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: A distributed storage management system comprising nodes that form a cluster, a distributed block layer that spans the nodes in the cluster, and file system instances deployed on the nodes. Each file system instance comprises a data management subsystem and a storage management subsystem disaggregated from the data management subsystem. The storage management subsystem comprises a node block store that forms a portion of the distributed block layer and a storage manager that manages a key-value store and virtualized storage supporting the node block store. A file system volume hosted by the data management subsystem maps to a logical block device hosted by the virtualized storage in the storage management subsystem. The key-value store includes, for a data block of the logical block device, a key that comprises a block identifier for the logical block device and a value that comprises the data block.

    Distributed file system that provides scalability and resiliency

    公开(公告)号:US12038886B2

    公开(公告)日:2024-07-16

    申请号:US18359192

    申请日:2023-07-26

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: In various examples, data storage is managed using a distributed storage management system that is resilient. Data blocks of a logical block device may be distributed across multiple nodes in a cluster. The logical block device may correspond to a file system volume associated with a file system instance deployed on a selected node within a distributed block layer of a distributed file system. Each data block may have a location in the cluster identified by a block identifier associated with each data block. Each data block may be replicated on at least one other node in the cluster. A metadata object corresponding to a logical block device that maps to the file system volume may be replicated on at least another node in the cluster. Each data block and the metadata object may be hosted on virtualized storage that is protected using redundant array independent disks (RAID).

    Distributed File System with Disaggregated Data Management and Storage Management Layers

    公开(公告)号:US20230393787A1

    公开(公告)日:2023-12-07

    申请号:US18452814

    申请日:2023-08-21

    Applicant: NetApp, Inc.

    Abstract: Systems and methods for managing data storage using a distributed file system are provided. In one example, a file system instance is deployed virtually in a node of a distributed storage system. The file system instance has a dynamic configuration including a set of services corresponding to a cluster management subsystem and a storage management subsystem. The storage management subsystem operates independently of a data management subsystem of the distributed storage system as a result of disaggregation from the data management subsystem. The data management subsystem performs storage and block management functions based on requests received from an application layer. An additional service corresponding to either the data management subsystem or the storage management subsystem is deployed virtually to meet the demand for the additional service in response to determining the presence of a demand for the additional service and availability a set of resources corresponding to the additional service.

    Selectively storing data into allocation areas using streams

    公开(公告)号:US11409448B2

    公开(公告)日:2022-08-09

    申请号:US16940448

    申请日:2020-07-28

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for selectively storing data into allocation areas using streams. A set of allocation areas (e.g., ranges of block numbers such as virtual block numbers) are defined for a storage device. Data having particular characteristics (e.g., user data, metadata, hot data, cold data, randomly accessed data, sequentially accessed data, etc.) will be sent to the storage device for selective storage in corresponding allocation areas. For example, when a file system receives a write stream of hot data, the hot data may be assigned to a stream. The stream will be tagged using a stream identifier that is used as an indicator to the storage device to process data of the stream using an allocation area defined for hot data. In this way, data having different characteristics will be stored/confined within particular allocation areas of the storage device to reduce fragmentation and write amplification.

    Methods for minimizing fragmentation in SSD within a storage system and devices thereof

    公开(公告)号:US11132129B2

    公开(公告)日:2021-09-28

    申请号:US16584025

    申请日:2019-09-26

    Applicant: NetApp Inc.

    Abstract: A method, non-transitory computer readable medium, and device that assists with reducing memory fragmentation in solid state devices includes identifying an allocation area within an address range to write data from a cache. Next, the identified allocation area is determined for including previously stored data. The previously stored data is read from the identified allocation area when it is determined that the identified allocation area comprises previously stored data. Next, both the write data from the cache and the read previously stored data are written back into the identified allocation area sequentially through the address range.

    Methods for minimizing fragmentation in SSD within a storage system and devices thereof

    公开(公告)号:US10430081B2

    公开(公告)日:2019-10-01

    申请号:US15195093

    申请日:2016-06-28

    Applicant: NetApp, Inc.

    Abstract: A method, non-transitory computer readable medium, and device that assists with reducing memory fragmentation in solid state devices includes identifying an allocation area within an address range to write data from a cache. Next, the identified allocation area is determined for including previously stored data. The previously stored data is read from the identified allocation area when it is determined that the identified allocation area comprises previously stored data. Next, both the write data from the cache and the read previously stored data are written back into the identified allocation area sequentially through the address range.

    Cache modeling using random sampling and a timestamp histogram
    38.
    发明授权
    Cache modeling using random sampling and a timestamp histogram 有权
    使用随机抽样和时间戳直方图进行缓存建模

    公开(公告)号:US09405695B2

    公开(公告)日:2016-08-02

    申请号:US14072639

    申请日:2013-11-05

    Applicant: NETAPP, INC.

    Abstract: A system and method for determining an optimal cache size of a computing system is provided. In some embodiments, the method comprises selecting a portion of an address space of a memory structure of the computing system. A workload of data transactions is monitored to identify a transaction of the workload directed to the portion of the address space. An effect of the transaction on a cache of the computing system is determined, and, based on the determined effect of the transaction, an optimal cache size satisfying a performance target is determined. In one such embodiment the determining of the effect of the transaction on a cache of the computing system includes determining whether the effect would include a cache hit for a first cache size and determining whether the effect would include a cache hit for a second cache size different from the first cache size.

    Abstract translation: 提供了一种用于确定计算系统的最佳高速缓存大小的系统和方法。 在一些实施例中,该方法包括选择计算系统的存储器结构的地址空间的一部分。 监视数据事务的工作负载以识别指向地址空间部分的工作负载的事务。 确定事务对计算系统的缓存的影响,并且基于所确定的事务的效果,确定满足性能目标的最佳高速缓存大小。 在一个这样的实施例中,事务对计算系统的高速缓存的影响的确定包括确定效果是否将包括第一高速缓存大小的高速缓存命中,并且确定该效果是否将包括不同于第二高速缓存大小的高速缓存命中 从第一个缓存大小。

Patent Agency Ranking