Mapping logical identifiers using multiple identifier spaces

    公开(公告)号:US10515055B2

    公开(公告)日:2019-12-24

    申请号:US14859009

    申请日:2015-09-18

    申请人: NetApp, Inc.

    摘要: It is determined that a first data unit is to be written to a storage device and that the first data unit is associated with a first attribute. In response to determining that the first data unit is associated with the first attribute, a first identifier is selected from a first identifier space and the first identifier is associated with the first data unit. It is determined that a second data unit is to be written to the storage device and that the second data unit is associated with the second attribute. In response to determining that the second data unit is associated with the second attribute, a second identifier is selected from a second identifier space and the second identifier is associated with the second data unit.

    NEARSTONE COMPRESSION OF DATA IN A STORAGE SYSTEM
    2.
    发明申请
    NEARSTONE COMPRESSION OF DATA IN A STORAGE SYSTEM 有权
    存储系统中数据的近似数据压缩

    公开(公告)号:US20140081929A1

    公开(公告)日:2014-03-20

    申请号:US14026726

    申请日:2013-09-13

    申请人: NetApp, Inc.

    IPC分类号: H04L29/06

    摘要: A storage server is configured to receive a request to store a data block from a client. The request to store the data block is serviced by the storage server by compressing the data block into a compression group, which includes a number of compressed data blocks. The storage server stores the compression group in a non-volatile memory and flushes the compression group from the non-volatile memory to a physical storage device in response to reaching a consistency point. By compressing data to be stored in system memory of a storage server, the amount of data that can be processed during a given time period by a data storage system is increased. Furthermore, an increase in performance can be achieved at a lower cost, since the cost of additional physical system memory modules can be avoided.

    摘要翻译: 存储服务器被配置为从客户端接收存储数据块的请求。 存储数据块的请求由存储服务器通过将数据块压缩成包括多个压缩数据块的压缩组来服务。 存储服务器将压缩组存储在非易失性存储器中,并响应于达到一致性点将压缩组从非易失性存储器刷新到物理存储设备。 通过压缩要存储在存储服务器的系统存储器中的数据,可以增加数据存储系统在给定时间段内可处理的数据量。 此外,由于可以避免额外的物理系统存储器模块的成本,因此可以以更低的成本实现性能的提高。

    MAPPING LOGICAL IDENTIFIERS USING MULTIPLE IDENTIFIER SPACES

    公开(公告)号:US20170083537A1

    公开(公告)日:2017-03-23

    申请号:US14859009

    申请日:2015-09-18

    申请人: NetApp, Inc.

    IPC分类号: G06F17/30

    摘要: It is determined that a first data unit is to be written to a storage device and that the first data unit is associated with a first attribute. In response to determining that the first data unit is associated with the first attribute, a first identifier is selected from a first identifier space and the first identifier is associated with the first data unit. It is determined that a second data unit is to be written to the storage device and that the second data unit is associated with the second attribute. In response to determining that the second data unit is associated with the second attribute, a second identifier is selected from a second identifier space and the second identifier is associated with the second data unit.

    Nearstore compression of data in a storage system
    4.
    发明授权
    Nearstore compression of data in a storage system 有权
    近端存储系统中数据的压缩

    公开(公告)号:US09319489B2

    公开(公告)日:2016-04-19

    申请号:US14026726

    申请日:2013-09-13

    申请人: NetApp, Inc.

    摘要: A network adapter receives a request to store a data block. The data block is sent from the network adapter to a compression module. The compression module generates a compressed data block from the data block. The compressed data block or a reference to the compressed data block is stored in a buffer cache. The compressed data block is stored in nonvolatile memory. It is determined that the compressed data block should be flushed a storage device. In response to determining that the compressed data block should be flushed to the storage device, the compressed data block is flushed from the nonvolatile memory to the storage device.

    摘要翻译: 网络适​​配器接收存储数据块的请求。 数据块从网络适配器发送到压缩模块。 压缩模块从数据块生成压缩数据块。 压缩数据块或对压缩数据块的引用存储在缓冲区高速缓存中。 压缩数据块被存储在非易失性存储器中。 确定压缩数据块应该被刷新存储设备。 响应于确定压缩数据块应被刷新到存储设备,压缩数据块从非易失性存储器刷新到存储设备。

    PRESERVING STORAGE EFFICIENCY DURING RESTORATION OF DATA FROM THE CLOUD TO A DATA STORAGE SYSTEM

    公开(公告)号:US20240354198A1

    公开(公告)日:2024-10-24

    申请号:US18304583

    申请日:2023-04-21

    申请人: NetApp, Inc.

    IPC分类号: G06F11/14

    CPC分类号: G06F11/1451

    摘要: Systems and methods for preserving storage efficiency during restoration of data from the cloud are provided. In one embodiment, a CBMAP is maintained that maps cloud block numbers (CBNs) to respective corresponding block numbers of a volume of a data storage system in which previously restored data has been stored by a previously restored file. By making use of the CBMAP during the restoration process, storage of duplicate file data blocks on the volume may be avoided by sharing with a current file being restored a reference to the corresponding file data block previously stored on the volume and associated with the previously restored file. In addition to preserving storage efficiency, use of the CBMAP facilitates avoidance of repeated GET operations for data associated with CBNs previously retrieved from the cloud and stored to the volume, thereby reducing data access costs as well as latency of the restore operation.

    Method and system for providing substantially constant-time execution of a copy operation
    6.
    发明授权
    Method and system for providing substantially constant-time execution of a copy operation 有权
    提供基本上恒定时间执行复印操作的方法和系统

    公开(公告)号:US09182921B2

    公开(公告)日:2015-11-10

    申请号:US13875164

    申请日:2013-05-01

    申请人: NetApp, Inc.

    IPC分类号: G06F17/30 G06F3/06

    摘要: A system and method for providing a substantially constant-time copy operation for file system objects managed by a storage server begins by generating a snapshot of at least a portion of a data set managed by the storage server. The system then performs a copy operation in the storage server to generate a copy of the data set separate from the snapshot, on a set of block locations containing a predetermined reference value. During the copy operation to generate the copy of the data set separate from the snapshot, the system can receive from a requester a first read request directed to the copy of the data set that the copy operation is to generate. In response to the first data request, the system provides data from the snapshot to the requester by the storage server.

    摘要翻译: 用于为由存储服务器管理的文件系统对象提供基本上恒定的复制操作的系统和方法开始于生成由存储服务器管理的数据集的至少一部分的快照。 然后,系统在存储服务器中执行复制操作,以在包含预定参考值的一组块位置上生成与快照分离的数据集的副本。 在复制操作期间,生成与快照分开的数据集的副本,系统可以从请求者接收针对复制操作要生成的数据集的副本的第一读请求。 响应于第一个数据请求,系统通过存储服务器将快照中的数据提供给请求者。

    SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION
    7.
    发明申请
    SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION 审中-公开
    用于组织数据以促进数据重复的系统和方法

    公开(公告)号:US20150205816A1

    公开(公告)日:2015-07-23

    申请号:US14552292

    申请日:2014-11-24

    申请人: NetApp, Inc.

    IPC分类号: G06F17/30

    摘要: A technique for organizing data to facilitate data deduplication includes dividing a block-based set of data into multiple “chunks”, where the chunk boundaries are independent of the block boundaries (due to the hashing algorithm). Metadata of the data set, such as block pointers for locating the data, are stored in a tree structure that includes multiple levels, each of which includes at least one node. The lowest level of the tree includes multiple nodes that each contain chunk metadata relating to the chunks of the data set. In each node of the lowest level of the buffer tree, the chunk metadata contained therein identifies at least one of the chunks. The chunks (user-level data) are stored in one or more system files that are separate from the buffer tree and not visible to the user.

    摘要翻译: 用于组织数据以便于重复数据删除的技术包括将基于块的数据集划分成多个“块”,其中块边界独立于块边界(由于散列算法)。 数据集的元数据(例如用于定位数据的块指针)被存储在包括多个级别的树结构中,每个级别包括至少一个节点。 树的最低级别包括多个节点,每个节点包含与数据集块大小相关的块元数据。 在缓冲树的最低级别的每个节点中,其中包含的块元数据标识至少一个块。 块(用户级数据)存储在与缓冲树分开的一个或多个系统文件中,并且对用户不可见。

    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING BLOCK CHARACTERISTICS IN A COMPUTER DATA STORAGE SYSTEM
    8.
    发明申请
    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING BLOCK CHARACTERISTICS IN A COMPUTER DATA STORAGE SYSTEM 审中-公开
    用于在计算机数据存储系统中确定块特性的系统,方法和计算机程序产品

    公开(公告)号:US20140344538A1

    公开(公告)日:2014-11-20

    申请号:US13894337

    申请日:2013-05-14

    申请人: Netapp, Inc.

    IPC分类号: G06F3/06

    摘要: Systems, methods, and non-transitory machine readable media for determining block characteristics include one or more processors, a memory for storing instructions for the one or more processors, persistent storage, and a file system implemented in the persistent storage and storing data in the persistent storage using a plurality of blocks. When the stored instructions are executed by the one or more processors, the one or more processors are configured to traverse the plurality of blocks, read contents of a first block selected from the plurality of blocks, determine one or more characteristics of the first block from metadata within the block, and selectively perform or not perform a storage operation with respect to the first data block in response to determining the one or more characteristics. In some embodiments, the storage operation is a replication operation or a deduplication operation.

    摘要翻译: 用于确定块特征的系统,方法和非暂时机器可读介质包括一个或多个处理器,用于存储用于一个或多个处理器的指令的存储器,持久存储器,以及在持久存储器中实现的文件系统, 使用多个块的持久存储。 当所存储的指令由一个或多个处理器执行时,一个或多个处理器被配置成遍历多个块,从多个块中选择的第一块的读取内容,确定第一块的一个或多个特性 块内的元数据,并且响应于确定一个或多个特性而选择性地执行或不执行关于第一数据块的存储操作。 在一些实施例中,存储操作是复制操作或重复数据消除操作。

    METHOD AND SYSTEM FOR PROVIDING SUBSTANTIALLY CONSTANT-TIME EXECUTION OF A COPY OPERATION
    9.
    发明申请
    METHOD AND SYSTEM FOR PROVIDING SUBSTANTIALLY CONSTANT-TIME EXECUTION OF A COPY OPERATION 有权
    提供复制操作的实质性持续执行的方法和系统

    公开(公告)号:US20130311735A1

    公开(公告)日:2013-11-21

    申请号:US13875164

    申请日:2013-05-01

    申请人: NetApp, Inc.

    IPC分类号: G06F3/06

    摘要: A system and method for providing a substantially constant-time copy operation for file system objects managed by a storage server begins by generating a snapshot of at least a portion of a data set managed by the storage server. The system then performs a copy operation in the storage server to generate a copy of the data set separate from the snapshot, on a set of block locations containing a predetermined reference value. During the copy operation to generate the copy of the data set separate from the snapshot, the system can receive from a requester a first read request directed to the copy of the data set that the copy operation is to generate. In response to the first data request, the system provides data from the snapshot to the requester by the storage server.

    摘要翻译: 用于为由存储服务器管理的文件系统对象提供基本上恒定的复制操作的系统和方法开始于生成由存储服务器管理的数据集的至少一部分的快照。 然后,系统在存储服务器中执行复制操作,以在包含预定参考值的一组块位置上生成与快照分离的数据集的副本。 在复制操作期间,生成与快照分开的数据集的副本,系统可以从请求者接收针对复制操作要生成的数据集的副本的第一读请求。 响应于第一个数据请求,系统通过存储服务器将快照中的数据提供给请求者。