TWO-STAGE FRONT END FOR EXTENT MAP DATABASE
    1.
    发明申请

    公开(公告)号:US20170255624A1

    公开(公告)日:2017-09-07

    申请号:US15601388

    申请日:2017-05-22

    Applicant: NetApp, Inc.

    Abstract: Multiple key-value stores may be employed to smooth out random updates (based on the extent ID) to the EMAP database. The updates to the EMAP database occur in a two-stage manner: (i) using an append-only log store for the first stage and (ii) using an on-disk hash store for the second stage. The append-only log store is used to convert the random updates to sequential write operations on the EMAP database. Once full, the contents of the log store are sorted and moved to the on-disk hash store, which holds the updates for a transient period of time to enable batching of the updates. Once sufficient batching of the extent map entries are accumulated, those entries are sorted and moved to the EMAP database. Thereafter, the EMAP database can be scanned to find extent map entries having identical checksum bits to perform data deduplication.

    TWO-STAGE FRONT END FOR EXTENT MAP DATABASE
    2.
    发明申请

    公开(公告)号:US20190324954A1

    公开(公告)日:2019-10-24

    申请号:US16459852

    申请日:2019-07-02

    Applicant: NetApp Inc.

    Abstract: Multiple key-value stores may be employed to smooth out random updates (based on the extent ID) to the EMAP database. The updates to the EMAP database occur in a two-stage manner: (i) using an append-only log store for the first stage and (ii) using an on-disk hash store for the second stage. The append-only log store is used to convert the random updates to sequential write operations on the EMAP database. Once full, the contents of the log store are sorted and moved to the on-disk hash store, which holds the updates for a transient period of time to enable batching of the updates. Once sufficient batching of the extent map entries are accumulated, those entries are sorted and moved to the EMAP database. Thereafter, the EMAP database can be scanned to find extent map entries having identical checksum bits to perform data deduplication.

    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS PROVIDING CHANGE LOGGING IN A DEDUPLICATION PROCESS
    3.
    发明申请
    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS PROVIDING CHANGE LOGGING IN A DEDUPLICATION PROCESS 有权
    系统,方法和计算机程序产品提供更改记录过程

    公开(公告)号:US20150026424A1

    公开(公告)日:2015-01-22

    申请号:US14509892

    申请日:2014-10-08

    Applicant: NetApp, Inc.

    CPC classification number: G06F3/0641 G06F3/0619 G06F3/067 G06F11/1453

    Abstract: A method performed in a network storage system, the method including receiving a plurality of data blocks at a secondary storage subsystem from a primary storage subsystem, generating a first log that includes a first plurality of entries, one entry for each of the data blocks, in which each entry of the first plurality of entries includes a name for a respective data block and a fingerprint of the respective data block, receiving metadata at the secondary storage subsystem from the primary storage subsystem, the metadata describing relationships between the plurality of blocks and a plurality of files, generating a second log that includes a second plurality of entries, and merging the first log with the second log to generate a change log.

    Abstract translation: 一种在网络存储系统中执行的方法,所述方法包括从主存储子系统在次存储子系统处接收多个数据块,生成包括第一多个条目的第一日志,每个数据块的一个条目, 其中所述第一多个条目的每个条目包括相应数据块的名称和相应数据块的指纹,从所述主存储子系统接收所述辅助存储子系统的元数据,所述元数据描述所述多个块之间的关系和 多个文件,生成包括第二多个条目的第二日志,以及将第一日志与第二日志合并以生成改变日志。

    Two-stage front end for extent map database

    公开(公告)号:US10353884B2

    公开(公告)日:2019-07-16

    申请号:US15601388

    申请日:2017-05-22

    Applicant: NetApp, Inc.

    Abstract: Multiple key-value stores may be employed to smooth out random updates (based on the extent ID) to the EMAP database. The updates to the EMAP database occur in a two-stage manner: (i) using an append-only log store for the first stage and (ii) using an on-disk hash store for the second stage. The append-only log store is used to convert the random updates to sequential write operations on the EMAP database. Once full, the contents of the log store are sorted and moved to the on-disk hash store, which holds the updates for a transient period of time to enable batching of the updates. Once sufficient batching of the extent map entries are accumulated, those entries are sorted and moved to the EMAP database. Thereafter, the EMAP database can be scanned to find extent map entries having identical checksum bits to perform data deduplication.

    SYSTEM AND METHOD FOR DATA DEDUPLICATION UTILIZING EXTENT ID DATABASE
    5.
    发明申请
    SYSTEM AND METHOD FOR DATA DEDUPLICATION UTILIZING EXTENT ID DATABASE 有权
    使用EXTENT ID数据库的数据分发的系统和方法

    公开(公告)号:US20160162207A1

    公开(公告)日:2016-06-09

    申请号:US14559317

    申请日:2014-12-03

    Applicant: NetApp, Inc.

    Abstract: An extent map (EMAP) database may include one or more extent map entries configured to map extent IDs to PVBNs. Each extent ID may be apportioned into a most significant bit (MSB) portion, i.e., checksum bits, and a least significant bit (LSB) portion, i.e., duplicate bits. A hash may be applied to the data of the extent to calculate the checksum bits, which illustratively represent a fingerprint of the data. The duplicate bits may be configured to denote any reoccurrence of the checksum bits in the EMAP database, i.e., whether there is an existing extent with potentially identical data in a volume of the aggregate. Each extent map entry may be inserted on a node having one or more key/value pairs, wherein the key is the extent ID and the value is the PVBN. The EMAP database may be scanned and utilized to perform data deduplication.

    Abstract translation: 扩展区映射(EMAP)数据库可以包括一个或多个扩展映射条目,被配置为将扩展区ID映射到PVBN。 每个区段ID可以分配到最高有效位(MSB)部分,即校验和位和最低有效位(LSB)部分,即重复位。 可以将哈希值应用于计算校验和位的程度的数据,其示意性地表示数据的指纹。 重复比特可以被配置为表示EMAP数据库中的校验和比特的任何再次出现,即,是否存在在聚合体积中具有潜在相同数据的现有范围。 每个扩展区映射条目可以被插入到具有一个或多个密钥/值对的节点上,其中密钥是扩展ID,并且该值是PVBN。 EMAP数据库可能被扫描并用于执行重复数据删除。

    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING BLOCK CHARACTERISTICS IN A COMPUTER DATA STORAGE SYSTEM
    6.
    发明申请
    SYSTEMS, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING BLOCK CHARACTERISTICS IN A COMPUTER DATA STORAGE SYSTEM 审中-公开
    用于在计算机数据存储系统中确定块特性的系统,方法和计算机程序产品

    公开(公告)号:US20140344538A1

    公开(公告)日:2014-11-20

    申请号:US13894337

    申请日:2013-05-14

    Applicant: Netapp, Inc.

    CPC classification number: G06F3/061 G06F3/0659 G06F3/0689 G06F16/13

    Abstract: Systems, methods, and non-transitory machine readable media for determining block characteristics include one or more processors, a memory for storing instructions for the one or more processors, persistent storage, and a file system implemented in the persistent storage and storing data in the persistent storage using a plurality of blocks. When the stored instructions are executed by the one or more processors, the one or more processors are configured to traverse the plurality of blocks, read contents of a first block selected from the plurality of blocks, determine one or more characteristics of the first block from metadata within the block, and selectively perform or not perform a storage operation with respect to the first data block in response to determining the one or more characteristics. In some embodiments, the storage operation is a replication operation or a deduplication operation.

    Abstract translation: 用于确定块特征的系统,方法和非暂时机器可读介质包括一个或多个处理器,用于存储用于一个或多个处理器的指令的存储器,持久存储器,以及在持久存储器中实现的文件系统, 使用多个块的持久存储。 当所存储的指令由一个或多个处理器执行时,一个或多个处理器被配置成遍历多个块,从多个块中选择的第一块的读取内容,确定第一块的一个或多个特性 块内的元数据,并且响应于确定一个或多个特性而选择性地执行或不执行关于第一数据块的存储操作。 在一些实施例中,存储操作是复制操作或重复数据消除操作。

    Data deduplication utilizing extent ID database

    公开(公告)号:US09659047B2

    公开(公告)日:2017-05-23

    申请号:US14559317

    申请日:2014-12-03

    Applicant: NetApp, Inc.

    Abstract: An extent map (EMAP) database may include one or more extent map entries configured to map extent IDs to PVBNs. Each extent ID may be apportioned into a most significant bit (MSB) portion, i.e., checksum bits, and a least significant bit (LSB) portion, i.e., duplicate bits. A hash may be applied to the data of the extent to calculate the checksum bits, which illustratively represent a fingerprint of the data. The duplicate bits may be configured to denote any reoccurrence of the checksum bits in the EMAP database, i.e., whether there is an existing extent with potentially identical data in a volume of the aggregate. Each extent map entry may be inserted on a node having one or more key/value pairs, wherein the key is the extent ID and the value is the PVBN. The EMAP database may be scanned and utilized to perform data deduplication.

Patent Agency Ranking