SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION
    12.
    发明申请
    SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION 审中-公开
    用于组织数据以促进数据重复的系统和方法

    公开(公告)号:US20150205816A1

    公开(公告)日:2015-07-23

    申请号:US14552292

    申请日:2014-11-24

    Applicant: NetApp, Inc.

    Abstract: A technique for organizing data to facilitate data deduplication includes dividing a block-based set of data into multiple “chunks”, where the chunk boundaries are independent of the block boundaries (due to the hashing algorithm). Metadata of the data set, such as block pointers for locating the data, are stored in a tree structure that includes multiple levels, each of which includes at least one node. The lowest level of the tree includes multiple nodes that each contain chunk metadata relating to the chunks of the data set. In each node of the lowest level of the buffer tree, the chunk metadata contained therein identifies at least one of the chunks. The chunks (user-level data) are stored in one or more system files that are separate from the buffer tree and not visible to the user.

    Abstract translation: 用于组织数据以便于重复数据删除的技术包括将基于块的数据集划分成多个“块”,其中块边界独立于块边界(由于散列算法)。 数据集的元数据(例如用于定位数据的块指针)被存储在包括多个级别的树结构中,每个级别包括至少一个节点。 树的最低级别包括多个节点,每个节点包含与数据集块大小相关的块元数据。 在缓冲树的最低级别的每个节点中,其中包含的块元数据标识至少一个块。 块(用户级数据)存储在与缓冲树分开的一个或多个系统文件中,并且对用户不可见。

    Co-located Journaling and Data Storage for Write Requests

    公开(公告)号:US20240061603A1

    公开(公告)日:2024-02-22

    申请号:US18497925

    申请日:2023-10-30

    Applicant: NetApp, Inc.

    Abstract: Methods and systems for co-locating journaling and data storage are provided. Separate journal and volume partitions may be maintained within each logical storage unit (e.g., Logical Unit Number (LUN)) of a distributed storage system. Journaling of metadata associated with write requests received from one or more clients may be distributed by identifying a destination logical storage unit to which data associated with a given write request is to be stored and causing the data and metadata to be persisted to disk by journaling the metadata and the data to respective portions of an active log within the journal partition of the destination logical storage unit. By using the same logical storage unit for both journaling of write requests and writing the data associated with such write requests, the bottleneck due to there being only a single device or storage unit handling all metadata for all write requests can be avoided.

    USE OF PREDEFINED BLOCK POINTERS TO REDUCE DUPLICATE STORAGE OF CERTAIN DATA IN A STORAGE SUBSYSTEM OF A STORAGE SERVER

    公开(公告)号:US20190018605A1

    公开(公告)日:2019-01-17

    申请号:US16133284

    申请日:2018-09-17

    Applicant: NETAPP, INC.

    Abstract: A method and system for eliminating the redundant allocation and deallocation of special data on disk, wherein the redundant allocation and deallocation of special data on disk is eliminated by providing an innovate technique for specially allocating special data of a storage system. Specially allocated data is data that is pre-allocated on disk and stored in memory of the storage system. “Special data” may include any pre-decided data, one or more portions of data that exceed a pre-defined sharing threshold, and/or one or more portions of data that have been identified by a user as special. For example, in some embodiments, a zero-filled data block is specially allocated by a storage system. As another example, in some embodiments, a data block whose contents correspond to a particular type document header is specially allocated.

    LOGICAL REPLICATION MAPPING FOR ASYMMETRIC COMPRESSION
    15.
    发明申请
    LOGICAL REPLICATION MAPPING FOR ASYMMETRIC COMPRESSION 审中-公开
    用于不对称压缩的逻辑复制映射

    公开(公告)号:US20160070495A1

    公开(公告)日:2016-03-10

    申请号:US14929018

    申请日:2015-10-30

    Applicant: NetApp, Inc.

    Abstract: A system and method for logically organizing compressed data. In one aspect, a destination storage server receives a write request that includes multiple data blocks and specifies corresponding file block numbers. An extent-based file system executing on the storage server accesses intermediate block entries that each associates one of the file block numbers with a respective extent block number. The file system, in cooperation with a compression engine, compresses the data blocks into a set of one or more compressed data blocks. The file system stores the compressed data blocks at physical locations corresponding to physical block numbers and allocates, within an extent map, pointers from an extent ID to the extent block numbers, and pointers from the extent ID to the physical block numbers.

    Abstract translation: 一种用于逻辑组织压缩数据的系统和方法。 一方面,目的地存储服务器接收包含多个数据块的写请求,并指定相应的文件块号。 在存储服务器上执行的基于范围的文件系统访问中间块条目,每个条目将文件块号码之一与相应的扩展块号码相关联。 与压缩引擎协作的文件系统将数据块压缩成一组一个或多个压缩数据块。 文件系统将压缩数据块存储在对应于物理块号码的物理位置处,并且在盘区图中分配从盘区ID到块号码的指针,以及从盘区ID到物理块号的指针。

    Deduplication in an extent-based architecture
    16.
    发明授权
    Deduplication in an extent-based architecture 有权
    基于范围的架构中的重复数据删除

    公开(公告)号:US09043287B2

    公开(公告)日:2015-05-26

    申请号:US14087345

    申请日:2013-11-22

    Applicant: NetApp, Inc.

    CPC classification number: G06F17/30156 G06F3/0608 G06F3/0641

    Abstract: It is determined that a first data block contains the same data as a second data block. The first data block is associated with a first extent and the second data block is associated with a second extent. In response to determining that the first data block contains the same data as the second data block, the second data block is associated with the first extent and the first data block is disassociated with the second extent.

    Abstract translation: 确定第一数据块包含与第二数据块相同的数据。 第一数据块与第一范围相关联,并且第二数据块与第二范围相关联。 响应于确定第一数据块包含与第二数据块相同的数据,第二数据块与第一扩展区相关联,并且第一数据块与第二扩展区分离。

    USE OF PREDEFINED BLOCK POINTERS TO REDUCE DUPLICATE STORAGE OF CERTAIN DATA IN A STORAGE SUBSYSTEM OF A STORAGE SERVER
    17.
    发明申请
    USE OF PREDEFINED BLOCK POINTERS TO REDUCE DUPLICATE STORAGE OF CERTAIN DATA IN A STORAGE SUBSYSTEM OF A STORAGE SERVER 审中-公开
    使用预定块指针来减少存储服务器存储子系统中某些数据的重复存储

    公开(公告)号:US20150039818A1

    公开(公告)日:2015-02-05

    申请号:US14516308

    申请日:2014-10-16

    Applicant: NetApp, Inc.

    Abstract: A method and system for eliminating the redundant allocation and deallocation of special data on disk, wherein the redundant allocation and deallocation of special data on disk is eliminated by providing an innovate technique for specially allocating special data of a storage system. Specially allocated data is data that is pre-allocated on disk and stored in memory of the storage system. “Special data” may include any pre-decided data, one or more portions of data that exceed a pre-defined sharing threshold, and/or one or more portions of data that have been identified by a user as special. For example, in some embodiments, a zero-filled data block is specially allocated by a storage system. As another example, in some embodiments, a data block whose contents correspond to a particular type document header is specially allocated.

    Abstract translation: 一种消除磁盘上特殊数据的冗余分配和释放的方法和系统,其中通过提供专门分配存储系统的特殊数据的创新技术,消除了磁盘上特殊数据的冗余分配和释放。 特别分配的数据是在磁盘上预先分配并存储在存储系统的存储器中的数据。 “特殊数据”可以包括任何预先决定的数据,超过预定义共享阈值的数据的一个或多个部分,和/或用户已经被特别识别的数据的一个或多个部分。 例如,在一些实施例中,零填充数据块由存储系统专门分配。 作为另一示例,在一些实施例中,特别分配其内容对应于特定类型文档头的数据块。

    Co-located journaling and data storage for write requests

    公开(公告)号:US12223193B2

    公开(公告)日:2025-02-11

    申请号:US18497925

    申请日:2023-10-30

    Applicant: NetApp, Inc.

    Abstract: Methods and systems for co-locating journaling and data storage are provided. Separate journal and volume partitions may be maintained within each logical storage unit (e.g., Logical Unit Number (LUN)) of a distributed storage system. Journaling of metadata associated with write requests received from one or more clients may be distributed by identifying a destination logical storage unit to which data associated with a given write request is to be stored and causing the data and metadata to be persisted to disk by journaling the metadata and the data to respective portions of an active log within the journal partition of the destination logical storage unit. By using the same logical storage unit for both journaling of write requests and writing the data associated with such write requests, the bottleneck due to there being only a single device or storage unit handling all metadata for all write requests can be avoided.

    Co-located journaling and data storage for write requests

    公开(公告)号:US11409457B1

    公开(公告)日:2022-08-09

    申请号:US17239189

    申请日:2021-04-23

    Applicant: NetApp, Inc.

    Abstract: A method and system for co-locating journaling and data storage based on write requests. A write request that includes metadata and data is received from a client. A logical storage unit for storing the metadata and the data is identified. The logical storage unit is divided into a journal partition and a volume partition. The journal partition includes a first log and a second log. Which of the first log and the second log is an active log and which of the first log and the second log is an inactive log are identified. The metadata is recorded in a first location in the active log and the data is recorded in a second location in the active log during a single I/O operation. A reply is sent to the client after the metadata and the data are recorded in the journal partition.

    COMPRESSION FILE STRUCTURE
    20.
    发明申请

    公开(公告)号:US20190050416A1

    公开(公告)日:2019-02-14

    申请号:US16155645

    申请日:2018-10-09

    Applicant: NetApp, Inc.

    Abstract: A file system layout apportions an underlying physical volume into one or more virtual volumes of a storage system. The virtual volumes having a file system and one or more files organized as buffer trees, the buffer trees utilizing indirect blocks to point to the data blocks. The indirect block at the level above the data blocks are grouped into compression groups that point to a set of physical volume block number (pvbn) block pointers.

Patent Agency Ranking