Systems and methods for rebuilding a cache index

    公开(公告)号:US10331561B1

    公开(公告)日:2019-06-25

    申请号:US15196150

    申请日:2016-06-29

    申请人: EMC Corporation

    摘要: Systems and methods for rebuilding an index for a flash cache are provided. The index is rebuilt by reading headers of containers stored in the cache and inserting information from the headers into the index. The index is enabled while being rebuild such that lookup operations can be performed using the index even when the index is incomplete. New containers can be inserted into used or unused regions of the cache while the index is being rebuilt.

    Linked lists in flash memory
    2.
    发明授权

    公开(公告)号:US10261704B1

    公开(公告)日:2019-04-16

    申请号:US15196261

    申请日:2016-06-29

    申请人: EMC Corporation

    IPC分类号: G06F3/06

    摘要: Systems and methods for managing content in a flash memory. A data structure such as a linked is implemented in flash memory. Nodes can be added to the linked list by overwriting portions of the node when only sets are required to update the node. The nodes may include multiple pointer portions such that existing pointers can be invalided and open pointer portions used for the update to the node.

    Reducing index operations in a cache

    公开(公告)号:US10210087B1

    公开(公告)日:2019-02-19

    申请号:US14674879

    申请日:2015-03-31

    申请人: EMC Corporation

    摘要: Systems and methods for reducing input/output operations in a computing system that uses a cache. Input/output operations associated with cache index lookups are reduced by tracking the location of the requested data such that the data can be invalidated without having to access the cache index. Input/output operations can be reduced by invalidating the entry in the cache index when reading the corresponding data.

    Memory efficient sanitization of a deduplicated storage system using a perfect hash function
    4.
    发明授权
    Memory efficient sanitization of a deduplicated storage system using a perfect hash function 有权
    使用完美散列函数对重复数据删除的存储系统进行内存高效的清理

    公开(公告)号:US09317218B1

    公开(公告)日:2016-04-19

    申请号:US13763522

    申请日:2013-02-08

    申请人: EMC Corporation

    IPC分类号: G06F12/00 G06F3/06

    摘要: Techniques for sanitizing a storage system are described herein. In one embodiment, for each of fingerprints representing data chunks stored in a first container of the storage system, a lookup operation in a live bit vector based on the fingerprint is performed to determine whether a corresponding data chunk is live. In one embodiment, a bit in a copy bit vector corresponding to the data chunk is populated based on the lookup operation. In one embodiment, after all of the bits corresponding to the data chunks of the first container have been populated in the CBV, data chunks represented by the CBV are copied from the first container to a second container, and records of the data chunks in the first container are erased.

    摘要翻译: 本文描述了用于消毒存储系统的技术。 在一个实施例中,对于代表存储在存储系统的第一容器中的数据块的每个指纹,执行基于指纹的实时比特向量中的查找操作,以确定对应的数据块是否是活的。 在一个实施例中,基于查找操作填充与数据块对应的复制位向量中的位。 在一个实施例中,在对应于第一容器的数据块的所有位已经被填充到CBV之后,由CBV表示的数据块从第一容器复制到第二容器,并将数据块的记录在 第一个容器被擦除。

    File indexing using an exclusion list of a deduplicated cache system of a storage system
    5.
    发明授权
    File indexing using an exclusion list of a deduplicated cache system of a storage system 有权
    使用存储系统的重复数据删除缓存系统的排除列表进行文件索引

    公开(公告)号:US09189414B1

    公开(公告)日:2015-11-17

    申请号:US14038687

    申请日:2013-09-26

    申请人: EMC Corporation

    IPC分类号: G06F13/12 G06F12/08 G06F12/12

    摘要: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a storage system. In response to receiving a request to read a first file region of a first file, the method continues with retrieving first data block from the persistent storage device that contains the first data block, caching the first data block at a first storage location of the cache memory device; and creating a first extent entry in the file index having at least a first node, where the first node includes an address of the first storage location and a first bitmap indicating which data blocks are valid.

    摘要翻译: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 该方法开始于维护具有多个盘区条目的文件索引,每个盘区条目对应于存储在存储在存储系统的持久存储装置中的数据的高速缓冲存储器装置中存储的多个文件盘区中的一个。 响应于接收到读取第一文件的第一文件区域的请求,该方法继续从包含第一数据块的永久存储设备中检索第一数据块,在高速缓存的第一存储位置缓存第一数据块 记忆装置; 以及在所述文件索引中创建具有至少第一节点的第一扩展名条目,其中所述第一节点包括所述第一存储位置的地址,以及指示哪些数据块有效的第一位图。

    Method for cleaning a delta storage system

    公开(公告)号:US09405764B1

    公开(公告)日:2016-08-02

    申请号:US14801773

    申请日:2015-07-16

    申请人: EMC Corporation

    IPC分类号: G06F17/30

    摘要: A computer-implemented method for performing garbage collection in a delta compressed data storage system selects a file recipe to traverse to identify live data chunks and a chunk identifier from the file recipe. The chunk identifier is added to a set of live data chunks. Delta references in the file recipe corresponding to the chunk identifier are added to the set of live data chunks. Data chunks in a data storage system not identified by the set of live data chunks are discarded.