Global access to filesystem content

    公开(公告)号:US12105671B1

    公开(公告)日:2024-10-01

    申请号:US18306969

    申请日:2023-04-25

    申请人: VAST DATA LTD.

    摘要: A method for managing access to a shared file system entity (FSE), the method includes (i) managing access requests for reading and writing accessed parts of the FSE, by a current point of access storage system (POASS), the current POASS currently acts as a point of access (POA) for the access requests; wherein stored parts of the FSE are stored at multiple POASSs, the multiple POASSs comprise the current POASS and former POASSs; (ii) determining, by the current POASS and while managing the access requests, to store the stored parts of the FSE at a fewer number of the former POASSs; and (iii) reducing, by two or more of the former POASSs, the number of former POASSs that store the stored parts of the FSE.

    Data aware compression in a storage system

    公开(公告)号:US11936404B2

    公开(公告)日:2024-03-19

    申请号:US17661264

    申请日:2022-04-28

    申请人: VAST DATA LTD.

    IPC分类号: H03M7/30

    摘要: A method for storage system data aware compression, the method may include pre-compressing data units received by the storage system, by different pre-compression units to provide different pre-compressed versions of the data units; wherein the different pre-compression schemes are associated with different compression schemes, wherein at least some of the different compression schemes are data type specific compression schemes; calculating entropies of the different pre-compressed versions; and selecting a compression scheme out of the different compression schemes based on the entropies of the different pre-compressed versions.

    System and method for using free space to improve erasure code locality

    公开(公告)号:US11886705B2

    公开(公告)日:2024-01-30

    申请号:US17503861

    申请日:2021-10-18

    申请人: Vast Data LTD.

    发明人: Yogev Vaknin

    IPC分类号: G06F3/06

    摘要: A system and method for using free space for recovering erasure coding data sets. The method includes segmenting at least one erasure coding data set into at least one stripe based on an erasure coding scheme, wherein each erasure coding data set includes chunks, wherein the chunks include chunks of systematic data and chunks of parity data; distributing the at least one stripe across non-volatile memory nodes based on the erasure coding scheme, wherein each non-volatile memory node is a unit of a non-volatile storage device; and upon a first failure of one of the non-volatile memory nodes: detecting that at least one of the non-volatile memory nodes that is not failed includes a first free space that does not currently store user data; and performing a first recovery process for recovering a first portion of the at least one erasure coding data set, based on the first free space.

    Striping in a storage system
    4.
    发明授权

    公开(公告)号:US11842074B2

    公开(公告)日:2023-12-12

    申请号:US17207721

    申请日:2021-03-21

    申请人: Vast Data Ltd.

    IPC分类号: G06F3/06

    摘要: A method for striping, the method may include performing, for each period of time out of multiple periods of time, the steps of: (i) determining striping rules; wherein the determining of the striping rules may include selecting one or more selected stripe size values out of multiple stripe size value candidates; wherein the selecting is based on values of storage system parameters that are obtained when applying the multiple stripe size value candidates; wherein the storage system parameters comprise storage space utilization and storage system throughput; and (ii) applying the striping rules by the storage system, during the period of time; wherein the applying comprises obtaining data chunks; converting the data chunks to stripes having at least one of the one or more selected stripe size values; and storing the stripes in the storage system.

    Untying compression related links to stale reference chunks

    公开(公告)号:US11829622B2

    公开(公告)日:2023-11-28

    申请号:US17650245

    申请日:2022-02-07

    申请人: VAST DATA LTD.

    IPC分类号: G06F3/06

    摘要: A method for selectively untying at least one of compression related links to a stale reference chunk, the method may include determining whether a compression effectiveness condition is fulfilled in relation to a set of referring chunks that are compressed using the stale reference chunk and are linked to the to the stale reference chunk by the compression related links; wherein the compression effectiveness condition is responsive to, at least, a number of the referring chunks of the set; and untying the at least one compression related links to the stale reference chunks when the compression effectiveness condition is not fulfilled.

    SNAPSHOT CAPACITY ESTIMATION
    6.
    发明公开

    公开(公告)号:US20230315680A1

    公开(公告)日:2023-10-05

    申请号:US17657339

    申请日:2022-03-30

    申请人: VAST DATA LTD.

    IPC分类号: G06F16/11 G06F16/174

    CPC分类号: G06F16/128 G06F16/1748

    摘要: A method for determining size information related one or more snapshots related to file systems stored in a storage system, the method may include (a) sampling one or more combinations of points in time and logical spaces, the logical spaces are associated with the one or more file systems to provide sampled combinations of sampled portions of file system entities (FSEs) and sampled points in time; (b) searching for relevant snapshots that are relevant to the sampled combinations to provide relevant snapshots at the sampled points in time; and (c) determining physical sizes of the relevant snapshots at the sampled point in time; wherein a number of samples per sampled FSE is indicative of a size of the sampled FSEs

    Database table with a minimum-maximum filter for multiple ranges

    公开(公告)号:US11727008B1

    公开(公告)日:2023-08-15

    申请号:US17810336

    申请日:2022-06-30

    申请人: VAST DATA LTD.

    发明人: Asaf Levy Eyal Gordon

    IPC分类号: G06F16/2455

    CPC分类号: G06F16/24557

    摘要: A method for responding to a tabular database (TD) query, the method may include (i) receiving the TD query, wherein the TD query comprises one or more numerical conditions; (ii) determining, using gap filters and based on the one or more numeral conditions, a relevancy to the TD query of groups of cells of the TD that are associated with the gap filters; wherein different gap filters are associated with different groups of cells of the TD; wherein each gap filter comprises one or more pairs of minimum-maximum values that are defined based on one or more gaps between sorted values of the group of cells, wherein at least one gap filter of the gap filters is set up based on a storage parameter of the gap filter and a filtering parameter of the gap filter; (iii) skip a scanning of one or more groups of cells of the TD that are irrelevant to the TD query; and (iv) generate a response to the TD query, wherein the generating comprises scanning one or more groups of cells of the TD that are relevant to the TD query.

    Reverting to a previous version of a filesystem entity

    公开(公告)号:US11726881B2

    公开(公告)日:2023-08-15

    申请号:US17302129

    申请日:2021-04-23

    申请人: VAST DATA LTD.

    IPC分类号: G06F11/14

    摘要: A method for generating a reconstructed version of a filesystem entity, the method may include (i) generating fallback retrieval metadata for a reconstructed version segment, when the reconstructed version segment is (a) associated with a corresponding intermediate version segment, and (b) the corresponding intermediate version segment is preceded by a corresponding most updated segment that does not exceed a fallback version of the filesystem entity; wherein the reconstructed version segment, the corresponding intermediate version segment and the corresponding most updated segment that does not exceed a fallback version have a same address range; (ii) generating a non-existing indicator for the reconstructed version segment, when the reconstructed version segment is (a) associated with the corresponding intermediate version segment, and (b) the corresponding intermediate version segment is not preceded by any corresponding most updated segment that does not exceed the fallback version; and (iii) maintaining intermediate metadata for retrieving one or more intermediate versions of the filesystem entity.

    TABULAR DATABASE REGROUPING
    9.
    发明公开

    公开(公告)号:US20230244699A1

    公开(公告)日:2023-08-03

    申请号:US17649537

    申请日:2022-01-31

    申请人: VAST DATA LTD.

    发明人: Eyal Gordon Asaf Levy

    IPC分类号: G06F16/28 G06F16/22

    CPC分类号: G06F16/285 G06F16/221

    摘要: There may be provided a method for managing column extents of a tabular database, the method may include (a) generating a multi-snapshot row score to each row of a group of rows of the tabular database multiple rows; wherein the multi-snapshot score is indicative of fullness of the row in the multiple snapshots; wherein the group of rows comprises column extents associated with column extents metadata; and (b) improving a column extent metadata parameter by reordering the rows of the group of rows according to the multi-snapshot row scores to provide a re-ordered group of rows that comprises re-ordered group extents associated with re-ordered column extents associated with re-ordered column extents metadata.

    SCHEDULING STORAGE TASKS
    10.
    发明公开

    公开(公告)号:US20230141122A1

    公开(公告)日:2023-05-11

    申请号:US17453821

    申请日:2021-11-05

    申请人: VAST DATA LTD.

    IPC分类号: G06F3/06 G06F9/48

    摘要: A method for managing tasks in a storage system, the method may include: (a) obtaining, by a scheduler, a shared budget for background storage tasks and foreground storage tasks; (b) obtaining, by the scheduler, a background budget for background storage tasks; wherein the background budget is a fraction of the shared budget; (c) allocating, by the scheduler, resources to pending storage tasks according to the shared budget and the background budget; wherein the allocating comprises (i) allocating the shared budget while prioritizing foreground storage tasks over background storage tasks; and (ii) allocating the background budget to background storage tasks; and (d) participating, by the scheduler, in executing of storage tasks according to the allocation.