ADAPTIVELY PROVIDING UNCOMPRESSED AND COMPRESSED DATA CHUNKS

    公开(公告)号:US20240232138A1

    公开(公告)日:2024-07-11

    申请号:US18614415

    申请日:2024-03-22

    申请人: Cohesity, Inc.

    IPC分类号: G06F16/174

    CPC分类号: G06F16/1744

    摘要: A selected data chunk associated with an object is determined to be sent to a destination. A chunk compression grouping storing the selected data chunk associated with the object is identified. The identified chunk compression grouping includes a plurality of data chunks compressed together. A data content version that includes the selected data chunk associated with the object to be provided to the destination is determined from a plurality of data content versions based at least in part a metric associated with the identified chunk compression grouping.

    Adaptively providing uncompressed and compressed data chunks

    公开(公告)号:US11971857B2

    公开(公告)日:2024-04-30

    申请号:US17545655

    申请日:2021-12-08

    申请人: Cohesity, Inc.

    IPC分类号: G06F16/174

    CPC分类号: G06F16/1744

    摘要: A selected data chunk associated with an object is determined to be sent to a destination. A chunk compression grouping storing the selected data chunk associated with the object is identified. The identified chunk compression grouping includes a plurality of data chunks compressed together. A data content version that includes the selected data chunk associated with the object to be provided to the destination is determined from a plurality of data content versions based at least in part on a metric associated with the identified chunk compression grouping.

    Efficiently storing data in a cloud storage

    公开(公告)号:US11842060B2

    公开(公告)日:2023-12-12

    申请号:US17402198

    申请日:2021-08-13

    申请人: Cohesity, Inc.

    IPC分类号: G06F3/06

    摘要: A specification of content to be stored in a cloud storage is received at a client-side component. A portion of the content is divided into a plurality of data chunks. One or more data chunks of the plurality of data chunks that are to be sent via a network to be stored in the cloud storage are identified. It is determined whether a batch size of the one or more identified data chunks meets a threshold size. Based on the determination of whether the batch size meets the threshold size, a cloud storage destination among a plurality of different cloud storage destinations associated with different performance tiers is selected.

    Efficiently storing data in a cloud storage

    公开(公告)号:US11681454B2

    公开(公告)日:2023-06-20

    申请号:US17402206

    申请日:2021-08-13

    申请人: Cohesity, Inc.

    IPC分类号: G06F3/06

    摘要: A cloud server determines that a size of a first cloud storage element object is at least below a first threshold. In response to the first determination, a client-side component is requested to store additional data in the cloud storage element object including by having the client-side component update the first cloud storage element with an updated version that includes previously existing data of the first cloud storage element and the additional data. The first cloud storage element object is added to a set of one or more cloud storage element objects available for update. The client-side component is configured to generate an updated version of the first cloud storage element object that has a size that is greater than or equal to the first threshold.

    LARGE CONTENT FILE OPTIMIZATION
    6.
    发明申请

    公开(公告)号:US20200349138A1

    公开(公告)日:2020-11-05

    申请号:US16881942

    申请日:2020-05-22

    申请人: Cohesity, Inc.

    IPC分类号: G06F16/22 G06F16/11 G06F11/14

    摘要: A size associated with a content file is determined to be greater than a threshold size. In response to the determination, file metadata of the content file split and stored across a plurality of component file metadata structures. The file metadata of the content file specifies tree structure organizing data components of the content file and each component file metadata structure of the plurality of component file metadata structures stores a portion of the tree structure. A snapshot tree is updated to reference the plurality of component file metadata structures for the content file.

    Incremental virtual machine metadata extraction

    公开(公告)号:US10534759B1

    公开(公告)日:2020-01-14

    申请号:US16110314

    申请日:2018-08-23

    申请人: Cohesity, Inc.

    IPC分类号: G06F16/188 G06F16/11

    摘要: A virtual machine container file is analyzed to determine which portion of the virtual machine container file corresponds to a virtual machine file system metadata of the virtual machine container file. One or more differences between a first version of a virtual machine container file and a second version of the virtual machine container file are determined at least in part by traversing a snapshot structure associated with the virtual machine container file. The determined one or more differences that corresponds to the virtual machine file system metadata portion of the virtual machine container file are identified based at least in part on the analysis of the virtual machine container file. The identified one or more differences corresponding to the virtual machine file system metadata portion of the virtual machine file are utilized to identify one or more changes from the content files included in the first version of the virtual machine container file to content files included in the second version of the virtual machine container file.

    REDUCING MEMORY USAGE IN STORING METADATA
    10.
    发明公开

    公开(公告)号:US20230401003A1

    公开(公告)日:2023-12-14

    申请号:US18239004

    申请日:2023-08-28

    申请人: Cohesity, Inc.

    IPC分类号: G06F3/06

    摘要: Data associated with a source system is ingested. After the data is ingested, a post-processing metadata conversion process is performed including by selecting an entry of a chunk metadata data structure and determining that a data chunk associated with the selected entry is not referenced by at least a threshold number of objects. In response to determining that the data chunk associated with the selected entry is not referenced by at least the threshold number of objects, metadata of a tree data structure node corresponding to a chunk identifier associated with the data chunk is updated to store a reference to a chunk file storing the data chunk and the selected entry is removed from the chunk metadata data structure.