Data storage method in storage system and related device

    公开(公告)号:US11755207B2

    公开(公告)日:2023-09-12

    申请号:US17840982

    申请日:2022-06-15

    CPC classification number: G06F3/0613 G06F3/0608 G06F3/0641 G06F3/0683

    Abstract: A data storage method in a storage system and a related system. The method includes: calculating a similar fingerprint of first to-be-stored data to obtain a first similar fingerprint, where the first similar fingerprint is for determining whether the first to-be-stored data is similar to stored data; determining reference data based on the first similar fingerprint, where a similar fingerprint of the reference data is the first similar fingerprint; determining first differential data between the to-be-stored data and the reference data based on the reference data, where the reference data is stored in a first storage unit; and storing the first differential data in a second storage unit, where the first storage unit and the second storage unit belong to a read range of a same read I/O.

    Intelligent deduplication in storage system based on application IO tagging

    公开(公告)号:US11748014B2

    公开(公告)日:2023-09-05

    申请号:US16790789

    申请日:2020-02-14

    CPC classification number: G06F3/0641 G06F3/0608 G06F3/0659

    Abstract: Host computers running applications that store data on a block-based storage system such as a SAN provide hints that differentiate IO data based on which application generated the IO. The hints may include tags that are associated with IO commands sent to the block-based storage system. Each host application is associated with a unique identifier that is placed in the tag. Application name-to-identifier mappings may be sent from the hosts to the block-based storage system. Per-identifier/application deduplication statistics are maintained by the block-based storage system and shared with other block-based storage system. Deduplication is disabled or de-emphasized for IO data generated by applications with statistically low deduplication ratios.

    Compression device and decompression device

    公开(公告)号:US11742876B2

    公开(公告)日:2023-08-29

    申请号:US17688368

    申请日:2022-03-07

    Abstract: According to one embodiment, an interleaving unit divides a symbol string into first and second symbols. A first coding unit converts the first symbols to first codewords. A first packet generating unit generates first packets including the first codewords. A first request generating unit generates first packet requests including sizes of variable length packets. A second coding unit converts the second symbols to second codewords. A second packet generating unit generates second packets including the second codewords. A second request generating unit generates second packet requests including sizes of variable length packets. A multiplexer outputs a compressed stream including the first and second variable length packets cut out from the first and second packets.

    System and methods for bandwidth-efficient encoding of genomic data

    公开(公告)号:US11734231B2

    公开(公告)日:2023-08-22

    申请号:US17569500

    申请日:2022-01-05

    CPC classification number: G06F16/1752 G06F3/067 G06F3/0608 G06F3/0641

    Abstract: A system and methods for bandwidth-efficient encoding of genome and bioinformatic sequence datasets comprising a sequence analyzer configured to: analyze a received sequence dataset to determine a sequence dataset file type, scan the sequence dataset to maintain a count of unique characters contained therein, identify positions where the unique character count increases by a power of two, deconstruct the sequence dataset into a plurality of sourceblocks at the identified positions, and encode the plurality of sourceblocks using a data deconstruction engine and library management module to assign each sourceblock a reference code.

    GLOBAL DEDUPLICATION
    100.
    发明公开

    公开(公告)号:US20230259495A1

    公开(公告)日:2023-08-17

    申请号:US18139286

    申请日:2023-04-25

    Applicant: Cohesity, Inc.

    Abstract: A first fingerprint corresponding to a first chunk associated with a stream of data is generated. It is determined that the first fingerprint matches a second fingerprint of a plurality of fingerprints listed in at least one entry in a deduplication map associated with a plurality of storage systems. A first storage system of the plurality of storage systems is located at a first geographic location and a second storage system of the plurality of storage systems is located at a second geographic location. The first chunk corresponding to the second fingerprint is stored by at least the second storage system. In response to a determination that the first fingerprint matches the second fingerprint, it is determined to store at the first storage system a local copy of the first chunk based in part on one or more deduplication factors. In response to the determination that the one or more deduplication factors indicate to store the local copy of the first chunk, the local copy of the first chunk is stored at the first storage system.

Patent Agency Ranking