Continuous learning techniques with machine learning systems to adapt to new input data sets

    公开(公告)号:US11514295B2

    公开(公告)日:2022-11-29

    申请号:US16663682

    申请日:2019-10-25

    Inventor: Sorin Faibish

    Abstract: Continuous learning may include receiving a first neural network trained using a first training data set to predict outputs; determining whether the first neural network has a successful prediction rate greater than a prediction threshold; and responsive to determining the first neural network does not have a successful prediction rate greater than the prediction threshold, performing processing. The processing may include training the first neural network using a second training data set different than the first training data set; determining that a trigger condition has occurred, wherein the trigger condition includes detecting, during training of the first neural network using the second training data set, that a first weight of the first neural network has a corresponding weight change exceeding a threshold weight change; responsive to determining the trigger condition has occurred, reconfiguring the first neural network; and training the reconfigured first neural network using the second training data set.

    Deduplication of large block aggregates using representative block digests

    公开(公告)号:US10921987B1

    公开(公告)日:2021-02-16

    申请号:US16527894

    申请日:2019-07-31

    Abstract: A method of performing deduplication includes (1) receiving a write command that specifies a set of data, the set of data including multiple blocks of data, (2) hashing a subset of the set of data, yielding a representative digest of the set of data, and (3) performing deduplication on the set of data based at least in part on matching the representative digest to a digest already stored in a database which relates digests to locations of data from which the digests were produced. An apparatus, system, and computer program product for performing a similar method are also provided.

    Multi-tier storage system with direct client access to archive storage tier

    公开(公告)号:US10901943B1

    公开(公告)日:2021-01-26

    申请号:US15282076

    申请日:2016-09-30

    Abstract: A multi-tier storage system is provided with direct client access to an archive storage tier for input/output operations. An exemplary method comprises communicating over a network with (i) a cluster file system on a first storage tier, and (ii) a second archive storage tier comprising an object store; providing a client of the cluster file system with access to one or more files in the cluster file system on the first storage tier; and executing a translation shim to provide the client of the cluster file system with one or more of read and write access to one or more files on the second archive storage tier. The translation shim converts between the protocols of the cluster file system and the protocols of the second archive storage tier, to allow unmodified applications to optionally access the second archive storage tier using existing cluster file system protocols.

    SUB-BLOCK DATA DEDUPLICATION
    10.
    发明申请

    公开(公告)号:US20200341668A1

    公开(公告)日:2020-10-29

    申请号:US16393061

    申请日:2019-04-24

    Abstract: Techniques for data processing may include: determining one or more sub-blocks of a target block that match one or more sub-blocks of a candidate block; creating a shared sub-block mapping (SSM) structure having a plurality of entries, wherein each of the plurality of entries corresponds to a different one of the sub-blocks in the candidate block and wherein a value stored in said each entry, corresponding to one of the sub-blocks of the candidate block, identifies a sub-block of the target block matching said one sub-block of the candidate block; and storing the candidate block as a deduplicated block sharing at least one sub-block with the target block. The SSM structure may be stored as a metadata structure of the candidate block to identify deduplicated sub-blocks of the candidate block and to identify sub-blocks of the target block providing content for the deduplicated sub-blocks of the candidate block.

Patent Agency Ranking