Method and system for improving efficiency in the management of data references

    公开(公告)号:US12001411B2

    公开(公告)日:2024-06-04

    申请号:US18161592

    申请日:2023-01-30

    CPC分类号: G06F16/215 G06F16/2358

    摘要: Methods, computer program products, and computer systems for the management of data references in an efficient and effective manner are disclosed. Such methods, computer program products, and computer systems include receiving a change tracking stream at the computer system, identifying a data object group, and performing a deduplication management operation on the data object group. The change tracking stream is received from a client computing system. The change tracking stream identifies one or more changes made to a plurality of data objects of the client computing system. The identifying is based, at least in part, on at least a portion of the change tracking stream. The data object group represents the plurality of data objects.

    Methods and systems for affinity aware container prefetching

    公开(公告)号:US11868214B1

    公开(公告)日:2024-01-09

    申请号:US16836472

    申请日:2020-03-31

    IPC分类号: G06F11/14 G06N20/00

    摘要: Disclosed are techniques that provide for deduplication in an efficient and effective manner. For example, such methods, computer program products, and computer systems can include generating new feature information for one or more portions of a new backup image, generating first container range information by performing a container range calculation using the new feature information, generating existing feature information for one or more portions of an existing backup image, generating second container range information by performing the container range calculation using the existing feature information, determining a container range affinity between the first container range information and the second container range information, identifying at least one portion of the one or more portions of the existing backup image using a result of the determining, and prefetching the one or more fingerprints corresponding to the at least one portion of the one or more portions of the existing backup image.

    Methods and systems for data resynchronization in a replication environment

    公开(公告)号:US11847139B1

    公开(公告)日:2023-12-19

    申请号:US17897583

    申请日:2022-08-29

    IPC分类号: G06F16/00 G06F16/27

    CPC分类号: G06F16/273

    摘要: Methods, computer program products, computer systems, and the like are disclosed that provide for scalable deduplication in an efficient and effective manner. For example, such methods, computer program products, and computer systems can include determining, at a source site, whether metadata has been received from a target site, and, in response to a determination that the metadata has been received at the source site, retrieving the at least one unit of the source data from the source data store using the metadata and sending, from the source site, the at least one unit of source data to the target site.

    METHOD AND SYSTEM FOR DATA CONSISTENCY ACROSS FAILURE AND RECOVERY OF INFRASTRUCTURE

    公开(公告)号:US20230393771A1

    公开(公告)日:2023-12-07

    申请号:US18237296

    申请日:2023-08-23

    IPC分类号: G06F3/06 G06F11/14

    摘要: A method and system for data consistency across failure and recovery of infrastructure. In one embodiment of the method, copies of first data blocks stored in a source memory are sent to a target site via a data link. While sending one or more of the copies of the first data blocks to the target site, source hashes for second data blocks stored in the source memory are calculated, wherein the first data blocks are distinct from the second data blocks. While sending one or more of the copies of the first data blocks to the target site, target hashes of data blocks stored in a target memory of the target site are received. While sending one or more of the copies of the first data blocks to the target site, the source hashes are compared with the target hashes, respectively. After sending the first data blocks to the target site via the data link, copies of only those second data blocks are sent to the target site with source hashes that do not compare equally with respective target hashes.

    METHODS AND SYSTEMS FOR SCALABLE DEDUPLICATION

    公开(公告)号:US20230350863A1

    公开(公告)日:2023-11-02

    申请号:US18347395

    申请日:2023-07-05

    IPC分类号: G06F16/215

    CPC分类号: G06F16/215

    摘要: Methods, computer program products, computer systems, and the like are disclosed that provide for scalable deduplication. Such methods, computer program products, and computer systems can include, in response to receiving a request to perform a lookup operation, performing the lookup operation and, in response to the signature not being found, forwarding the request to a remote node. Further, in response to receiving an indication that the signature was not found at the remote node, processing the subunit of data as a unique subunit of data.

    Systems and methods for producing message search recommendations

    公开(公告)号:US11657093B1

    公开(公告)日:2023-05-23

    申请号:US16906204

    申请日:2020-06-19

    发明人: Mirang Parikh

    摘要: The disclosed computer-implemented method for producing message search recommendations may include (i) providing a search bar for searching a corpus of network messages such that the search bar is configured to enable a user to search the network messages by specifying both a specialized keyword that designates a separate common field for searching the network messages and a value that corresponds to the separate common field, (ii) detecting, as the user types the specialized keyword that the user is inputting the specialized keyword, and (iii) presenting, in response to detecting that the user is inputting the specialized keyword, a recommended different specialized keyword that has been used in conjunction with the detected specialized keyword in search queries rather than simply recommending a value that corresponds to the detected specialized keyword. Various other methods, systems, and computer-readable media are also disclosed.

    Context-driven data backup and recovery

    公开(公告)号:US11409610B1

    公开(公告)日:2022-08-09

    申请号:US16836997

    申请日:2020-04-01

    IPC分类号: G06F11/14

    摘要: Disclosed herein are systems, methods, and processes to perform context-driven (or context-based) data backup and recovery operations. A request to perform a backup operation on a dataset is received. Current external context datasets related to the dataset and generated based on prioritization techniques are collected from computing devices. a saved context dataset is generated based on the current external context datasets. The backup operation is performed by storing a backup image that includes at least a portion of the dataset and the saved context dataset.

    Systems and methods for agentless and accelerated backup of a database

    公开(公告)号:US11372732B2

    公开(公告)日:2022-06-28

    申请号:US16800322

    申请日:2020-02-25

    摘要: The disclosed computer-implemented method for agentless and accelerated backup of a database may include, receiving, by a data backup device from a data server, blocks of data that provide a full backup of data of the data server. The method additionally includes receiving, by the data backup device from the data server, one or more native logs indicating one or more transactions performed by the data server. The method also includes determining, by the data backup device and based on the native logs, one or more changed blocks of the blocks of data. The method further includes providing, by the data backup device, a point in time restore of the data server by creating a synthetic full backup that overlays one or more of the blocks of data with the one or more changed blocks, and that shares remaining blocks of the blocks of data with the full backup.

    Low cost, heterogeneous method of transforming replicated data for consumption in the cloud

    公开(公告)号:US11366724B2

    公开(公告)日:2022-06-21

    申请号:US17161765

    申请日:2021-01-29

    摘要: Disclosed are methods and the like that provide for transforming replicated data for consumption in the cloud, for example. Such methods can include attaching a target gateway node at a secondary site to a storage device at the secondary site, searching for an identifier stored in the storage device, and storing replicated data in the replication volume. The identifier is associated with an offset stored in the storage device, and the offset identifies a starting location of a replication volume in the storage device. The replicated data is received by the target gateway node from a source gateway node at a primary site. A starting location is received with the replicated data. The target gateway node stores the replicated data at a first location in the storage volume, and the first location is determined based, at least in part, on the starting location and the first storage location.