摘要:
A back-up storage system that emulates a sequential storage medium such as a tape and stores data on a randomly accessible storage medium, such as disk. The back-up storage system includes a randomly accessible storage medium, a front-end interface to emulate the sequential storage medium, to communicate with external devices using a protocol that supports the sequential storage medium, and to receive sequential-format data from the external devices, and a back-end interface to receive the sequential-format data from the front-end interface and to store the sequential-format data on the randomly-accessible storage medium.
摘要:
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
摘要:
A method for removing redundant data from a backup storage system is presented. In one example, the method may include identifying a first back-up data object, identifying a second back-up data object, detecting a first portion of the first back-up data object that is a copy of a second portion of the second back-up data object, and replacing the second portion with a pointer to the first portion.
摘要:
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
摘要:
Systems and methods for efficient storage of data are provided. For instance, a method that includes acts of receiving a data stream to be stored on a first storage system, the data stream comprising a data object and metadata that describes attributes of the data object, extracting the metadata from the data stream, storing the data object on the first storage system and storing the extracted metadata, separately from the data object, on a computer-readable medium is disclosed. Additionally, a storage system is disclosed that includes a first storage medium, an interface configured to receive a data stream including a data object to be stored on the first storage medium and a controller. The controller may be configured to extract metadata from the data stream, the metadata being descriptive of attributes of the data object, and to direct the extracted metadata to be stored on a computer-readable medium.
摘要:
Systems and methods for efficient storage of data are provided. For instance, a method that includes acts of receiving a data stream to be stored on a first storage system, the data stream comprising a data object and metadata that describes attributes of the data object, extracting the metadata from the data stream, storing the data object on the first storage system and storing the extracted metadata, separately from the data object, on a computer-readable medium is disclosed. Additionally, a storage system is disclosed that includes a first storage medium, an interface configured to receive a data stream including a data object to be stored on the first storage medium and a controller. The controller may be configured to extract metadata from the data stream, the metadata being descriptive of attributes of the data object, and to direct the extracted metadata to be stored on a computer-readable medium.
摘要:
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
摘要:
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
摘要:
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
摘要:
A method for removing redundant data from a backup storage system is presented. In one example, the method may include receiving the application layer data object, selecting a de-duplication domain from a plurality of de-duplication domains based at least in part on a data object characteristic associated with the de-duplication domain, determining that the application layer data object has the characteristic and directing the application layer data object to the de-duplication domain.