摘要:
A system and method for demand-based data recovery operating in a computerized data processing system that includes a computer platform operable under the control of system data, including system programs, catalogs, directories, inventories and the like, and one or more user applications operable in conjunction with user data. The data processing system includes a data backup and recovery system for periodically transferring data between one or more primary data storage resources and one or more secondary data storage resources. The demand-based system and method operate to recover data from the secondary data storage resources to the primary data storage resources following a disaster event resulting in the loss of all or a portion of the data on the primary data storage resources. In accordance therewith, the system data is first recovered from the secondary data storage resources to the primary data storage resources. Execution of the one or more user applications is next commenced but the user data is not immediately recovered. Only if the application requests non-recovered user data will the requested data be recovered from the secondary data storage resources to the primary data storage resources. The demand-based system and method can be adapted to recover entire data sets (e.g. files) from a previous full backup or it can recover data subsets (e.g. records) in either a full backup mode or a change-data-only mode.
摘要:
A system, apparatus, and process creates a table of contents (TOC), including one or more table of contents (TOC) entries, to manage data in a hierarchical storage management system. Each TOC entry contains metadata describing the contents and attributes of a data object within an image, which is an aggregation of multiple data objects into a single object for storage management purposes. The TOC is stored in a storage hierarchy, such as magnetic disk, for fast access of and efficient operation on the aggregated TOC entries. The system, apparatus, and process also provide for aggregating the TOC entries from one or more TOCs into a TOC set in the storage management server database. The TOC set may be manipulated and queried in order to find a particular data object or image referenced by a TOC entry. The TOC entries, TOCs, and TOC sets may be dynamically managed by the hierarchical data storage management system through implementation of a set of policy management constructs that define appropriate creation, retention, and movement of the objects within the database and storage hierarchy.
摘要:
A system, apparatus, and process creates a table of contents (TOC), including one or more table of contents (TOC) entries, to manage data in a hierarchical storage management system. Each TOC entry contains metadata describing the contents and attributes of a data object within an image, which is an aggregation of multiple data objects into a single object for storage management purposes. The TOC is stored in a storage hierarchy, such as magnetic disk, for fast access of and efficient operation on the aggregated TOC entries. The system, apparatus, and process also provide for aggregating the TOC entries from one or more TOCs into a TOC set in the storage management server database. The TOC set may be manipulated and queried in order to find a particular data object or image referenced by a TOC entry. The TOC entries, TOCs, and TOC sets may be dynamically managed by the hierarchical data storage management system through implementation of a set of policy management constructs that define appropriate creation, retention, and movement of the objects within the database and storage hierarchy.
摘要:
A system, apparatus, and process creates a table of contents (TOC), including one or more table of contents (TOC) entries, to manage data in a hierarchical storage management system. Each TOC entry contains metadata describing the contents and attributes of a data object within an image, which is an aggregation of multiple data objects into a single object for storage management purposes. The TOC is stored in a storage hierarchy, such as magnetic disk, for fast access of and efficient operation on the aggregated TOC entries. The system, apparatus, and process also provide for aggregating the TOC entries from one or more TOCs into a TOC set in the storage management server database. The TOC set may be manipulated and queried in order to find a particular data object or image referenced by a TOC entry. The TOC entries, TOCs, and TOC sets may be dynamically managed by the hierarchical data storage management system through implementation of a set of policy management constructs that define appropriate creation, retention, and movement of the objects within the database and storage hierarchy.
摘要:
Provided are a system and article of manufacture for merging metadata on files in a backup storage. A query is received for information on files from a file system for a client node. A determination is made of metadata for files in a backup storage for the client node stored external to backup sets. At least one backup set in the backup storage for the client node is processed to determine metadata for files included in at least one backup set. The determined metadata for files in the backup storage and for files included in at least one backup set is merged to form merged metadata. The query is executed against the merged metadata to determine files whose metadata satisfies the query. Results including information from the merged metadata on the determined files are returned.
摘要:
Provided are techniques for data deduplication. A chunk of data and a mapping of boundaries between file data and meta data in the chunk of data are received. The mapping is used to split the chunk of data into a file data stream and a meta data stream and to store file data from the file data stream in a first file and to store meta data from the meta data stream in a second file, wherein the first file and the second file are separate files. The file data in the first file is deduplicated.
摘要:
Provided are a method, system, and program for transferring data between system and storage in a shared buffer. An application requests a buffer from a component. The component allocates a buffer and returns to the application a first offset in the buffer for application data and a second offset in the buffer for a first header. The application writes the application data at the first offset in the buffer; and writes the first header at the second offset in the buffer. A second header is written to a third offset in the buffer.
摘要:
Systems, methods, and computer products for separating file data streams for improved progressive incremental processing are provided. The method comprises identifying data in a file, prior to transmission of the data in a data stream to a second system, such that the second system can distinguish first data from second data in the file. The method further comprises processing the first and second data to determine whether the first or second data has changed, and transmitting the first data to the second system in response to determining the first data has changed, so that the first data is stored in a data storage medium in association with third and fourth data previously stored without replacing the third data and fourth data, wherein the third and fourth data comprise older versions of the first and second data, respectively.
摘要:
Provided are techniques for accessing information. A partial key index is created, wherein the partial key index includes one or more partial key index (PKI) entries, wherein each PKI entry identifies a portion of a metadata object, wherein the metadata object includes metadata entries, each of which corresponds to an associated data object. The partial key index is used to locate one or more PKI entries, wherein each of the one or more PKI entries corresponds to one or more data objects in one of the one or more composite objects. Each of the one or more PKI entries is used to retrieve location information of metadata in a metadata object for each of the one or more composite objects. The metadata from each metadata object is loaded into a database using the location information.
摘要:
Disclosed is a system for backing up files in a distributed computing system. A file server maintains files in a shared name space. The file server provides a first backup client program and a second backup client program with access to the files in the shared name space. The first backup client program initiates a backup request to backup a requested file. A determination is made as to whether the requested file is maintained in a shared name space. The backup request is transmitted to the second backup client program upon determining that the requested file is maintained in the shared name space. The second backup client program transmits a message to the file server to provide the requested file. The file server transmits the requested file with the file server to the second backup client program. The second backup client program then transmits the requested file to a backup server program. The backup server program stores the requested file in a storage device.