摘要:
A file system maintains a series of read-only snapshot copies of a production file system. A read-write snapshot copy is created based on a selected read-only snapshot copy by maintaining a set of save volume blocks of new data of the read-write snapshot copy. A block of new data is written to the read-write snapshot copy by allocating a save volume block and writing to the save volume block. A specified block is read from the read-write snapshot copy by checking whether there is a respective save volume block allocated to the specified block, and if so, reading from the respective save volume block, and if not, reading from the read-only snapshot copy upon which the read-write snapshot copy is based. The read-write snapshot copy can be refreshed with a specified read-only snapshot copy. The production file can be restored with a specified read-write snapshot copy.
摘要:
A data storage system is constructed to rapidly respond to a backup request by streaming backup data from primary storage to tape. It is desirable to permit the data to be removed from the primary storage at a faster rate than it can be written to tape. The backup data is buffered in a memory buffer, and when the memory buffer becomes substantially full, a portion of the backup data is buffered in disk storage. When the memory buffer becomes substantially empty, the portion of the backup data in the disk storage is written to tape. In a preferred embodiment, the memory buffer is in random access memory of a data mover computer that transfers the backup data from primary storage to a tape library unit. When the memory buffer becomes full, the data mover stores the overflow in a cached disk storage subsystem. When the memory buffer becomes empty, the data mover retrieves the overflow from the cached disk storage subsystem and transmits the overflow to the tape library unit. In this fashion, the sequence of data tracks as stored on the tape can be different from the sequence of data tracks as stored in the primary storage. To facilitate restoration of the primary storage with a backup version, the data tracks as stored in records on the tape are identified by a track number included in each record on the tape.
摘要:
A snapshot copy of a production data set is maintained while a host may continue write access to the production data set. The data storage system responds to a host request to write to a storage location of the production data set by checking whether or not the storage-location has been modified since the time when the snapshot copy was created, and upon finding that the storage location of the production data set has not been modified, copying data from the storage location of the production data set to an allocated storage location of the snapshot copy, and after copying data from the storage location of the production data set to the allocated storage location of the snapshot copy, performing the write operation upon the storage location of the production data set. In the preferred implementation, the data storage system allocates to the snapshot copy a bit map to indicate storage locations in the production data set that have been modified, and a list of pointers to allocated storage locations for the snapshot copy. The snapshot copy facility is useful so that a host write operation upon a storage location being backed up need not be delayed until original data in the storage location is written to secondary storage. The snapshot copy facility is also useful for other applications such as transaction processing and debugging.
摘要:
A method is used in encrypting in deduplication systems. A deduplicating technique is applied to a data object. Applying the deduplicating technique includes computing a digest of the data object. An encryption key is derived from the digest of the data object. The data object is encrypted using the encryption key.
摘要:
A technique of identifying IO hot spots is performed in a data storage apparatus (e.g., a file server). The technique involves updating, in response to host IO operations which lock ranges of extents prior to accessing the ranges of extents, contents of a lock history database based on the ranges of extents which were locked by the host IO operations. The technique further involves receiving a lock history request. The technique further involves providing, in response to the lock history request, the contents of the lock history database to identify, as the IO hot spots, extents which were locked by the host IO operations.
摘要:
A method is used in managing deduplication density. A deduplication density of a data object is determined. The deduplication density is indicated by a number of deduplicated data objects represented by the data object. Based on the deduplication density of the data object, the data object is skipped for applying a deduplicating technique to the data object. An association is created between the data object and another data object. The other data object is used for applying the deduplicating technique where contents of the other data object are identical to contents of the data object.
摘要:
A method is used in recovering in deduplication systems. Metadata of a data object is evaluated for determining deduplication status for the data object. Based on the deduplication status, the data object is recovered.
摘要:
One embodiment of the invention relates to the transfer of content between a host computer that issues OAS access requests and a block I/O storage system. Specifically, a host computer may issue an access request for a content unit that identifies the content unit is an object identifier. The request may be received by a second server, which may determine the block address(es) on the block I/O storage system at which the content unit is stored. A request may then be sent to the block I/O storage system to retrieve the content stored at the requested block address(es) and the block I/O storage system may return the content.
摘要:
One embodiment of the invention is directed to the containerization of content units that are accessed using an identifier. Applicants have appreciated that file systems sometimes impose limits on the number of files that may be stored therein that are too restrictive. Thus, in one embodiment, multiple content units may be stored in a single file, called a container file, in a file system. Each content unit may have an identifier associated with it. When an accessing entity requests access to a previously-stored content unit and provides the identifier for the content unit, the identifier may be used to locate the container file in which the content unit is stored.
摘要:
One embodiment of the invention relates to the transfer of content between a host computer that issues OAS access requests and a block I/O storage system. Specifically, a host computer may issue an access request for a content unit that identifies the content unit is an object identifier. The request may be received by a second server, which may determine the block address(es) on the block I/O storage system at which the content unit is stored. A request may then be sent to the block I/O storage system to retrieve the content stored at the requested block address(es) and the block I/O storage system may return the content.