摘要:
According to one or more embodiments of the present invention, a network cache intercepts data requested by a client from a remote server interconnected with the cache through one or more wide area network (WAN) links (e.g., for Wide Area File Services, or “WAFS”). The network cache stores the data and sends the data to the client. The cache may then intercept a first write request for the data from the client to the remote server, and determine one or more portions of the data in the write request that changed from the data stored at the cache (e.g., according to one or more hashes created based on the data). The network cache then sends a second write request for only the changed portions of the data to the remote server.
摘要:
A technique for eliminating duplicate data is provided. Upon receipt of a new data set, one or more anchor points are identified within the data set. A bit-by-bit data comparison is then performed of the region surrounding the anchor point in the received data set with the region surrounding an anchor point stored within a pattern database to identify forward/backward delta values. The duplicate data identified by the anchor point, forward and backward delta values is then replaced in the received data set with a storage indicator.
摘要:
A method for stream allocation in a data storage system begins by determining whether a stream needs storage space. Storage space for the stream is requested, and a device on which to allocate the storage space is selected. The storage space on the selected device is allocated to the stream.
摘要:
The present invention is a tape library comprising both a small front-end portion and a larger back-end portion. The front-end portion is a visible virtual or physical tape library; and the back-end portion is a large invisible virtual or physical tape vault. The DPA is a typical DPA which only sees the front-end library. If desired, all messages regarding the importing and exporting of tapes are reviewed by an agent. The agent manages the automatic transfer of tapes between the front-end and the back-end, and notifies the user when a manual import or export is needed. This greatly reduces both the amount of hardware required to establish a virtual or physical tape library and also reduces the amount of labor required by a user to physically import and export tapes. If both the front-end library and, the back-end vault are virtual and reside on the same virtual tape library appliance, no data movement is necessary and the tapes can be moved merely by performing small software state changes. In this case, manual operation may be acceptable.
摘要:
According to one or more embodiments of the present invention, a network cache intercepts data requested by a client from a remote server interconnected with the cache through one or more wide area network (WAN) links (e.g., for Wide Area File Services, or “WAFS”). The network cache stores the data and sends the data to the client. The cache may then intercept a first write request for the data from the client to the remote server, and determine one or more portions of the data in the write request that changed from the data stored at the cache (e.g., according to one or more hashes created based on the data). The network cache then sends a second write request for only the changed portions of the data to the remote server.
摘要:
A sampling based technique for eliminating duplicate data (de-duplication) stored on storage resources, is provided. According to the invention, when a new data set, e.g., a backup data stream, is received by a server, e.g., a storage system or virtual tape library (VTL) system implementing the invention, one or more anchors are identified within the new data set. The anchors are identified using a novel anchor detection circuitry in accordance with an illustrative embodiment of the present invention. Upon receipt of the new data set by, for example, a network adapter of a VTL system, the data set is transferred using direct memory access (DMA) operations to a memory associated with an anchor detection hardware card that is operatively interconnected with the storage system. The anchor detection hardware card may be implemented as, for example, a FPGA is to quickly identify anchors within the data set. As the anchor detection process is performed using a hardware assist, the load on a main processor of the system is reduced, thereby enabling line speed de-duplication.
摘要:
According to one or more embodiments of the present invention, a network cache intercepts data requested by a client from a remote server interconnected with the cache through one or more wide area network (WAN) links (e.g., for Wide Area File Services, or “WAFS”). The network cache stores the data and sends the data to the client. The cache may then intercept a first write request for the data from the client to the remote server, and determine one or more portions of the data in the write request that changed from the data stored at the cache (e.g., according to one or more hashes created based on the data). The network cache then sends a second write request for only the changed portions of the data to the remote server.
摘要:
The present invention is a tape library comprising both a small front-end portion and a larger back-end portion. The front-end portion is a visible virtual or physical tape library; and the back-end portion is a large invisible virtual or physical tape vault. The DPA is a typical DPA which only sees the front-end library. If desired, all messages regarding the importing and exporting of tapes are reviewed by an agent. The agent manages the automatic transfer of tapes between the front-end and the back-end, and notifies the user when a manual import or export is needed. This greatly reduces both the amount of hardware required to establish a virtual or physical tape library and also reduces the amount of labor required by a user to physically import and export tapes. If both the front-end library and the back-end vault are virtual and reside on the same virtual tape library appliance, no data movement is necessary and the tapes can be moved merely by performing small software state changes. In this case, manual operation may be acceptable.
摘要:
A method and an apparatus to store data patterns are presented. In one embodiment, the method includes searching a pattern repository to find prior copies of a pattern and to reference one of the prior copies, or insert a new copy, based on the access time of the prior copy and the effect on the sequential stream performance.
摘要:
A technique for eliminating duplicate data is provided. Upon receipt of a new data set, one or more anchor points are identified within the data set. A bit-by-bit data comparison is then performed of the region surrounding the anchor point in the received data set with the region surrounding an anchor point stored within a pattern database to identify forward/backward delta values. The duplicate data identified by the anchor point, forward and backward delta values is then replaced in the received data set with a storage indicator.