摘要:
A method, apparatus, and article of manufacture containing instructions for processing multiple point-in-time consistent data sets. The method consists of creating multiple point-in-time data sets associated with a backup appliance which is associated with backup storage. Upon the transfer of a first update from a primary storage controller to the backup appliance, a determination is made whether the update affects data stored in the backup storage has been unchanged following the creation of the respective data sets. If the update does affect data which is unchanged following the creation of a data set, the current, non-updated data is read from the backup storage. A copy of the current, non-updated data thus read from the backup storage is stored in a storage location which can be one of the data sets or an independent memory location. In addition, pointers are associated with the remaining applicable data sets and directed to the copy at its storage location. Subsequently, the update is applied to the data on the backup storage device. In the event an update is subsequently transferred from the primary storage controller to the backup appliance which affects data stored in backup storage which has already been changed since the creation of an applicable data set, no further data is written to the applicable data set and the update is applied to the backup storage.
摘要:
An apparatus, system, and method are disclosed for efficient creation of point-in-time versions of data using differencing in a read-before-write environment. Included in an embodiment of the invention are a receive module that receives a data block sent from a primary data storage system, a read module that reads a data block from a secondary data storage device into random access memory, the data block corresponding to the data block sent from the primary data storage system, a comparison module that compares the data block sent from the primary data storage system to the data block from the secondary data storage device and creates a differencing data file, a compression module that compresses the differencing data file and to create a compressed differencing data file, and a differencing storage module stores the compressed differencing data file in a differencing storage device.
摘要:
Disclosed is a technique for transferring data using multiple backup components. Responsibility for a portion of data is assigned to a first backup component. When a data update for the portion of data is received at the first backup component from a primary source, the data update is mirrored to a second backup component that is not assigned responsibility for the portion of data. Also disclosed is a technique for processing data updates with a group of backup components. It is determined that a new backup component is active. Near an end of a consistent transactions set formation period, responsibility for one or more portions of data are assigned to each backup component in the group and to the new backup component. During a next consistent transactions set formation period, data updates are processed with each backup component in the group and the new backup component. Moreover, disclosed is a technique for processing data updates with a group of backup components. It is determined that a first backup component in the group is no longer available. Portions of data for which the first backup component had been assigned responsibility are reassigned to each of the other backup components in the group.
摘要:
Disclosed are a system, a method, and a computer program product to optimize the overall latency of transferring data from peer computers to storage devices. The latency optimization occurs after a group of data is received and organized by the peer computers. The average amount of time to transfer data to each particular storage device from the peer computers is used to determine the assignment of data transfers between the peer computers. Each peer computer maintains continuously updated measurements of the average time to transfer data to each storage device. The updated transfer time measurements are based upon a moving average with adjustable weighting of past and present measurements enabling the system to adapt to changing conditions.
摘要:
Disclosed are a system, a method, and a computer program product to provide for the optimization of the output transfer load balance between the peer computers transferring data to one or more storage devices. The peer computers receive, organize and transfer the data to storage devices. The data set is composed of a plurality of data transfers. After an initial division of the data transfers between the two peers, each peer will have assigned responsibility for a number of data transfers. If the one of the peer computers completes offloading transactions earlier than the other peer, then the peer that is still transferring data will employ the other peer to execute a portion of the remaining data transfers. The operation of the system is symmetrical in that either peer may assist the other peer depending upon which peer has idle time. In addition the operation is autonomous and self-adjusting resulting in the peer nodes optimizing the size of the portion of data transfers that are reassigned during the operation of the invention resulting in the minimization of idle time for either peer. The self-adjusting feature allows the system to react to changing conditions that affect data transfer rates to the storage devices.
摘要:
A method to optimize the transmission of data from (N) primary backup appliances interconnected to a plurality of second backup appliances by a single communication link, wherein (N) is greater than 1, by transferring a data set to one or more secondary backup appliances by two or more of the (N) primary backup appliances using the communication link, and completing those transfers of the data sets by the two or more primary backup appliances at the same time.
摘要:
An apparatus, system, and method are disclosed for determining prefetch data. A start module communicates a start of a target software process to a storage device. A learning module learns data blocks accessed for the target software process. In one embodiment, a prefetch module prefetches the learned data blocks in response to the start of the target software process. An end module communicates the end of the target software process to the storage device. In one embodiment, the prefetch module terminates prefetching data blocks and the learning module terminates learning the data blocks accessed for the target software process in response to the end module's communication of the end of the target software process.
摘要:
Provided are a method, system, and article of manufacture wherein a command is received for writing data to a first storage location. A determination is made that previously written data is stored in the first storage location. The previously written data is copied to a second storage location, in response to determining that the previously written data is stored in the first storage location. The data is written to the first storage location, in response to copying the previously written data to the second storage location.
摘要:
In a data processing environment, data is replicated to a remote or secondary storage device in a manner which reduces the adverse performance effects and inefficient bandwidth usage imposed by the conventional one-transaction-at-a-time process. Transactions to be transferred are grouped by a replication manager by selecting transactions having start times earlier than the completion time of a first transaction. Thus, no transaction in a group will be dependent upon any other transaction in the group. Once selected, all transactions in the group may then be transferred to the secondary storage device.
摘要:
Provided are a method, system, and article of manufacture, wherein a controller receives a request from one of a plurality of hosts. The controller determines whether a primary storage control unit coupled to the controller is operational. A response is generated by accessing the primary storage control unit, in response to determining that the primary storage control unit is operational. The response is generated by accessing a secondary storage control unit, in response to determining that the primary storage control unit is not operational, wherein data is replicated synchronously from the primary storage control unit to the secondary storage control unit.