摘要:
Described is data storage system modeling. Received at a first data storage system is information representing a workload for I/O operations directed to a logical devices having storage provisioned on physical devices of a second data storage system. Information representing the workload may be obtained by performing discovery processing to discover performance characteristics of the physical devices of the second data storage system. Discovery processing may include receiving, at the first data storage system, I/O operations from a client directed to the set of one or more logical devices having storage provisioned on the physical devices of the second data storage system. The I/O operations are then issued to the second data storage system. In response, performance data is obtained at the first data storage system representing the workload for the plurality of I/O operations. Performance of the second data storage system is modeled in accordance with the workload.
摘要:
Described are techniques for performing data storage optimizations. A data storage optimizer on a first data storage system connected to a set of other data storage systems receives performance classification information. The first data storage system includes a first set of storage devices of a first performance classification. The set of other data storage systems includes a second set of storage devices of a second performance classification different from the first performance classification. A write operation is received from the host to write first data to a target address of a virtually provisioned device. It is determined that physical storage is not currently allocated for the target address of the virtually provisioned device. A first portion of physical storage is allocated for storing the first data. The first portion of physical storage is included in a storage device of the second set having the second performance classification.
摘要:
Identifying data for placement in a storage system having a plurality of storage classes includes subdividing the data into portions, for each of the portions, independently determining at least one score for a particular portion based on a metric corresponding to access of the particular portion, where the at least one score for the particular portion is independent of scores for other ones of the portions, and identifying sub-portions of data for placement in a particular storage class based on the at least one score of a portion of data corresponding to the sub-portions. The at least one score may be based on short term access statistics and long term access statistics. The access statistics may include read misses, writes, and prefetches.
摘要:
Managing data in a storage system having a plurality of classes of storage includes determining an amount of data to be provided on at least one of the classes of storage according to a policy, dynamically setting a threshold according to the amount of data to be provided on the at least one of the classes or an expected performance based on the threshold, and placing data on particular classes of storage based on the threshold. Dynamically setting a threshold may include sorting data portions according to at least one score thereof and may include determining a particular score corresponding to the amount of data to be provided. The data portions may be provided in a histogram having a horizontal scale corresponding to a score value and a vertical scale corresponding to a number of data portions having a particular value.
摘要:
When migrating data, a first message is received at a target data storage system from a source data storage system. The target data storage system includes a data storage optimizer that performs automated data movement optimizations. The first message requests a reservation of a first amount of storage on a first storage tier for performing a data migration to migrate data from the source to the target data storage system. A first capacity limit of the first storage tier is reduced by the first amount thereby representing the reservation of the first amount of storage for performing the data migration. If the first storage tier does not include an amount of available storage of at least the first amount, processing is performed to increase the amount of available storage of the first storage tier.
摘要:
A networked data storage system includes data movement logic and a virtual machine for executing a data management processing plan which is generated by a facility outside the storage array based on statistical information associated with extent level activity metrics provided by the storage array. Dynamically updated and automatically generated data management processing plans are provided to the storage array in the form of units of compiled byte code which the storage array is able to verify and execute using a virtual machine. The virtual machine interfaces with the environment in which it is embedded in a carefully constrained manner, thereby preventing clearly undesirable operations.
摘要:
Described are techniques for evaluating data movement alternative. A set of criteria including capacity and performance limits is received. First processing is performed to evaluate a plurality of alternatives for use in data movement with respect to a set of logical devices having data stored on a set of physical storage devices. Each of the plurality of alternatives includes a different set of data movement criteria comprising capacity limits and a different set of performance limits. The set of physical storage devices includes at least a first physical device of one of a plurality of storage tiers and a second physical device of another one of the plurality of storage tier. One of the sets of performance limits is selected in accordance with the first processing.
摘要:
Described are modeling techniques. In accordance with one or more criteria, a determination may be made as to whether to use a dynamic model or a static model to model performance of components, such as storage devices, of a data storage system. A system may include first and second data storage systems where the first data storage system includes a computer readable medium with first code that performs processing in connection with data storage movement optimizations using one or more models including a dynamic model, and second code that generates and maintains the dynamic model used to model performance of storage devices. The second code may include code for performing first processing to determine device sets each of which does not share back-end resources of the second data storage system with any other device sets, and performing second processing to determine sets of performance curves corresponding to the device sets.
摘要:
Described are techniques for configuring data storage. Criteria identifying limits is received for a plurality of storage tiers for one or more logical devices. The storage tiers include a first storage tier and one or more other storage tiers where the first storage tier is higher performing than the other storage tiers. First processing is performed to determine a first set of data portions of the logical devices to be stored on physical devices of the first storage tier subject to limits included in said criteria. First processing includes determining scores for data portions of the logical devices. Each score is calculated using one or more metrics including a first metric weighted based on an expected I/O size. The data portions are ranked in accordance with the scores. The first set of data portions stored in the first storage tier is selected based on the score ranking.
摘要:
A write operation writes first data to a target logical address range included in a first logical address range of a virtually provisioned device. It is determined that physical storage is not allocated for the target logical address range. First processing is performed to allocate a first portion of physical storage having a first corresponding logical address subrange that includes the target logical address range. The first portion is allocated from a physical device included in a first storage tier. The first processing includes selecting the first storage tier from multiple storage tiers in accordance with selection criteria including metrics characterizing an expected workload or level of activity for the first corresponding logical address subrange of the virtually provisioned device. Mapping information is updated to indicate that the first corresponding logical address subrange of the virtually provisioned device maps to the first portion of physical storage.