Method for removing duplicate data from a storage array
    1.
    发明授权
    Method for removing duplicate data from a storage array 有权
    从存储阵列中删除重复数据的方法

    公开(公告)号:US08930307B2

    公开(公告)日:2015-01-06

    申请号:US13250570

    申请日:2011-09-30

    摘要: A system and method for efficiently removing duplicate data blocks at a fine-granularity from a storage array. A data storage subsystem supports multiple deduplication tables. Table entries in one deduplication table have the highest associated probability of being deduplicated. Table entries may move from one deduplication table to another as the probabilities change. Additionally, a table entry may be evicted from all deduplication tables if a corresponding estimated probability falls below a given threshold. The probabilities are based on attributes associated with a data component and attributes associated with a virtual address corresponding to a received storage access request. A strategy for searches of the multiple deduplication tables may also be determined by the attributes associated with a given storage access request.

    摘要翻译: 一种用于从存储阵列以细粒度有效地去除重复数据块的系统和方法。 数据存储子系统支持多个重复数据删除表。 一个重复数据删除表中的表项具有重复数据删除的相关概率最高。 随着概率的变化,表项可以从一个重复数据删除表移动到另一个。 另外,如果相应的估计概率低于给定阈值,则表条目可以从所有重复数据删除表中逐出。 概率基于与数据组件相关联的属性以及与对应于接收到的存储访问请求的虚拟地址相关联的属性。 用于搜索多个重复数据删除表的策略也可以由与给定存储访问请求相关联的属性来确定。

    Mapping in a storage system
    2.
    发明授权
    Mapping in a storage system 有权
    在存储系统中映射

    公开(公告)号:US08806160B2

    公开(公告)日:2014-08-12

    申请号:US13211288

    申请日:2011-08-16

    IPC分类号: G06F12/00 G06F3/06

    摘要: A system and method for maintaining a mapping table in a data storage subsystem. A data storage subsystem supports multiple mapping tables. Records within a mapping table are arranged in multiple levels which may be logically ordered by time. Each level stores pairs of a key value and a pointer value. New records are inserted in a created new (youngest) level. All levels other than the youngest may be read only. In response to detecting a flattening condition, a data storage controller is configured to identify a group of two or more adjacent levels of the plurality of levels for flattening which are logically adjacent in time. A new level is created and one or more records stored within the group are stored in the new level, in response to detecting each of the one or more records stores a unique key among keys stored within the group.

    摘要翻译: 一种用于在数据存储子系统中维护映射表的系统和方法。 数据存储子系统支持多个映射表。 映射表内的记录以多个级别排列,可以按时间逻辑排序。 每个级别存储键值和指针值的对。 新记录插入到创建的新(最年轻)级别中。 除最小的以外的所有级别可能只读。 响应于检测到平坦化状况,数据存储控制器被配置为识别在时间上在逻辑上相邻的用于平坦化的多个级别中的两个或更多个相邻级别的组。 响应于检测到一个或多个记录中的每一个在存储在组内的密钥之间存储唯一密钥,创建新级别并存储在组内的一个或多个记录在新级中。

    Variable length encoding in a storage system
    3.
    发明授权
    Variable length encoding in a storage system 有权
    存储系统中的可变长度编码

    公开(公告)号:US08793467B2

    公开(公告)日:2014-07-29

    申请号:US13250579

    申请日:2011-09-30

    IPC分类号: G06F12/10 G06F3/06

    摘要: A system and method for maintaining a mapping table in a data storage subsystem. A data storage subsystem supports multiple mapping tables including a plurality of entries. Each of the entries comprise a tuple including a key. A data storage controller is configured to encode each tuple in the mapping table using a variable length encoding. Additionally, the mapping table may be organized as a plurality of time ordered levels, with each level including one or more mapping table entries. Further, a particular encoding of a plurality of encodings for a given tuple may be selected based at least in part on a size of the given tuple as unencoded, a size of the given tuple as encoded, and a time to encode the given tuple.

    摘要翻译: 一种用于在数据存储子系统中维护映射表的系统和方法。 数据存储子系统支持包括多个条目的多个映射表。 每个条目包括包括密钥的元组。 数据存储控制器被配置为使用可变长度编码对映射表中的每个元组进行编码。 另外,映射表可以被组织为多个时间有序的级别,其中每个级别包括一个或多个映射表条目。 此外,可以至少部分地基于给定元组的大小作为未编码,被编码的给定元组的大小和对给定元组进行编码的时间来选择给定元组的多个编码的特定编码。

    Scheduling of reconstructive I/O read operations in a storage environment
    4.
    发明授权
    Scheduling of reconstructive I/O read operations in a storage environment 有权
    在存储环境中调度重构I / O读取操作

    公开(公告)号:US08589625B2

    公开(公告)日:2013-11-19

    申请号:US12882872

    申请日:2010-09-15

    IPC分类号: G06F12/00

    摘要: A system and method for effectively scheduling read and write operations among a plurality of solid-state storage devices. A computer system comprises client computers and data storage arrays coupled to one another via a network. A data storage array utilizes solid-state drives and Flash memory cells for data storage. A storage controller within a data storage array comprises an I/O scheduler. The storage controller is configured to receive a read request targeted to the data storage medium, and identify at least a first storage device of the plurality of storage devices which contains data targeted by the read request. In response to either detecting or predicting the first storage device will exhibit variable performance, the controller is configured to generate a reconstruct read request configured to obtain the data from one or more devices of the plurality of storage devices other than the first storage device.

    摘要翻译: 一种用于在多个固态存储设备之间有效地调度读取和写入操作的系统和方法。 计算机系统包括经由网络彼此耦合的客户端计算机和数据存储阵列。 数据存储阵列利用固态驱动器和闪存单元进行数据存储。 数据存储阵列中的存储控制器包括I / O调度器。 存储控制器被配置为接收针对数据存储介质的读取请求,并且识别包含读取请求所针对的数据的多个存储设备中的至少第一存储设备。 响应于检测或预测第一存储设备将呈现可变性能,控制器被配置为生成重构读取请求,其被配置为从除第一存储设备之外的多个存储设备中的一个或多个设备获得数据。

    METHOD FOR REMOVING DUPLICATE DATA FROM A STORAGE ARRAY
    6.
    发明申请
    METHOD FOR REMOVING DUPLICATE DATA FROM A STORAGE ARRAY 有权
    从存储阵列中删除重复数据的方法

    公开(公告)号:US20130086006A1

    公开(公告)日:2013-04-04

    申请号:US13250570

    申请日:2011-09-30

    IPC分类号: G06F17/30

    摘要: A system and method for efficiently removing duplicate data blocks at a fine-granularity from a storage array. A data storage subsystem supports multiple deduplication tables. Table entries in one deduplication table have the highest associated probability of being deduplicated. Table entries may move from one deduplication table to another as the probabilities change. Additionally, a table entry may be evicted from all deduplication tables if a corresponding estimated probability falls below a given threshold. The probabilities are based on attributes associated with a data component and attributes associated with a virtual address corresponding to a received storage access request. A strategy for searches of the multiple deduplication tables may also be determined by the attributes associated with a given storage access request.

    摘要翻译: 一种用于从存储阵列以细粒度有效地去除重复数据块的系统和方法。 数据存储子系统支持多个重复数据删除表。 一个重复数据删除表中的表项具有重复数据删除的相关概率最高。 随着概率的变化,表项可以从一个重复数据删除表移动到另一个。 另外,如果相应的估计概率低于给定阈值,则表条目可以从所有重复数据删除表中逐出。 概率基于与数据组件相关联的属性以及与对应于接收到的存储访问请求的虚拟地址相关联的属性。 用于搜索多个重复数据删除表的策略也可以由与给定存储访问请求相关联的属性来确定。

    LOGICAL SECTOR MAPPING IN A FLASH STORAGE ARRAY
    7.
    发明申请
    LOGICAL SECTOR MAPPING IN A FLASH STORAGE ARRAY 有权
    闪存存储阵列中的逻辑扇区映射

    公开(公告)号:US20130042052A1

    公开(公告)日:2013-02-14

    申请号:US13208094

    申请日:2011-08-11

    摘要: A system and method for efficiently performing user storage virtualization for data stored in a storage system including a plurality of solid-state storage devices. A data storage subsystem supports multiple mapping tables. Records within a mapping table are arranged in multiple levels. Each level stores pairs of a key value and a pointer value. The levels are sorted by time. New records are inserted in a created newest (youngest) level. No edits are performed in-place. All levels other than the youngest may be read only. The system may further include an overlay table which identifies those keys within the mapping table that are invalid.

    摘要翻译: 一种用于对存储在包括多个固态存储设备的存储系统中的数据有效地执行用户存储虚拟化的系统和方法。 数据存储子系统支持多个映射表。 映射表中的记录以多个级别排列。 每个级别存储键值和指针值的对。 级别按时间排序。 新记录插入到创建的最新(最小)级别中。 没有就地进行编辑。 除最小的以外的所有级别可能只读。 该系统还可以包括标识映射表内无效的那些密钥的覆盖表。

    DYNAMIC RAID GEOMETRIES IN AN SSD ENVIRONMENT
    8.
    发明申请
    DYNAMIC RAID GEOMETRIES IN AN SSD ENVIRONMENT 审中-公开
    固态硬盘环境中的动态RAID几何

    公开(公告)号:US20120084504A1

    公开(公告)日:2012-04-05

    申请号:US12896667

    申请日:2010-10-01

    IPC分类号: G06F12/00

    摘要: A system and method for dynamic RAID geometries. A computer system comprises client computers and data storage arrays coupled to one another via a network. A data storage array utilizes solid-state drives and Flash memory cells for data storage. A storage controller within a data storage array is configured to configure a first subset of the storage devices for use in a first RAID layout, the first RAID layout including a first set of redundant data. The controller further configures a second subset of the storage devices for use in a second RAID layout, the second RAID layout including a second set of redundant data. Additionally, the controller configure an additional device not included in either the first subset or the second subset to store redundant data for both the first RAID layout and the second RAID layout.

    摘要翻译: 用于动态RAID几何的系统和方法。 计算机系统包括经由网络彼此耦合的客户端计算机和数据存储阵列。 数据存储阵列利用固态驱动器和闪存单元进行数据存储。 数据存储阵列中的存储控制器被配置为配置用于第一RAID布局的存储设备的第一子集,所述第一RAID布局包括第一组冗余数据。 控制器还配置用于第二RAID布局的存储设备的第二子集,第二RAID布局包括第二组冗余数据。 此外,控制器配置不包括在第一子集或第二子集中的附加设备以存储用于第一RAID布局和第二RAID布局的冗余数据。

    INTRA-DEVICE DATA PROTECTION IN A RAID ARRAY
    9.
    发明申请
    INTRA-DEVICE DATA PROTECTION IN A RAID ARRAY 有权
    RAID阵列中的设备数据保护

    公开(公告)号:US20120079189A1

    公开(公告)日:2012-03-29

    申请号:US12892535

    申请日:2010-09-28

    IPC分类号: G06F12/16 G06F15/16

    摘要: A system and method for intra-device data protection in a RAID array. A computer system comprises client computers and data storage arrays coupled to one another via a network. A data storage array utilizes solid-state drives and Flash memory cells for data storage. A storage controller within a data storage array is configured to identify a unit of data stored in the data storage subsystem, wherein said unit of data is stored across at least a first storage device and a second storage device of the plurality of storage devices, each of the first storage device and the second storage device storing intra-device redundancy data corresponding to the unit of data; and change an amount of intra-device redundancy data corresponding to the unit of data on only the first storage device.

    摘要翻译: 用于RAID阵列中的设备内数据保护的系统和方法。 计算机系统包括经由网络彼此耦合的客户端计算机和数据存储阵列。 数据存储阵列利用固态驱动器和闪存单元进行数据存储。 数据存储阵列中的存储控制器被配置为识别存储在数据存储子系统中的数据单元,其中所述数据单元被存储在多个存储设备中的至少第一存储设备和第二存储设备上,每个存储控制器 所述第一存储设备和所述第二存储设备存储与所述数据单元对应的设备内冗余数据; 并且仅在第一存储设备上改变对应于数据单元的设备内冗余数据量。

    Dynamic cluster visualization
    10.
    发明授权
    Dynamic cluster visualization 有权
    动态集群可视化

    公开(公告)号:US07599945B2

    公开(公告)日:2009-10-06

    申请号:US11564995

    申请日:2006-11-30

    IPC分类号: G06F7/00 G06F17/00 G06F15/16

    摘要: A method for visually representing a plurality of clusters, the plurality of clusters comprising a plurality of entities with respect to a plurality of entity attributes is provided. The plurality of entities is segmented into the plurality of clusters, such that each individual entity belongs to at least one cluster. A plurality of entity data regarding the plurality of entities is processed to obtain a plurality of characteristics of each cluster of the plurality of clusters with respect to each entity attribute of the plurality of entity attributes. A visual display of the plurality of clusters with respect to the plurality of entity attributes is generated, such that for each cluster of the plurality of clusters and for each entity attribute of the plurality of entity attributes, a portion of the display represents at least two characteristics of the plurality of characteristics simultaneously for that cluster with respect to that entity attribute.

    摘要翻译: 提供了一种用于可视地表示多个聚类的方法,所述多个聚类包括关于多个实体属性的多个实体。 多个实体被分割成多个集群,使得每个单独实体属于至少一个集群。 处理关于多个实体的多个实体数据,以便针对多个实体属性的每个实体属性来获得多个集群中的每个集群的多个特征。 生成关于多个实体属性的多个集群的视觉显示,使得对于多个集群的每个集群以及对于多个实体属性的每个实体属性,显示器的一部分表示至少两个 该集群相对于该实体属性同时具有多个特征的特征。