-
公开(公告)号:US08065273B2
公开(公告)日:2011-11-22
申请号:US11746399
申请日:2007-05-09
申请人: Jedidiah Yueh , Clint L McVey , Thomas B Bolt , Scott Auchmoody , Eric Olsen , William Crosby , Scott Ogata , Jennifer Starling , Robert Ramos
发明人: Jedidiah Yueh , Clint L McVey , Thomas B Bolt , Scott Auchmoody , Eric Olsen , William Crosby , Scott Ogata , Jennifer Starling , Robert Ramos
CPC分类号: G06F11/1469 , G06F11/1451 , G06F11/1464
摘要: A priority restore agent in a data storage system generates a priority restore data set for a client computer system or device by identifying a set of active data sets and/or a set of key data sets within client system data generated by the client computer system. The priority restore agent looks at or processes file system attributes for the client system data and compares these attributes with predefined restore parameters. The restore parameters may indicate that any file that has been accessed, modified, or created within a particular period of time be included in the priority restore data set. The key data sets may be identified in a set of automated restore rules. A data protection application within the data storage system can restore data in the priority restore data set onto the client computer system after a disaster or system crash.
摘要翻译: 数据存储系统中的优先级还原代理通过在客户端计算机系统生成的客户端系统数据内识别一组活动数据集和/或一组密钥数据集,来为客户端计算机系统或设备生成优先级恢复数据集。 优先级还原代理查看或处理客户端系统数据的文件系统属性,并将这些属性与预定义的恢复参数进行比较。 恢复参数可以指示在特定时间段内已被访问,修改或创建的任何文件被包括在优先级恢复数据集中。 可以在一组自动恢复规则中标识关键数据集。 数据存储系统中的数据保护应用程序可以在灾难或系统崩溃后将优先级还原数据集中的数据恢复到客户端计算机系统上。
-
公开(公告)号:US20070294320A1
公开(公告)日:2007-12-20
申请号:US11746399
申请日:2007-05-09
申请人: Jedidiah Yueh , Clint McVey , Thomas Bolt , Scott Auchmoody , Eric Olsen , William Crosby , Scott Ogata , Jennifer Starling , Robert Ramos
发明人: Jedidiah Yueh , Clint McVey , Thomas Bolt , Scott Auchmoody , Eric Olsen , William Crosby , Scott Ogata , Jennifer Starling , Robert Ramos
IPC分类号: G06F17/30
CPC分类号: G06F11/1469 , G06F11/1451 , G06F11/1464
摘要: A priority restore agent in a data storage system generates a priority restore data set for a client computer system or device by identifying a set of active data sets and/or a set of key data sets within client system data generated by the client computer system. The priority restore agent looks at or processes file system attributes for the client system data and compares these attributes with predefined restore parameters. The restore parameters may indicate that any file that has been accessed, modified, or created within a particular period of time be included in the priority restore data set. The key data sets may be identified in a set of automated restore rules. A data protection application within the data storage system can restore data in the priority restore data set onto the client computer system after a disaster or system crash.
摘要翻译: 数据存储系统中的优先级还原代理通过在客户端计算机系统生成的客户端系统数据内识别一组活动数据集和/或一组密钥数据集,来为客户端计算机系统或设备生成优先级恢复数据集。 优先级还原代理查看或处理客户端系统数据的文件系统属性,并将这些属性与预定义的恢复参数进行比较。 恢复参数可以指示在特定时间段内已被访问,修改或创建的任何文件被包括在优先级恢复数据集中。 可以在一组自动恢复规则中标识关键数据集。 数据存储系统中的数据保护应用程序可以在灾难或系统崩溃后将优先级还原数据集中的数据恢复到客户端计算机系统上。
-
公开(公告)号:US09002800B1
公开(公告)日:2015-04-07
申请号:US11772182
申请日:2007-06-30
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
CPC分类号: G06F12/145 , G06F11/1448 , G06F11/1456 , G06F17/30575
摘要: A data storage and protection system includes secondary storage and at least one instance of a high efficiency storage application (“HESA”). The HESA backs up and archives client data stored in primary storage of a client computer system or client node to secondary storage. Archive files generated by the HESA re-use previously backed up client data stored in the secondary storage. In one embodiment, previously backed up client data is re-used for an archive file by organizing the archive file as a hash tree having hash values pointing to the previously backed up client data. In addition, the HESA can maximize available space in the primary storage by replacing previously backed up and/or archived client data in the primary storage with pointers that point to the previously backed up and/or archived client data in secondary storage.
摘要翻译: 数据存储和保护系统包括二级存储和高效率存储应用(“HESA”)的至少一个实例。 HESA将存储在客户端计算机系统或客户机节点的主存储中的客户端数据备份和归档到辅助存储。 由HESA生成的归档文件重新使用以前备份存储在辅助存储中的客户端数据。 在一个实施例中,先前备份的客户端数据通过将存档文件组织为具有指向先前备份的客户端数据的散列值的散列树来重新用于归档文件。 此外,HESA可以通过在主存储器中替换以前备份的和/或存档的客户端数据来指示主存储器中的可用空间,指针指向辅助存储器中的先前备份和/或存档的客户端数据。
-
公开(公告)号:US08880797B2
公开(公告)日:2014-11-04
申请号:US11864756
申请日:2007-09-28
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
CPC分类号: G06F17/30159 , G06F3/0608 , G06F3/0641
摘要: A data de-duplication application de-duplicates redundant data on the primary storage read/write pathway of a virtualized server environment. The virtualized server environment comprises one or more server applications operating on a virtualization layer provided on a computer architecture that includes memory (e.g., RAM, cache memory) for temporarily storing data and storage (e.g., disk storage) for persistently storing data. The one or more server applications use the read-write pathway to read data into memory from storage and to write data to storage from memory. The de-duplication application identifies redundant data in memory, storage, or both, and replaces the redundant data with one or more pointers pointing to a single copy of the data. The de-duplication application can operate on fixed or variable size blocks of data and can de-duplicate data either post-process or in-line.
摘要翻译: 数据重复数据删除应用程序在虚拟化服务器环境的主存储读/写路径上复制冗余数据。 虚拟化服务器环境包括在包括用于临时存储用于持久存储数据的数据和存储(例如,磁盘存储))的存储器(例如,RAM,高速缓冲存储器)的计算机体系结构上提供的虚拟化层上操作的一个或多个服务器应用。 一个或多个服务器应用程序使用读写路径从存储器读取数据到存储器,并将数据写入存储器中的存储器。 重复数据删除应用程序识别内存,存储或两者中的冗余数据,并用指向数据的单个副本的一个或多个指针替换冗余数据。 重复数据删除应用程序可以对固定或可变大小的数据块进行操作,并可以对后处理或在线的数据进行重复复制。
-
公开(公告)号:US08135918B1
公开(公告)日:2012-03-13
申请号:US11968052
申请日:2007-12-31
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
CPC分类号: G06F13/28
摘要: Redundant data is identified and eliminated in a network that implements the iSCSI protocol either in-band at the source, in-band at the target, or out-of-band at the target. For in-band de-duplication, a data block included with a write command is assigned a unique identifier that is compared to a database of unique identifiers corresponding to previously written data. If the unique identifier is identical to an existing unique identifier, this indicates that the data block is redundant and has previously been stored elsewhere, in which case it is not stored again. Instead, the storage address specified in the write command may be added to a routing table showing the equivalence of unique identifiers, actual storage addresses, and duplicate storage addresses. When a read request specifying a duplicate storage address is received, the duplicate storage address can be translated to a corresponding unique identifier which points to the actual storage address.
摘要翻译: 在实现iSCSI协议的网络中识别和删除冗余数据,这些协议在源端带内,目标带内或目标带外实现。 对于带内重复数据删除,与写入命令一起提供的数据块被分配唯一的标识符,其与对应于先前写入的数据的唯一标识符的数据库进行比较。 如果唯一标识符与现有的唯一标识符相同,则表示数据块是冗余的,并且先前已经存储在其他位置,在这种情况下,它不会被再次存储。 相反,写命令中指定的存储地址可以被添加到示出唯一标识符,实际存储地址和重复存储地址的等同性的路由表。 当接收到指定重复存储地址的读取请求时,可以将重复存储地址转换为指向实际存储地址的对应的唯一标识符。
-
公开(公告)号:US07783604B1
公开(公告)日:2010-08-24
申请号:US11968050
申请日:2007-12-31
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
IPC分类号: G06F17/30
CPC分类号: G06F11/1453 , G06F11/1464 , G06F11/2094 , G06F11/2097 , G06F17/30289 , G06F17/30575
摘要: A data protection and management solution includes a storage application operated in a tenant computer system and a data protection application hosted in a host computer system as a SaaS application. The storage application generates storage efficient backups of a raw data set stored on the tenant computer system and replicates the backups to the SaaS application. The SaaS application stores the backups and provides one or more data protection and management services, such as offsite backup, offsite archiving, and information lifecycle management, to the tenant computer system. In the event of loss or corruption of all or a portion of the raw data set on the tenant computer system, a recovery module of the storage application can be used to generate a recovered version on the tenant computer system of the raw data set based on the backups replicated to the SaaS application.
摘要翻译: 数据保护和管理解决方案包括在租户计算机系统中操作的存储应用和作为SaaS应用程序托管在主计算机系统中的数据保护应用。 存储应用程序生成存储在租户计算机系统上的原始数据集的存储高效备份,并将备份复制到SaaS应用程序。 SaaS应用程序存储备份,并向租户计算机系统提供一个或多个数据保护和管理服务,如异地备份,异地存档和信息生命周期管理。 如果在租户计算机系统上设置的全部或部分原始数据发生丢失或损坏,则存储应用程序的恢复模块可用于在原始数据集的租户计算机系统上生成基于 备份复制到SaaS应用程序。
-
公开(公告)号:US20090063795A1
公开(公告)日:2009-03-05
申请号:US11864583
申请日:2007-09-28
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
IPC分类号: G06F12/00
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0665 , G06F3/0683 , G06F17/30489 , G06F17/3061
摘要: A data de-duplication application de-duplicates redundant data in the pooled storage capacity of a virtualized storage environment. The virtualized storage environment includes a plurality of storage devices and a virtualization or abstraction layer that aggregates all or a portion of the storage capacity of each storage device into a single pool of storage capacity, all or portions of which can be allocated to one or more host systems. For each host system, the virtualization layer presents a representation of at least a portion of the pooled storage capacity wherein the corresponding host system can read and write data. The data de-duplication application identifies redundant data in the pooled storage capacity and replaces it with one or more pointers pointing to a single instance of the data. The de-duplication application can operate on fixed or variable size blocks of data and can de-duplicate data either post-process or in-line.
摘要翻译: 数据重复数据删除应用程序在虚拟化存储环境的池存储容量中解除复制冗余数据。 虚拟化存储环境包括多个存储设备和将每个存储设备的全部或一部分存储容量聚合成单个存储容量池的虚拟化或抽象层,其全部或部分可分配给一个或多个 主机系统。 对于每个主机系统,虚拟化层呈现了池化存储容量的至少一部分的表示,其中相应的主机系统可以读取和写入数据。 数据重复数据删除应用程序识别池存储容量中的冗余数据,并用指向数据的单个实例的一个或多个指针替换它。 重复数据删除应用程序可以对固定或可变大小的数据块进行操作,并可以对后处理或在线的数据进行重复复制。
-
公开(公告)号:US20080013365A1
公开(公告)日:2008-01-17
申请号:US11688203
申请日:2007-03-19
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
IPC分类号: G11C11/24
CPC分类号: G06F11/1458 , G06F3/0623 , G06F3/0662 , G06F3/0683 , G06F9/45541 , G06F11/1469
摘要: A high efficiency portable archive implements a storage system running on a virtualization layer to archive point-in-time versions of a raw data set and the storage system itself as a virtual system on archive media. The high efficiency portable archive can be implemented in a variety of computer architectures. The virtualization layer presents to the storage system a normalized representation of a set of hardware based on components of the computer architecture, shielding the storage system from the actual hardware components of the computer architecture. The storage system and point-in-time versions of the raw data set can be restored to any hardware subsystem that supports the virtual system.
摘要翻译: 高效率的便携式归档实现了在虚拟化层上运行的存储系统,以将原始数据集的时间点版本和存储系统本身归档在归档介质上作为虚拟系统。 高效率便携式归档可以在各种计算机体系结构中实现。 虚拟化层向存储系统呈现基于计算机架构的组件的一组硬件的归一化表示,从而将存储系统与计算机体系结构的实际硬件组件进行屏蔽。 原始数据集的存储系统和时间点版本可以恢复到支持虚拟系统的任何硬件子系统。
-
公开(公告)号:US20120233425A1
公开(公告)日:2012-09-13
申请号:US13476666
申请日:2012-05-21
申请人: Jedidiah Yueh
发明人: Jedidiah Yueh
IPC分类号: G06F12/16
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0665 , G06F3/0683 , G06F17/30489 , G06F17/3061
摘要: A data de-duplication application de-duplicates redundant data in the pooled storage capacity of a virtualized storage environment. The virtualized storage environment includes a plurality of storage devices and a virtualization or abstraction layer that aggregates all or a portion of the storage capacity of each storage device into a single pool of storage capacity, all or portions of which can be allocated to one or more host systems. For each host system, the virtualization layer presents a representation of at least a portion of the pooled storage capacity wherein the corresponding host system can read and write data. The data de-duplication application identifies redundant data in the pooled storage capacity and replaces it with one or more pointers pointing to a single instance of the data. The de-duplication application can operate on fixed or variable size blocks of data and can de-duplicate data either post-process or in-line.
摘要翻译: 数据重复数据删除应用程序在虚拟化存储环境的池存储容量中解除复制冗余数据。 虚拟化存储环境包括多个存储设备和将每个存储设备的全部或一部分存储容量聚合成单个存储容量池的虚拟化或抽象层,其全部或部分可分配给一个或多个 主机系统。 对于每个主机系统,虚拟化层呈现了池化存储容量的至少一部分的表示,其中相应的主机系统可以读取和写入数据。 数据重复数据删除应用程序识别池存储容量中的冗余数据,并用指向数据的单个实例的一个或多个指针替换它。 重复数据删除应用程序可以对固定或可变大小的数据块进行操作,并可以对后处理或在线的数据进行重复复制。
-
公开(公告)号:US08150808B2
公开(公告)日:2012-04-03
申请号:US12603541
申请日:2009-10-21
申请人: Charles Li Zha , Jedidiah Yueh
发明人: Charles Li Zha , Jedidiah Yueh
IPC分类号: G06F17/00
CPC分类号: G06F17/30088 , G06F17/30566 , G06F17/30864 , G06F2201/84
摘要: Information from multiple databases is retrieved and stored on a database storage system. Multiple point-in-time copies are obtained for each database. A point-in-time copy retrieves data changed in the database since the retrieval of a previous point-in-time copy. A virtual database (VDB) is created by creating a set of files in the data storage system. Each file in the set of files created for a VDB is linked to the database blocks on the database storage system associated with a point-in-time copy of the source database. The set of files associated with the VDB are mounted on a database server allowing the database server to read from and write to the set of files. Workflows based on VDBs allow various usage scenarios based on databases to be implemented efficiently, for example, testing and development, backup and recovery, and data warehouse building.
摘要翻译: 来自多个数据库的信息被检索并存储在数据库存储系统上。 为每个数据库获取多个时间点副本。 时间点副本从检索先前的时间点副本后检索数据库中已更改的数据。 通过在数据存储系统中创建一组文件来创建虚拟数据库(VDB)。 为VDB创建的文件集中的每个文件都链接到与源数据库的时间点副本相关联的数据库存储系统上的数据库块。 与VDB相关联的一组文件安装在数据库服务器上,允许数据库服务器从该文件集中读取和写入。 基于VDB的工作流允许基于数据库的各种使用场景被有效地实现,例如测试和开发,备份和恢复以及数据仓库构建。
-
-
-
-
-
-
-
-
-