Systems and methods for validating ownership of deduplicated data
    1.
    发明授权
    Systems and methods for validating ownership of deduplicated data 有权
    用于验证重复数据删除数据的所有权的系统和方法

    公开(公告)号:US08769627B1

    公开(公告)日:2014-07-01

    申请号:US13314496

    申请日:2011-12-08

    IPC分类号: G06F7/04 G06F11/14 G06F3/06

    摘要: A computer-implemented method for validating ownership of deduplicated data may include (1) identifying a request from a remote client to store a data object in a data store that already includes an instance of the data object, (2) in response to the request, verifying that the remote client possesses the data object by (i) issuing a randomized challenge to the remote client, the randomized challenge including a random value which, when combined with at least a portion of the data object, produces an authentication token demonstrating possession of the data object and, in response to the randomized challenge, (ii) receiving the authentication token from the remote client; and, in response to receiving the authentication token from the remote client, (3) storing the data object in the data store on behalf of the remote client. Various other methods and systems are also disclosed.

    摘要翻译: 用于验证重复数据删除数据的所有权的计算机实现的方法可以包括(1)识别来自远程客户端的请求以将数据对象存储在已经包括数据对象的实例的数据存储中,(2)响应于该请求 ,通过(i)向远程客户端发出随机挑战来验证远程客户端拥有数据对象,随机挑战包括随机值,该随机值当与数据对象的至少一部分组合时产生证明拥有的认证令牌 并且响应于随机挑战,(ii)从远程客户端接收认证令牌; 并且响应于从所述远程客户端接收所述认证令牌,(3)代表所述远程客户机将所述数据对象存储在所述数据存储器中。 还公开了各种其它方法和系统。

    De-duplication storage system with improved reference update efficiency
    2.
    发明授权
    De-duplication storage system with improved reference update efficiency 有权
    重复数据删除存储系统具有改进的参考更新效率

    公开(公告)号:US08914324B1

    公开(公告)日:2014-12-16

    申请号:US12580785

    申请日:2009-10-16

    IPC分类号: G06F17/30

    CPC分类号: G06F11/1453 G06F17/30156

    摘要: A system and method for backing up files to a single-instance storage system are disclosed. The files may be split into segments, and the file data may be stored in the single-instance storage system as individual segments. The single-instance storage system uses the concept of a file region which covers multiple segments of the file. If a region of a file is unchanged from one backup to the next, the system may use a region object to refer to the unchanged region. This avoids the need to update the reference information for each of the segments within the region, thus increasing the efficiency of backing up the new version of the file.

    摘要翻译: 公开了将文件备份到单实例存储系统的系统和方法。 文件可以被分割成段,并且文件数据可以作为单个段存储在单实例存储系统中。 单实例存储系统使用涵盖文件多个段的文件区域的概念。 如果文件的一个区域从一个备份到下一个备份不变,则系统可以使用区域对象来引用未更改的区域。 这避免了需要更新区域内每个段的参考信息,从而提高了备份新版本文件的效率。

    MANAGING BACKUPS OF DATA OBJECTS IN CONTAINERS
    3.
    发明申请
    MANAGING BACKUPS OF DATA OBJECTS IN CONTAINERS 有权
    管理集装箱数据对象的备份

    公开(公告)号:US20130110784A1

    公开(公告)日:2013-05-02

    申请号:US13285331

    申请日:2011-10-31

    IPC分类号: G06F7/00

    摘要: Containers that store data objects that were written to those containers during a particular backup are accessed. Then, a subset of the containers is identified; the containers in the subset have less than a threshold number of data objects associated with the particular backup. Data objects that are in containers in that subset and that are associated with the backup are copied to one or more other containers. Those other containers are subsequently used to restore data objects associated with the backup.

    摘要翻译: 存储在特定备份期间存储写入这些容器的数据对象的容器。 然后,识别容器的一个子集; 子集中的容器具有小于阈值数量的与特定备份相关联的数据对象。 位于该子集中并与备份关联的容器中的数据对象将复制到一个或多个其他容器。 这些其他容器随后用于还原与备份相关联的数据对象。

    System and method for high performance deduplication indexing
    4.
    发明授权
    System and method for high performance deduplication indexing 有权
    高性能重复数据删除索引的系统和方法

    公开(公告)号:US08370315B1

    公开(公告)日:2013-02-05

    申请号:US12790461

    申请日:2010-05-28

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: A system and method for efficiently reducing latency of accessing an index for a data segment stored on a server. A server both removes duplicate data and prevents duplicate data from being stored in a shared data storage. The file server is coupled to an index storage subsystem holding fingerprint and pointer value pairs corresponding to a data segment stored in the shared data storage. The pairs are stored in a predetermined order. The file server utilizes an ordered binary search tree to identify a particular block of multiple blocks within the index storage subsystem corresponding to a received memory access request. The index storage subsystem determines whether an entry corresponding to the memory access request is located within the identified block. Based on at least this determination, the file server processes the memory access request accordingly. In one embodiment, the index storage subsystem is a solid-state disk (SSD).

    摘要翻译: 一种用于有效地减少访问存储在服务器上的数据段的索引的延迟的系统和方法。 一个服务器都会删除重复的数据,并防止重复的数据存储在共享的数据存储中。 文件服务器耦合到保存与存储在共享数据存储器中的数据段对应的指纹和指针值对的索引存储子系统。 这些对以预定顺序存储。 文件服务器利用有序的二叉搜索树来识别与所接收的存储器访问请求对应的索引存储子系统内的多个块的特定块。 索引存储子系统确定与存储器访问请求相对应的条目是否位于所识别的块内。 至少基于这一决定,文件服务器相应地处理存储器访问请求。 在一个实施例中,索引存储子系统是固态盘(SSD)。

    Managing backups of data objects in containers
    5.
    发明授权
    Managing backups of data objects in containers 有权
    管理容器中数据对象的备份

    公开(公告)号:US08874522B2

    公开(公告)日:2014-10-28

    申请号:US13285331

    申请日:2011-10-31

    IPC分类号: G06F17/30 G06F7/00 G06F11/14

    摘要: Containers that store data objects that were written to those containers during a particular backup are accessed. Then, a subset of the containers is identified; the containers in the subset have less than a threshold number of data objects associated with the particular backup. Data objects that are in containers in that subset and that are associated with the backup are copied to one or more other containers. Those other containers are subsequently used to restore data objects associated with the backup.

    摘要翻译: 存储在特定备份期间存储写入这些容器的数据对象的容器。 然后,识别容器的一个子集; 子集中的容器具有小于阈值数量的与特定备份相关联的数据对象。 位于该子集中并与备份关联的容器中的数据对象将复制到一个或多个其他容器。 这些其他容器随后用于还原与备份相关联的数据对象。

    Progressive sampling for deduplication indexing
    6.
    发明授权
    Progressive sampling for deduplication indexing 有权
    重复数据删除索引的逐行抽样

    公开(公告)号:US08311964B1

    公开(公告)日:2012-11-13

    申请号:US12617426

    申请日:2009-11-12

    IPC分类号: G06F17/00 G06N5/00

    摘要: A system and method for efficiently reducing a number of duplicate blocks of stored data. A file server both removes duplicate data and prevents duplicate data from being stored in the shared storage. A sampling rate may be used to determine which fingerprints, or hash values, are stored in an index. The sampling rate may be modified in response to changes in characteristics of the system, such as a change in the shared storage size, a change in a utilization of the shared storage, a change in the size of the storage unit, and reaching a threshold corresponding to utilization of the index. Also, a small cache may be maintained for holding fingerprint and pointer pair values prefetched from the shared storage. Each prefetched pair may be associated with data corresponding to a previous hit in the index. The association may be related to spatial locality, temporal locality, or otherwise.

    摘要翻译: 一种用于有效地减少存储数据的多个重复块的系统和方法。 文件服务器同时删除重复数据,并防止重复数据存储在共享存储中。 可以使用采样率来确定哪些指纹或散列值存储在索引中。 可以响应于系统特性的变化来修改采样率,例如共享存储大小的变化,共享存储器的利用率的改变,存储单元的大小的变化以及达到阈值 对应于索引的利用。 此外,可以维护小的缓存以保持从共享存储器预取的指纹和指针对值。 每个预取对可以与对应于索引中的先前命中的数据相关联。 该关联可能与空间局部性,时间局部性或其他方面有关。

    Systems and methods for neutralizing file-format-specific exploits included within files contained within electronic communications
    7.
    发明授权
    Systems and methods for neutralizing file-format-specific exploits included within files contained within electronic communications 有权
    用于中和文件格式特定漏洞的系统和方法包括在电子通信内的文件中

    公开(公告)号:US09009459B1

    公开(公告)日:2015-04-14

    申请号:US13418332

    申请日:2012-03-12

    IPC分类号: G06F15/16 G06F21/56

    摘要: A computer-implemented method for neutralizing file-format-specific exploits contained within electronic communications may include (1) identifying an electronic communication, (2) identifying at least one file contained within the electronic communication, and then (3) neutralizing any file-format-specific exploits contained within the file. In one example, neutralizing any file-format-specific exploits contained within the file may include applying at least one file-format-conversion operation to the file. Additionally or alternatively, neutralizing any file-format-specific exploits contained within the file may include constructing a sterile version of the file that selectively omits at least a portion of any exploitable content contained within the file. Various other methods, systems, and computer-readable media are also disclosed.

    摘要翻译: 用于中和电子通信中包含的文件格式特定漏洞的计算机实现的方法可以包括(1)识别电子通信,(2)识别包含在电子通信中的至少一个文件,然后(3)中和任何文件 - 文件中包含的特定于格式的漏洞利用。 在一个示例中,中和文件中包含的任何文件格式特定的漏洞利用可能包括对文件应用至少一个文件格式转换操作。 另外或替代地,中和文件中包含的任何文件格式特定的漏洞利用可以包括构建文件的无菌版本,其选择性地省略包含在文件内的任何可利用内容的至少一部分。 还公开了各种其它方法,系统和计算机可读介质。

    Network protocol with damage loss resilient congestion control algorithm
    8.
    发明授权
    Network protocol with damage loss resilient congestion control algorithm 有权
    网络协议具有损伤弹性拥塞控制算法

    公开(公告)号:US07821937B1

    公开(公告)日:2010-10-26

    申请号:US11771204

    申请日:2007-06-29

    申请人: Fanglu Guo

    发明人: Fanglu Guo

    IPC分类号: H04L12/26

    摘要: Various embodiments of a network protocol that utilizes a congestion control algorithm that distinguishes between congestion loss and damage loss are described. In response to a packet loss on a network, a delay-based detection algorithm may be performed based on RTT (Round-Trip Time) information to determine whether the network is congested. If the delay-based detection algorithm does not determine that the network is congested then a consistency-based detection algorithm may be performed based on packet loss rate information. If either the delay-based detection algorithm or the consistency-based detection algorithm determine that the network is congested then the rate of data transmission may be reduced, e.g., by reducing a congestion window size.

    摘要翻译: 描述利用区分拥塞丢失和损害损失的拥塞控制算法的网络协议的各种实施例。 响应于网络上的分组丢失,可以基于RTT(往返时间)信息执行基于延迟的检测算法,以确定网络是否拥塞。 如果基于延迟的检测算法不能确定网络拥塞,则可以基于丢包率信息来执行基于一致性的检测算法。 如果基于延迟的检测算法或基于一致性的检测算法确定网络拥塞,则可以例如通过减少拥塞窗口大小来降低数据传输速率。

    Systems and methods for providing increased scalability in deduplication storage systems
    9.
    发明授权
    Systems and methods for providing increased scalability in deduplication storage systems 有权
    在重复数据删除存储系统中提供更高可扩展性的系统和方法

    公开(公告)号:US08954401B2

    公开(公告)日:2015-02-10

    申请号:US13007301

    申请日:2011-01-14

    摘要: A computer-implemented method for providing increased scalability in deduplication storage systems may include (1) identifying a database that stores a plurality of reference objects, (2) determining that at least one size-related characteristic of the database has reached a predetermined threshold, (3) partitioning the database into a plurality of sub-databases capable of being updated independent of one another, (4) identifying a request to perform an update operation that updates one or more reference objects stored within at least one sub-database, and then (5) performing the update operation on less than all of the sub-databases to avoid processing costs associated with performing the update operation on all of the sub-databases. Various other systems, methods, and computer-readable media are also disclosed.

    摘要翻译: 用于在重复数据删除存储系统中提供增加的可扩展性的计算机实现的方法可以包括(1)识别存储多个参考对象的数据库,(2)确定数据库的至少一个尺寸相关特性已经达到预定阈值, (3)将数据库分割成能够彼此独立地更新的多个子数据库,(4)识别执行更新存储在至少一个子数据库中的一个或多个参考对象的更新操作的请求,以及 然后(5)在小于所有子数据库的情况下执行更新操作,以避免处理与对所有子数据库执行更新操作相关联的成本。 还公开了各种其它系统,方法和计算机可读介质。

    SYSTEM AND METHOD FOR SCALABLE REFERENCE MANAGEMENT IN A DEDUPLICATION BASED STORAGE SYSTEM
    10.
    发明申请
    SYSTEM AND METHOD FOR SCALABLE REFERENCE MANAGEMENT IN A DEDUPLICATION BASED STORAGE SYSTEM 有权
    一种基于存储系统的可扩展参考管理系统与方法

    公开(公告)号:US20120059800A1

    公开(公告)日:2012-03-08

    申请号:US12875815

    申请日:2010-09-03

    申请人: Fanglu Guo

    发明人: Fanglu Guo

    IPC分类号: G06F17/00

    摘要: A system and method for managing a resource reclamation reference list at a coarse level. A storage device is configured to store a plurality of storage objects in a plurality of storage containers, each of said storage containers being configured to store a plurality of said storage objects. A storage container reference list is maintained, wherein for each of the storage containers the storage container reference list identifies which files of a plurality of files reference a storage object within a given storage container. In response to detecting deletion of a given file that references an object within a particular storage container of the storage containers, a server is configured to update the storage container reference list by removing from the storage container reference list an identification of the given file. A reference list associating segment objects with files that reference those segment objects may not be updated response to the deletion.

    摘要翻译: 一种用于在粗略级别管理资源回收参考列表的系统和方法。 存储装置被配置为将多个存储对象存储在多个存储容器中,每个所述存储容器被配置为存储多个所述存储对象。 维护存储容器参考列表,其中对于每个存储容器,存储容器参考列表识别多个文件中的哪些文件引用给定存储容器内的存储对象。 响应于检测到引用存储容器的特定存储容器内的对象的给定文件的删除,服务器被配置为通过从存储容器引用列表中移除给定文件的标识来更新存储容器引用列表。 将段对象与引用这些段对象的文件相关联的引用列表可能不会被更新以响应于删除。