Two-dimensional indexes for quick multiple attribute search in a catalog system
    11.
    发明授权
    Two-dimensional indexes for quick multiple attribute search in a catalog system 有权
    用于目录系统中快速多属性搜索的二维索引

    公开(公告)号:US08219564B1

    公开(公告)日:2012-07-10

    申请号:US12111530

    申请日:2008-04-29

    CPC classification number: G06F17/30946

    Abstract: Embodiments of the present invention provide mechanisms that overcome limitations of existing indexes by creating two-dimensional (2D) spatial indexes to quickly locate data containers that match two or more predicates. This is accomplished by representing metadata attributes describing a data container as dimensions in a 2D space so that a data container can be expressed as a point or a cell in a 2D space with coordinates being a pair of values of the selected attributes. A space filling curve is used to traverse the 2D space and convert each pair of the 2D coordinates to a single space filling curve value. A 2D spatial index is then created based on the computed space filling curve values so that one value can be associated with one or more points (data containers) in the index. Advantageously, the created spatial index provides for searching and processing fewer metadata entries, thereby decreasing the time typically used to search for data.

    Abstract translation: 本发明的实施例提供了通过创建二维(2D)空间索引来快速定位与两个或更多个谓词匹配的数据容器来克服现有索引的限制的机制。 这通过将描述数据容器的元数据属性表示为2D空间中的维度来实现,使得数据容器可以被表示为具有坐标是所选属性的一对值的2D空间中的点或单元。 空间填充曲线用于遍历2D空间,并将每对2D坐标转换为单个空间填充曲线值。 然后基于计算的空间填充曲线值创建2D空间索引,使得一个值可以与索引中的一个或多个点(数据容器)相关联。 有利地,创建的空间索引提供搜索和处理较少的元数据条目,从而减少通常用于搜索数据的时间。

    Content repository implemented in a network storage server system
    12.
    发明授权
    Content repository implemented in a network storage server system 有权
    内容存储库在网络存储服务器系统中实现

    公开(公告)号:US08180813B1

    公开(公告)日:2012-05-15

    申请号:US12633718

    申请日:2009-12-08

    CPC classification number: G06F17/30997

    Abstract: A network storage server system includes a distributed object store, a presentation layer, a metadata subsystem, and a content management subsystem. The object store has no namespace and provides location-independent addressing of data objects. The presentation layer provides multiple interfaces for accessing data stored in the object store, including a NAS interface and a Web Service interface, and provides at least one namespace for accessing data via the NAS interface or the Web Service interface. The Web Service interface allows access to stored data via the namespace or without using the namespace (“raw object” access). The metadata subsystem stores user-specified and/or system-generated metadata relating to data objects and allows data objects to be identified and retrieved by searching on the metadata. The content management subsystem autonomously manages lifecycles of data objects according to user-specified policies, based on metadata associated with the data objects and tracked by the metadata subsystem.

    Abstract translation: 网络存储服务器系统包括分布式对象存储,表示层,元数据子系统和内容管理子系统。 对象存储没有命名空间,并提供数据对象的位置无关寻址。 表示层提供多个接口,用于访问存储在对象存储中的数据,包括NAS接口和Web服务接口,并提供至少一个用于通过NAS接口或Web服务接口访问数据的命名空间。 Web服务界面允许通过命名空间访问存储的数据,也可以不使用命名空间(“raw object”访问))。 元数据子系统存储与数据对象相关的用户指定的和/或系统生成的元数据,并允许通过搜索元数据来识别和检索数据对象。 内容管理子系统根据用户指定的策略,根据与数据对象相关联并由元数据子系统跟踪的元数据自主地管理数据对象的生命周期。

    System and method for nearly in-band search indexing
    13.
    发明授权
    System and method for nearly in-band search indexing 有权
    用于几乎带内搜索索引的系统和方法

    公开(公告)号:US07949638B1

    公开(公告)日:2011-05-24

    申请号:US12750137

    申请日:2010-03-30

    CPC classification number: G06F17/30091

    Abstract: A system and method for nearly in-band search indexing. A network switch (or other intermediate network device) is configured to provide port mirroring so that data access requests directed to a storage system are forwarded to both the storage system and to a search appliance. The search appliance collects index information from the received data access requests to update a search index. As the search appliance is nearly in-band, i.e., not directly in-line of the data access request path, no increase of latency occurs for processing data access requests by the storage system.

    Abstract translation: 一种用于近地带内搜索索引的系统和方法。 网络交换机(或其他中间网络设备)被配置为提供端口镜像,使得定向到存储系统的数据访问请求被转发到存储系统和Search Appliance两者。 Search Appliance从接收的数据访问请求收集索引信息,以更新搜索索引。 由于Search Appliance几乎是带内的,即不是数据访问请求路径的直接内联,所以不会增加等待时间来处理存储系统的数据访问请求。

    Facilitating delayed block allocation in a distributed file system
    14.
    发明授权
    Facilitating delayed block allocation in a distributed file system 有权
    促进分布式文件系统中的延迟块分配

    公开(公告)号:US07376679B2

    公开(公告)日:2008-05-20

    申请号:US10286778

    申请日:2002-11-01

    Abstract: One embodiment of the present invention provides a system that facilitates delayed block allocation in a distributed file system. During operation, the system receives a write command at a client, wherein the write command includes a buffer containing data to be written and a file identifier. In response to receiving the write command, the system reserves a set of disk blocks for the file from a virtual pool of disk blocks allocated to the client. The system also transfers the data to be written to the kernel of the client where the data waits to be transferred to the disk.

    Abstract translation: 本发明的一个实施例提供一种促进分布式文件系统中的延迟块分配的系统。 在操作期间,系统在客户机处接收写入命令,其中写入命令包括包含要写入的数据和文件标识符的缓冲器。 响应于接收到写入命令,系统从分配给客户端的磁盘块的虚拟池中预留一组文件的磁盘块。 系统还将要写入数据的数据传输到数据等待传输到磁盘的客户机内核。

    System and method for improving the relevance of search results using data container access patterns
    15.
    发明授权
    System and method for improving the relevance of search results using data container access patterns 有权
    使用数据容器访问模式来提高搜索结果的相关性的系统和方法

    公开(公告)号:US08996487B1

    公开(公告)日:2015-03-31

    申请号:US11590193

    申请日:2006-10-31

    CPC classification number: G06F17/301

    Abstract: A system and method for improving the relevance of search results using data container access patterns. An indexing process tracks data access patterns and updates an access data structure. When executing a search operation, a search process first identifies a set of data containers containing the search terms. The search process then utilizes the access data structure to rank the identified data containers based on the collected data access pattern information.

    Abstract translation: 一种用于使用数据容器访问模式来提高搜索结果的相关性的系统和方法。 索引过程跟踪数据访问模式并更新访问数据结构。 当执行搜索操作时,搜索处理首先识别包含搜索项的一组数据容器。 然后,搜索过程利用访问数据结构基于收集的数据访问模式信息对所识别的数据容器进行排名。

    Software module for using flash memory as a secondary permanent storage device
    16.
    发明授权
    Software module for using flash memory as a secondary permanent storage device 有权
    使用闪存作为辅助永久存储设备的软件模块

    公开(公告)号:US08499132B1

    公开(公告)日:2013-07-30

    申请号:US12030168

    申请日:2008-02-12

    Abstract: Described herein is a flash remapping (FR) layer in a storage operating system for utilizing flash memory as a secondary permanent storage device in a storage system. The FR layer collects particular information (specified by collection parameters) of received access requests for data stored on primary storage devices of the storage system. Based on the collected information and a predetermined access pattern (specified by pattern parameters), the FR layer selects data sets on the primary storage devices to be transferred permanently to flash memory, whereby subsequent access requests to the selected data sets are redirected to flash memory. New parameters may be received by the FR layer (from a user or program) to dynamically reconfigure the functions of the FR layer. The FR layer may be implemented in the operating system without requiring other code of the storage operating system to be modified.

    Abstract translation: 这里描述的是在存储操作系统中的闪存重映射(FR)层,用于在存储系统中利用闪存作为辅助永久存储设备。 FR层收集存储在存储系统的主存储设备上的数据的接收到的访问请求的特定信息(由收集参数指定)。 基于所收集的信息和预定的访问模式(由模式参数指定),FR层选择将主存储设备上的数据集永久地传送到闪存,由此对所选数据集的后续访问请求被重定向到闪存 。 可以由FR层(来自用户或程序)接收新参数以动态地重新配置FR层的功能。 可以在操作系统中实现FR层,而不需要修改存储操作系统的其他代码。

    Distributed deduplication using global chunk data structure and epochs
    18.
    发明授权
    Distributed deduplication using global chunk data structure and epochs 有权
    使用全局块数据结构和时代分布式重复数据消除

    公开(公告)号:US08930648B1

    公开(公告)日:2015-01-06

    申请号:US13479138

    申请日:2012-05-23

    Abstract: Techniques for a data storage cluster and a method for deduplicating data in the data storage cluster in a scalable manner, by (among other things) using an epoch-based global chunk data structure, are disclosed herein. A global chunk data structure for an epoch is distributed and maintained at a plurality of metadata nodes within the data storage cluster. Fingerprints and identifiers of data chunks are written to the cluster after a particular epoch are written to delta chunk data structures stored in different metadata nodes of the cluster. When the data storage cluster advances to the next epoch, the global chunk data structure is updated using the delta chunk data structures. At any given time, data deduplication in the data storage cluster can be conducted based on the global chunk data structure for the current epoch.

    Abstract translation: 本文公开了一种用于数据存储集群的技术和用于使用基于时代的全局块数据结构(以及其他方式)以可扩展方式对数据存储集群中的数据进行重复数据删除的方法。 用于时代的全局块数据结构被分布和维护在数据存储簇内的多个元数据节点上。 在将特定时期写入存储在集群的不同元数据节点中的增量块数据结构之后,将数据块的指纹和标识符写入集群。 当数据存储集群前进到下一个时期时,使用增量块数据结构来更新全局块数据结构。 在任何给定时间,可以基于当前时期的全局块数据结构来进行数据存储集群中的重复数据删除。

    Proxying search requests for files to a search engine
    19.
    发明授权
    Proxying search requests for files to a search engine 有权
    将文件的搜索请求代理到搜索引擎

    公开(公告)号:US08386446B1

    公开(公告)日:2013-02-26

    申请号:US12606076

    申请日:2009-10-26

    CPC classification number: G06F17/30864 H04L67/1097

    Abstract: A method and apparatus for proxying search requests for a storage system and maintaining a central index for performing the search requests is described herein. An index manager on the storage system may initially produce the central index by examining each file in a file system and update the central index thereafter by examining only those files that have changed since the central index was initially produced or last updated. The index manager may receive a changed file list from a differencing layer configured for comparing snapshots of the file system at different time points to produce changed file lists. A search proxy module may receive search requests in a search protocol and proxy the search requests to a search engine by converting the search requests to another search protocol compatible with the search engine. The search engine may then use the central index for performing the search request.

    Abstract translation: 这里描述了用于代理存储系统的搜索请求并维持用于执行搜索请求的中心索引的方法和装置。 存储系统上的索引管理器可以最初通过检查文件系统中的每个文件来生成中心索引,并且随后通过仅检查自中央索引最初生成或上次更新以来已经改变的那些文件来更新中心索引。 索引管理器可以从配置用于在不同时间点比较文件系统的快照的差分层接收更改的文件列表以产生改变的文件列表。 搜索代理模块可以在搜索协议中接收搜索请求,并通过将搜索请求转换成与搜索引擎兼容的另一搜索协议来将搜索请求代理到搜索引擎。 然后,搜索引擎可以使用中心索引来执行搜索请求。

    Search and update of attributes in file systems
    20.
    发明授权
    Search and update of attributes in file systems 有权
    搜索和更新文件系统中的属性

    公开(公告)号:US08296312B1

    公开(公告)日:2012-10-23

    申请号:US12013953

    申请日:2008-01-14

    CPC classification number: G06F17/30109

    Abstract: Example embodiments provide various techniques for fast and efficient search of attributes stored in data structures. The attributes are organized following a hierarchical structure of the file system and, in an example, the attributes are stored in a data structure where the hierarchical structure is maintained. As a result, a search within such data structure may follow one or more paths along the hierarchical structure of the file system. Attributes associated with directories and files outside of the path can be excluded from the search. Example embodiments also provide various techniques for updating signatures associated with the attributes. In an example, updates to the signatures can be made incrementally. For example, signatures can be updated when the attributes change.

    Abstract translation: 示例性实施例提供用于快速和有效地搜索存储在数据结构中的属性的各种技术。 属性按照文件系统的分层结构进行组织,在一个示例中,属性被存储在维护层次结构的数据结构中。 结果,在这种数据结构内的搜索可以沿着文件系统的层次结构遵循一个或多个路径。 与路径之外的目录和文件相关联的属性可以从搜索中排除。 示例性实施例还提供用于更新与属性相关联的签名的各种技术。 在一个例子中,可以逐步地对签名进行更新。 例如,当属性更改时,可以更新签名。

Patent Agency Ranking