Optimizing a query with extrema function using in-memory data summaries on the storage server

    公开(公告)号:US10528590B2

    公开(公告)日:2020-01-07

    申请号:US14726372

    申请日:2015-05-29

    Abstract: Techniques for optimizing a query with an extrema function are provided. In main memory, a data summary is maintained for a plurality of extents stored by at least one storage server. The data summary includes an extent minimum value and an extent maximum value for one or more columns. A storage server request is received, from a database server, based on a query with an extrema function applied to a particular column of a particular table. The data summaries for a set of relevant extents are processed by maintaining at least one global extrema value corresponding to the extrema function and, for each relevant extent of the set of relevant extents, determining whether to scan records of the relevant extent based on at least one of the global extrema value and an extent summary value of the data summary of the relevant extent.

    Database snapshots
    2.
    发明授权

    公开(公告)号:US10387447B2

    公开(公告)日:2019-08-20

    申请号:US14823212

    申请日:2015-08-11

    Abstract: Techniques herein are for creating a database snapshot by creating a sparse database. A method involves receiving a creation request to create a sparse database. The creation request has an identity of a parent database. The creation request is processed to create a sparse database. The sparse database has the identity of the parent database. The sparse database does not contain data copied from the parent database. A write request to write data into the sparse database is received. The write request is processed by writing the data into the sparse database. The parent database does not receive the data.

    Intelligent network resource manager

    公开(公告)号:US10742560B2

    公开(公告)日:2020-08-11

    申请号:US16287521

    申请日:2019-02-27

    Abstract: A method and apparatus for intelligent network resource manager for distributed computing systems is provided. A first priority is assigned to a first virtual channel set that includes at least two virtual channels of a plurality of virtual channels associated with a physical communication channel. A second priority is assigned to a second virtual channel set that includes at least one virtual channel of the plurality of virtual channels. The first virtual channel set has more virtual channels than the second virtual channel set. Outbound messages of the first priority are directed to virtual channels of the first virtual channel set. Outbound messages of the second priority are directed to virtual channels of the second virtual channel set. The virtual channels are processed in a round-robin order, where processing includes sending the outbound messages over the physical communication channel.

    INTELLIGENT NETWORK RESOURCE MANAGER
    5.
    发明申请
    INTELLIGENT NETWORK RESOURCE MANAGER 审中-公开
    智能网络资源经理

    公开(公告)号:US20150089008A1

    公开(公告)日:2015-03-26

    申请号:US14488161

    申请日:2014-09-16

    CPC classification number: H04L47/24 H04L47/2433 H04L47/2441 H04L67/322

    Abstract: A method and apparatus for intelligent network resource manager for distributed computing systems is provided. A first priority is assigned to a first virtual channel set that includes at least two virtual channels of a plurality of virtual channels associated with a physical communication channel. A second priority is assigned to a second virtual channel set that includes at least one virtual channel of the plurality of virtual channels. The first virtual channel set has more virtual channels than the second virtual channel set. Outbound messages of the first priority are directed to virtual channels of the first virtual channel set. Outbound messages of the second priority are directed to virtual channels of the second virtual channel set. The virtual channels are processed in a round-robin order, where processing includes sending the outbound messages over the physical communication channel.

    Abstract translation: 提供了一种用于分布式计算系统的智能网络资源管理器的方法和装置。 第一优先级被分配给包括与物理通信信道相关联的多个虚拟信道中的至少两个虚拟信道的第一虚拟信道集合。 第二优先级被分配给包括多个虚拟信道中的至少一个虚拟信道的第二虚拟信道集合。 第一个虚拟通道组具有比第二个虚拟通道组更多的虚拟通道。 第一优先级的出站消息被引导到第一虚拟信道集合的虚拟信道。 第二优先级的出站消息被引导到第二虚拟信道集合的虚拟信道。 以循环顺序处理虚拟信道,其中处理包括通过物理通信信道发送出站消息。

    STORAGE-SIDE STORAGE REQUEST MANAGEMENT
    6.
    发明申请
    STORAGE-SIDE STORAGE REQUEST MANAGEMENT 有权
    存储存储请求管理

    公开(公告)号:US20130339636A1

    公开(公告)日:2013-12-19

    申请号:US13971158

    申请日:2013-08-20

    Abstract: Techniques are provided for managing, within a storage system, the sequence in which I/O requests are processed by the storage system based, at least in part, on one or more logical characteristics of the I/O requests. The logical characteristics may include, for example, the identity of the user for whom the I/O request was submitted, the service that submitted the I/O request, the database targeted by the I/O request, an indication of a consumer group to which the I/O request maps, the reason why the I/O request was issued, a priority category of the I/O request, etc. Techniques are also provided for automatically establishing a scheduling policy within a storage system, and for dynamically changing the scheduling policy in response to changes in workload.

    Abstract translation: 提供了技术,用于至少部分地基于I / O请求的一个或多个逻辑特性在存储系统内管理由存储系统处理I / O请求的序列。 逻辑特征可以包括例如提交I / O请求的用户的身份,提交I / O请求的服务,I / O请求所针对的数据库,消费者组的指示 I / O请求映射到哪里,发出I / O请求的原因,I / O请求的优先类别等。还提供了用于在存储系统内自动建立调度策略,并且为了动态地 根据工作负载的变化改变调度策略。

    SELECTIVELY READING DATA FROM CACHE AND PRIMARY STORAGE
    7.
    发明申请
    SELECTIVELY READING DATA FROM CACHE AND PRIMARY STORAGE 有权
    从缓存和初级存储中选择读取数据

    公开(公告)号:US20130212332A1

    公开(公告)日:2013-08-15

    申请号:US13839251

    申请日:2013-03-15

    Abstract: Techniques are provided for using an intermediate cache to provide some of the items involved in a scan operation, while other items involved in the scan operation are provided from primary storage. Techniques are also provided for determining whether to service an I/O request for an item with a copy of the item that resides in the intermediate cache based on factors such as a) an identity of the user for whom the I/O request was submitted, b) an identity of a service that submitted the I/O request, c) an indication of a consumer group to which the I/O request maps, or d) whether the intermediate cache is overloaded. Techniques are also provided for determining whether to store items in an intermediate cache in response to the items being retrieved, based on logical characteristics associated with the requests that retrieve the items.

    Abstract translation: 提供了使用中间缓存来提供扫描操作中涉及的一些项目的技术,而从主存储器提供涉及扫描操作的其他项目。 还提供了用于基于以下因素来确定是否对具有驻留在中间高速缓存中的项目的副本的项目的I / O请求进行服务的技术,例如:a)向其提交I / O请求的用户的身份 ,b)提交I / O请求的服务的身份,c)I / O请求所映射的消费者组的指示,或d)中间缓存是否过载。 还提供了基于与检索项目的请求相关联的逻辑特征来确定是否将项目存储在中间高速缓存中以响应于正被检索的项目的技术。

    Storage-Side Scanning on Non-Natively Formatted Data
    10.
    发明申请
    Storage-Side Scanning on Non-Natively Formatted Data 审中-公开
    非本地格式化数据的存储侧扫描

    公开(公告)号:US20150356158A1

    公开(公告)日:2015-12-10

    申请号:US14733691

    申请日:2015-06-08

    Abstract: A storage system communicatively coupled to a DBMS performs storage-side scanning of data sources that are not stored in the native database storage format of the DBMS. Data sources for external tables are accessible in a storage system referred to herein as a distributed data access system, e.g. a Hadoop Distributed File System. To execute a query that references an external table, a DBMS first generates an execution plan. The distributed data access system supplies the DBMS with information that specifies each portion of the data source, and specifies which data node to use to access the portion. The DBMS sends a request for each portion to the respective data node, the request requesting that the data node generate rows from data in the portion. The request may specify scanning criteria, specifying one or more columns to project and/or filter on. The request may also specify code modules for the data node to execute to generate rows or records and columns.

    Abstract translation: 通信地耦合到DBMS的存储系统对不存储在DBMS的本地数据库存储格式的数据源执行存储侧扫描。 用于外部表的数据源可在本文称为分布式数据访问系统的存储系统中访问,例如, 一个Hadoop分布式文件系统。 要执行引用外部表的查询,DBMS首先生成执行计划。 分布式数据访问系统向DBMS提供指定数据源的每个部分的信息,并指定要用于访问该部分的数据节点。 DBMS向每个数据节点发送每个部分的请求,该请求请求数据节点从该部分中的数据生成行。 请求可以指定扫描条件,指定一个或多个列进行投影和/或过滤。 该请求还可以指定用于数据节点执行的代码模块以生成行或记录和列。

Patent Agency Ranking