Storage-Side Scanning on Non-Natively Formatted Data
    3.
    发明申请
    Storage-Side Scanning on Non-Natively Formatted Data 审中-公开
    非本地格式化数据的存储侧扫描

    公开(公告)号:US20150356158A1

    公开(公告)日:2015-12-10

    申请号:US14733691

    申请日:2015-06-08

    Abstract: A storage system communicatively coupled to a DBMS performs storage-side scanning of data sources that are not stored in the native database storage format of the DBMS. Data sources for external tables are accessible in a storage system referred to herein as a distributed data access system, e.g. a Hadoop Distributed File System. To execute a query that references an external table, a DBMS first generates an execution plan. The distributed data access system supplies the DBMS with information that specifies each portion of the data source, and specifies which data node to use to access the portion. The DBMS sends a request for each portion to the respective data node, the request requesting that the data node generate rows from data in the portion. The request may specify scanning criteria, specifying one or more columns to project and/or filter on. The request may also specify code modules for the data node to execute to generate rows or records and columns.

    Abstract translation: 通信地耦合到DBMS的存储系统对不存储在DBMS的本地数据库存储格式的数据源执行存储侧扫描。 用于外部表的数据源可在本文称为分布式数据访问系统的存储系统中访问,例如, 一个Hadoop分布式文件系统。 要执行引用外部表的查询,DBMS首先生成执行计划。 分布式数据访问系统向DBMS提供指定数据源的每个部分的信息,并指定要用于访问该部分的数据节点。 DBMS向每个数据节点发送每个部分的请求,该请求请求数据节点从该部分中的数据生成行。 请求可以指定扫描条件,指定一个或多个列进行投影和/或过滤。 该请求还可以指定用于数据节点执行的代码模块以生成行或记录和列。

    Transactional query processing in external tables

    公开(公告)号:US12242458B2

    公开(公告)日:2025-03-04

    申请号:US17588844

    申请日:2022-01-31

    Abstract: Consistent External Table Access maintains transactional consistency for queries that access external tables stored in a DBFS. This ability is achieved by bypassing the OS. One or more database processes executing a query that access an external table stored in a DBFS access the database-file table like other database tables in the DBMS that can be accessed to execute a query. Based on metadata stored in the DBMS regarding how an external table is stored in a DBFS, a DBMS is able to marshal database processes that access database-file tables directly to execute a query.

    Caching Large Objects In A Computer System With Mixed Data Warehousing And Online Transaction Processing Workload
    5.
    发明申请
    Caching Large Objects In A Computer System With Mixed Data Warehousing And Online Transaction Processing Workload 审中-公开
    在具有混合数据仓库和在线事务处理工作负载的计算机系统中缓存大对象

    公开(公告)号:US20140095802A1

    公开(公告)日:2014-04-03

    申请号:US13831462

    申请日:2013-03-14

    CPC classification number: G06F12/128 G06F12/126 G06F16/24561

    Abstract: Techniques are provided for managing cached data objects in a mixed workload environment. In an embodiment, a database system receives request to access a target data object. The database system determines whether the request to access the target data object is associated with a first type of workload or a second type of workload. In response to determining that the request is associated with the first type of workload, the target data object replaces a least recently used data object in a cache. In response to determining that the request is associated with the second type of workload, the target data object is cached based on an associated access-level value.

    Abstract translation: 提供了在混合工作负载环境中管理缓存数据对象的技术。 在一个实施例中,数据库系统接收访问目标数据对象的请求。 数据库系统确定访问目标数据对象的请求是否与第一类工作负载或第二类工作负载相关联。 响应于确定该请求与第一类型的工作负载相关联,目标数据对象将替换高速缓存中最近最少使用的数据对象。 响应于确定该请求与第二类型的工作负载相关联,基于相关联的访问级别值来缓存目标数据对象。

Patent Agency Ranking