-
公开(公告)号:US10055472B2
公开(公告)日:2018-08-21
申请号:US14626836
申请日:2015-02-19
发明人: Benoit Dageville , Thierry Cruanes , Marcin Zukowski , Allison Waingold Lee , Philipp Thomas Unterbrunner
CPC分类号: G06F16/27 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/951 , G06F16/9535 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US20170235750A1
公开(公告)日:2017-08-17
申请号:US15582126
申请日:2017-04-28
CPC分类号: G06F17/30575 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F17/30106 , G06F17/302 , G06F17/30292 , G06F17/30315 , G06F17/30371 , G06F17/30445 , G06F17/30463 , G06F17/30466 , G06F17/30469 , G06F17/30477 , G06F17/3048 , G06F17/30498 , G06F17/30545 , G06F17/30598 , G06F17/30864 , G06F17/30867 , G06F17/30914 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example data management systems and methods are described. In one implementation, a method identifies multiple files to process based on a received query and identifies multiple execution nodes available to process the multiple files. The method initially creates multiple scansets, each including a portion of the multiple files, and assigns each scanset to one of the execution nodes based on a file assignment model. The multiple scansets are processed by the multiple execution nodes. If the method determines that a particular execution node has finished processing all files in its assigned scanset, an unprocessed file is reassigned from another execution node to the particular execution node.
-
公开(公告)号:US09576039B2
公开(公告)日:2017-02-21
申请号:US14518898
申请日:2014-10-20
CPC分类号: G06F17/30575 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F17/30106 , G06F17/302 , G06F17/30292 , G06F17/30315 , G06F17/30371 , G06F17/30445 , G06F17/30463 , G06F17/30466 , G06F17/30469 , G06F17/30477 , G06F17/3048 , G06F17/30498 , G06F17/30545 , G06F17/30598 , G06F17/30864 , G06F17/30867 , G06F17/30914 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example resource provisioning systems and methods are described. In one implementation, an execution platform accesses multiple remote storage devices. The execution platform includes multiple virtual warehouses, each of which includes a cache to store data retrieved from the remote storage devices and a processor that is independent of the remote storage devices. A resource manager is coupled to the execution platform and monitors received data processing requests and resource utilization. The resource manager also determines whether additional virtual warehouses are needed based on the data processing requests and the resource utilization. If additional virtual warehouses are needed, the resource manager provisions a new virtual warehouse.
摘要翻译: 描述示例资源供应系统和方法。 在一个实现中,执行平台访问多个远程存储设备。 执行平台包括多个虚拟仓库,每个虚拟仓库包括用于存储从远程存储设备检索的数据的高速缓存和独立于远程存储设备的处理器。 资源管理器耦合到执行平台,并监视接收到的数据处理请求和资源利用。 资源管理器还基于数据处理请求和资源利用来确定是否需要额外的虚拟仓库。 如果需要额外的虚拟仓库,资源管理器将提供一个新的虚拟仓库。
-
公开(公告)号:US20180018343A1
公开(公告)日:2018-01-18
申请号:US15210536
申请日:2016-07-14
IPC分类号: G06F17/30
CPC分类号: G06F16/162 , G06F16/215 , G06F16/22 , G06F16/24544 , G06F16/9535
摘要: A system, apparatus, and method for processing queries wherein the query includes a request to access or delete data and accessing metadata associated with the set of data, the metadata defining data characteristics of the set of data and identifying at least sets of data that need or not need to be accessed or deleted based on the metadata without accessing the actual data in the set of data; also methods to optimize processing of some operations based on the collected metadata on data.
-
公开(公告)号:US20150234682A1
公开(公告)日:2015-08-20
申请号:US14518898
申请日:2014-10-20
CPC分类号: G06F17/30575 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F17/30106 , G06F17/302 , G06F17/30292 , G06F17/30315 , G06F17/30371 , G06F17/30445 , G06F17/30463 , G06F17/30466 , G06F17/30469 , G06F17/30477 , G06F17/3048 , G06F17/30498 , G06F17/30545 , G06F17/30598 , G06F17/30864 , G06F17/30867 , G06F17/30914 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example resource provisioning systems and methods are described. In one implementation, an execution platform accesses multiple remote storage devices. The execution platform includes multiple virtual warehouses, each of which includes a cache to store data retrieved from the remote storage devices and a processor that is independent of the remote storage devices. A resource manager is coupled to the execution platform and monitors received data processing requests and resource utilization. The resource manager also determines whether additional virtual warehouses are needed based on the data processing requests and the resource utilization. If additional virtual warehouses are needed, the resource manager provisions a new virtual warehouse.
摘要翻译: 描述示例资源供应系统和方法。 在一个实现中,执行平台访问多个远程存储设备。 执行平台包括多个虚拟仓库,每个虚拟仓库包括用于存储从远程存储设备检索的数据的高速缓存和独立于远程存储设备的处理器。 资源管理器耦合到执行平台,并监视接收到的数据处理请求和资源利用。 资源管理器还基于数据处理请求和资源利用来确定是否需要额外的虚拟仓库。 如果需要额外的虚拟仓库,资源管理器将提供一个新的虚拟仓库。
-
公开(公告)号:US10366102B2
公开(公告)日:2019-07-30
申请号:US14518884
申请日:2014-10-20
IPC分类号: G06F15/167 , G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
摘要: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
-
公开(公告)号:US20150237137A1
公开(公告)日:2015-08-20
申请号:US14518884
申请日:2014-10-20
CPC分类号: G06F16/27 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/951 , G06F16/9535 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
摘要翻译: 描述示例资源管理系统和方法。 在一个实现中,资源管理器被配置为管理与多个数据元素相关联的数据处理任务。 执行平台耦合到资源管理器,并且包括被配置为存储从多个远程存储设备检索的数据的多个执行节点。 每个执行节点包括高速缓存和处理器,其中高速缓存和处理器独立于远程存储设备。 元数据管理器被配置为访问与多个数据元素的至少一部分相关联的元数据。
-
公开(公告)号:US20150234922A1
公开(公告)日:2015-08-20
申请号:US14518971
申请日:2014-10-20
CPC分类号: G06F16/27 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/951 , G06F16/9535 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
摘要翻译: 描述示例缓存系统和方法。 在一个实现中,一种方法识别用于处理查询的多个文件,并将多个文件中的每个文件分配给特定执行节点以执行查询。 每个执行节点确定分布式文件是否存储在执行节点的高速缓存中。 如果执行节点确定文件存储在缓存中,则使用缓存文件处理查询。 如果文件未存储在缓存中,则执行节点从远程存储设备检索文件,将文件存储在执行节点的缓存中,并使用该文件处理查询。
-
公开(公告)号:US20150234894A1
公开(公告)日:2015-08-20
申请号:US14626853
申请日:2015-02-19
IPC分类号: G06F17/30
摘要: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
摘要翻译: 用于在分布式系统中用查询数据管理数据存储和数据访问的系统和方法,而不缓冲磁盘存储中的中间操作的结果。
-
公开(公告)号:US20150234688A1
公开(公告)日:2015-08-20
申请号:US14518873
申请日:2014-10-20
CPC分类号: G06F17/30575 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F17/30106 , G06F17/302 , G06F17/30292 , G06F17/30315 , G06F17/30371 , G06F17/30445 , G06F17/30463 , G06F17/30466 , G06F17/30469 , G06F17/30477 , G06F17/3048 , G06F17/30498 , G06F17/30545 , G06F17/30598 , G06F17/30864 , G06F17/30867 , G06F17/30914 , H04L67/1095 , H04L67/1097 , H04L67/2842
摘要: Example data management systems and methods are described. In one implementation, a method identifies multiple files to process based on a received query and identifies multiple execution nodes available to process the multiple files. The method initially creates multiple scansets, each including a portion of the multiple files, and assigns each scanset to one of the execution nodes based on a file assignment model. The multiple scansets are processed by the multiple execution nodes. If the method determines that a particular execution node has finished processing all files in its assigned scanset, an unprocessed file is reassigned from another execution node to the particular execution node.
摘要翻译: 描述示例数据管理系统和方法。 在一个实现中,一种方法基于接收到的查询来识别要处理的多个文件,并且识别可用于处理多个文件的多个执行节点。 该方法最初创建多个scanets,每个都包含多个文件的一部分,并且基于文件分配模型将每个scanet分配给其中一个执行节点。 多个scanets由多个执行节点处理。 如果该方法确定特定执行节点已完成处理其分配的扫描集中的所有文件,则未处理的文件将从另一执行节点重新分配给特定的执行节点。
-
-
-
-
-
-
-
-
-