Systems and methods for proactive caching utilizing OLAP variants
    31.
    发明授权
    Systems and methods for proactive caching utilizing OLAP variants 有权
    使用OLAP变体进行主动缓存的系统和方法

    公开(公告)号:US07269581B2

    公开(公告)日:2007-09-11

    申请号:US10402000

    申请日:2003-03-28

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: The present invention leverages MOLAP performance for ROLAP objects (dimensions, partitions and aggregations) by building, in a background process, a MOLAP equivalent of that object. When the background processing completes, queries are switched from ROLAP queries to MOLAP queries. When changes occur to relevant relational objects (such as tables that define content of OLAP objects), an OLAP object is switched back to a ROLAP mode, and all relevant caches are dropped while, as a background process, a new MOLAP equivalent is created.

    摘要翻译: 本发明通过在后台进程中构建该对象的MOLAP等价物来利用ROLAP对象(维度,分区和聚合)的MOLAP性能。 后台处理完成后,查询将从ROLAP查询切换到MOLAP查询。 当相关关系对象(例如定义OLAP对象的内容的表)发生更改时,OLAP对象将切换回ROLAP模式,并且删除所有相关缓存,而作为后台进程创建新的MOLAP等效项。

    Method, system, and apparatus for exposing workbook ranges as data sources
    32.
    发明申请
    Method, system, and apparatus for exposing workbook ranges as data sources 有权
    用于将工作簿范围暴露为数据源的方法,系统和装置

    公开(公告)号:US20050267853A1

    公开(公告)日:2005-12-01

    申请号:US10858175

    申请日:2004-06-01

    CPC分类号: G06F17/246 G06F17/30592

    摘要: A method, system, and apparatus are provided for exposing and utilizing workbook ranges as server data sources. The system includes a client computer capable of executing a spreadsheet application program for creating a workbook including a range that includes data objects. The workbook may be published to a server computer where the specified data objects are exposed as server data sources. The server computer allows client applications to discover and connect to the data objects contained within the workbook as server data sources.

    摘要翻译: 提供了一种用于将工作簿范围作为服务器数据源进行曝光和利用的方法,系统和装置。 该系统包括能够执行用于创建包括包括数据对象的范围的工作簿的电子表格应用程序的客户端计算机。 工作簿可能会被发布到服务器计算机,其中指定的数据对象作为服务器数据源公开。 服务器计算机允许客户端应用程序发现并连接到工作簿中包含的数据对象作为服务器数据源。

    Processing records in dynamic ranges
    33.
    发明授权
    Processing records in dynamic ranges 有权
    在动态范围内处理记录

    公开(公告)号:US09087094B2

    公开(公告)日:2015-07-21

    申请号:US13092978

    申请日:2011-04-25

    IPC分类号: G06F17/30 G06F17/00

    CPC分类号: G06F17/30454 G06F17/30412

    摘要: A scalable analysis system is described herein that performs common data analysis operations such as distinct counts and data grouping in a more scalable and efficient manner. The system allows distinct counts and data grouping to be applied to large datasets with predictable growth in the cost of the operation. The system dynamically partitions data based on the actual data distribution, which provides both scalability and uncompromised performance. The system sets a budget of available memory or other resources to use for the operation. As the operation progresses, the system determines whether the budget of memory is nearing exhaustion. Upon detecting that the memory used is near the limit, the system dynamically partitions the data. If the system still detects memory pressure, then the system partitions again, until a partition level is identified that fits within the memory budget.

    摘要翻译: 本文描述了可扩展分析系统,其以更可扩展和有效的方式执行诸如不同计数和数据分组之类的共同数据分析操作。 该系统允许将不同的计数和数据分组应用于具有可预测的操作成本增长的大型数据集。 系统根据实际的数据分布动态分割数据,提供了可扩展性和无与伦比的性能。 系统设置可用内存或其他资源的预算用于操作。 随着操作的进行,系统确定存储器的预算是否接近耗尽。 在检测到所使用的内存接近限制时,系统会动态分区数据。 如果系统仍然检测到内存压力,则系统再次分区,直到识别出符合内存预算的分区级别。

    Explaining changes in measures thru data mining
    35.
    发明授权
    Explaining changes in measures thru data mining 有权
    解释数据挖掘措施的变化

    公开(公告)号:US07899776B2

    公开(公告)日:2011-03-01

    申请号:US11772480

    申请日:2007-07-02

    IPC分类号: G06F17/00 G06N5/04

    CPC分类号: G06F17/30592

    摘要: Systems and methodologies for identification of factors that cause significant shifts in transactions in a relational store and/or OLAP environment. Transactions are grouped into significant categories defined across the whole data space, to detect interesting sub spaces transactions. Subsequently, sub spaces that show strong variance between two slices can be selected, followed by grouping the subspaces in sub reports to measure the coverage for each sub report. A final report can then be generated that contains list of sub-reports detected in the previous acts.

    摘要翻译: 用于识别在关系存储和/或OLAP环境中导致事务重大变化的因素的系统和方法。 事务被分组在整个数据空间中定义的重要类别中,以检测有趣的子空间事务。 随后,可以选择显示两个切片之间强差异的子空间,然后在子报告中对子空间进行分组,以测量每个子报告的覆盖范围。 然后可以生成包含先前行为中检测到的子报告列表的最终报告。

    EFFICIENT LARGE-SCALE FILTERING AND/OR SORTING FOR QUERYING OF COLUMN BASED DATA ENCODED STRUCTURES
    36.
    发明申请
    EFFICIENT LARGE-SCALE FILTERING AND/OR SORTING FOR QUERYING OF COLUMN BASED DATA ENCODED STRUCTURES 有权
    有效的大规模过滤和/或分类用于查询基于数据的数据编码结构

    公开(公告)号:US20100088315A1

    公开(公告)日:2010-04-08

    申请号:US12363637

    申请日:2009-01-30

    IPC分类号: G06F17/30

    摘要: The subject disclosure relates to querying of column based data encoded structures enabling efficient query processing over large scale data storage, and more specifically with respect to complex queries implicating filter and/or sort operations for data over a defined window. In this regard, in various embodiments, a method is provided that avoids scenarios involving expensive sorting of a high percentage of, or all, rows, either by not sorting any rows at all, or by sorting only a very small number of rows consistent with or smaller than a number of rows associated with the size of the requested window over the data. In one embodiment, this is achieved by splitting an external query request into two different internal sub-requests, a first one that computes statistics about distribution of rows for any specified WHERE clauses and ORDER BY columns, and a second one that selects only the rows that match the window based on the statistics.

    摘要翻译: 主题公开涉及查询基于列的数据编码结构,其能够在大规模数据存储上进行有效的查询处理,更具体地涉及涉及在定义的窗口上涉及数据的过滤器和/或排序操作的复杂查询。 在这方面,在各种实施例中,提供了一种方法,其避免了通过不对任何行进行排序的方式来避免高百分比或全部行的昂贵排序的情况,或者仅通过仅排列非常小数量的与 或小于与数据上所请求的窗口大小相关联的行数。 在一个实施例中,这是通过将外部查询请求分割成两个不同的内部子请求来实现的,第一个是根据任何指定的WHERE子句和ORDER BY列计算关于行的分布的统计信息,第二个仅选择行 根据统计信息匹配窗口。

    EFFICIENT LARGE-SCALE JOINING FOR QUERYING OF COLUMN BASED DATA ENCODED STRUCTURES
    37.
    发明申请
    EFFICIENT LARGE-SCALE JOINING FOR QUERYING OF COLUMN BASED DATA ENCODED STRUCTURES 审中-公开
    用于查询基于数据的数据编码结构的有效的大规模加工

    公开(公告)号:US20100088309A1

    公开(公告)日:2010-04-08

    申请号:US12335341

    申请日:2008-12-15

    IPC分类号: G06F17/30

    摘要: The subject disclosure relates to querying of column based data encoded structures enabling efficient query processing over large scale data storage, and more specifically, with respect to join operations. Initially, a compact structure is received that represents the data according to a column based organization, and various compression and data packing techniques, already enabling a highly efficient and fast query response in real-time. On top of already fast querying enabled by the compact column oriented structure, a scalable, fast algorithm is provided for query processing in memory, which constructs an auxiliary data structure, also column-oriented, for use in join operations, which further leverages characteristics of in-memory data processing and access, as well as the column-oriented characteristics of the compact data structure.

    摘要翻译: 主题公开涉及对基于列的数据编码结构的查询,其能够在大规模数据存储上进行有效的查询处理,更具体地,涉及连接操作。 最初,接收到一个紧凑的结构,它表示根据基于列的组织的数据,以及各种压缩和数据打包技术,已经实现了高效和快速的查询响应。 在紧凑型列导向结构启用的已经快速查询之上,提供了一种可扩展的快速算法,用于存储器中的查询处理,构建了一个辅助数据结构,也是以列为主,用于连接操作,这进一步利用了 内存数据处理和访问,以及紧凑数据结构的面向列的特性。

    EFFICIENT LARGE-SCALE PROCESSING OF COLUMN BASED DATA ENCODED STRUCTURES
    38.
    发明申请
    EFFICIENT LARGE-SCALE PROCESSING OF COLUMN BASED DATA ENCODED STRUCTURES 有权
    基于列的数据编码结构的有效的大规模处理

    公开(公告)号:US20100030748A1

    公开(公告)日:2010-02-04

    申请号:US12270872

    申请日:2008-11-14

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30492

    摘要: The subject disclosure relates to efficient query processing over large scale data storage. An exemplary process includes retrieving a subset of columns implicated by a query as integer encoded and compressed sequences of values corresponding to different columns of data, defining query processing buckets that span over the subset of columns based on changes of compression type occurring in the integer encoded and compressed sequences of values of the subset of data and processing the query in memory on a bucket by bucket basis and processing the query based on type of current bucket when processing the integer encoded and compressed sequences of values. The column based organization of the data, and the application of a hybrid run length encoding and bit packing technique, enable a highly efficient and speedy query response in real-time.

    摘要翻译: 本公开涉及对大规模数据存储的有效查询处理。 示例性过程包括:将查询所涉及的列的子集作为对应于不同数据列的整数编码和压缩的值序列,基于经整数编码的压缩类型的变化定义跨越列的子集的查询处理桶 以及数据子集的值的压缩序列,并且逐桶地处理存储器中的查询,并且当处理整数编码和压缩的值序列时,基于当前存储桶的类型来处理查询。 数据的基于列的组织以及混合运行长度编码和位打包技术的应用实现了高效和快速的查询响应。

    Method, system, and apparatus for exposing workbooks as data sources
    39.
    发明申请
    Method, system, and apparatus for exposing workbooks as data sources 有权
    用于将工作簿暴露为数据源的方法,系统和装置

    公开(公告)号:US20060024653A1

    公开(公告)日:2006-02-02

    申请号:US10903568

    申请日:2004-07-30

    IPC分类号: G09B3/00

    摘要: A method, system, and apparatus are provided for exposing and utilizing workbooks as server data sources. The system includes a client computer capable of executing a spreadsheet application program for creating a workbook including one or more worksheets. The workbook may be published to a server computer where the contents of the workbook are exposed as a multi-dimensional data source. The server computer allows client applications to discover and connect to the workbook as a multi-dimensional, data source, such as a cube.

    摘要翻译: 提供了一种方法,系统和装置,用于将工作簿作为服务器数据源进行曝光和利用。 该系统包括能够执行用于创建包括一个或多个工作表的工作簿的电子表格应用程序的客户端计算机。 工作簿可以被发布到服务器计算机,其中工作簿的内容作为多维数据源被公开。 服务器计算机允许客户端应用程序作为多维数据源(如多维数据集)发现和连接到工作簿。

    Efficient large-scale processing of column based data encoded structures
    40.
    发明授权
    Efficient large-scale processing of column based data encoded structures 有权
    基于列的数据编码结构的高效大规模处理

    公开(公告)号:US08626725B2

    公开(公告)日:2014-01-07

    申请号:US12270872

    申请日:2008-11-14

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    CPC分类号: G06F17/30492

    摘要: The subject disclosure relates to efficient query processing over large scale data storage. An exemplary process includes retrieving a subset of columns implicated by a query as integer encoded and compressed sequences of values corresponding to different columns of data, defining query processing buckets that span over the subset of columns based on changes of compression type occurring in the integer encoded and compressed sequences of values of the subset of data and processing the query in memory on a bucket by bucket basis and processing the query based on type of current bucket when processing the integer encoded and compressed sequences of values. The column based organization of the data, and the application of a hybrid run length encoding and bit packing technique, enable a highly efficient and speedy query response in real-time.

    摘要翻译: 本公开涉及对大规模数据存储的有效查询处理。 示例性过程包括:将查询所涉及的列的子集作为对应于不同数据列的整数编码和压缩的值序列,基于经整数编码的压缩类型的变化来定义跨越​​列的子集的查询处理桶 以及数据子集的值的压缩序列,并且逐桶地处理存储器中的查询,并且当处理整数编码和压缩的值序列时,基于当前存储桶的类型来处理查询。 数据的基于列的组织以及混合运行长度编码和位打包技术的应用实现了高效和快速的查询响应。