System and method for expressing and calculating a relationship between measures
    1.
    发明授权
    System and method for expressing and calculating a relationship between measures 有权
    用于表达和计算措施之间关系的系统和方法

    公开(公告)号:US07117218B2

    公开(公告)日:2006-10-03

    申请号:US10607087

    申请日:2003-06-26

    IPC分类号: G06F7/00

    摘要: A measure expression may include a relationship between measures defined by an arithmetic operation. A query may request a calculation of the measure expression over a selected range of attributes. The request may be processed by retrieving all rows comprising data within the selected range of attributes for each measure in the expression through a single access to an associated table.

    摘要翻译: 测量表达式可以包括由算术运算定义的度量之间的关系。 查询可以请求在所选范围的属性上计算度量表达式。 该请求可以通过对表达式中的每个度量的所选范围的属性中包含数据的单个访问来检索所有行。

    Relational reporting system and methodology
    4.
    发明申请
    Relational reporting system and methodology 审中-公开
    关系报告制度和方法论

    公开(公告)号:US20060010156A1

    公开(公告)日:2006-01-12

    申请号:US11069314

    申请日:2005-03-01

    IPC分类号: G06F17/00

    CPC分类号: G06F16/24556 G06F16/283

    摘要: A system that facilitates generating a report based upon data within a relational database comprises a mapping component that utilizes mapping functions to associate a multi-dimensional structure with the relational database. A report generator communicates with the multi-dimensional structure to obtain data relating to the relational database and generates a report that includes the obtained data. For example, the mapping component can utilize measure groups to effectuate the association between the multi-dimensional structure and the relational database.

    摘要翻译: 基于关系数据库内的数据来生成报告的系统包括利用映射函数将多维结构与关系数据库相关联的映射组件。 报告生成器与多维结构通信以获得与关系数据库相关的数据,并生成包括所获得的数据的报告。 例如,映射组件可以利用测量组来实现多维结构和关系数据库之间的关联。

    Direct write back systems and methodologies
    7.
    发明申请
    Direct write back systems and methodologies 有权
    直接回写系统和方法

    公开(公告)号:US20060010143A1

    公开(公告)日:2006-01-12

    申请号:US11137232

    申请日:2005-05-25

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30592

    摘要: Provided are systems and methods that facilitate direct write back in a multi-dimensional database. The system includes a delta cache component that receives a user request to change an original cell value and determines a delta value based at least in part upon the changed cell value. Also included is a write back partition component that selectively updates a data cell based at least in part upon the delta value without updating corresponding data cell values. The system and methods allow attributes to be added to any dimension of a cube without affecting the write back data. Adding, modifying or removing a hierarchy has no affect on write back data nor does deleting a dimension that is not referenced by a write back.

    摘要翻译: 提供了便于在多维数据库中直接回写的系统和方法。 该系统包括增量缓存组件,其接收用户请求以改变原始小区值,并且至少部分地基于所改变的小区值来确定增量值。 还包括写回分区组件,其至少部分地基于增量值来选择性地更新数据单元,而不更新对应的数据单元值。 系统和方法允许将属性添加到多维数据集的任何维度,而不会影响回写数据。 添加,修改或删除层次结构对回写数据没有影响,也不会删除未被回写引用的维。

    Efficient column based data encoding for large-scale data storage
    8.
    发明授权
    Efficient column based data encoding for large-scale data storage 有权
    高效的基于列的数据编码用于大规模数据存储

    公开(公告)号:US08452737B2

    公开(公告)日:2013-05-28

    申请号:US13347367

    申请日:2012-01-10

    IPC分类号: G06F17/30

    摘要: The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.

    摘要翻译: 本公开涉及基于列的数据编码,其中待压缩的原始数据由列组织,然后作为数据大小的第一和第二层缩减,字典编码和/或值编码被应用于由 列,以创建与列相对应的整数序列。 接下来,混合贪婪跑步长度编码和位打包压缩算法根据比特节省的分析进一步压缩数据。 混合数据简化技术与基于列的组织协调一致,加上由于表示紧凑数据而在扫描和查询效率方面的增益,导致数据压缩大大提高了传统系统成本的一小部分。

    Multidimensional data cubes with high-cardinality attributes
    9.
    发明授权
    Multidimensional data cubes with high-cardinality attributes 有权
    具有高基数属性的多维数据立方体

    公开(公告)号:US08380748B2

    公开(公告)日:2013-02-19

    申请号:US12042674

    申请日:2008-03-05

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30592

    摘要: Computer-readable media, systems, and methods for building a multidimensional data cube having one or more high-cardinality attributes are described. In embodiments, data is extracted from one or more databases. It is determined that one or more instances of the data are fact data and one or more instances of the data are dimension data. Each member of the fact data is one instance of a dimension and each instance of the dimension data includes an attribute for grouping the fact data. Moreover, in embodiments it is determined that one or more instances of the dimension data are high-cardinality attributes. The one or more high-cardinality attributes are processed with fact data and stored in fact tables on a computer storage medium.

    摘要翻译: 描述了用于构建具有一个或多个高基数属性的多维数据立方体的计算机可读介质,系统和方法。 在实施例中,从一个或多个数据库提取数据。 确定数据的一个或多个实例是事实数据,并且数据的一个或多个实例是尺寸数据。 事实数据的每个成员是维度的一个实例,维数据的每个实例包括用于对事实数据进行分组的属性。 此外,在实施例中,确定尺寸数据的一个或多个实例是高基数属性。 一个或多个高基数属性用事实数据处理并存储在计算机存储介质上的事实表中。

    PROCESSING RECORDS IN DYNAMIC RANGES
    10.
    发明申请
    PROCESSING RECORDS IN DYNAMIC RANGES 有权
    在动态范围内处理记录

    公开(公告)号:US20120271845A1

    公开(公告)日:2012-10-25

    申请号:US13092978

    申请日:2011-04-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30454 G06F17/30412

    摘要: A scalable analysis system is described herein that performs common data analysis operations such as distinct counts and data grouping in a more scalable and efficient manner. The system allows distinct counts and data grouping to be applied to large datasets with predictable growth in the cost of the operation. The system dynamically partitions data based on the actual data distribution, which provides both scalability and uncompromised performance. The system sets a budget of available memory or other resources to use for the operation. As the operation progresses, the system determines whether the budget of memory is nearing exhaustion. Upon detecting that the memory used is near the limit, the system dynamically partitions the data. If the system still detects memory pressure, then the system partitions again, until a partition level is identified that fits within the memory budget.

    摘要翻译: 本文描述了可扩展分析系统,其以更可扩展和有效的方式执行诸如不同计数和数据分组之类的共同数据分析操作。 该系统允许将不同的计数和数据分组应用于具有可预测的操作成本增长的大型数据集。 系统根据实际的数据分布动态分割数据,提供了可扩展性和无与伦比的性能。 系统设置可用内存或其他资源的预算用于操作。 随着操作的进行,系统确定存储器的预算是否接近耗尽。 在检测到所使用的内存接近限制时,系统会动态分区数据。 如果系统仍然检测到内存压力,则系统再次分区,直到识别出符合内存预算的分区级别。