BLOCK COMPRESSION OF TABLES WITH REPEATED VALUES
    21.
    发明申请
    BLOCK COMPRESSION OF TABLES WITH REPEATED VALUES 有权
    具有重复值的表的块压缩

    公开(公告)号:US20100299316A1

    公开(公告)日:2010-11-25

    申请号:US12848862

    申请日:2010-08-02

    Abstract: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    Abstract translation: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复值连续,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。

    Automatic reduction of table memory footprint using column cardinality information
    22.
    发明授权
    Automatic reduction of table memory footprint using column cardinality information 有权
    使用列基数信息自动减少表内存占用

    公开(公告)号:US07809771B2

    公开(公告)日:2010-10-05

    申请号:US11732310

    申请日:2007-04-02

    CPC classification number: G06F17/30321 G06F17/30312 Y10S707/99942

    Abstract: In a business system, one or more methods can be used to reduce an amount of redundancy in the storage of data. One implementation includes a method of reducing a memory footprint of a database table having multiple rows and one or more columns, in which each of the one or more columns has a cardinality, and the cardinality is a total number of different values in the rows of each column. The method includes comparing the cardinality with a total number of possible values in the rows of at least one column based on a width of the column. The method also includes reducing the width of the column if the cardinality is less than a threshold based on the total number of possible values in the rows of the column.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少数据存储中的冗余量。 一个实施方案包括一种减少具有多个行和一个或多个列的数据库表的存储器占用的方法,其中一个或多个列中的每一个具有基数,并且基数是行中的不同值的总数 每列。 该方法包括基于列的宽度将基数与至少一列的行中的可能值的总数进行比较。 如果基数小于阈值,则该方法还包括基于列的行中的可能值的总数来减小列的宽度。

    Mapping data structures
    23.
    发明授权
    Mapping data structures 有权
    映射数据结构

    公开(公告)号:US07620642B2

    公开(公告)日:2009-11-17

    申请号:US11302855

    申请日:2005-12-13

    CPC classification number: G06F17/30592 Y10S707/99943

    Abstract: A first data storage schema in which a characteristic in a first dimension table is mapped by a first table and a second table can be received and a second data storage schema can be received. The first table maps the characteristic to a first object that include attributes to which time information is irrelevant to data processing activities and the second maps the characteristic to a second object that includes attributes to which time information is relevant to data processing activities. The second data storage schema includes a fact table including at least some facts drawn from the first data storage schema and a second dimension table that includes at least some characteristics drawn from at least one of the first object and the second object.

    Abstract translation: 第一数据存储模式,其中第一维度表中的特征被第一表和第二表映射,并且可以接收第二数据存储模式。 第一表将特征映射到第一对象,其包括时间信息与数据处理活动无关的属性,第二表将特征映射到包括时间信息与数据处理活动相关的属性的第二对象。 第二数据存储模式包括包括从第一数据存储模式中提取的至少一些事实的事实表以及包括从第一对象和第二对象中的至少一个绘制的至少一些特征的第二维度表。

    Fast aggregation of compressed data using full table scans
    24.
    发明授权
    Fast aggregation of compressed data using full table scans 有权
    使用全表扫描快速聚合压缩数据

    公开(公告)号:US07263520B2

    公开(公告)日:2007-08-28

    申请号:US10789426

    申请日:2004-02-27

    Abstract: Methods and apparatus, including computer systems and program products, relating to an information management system and aggregating data by performing table scans. In general, in one aspect, the technique includes receiving a query for a response to a search on a database, loading data from the database into memory, filtering the data based on the query to generate a list of results, buffering at least one key figure corresponding to a result, buffering at least one dimension value corresponding to each key figure, aggregating the dimension values to generate an aggregate key, aggregating key figures corresponding to the same aggregate key to generate one or more aggregate key figures, and displaying the response to the search on a display device. Loading the data may include compressing the data. Filtering the data may be performed blockwise.

    Abstract translation: 涉及信息管理系统的方法和装置,包括计算机系统和程序产品,并通过执行表扫描来聚合数据。 通常,在一个方面,该技术包括接收对数据库上的搜索的响应的查询,将数据从数据库加载到存储器中,基于查询过滤数据以生成结果列表,缓冲至少一个密钥 图形对应于结果,缓冲对应于每个关键字图形的至少一个维度值,聚集维度值以生成聚合密钥,聚合对应于相同聚合密钥的关键数字以生成一个或多个聚合密钥数字,以及显示响应 在显示设备上进行搜索。 加载数据可能包括压缩数据。 数据的过滤可以是块状的。

    Fast aggregation of compressed data using full table scans
    25.
    发明申请
    Fast aggregation of compressed data using full table scans 有权
    使用全表扫描快速聚合压缩数据

    公开(公告)号:US20050192941A1

    公开(公告)日:2005-09-01

    申请号:US10789426

    申请日:2004-02-27

    Abstract: Methods and apparatus, including computer systems and program products, relating to an information management system and aggregating data by performing table scans. In general, in one aspect, the technique includes receiving a query for a response to a search on a database, loading data from the database into memory, filtering the data based on the query to generate a list of results, buffering at least one key figure corresponding to a result, buffering at least one dimension value corresponding to each key figure, aggregating the dimension values to generate an aggregate key, aggregating key figures corresponding to the same aggregate key to generate one or more aggregate key figures, and displaying the response to the search on a display device. Loading the data may include compressing the data. Filtering the data may be performed blockwise.

    Abstract translation: 涉及信息管理系统的方法和装置,包括计算机系统和程序产品,并通过执行表扫描来聚合数据。 通常,在一个方面,该技术包括接收对数据库上的搜索的响应的查询,将数据从数据库加载到存储器中,基于查询过滤数据以生成结果列表,缓冲至少一个密钥 图形对应于结果,缓冲对应于每个关键字图形的至少一个维度值,聚集维度值以生成聚合密钥,聚合对应于相同聚合密钥的关键数字以生成一个或多个聚合密钥数字,以及显示响应 在显示设备上进行搜索。 加载数据可能包括压缩数据。 数据的过滤可以是块状的。

    DUAL DATA STORAGE USING AN IN-MEMORY ARRAY AND AN ON-DISK PAGE STRUCTURE
    26.
    发明申请
    DUAL DATA STORAGE USING AN IN-MEMORY ARRAY AND AN ON-DISK PAGE STRUCTURE 有权
    使用内存阵列和磁盘结构的双数据存储

    公开(公告)号:US20160147457A1

    公开(公告)日:2016-05-26

    申请号:US14553859

    申请日:2014-11-25

    Abstract: The subject matter disclosed herein provides methods for the dual storage of data using an in-memory array and an on-disk page structure. An in-memory array holding a column of data can be maintained. One or more pages can be maintained. Each of the one or more pages can have one or more rows for storing the column of data. Random access can be provided to a subset of the one or more rows by at least loading the subset of rows from the one or more pages to the in-memory array without loading all of the rows from the one or more pages. Related apparatus, systems, techniques, and articles are also described.

    Abstract translation: 本文公开的主题提供了使用存储器内阵列和磁盘页面结构来双重存储数据的方法。 可以维护保存数据列的内存中阵列。 可以维护一个或多个页面。 一个或多个页面中的每一个可以具有用于存储数据列的一行或多行。 可以通过至少将来自一个或多个页面的行的子集加载到存储器内阵列而不从一个或多个页面加载所有行,来将随机访问提供给一行或多行的子集。 还描述了相关装置,系统,技术和制品。

    COMPRESSION OF TABLES BASED ON OCCURRENCE OF VALUES
    27.
    发明申请
    COMPRESSION OF TABLES BASED ON OCCURRENCE OF VALUES 有权
    基于价值观的表格压缩

    公开(公告)号:US20120124019A1

    公开(公告)日:2012-05-17

    申请号:US13356567

    申请日:2012-01-23

    CPC classification number: G06F17/30324 G06F17/30477 H03M7/30

    Abstract: Methods and apparatus, including computer program products, for compression of tables based on occurrence of values. In general, a number representing an amount of occurrences of a frequently occurring value in a group of adjacent rows of a column is generated, a vector representing whether the frequently occurring value exists in a row of the column is generated, and the number and the vector are stored to enable searches of the data represented by the number and the vector. The vector may omit a portion representing the group of adjacent rows. The values may be dictionary-based compression values representing business data such as business objects. The compression may be performed in-memory, in parallel, to improve memory utilization, network bandwidth consumption, and processing performance.

    Abstract translation: 方法和装置,包括计算机程序产品,用于根据值的发生压缩表。 通常,生成表示列的相邻行的频繁发生值的出现次数的数字,生成表示列的行中是否存在频繁出现的值的向量,并且生成数量和 存储矢量以使得能够搜索由数字和矢量表示的数据。 向量可以省略表示相邻行的组的部分。 这些值可以是表示业务数据(例如业务对象)的基于字典的压缩值。 可以并行地在内存中执行压缩,以提高存储器利用率,网络带宽消耗和处理性能。

    Automatic elimination of functional dependencies between columns
    28.
    发明授权
    Automatic elimination of functional dependencies between columns 有权
    自动消除列之间的功能依赖关系

    公开(公告)号:US08150888B2

    公开(公告)日:2012-04-03

    申请号:US12098350

    申请日:2008-04-04

    CPC classification number: G06F17/30595 Y10S707/99942

    Abstract: In business systems, one or more methods can be used to reduce an amount of redundant data. In one implementation, a method to reduce redundancy within a data model in a database, in which the data model is represented by at least one table, includes determining a number of distinct values of partial keys in a table. Each partial key represents at least one row in the table. The method includes reordering one or more columns of the table by cardinality of partial keys, in which the cardinality of a partial key represents a number of distinct values of the partial key. The method further includes determining whether pairs of partial keys are functionally dependent and eliminating one or more columns having functional dependencies from the table.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少冗余数据量。 在一个实现中,减少数据库中的数据模型中的冗余的方法,其中数据模型由至少一个表表示,包括确定表中部分密钥的不同值的数量。 每个部分键表示表中的至少一行。 该方法包括通过部分键的基数对表的一个或多个列重新排序,其中部分键的基数表示部分键的不同值的数量。 该方法还包括确定部分密钥对是否在功能上相关并且消除具有来自该表的功能依赖性的一个或多个列。

    COMPRESSION OF TABLES BASED ON OCCURRENCE OF VALUES
    29.
    发明申请
    COMPRESSION OF TABLES BASED ON OCCURRENCE OF VALUES 有权
    基于价值观的表格压缩

    公开(公告)号:US20110313980A1

    公开(公告)日:2011-12-22

    申请号:US13219499

    申请日:2011-08-26

    CPC classification number: G06F17/30324 G06F17/30477 H03M7/30

    Abstract: Methods and apparatus, including computer program products, for compression of tables based on occurrence of values. In general, a number representing an amount of occurrences of a frequently occurring value in a group of adjacent rows of a column is generated, a vector representing whether the frequently occurring value exists in a row of the column is generated, and the number and the vector are stored to enable searches of the data represented by the number and the vector. The vector may omit a portion representing the group of adjacent rows. The values may be dictionary-based compression values representing business data such as business objects. The compression may be performed in-memory, in parallel, to improve memory utilization, network bandwidth consumption, and processing performance.

    Abstract translation: 方法和装置,包括计算机程序产品,用于根据值的发生压缩表。 通常,生成表示列的相邻行的频繁发生值的出现次数的数字,生成表示列的行中是否存在频繁出现的值的向量,并且生成数量和 存储矢量以使得能够搜索由数字和矢量表示的数据。 向量可以省略表示相邻行的组的部分。 这些值可以是表示业务数据(例如业务对象)的基于字典的压缩值。 可以并行地在内存中执行压缩,以提高存储器利用率,网络带宽消耗和处理性能。

Patent Agency Ranking