Compression of tables based on occurrence of values
    1.
    发明授权
    Compression of tables based on occurrence of values 有权
    基于值的发生压缩表

    公开(公告)号:US08768899B2

    公开(公告)日:2014-07-01

    申请号:US13356567

    申请日:2012-01-23

    CPC classification number: G06F17/30324 G06F17/30477 H03M7/30

    Abstract: Methods and apparatus, including computer program products, for compression of tables based on occurrence of values. In general, a number representing an amount of occurrences of a frequently occurring value in a group of adjacent rows of a column is generated, a vector representing whether the frequently occurring value exists in a row of the column is generated, and the number and the vector are stored to enable searches of the data represented by the number and the vector. The vector may omit a portion representing the group of adjacent rows. The values may be dictionary-based compression values representing business data such as business objects. The compression may be performed in-memory, in parallel, to improve memory utilization, network bandwidth consumption, and processing performance.

    Abstract translation: 方法和装置,包括计算机程序产品,用于根据值的发生压缩表。 通常,生成表示列的相邻行的频繁出现值的出现次数的数字,生成表示列的行中是否存在频繁出现的值的向量,并且生成数量和 存储矢量以使得能够搜索由数字和矢量表示的数据。 向量可以省略表示相邻行组的部分。 这些值可以是表示业务数据(例如业务对象)的基于字典的压缩值。 可以并行地在内存中执行压缩,以提高存储器利用率,网络带宽消耗和处理性能。

    Automatic Elimination Of Functional Dependencies Between Columns
    3.
    发明申请
    Automatic Elimination Of Functional Dependencies Between Columns 有权
    自动消除列之间的功能依赖关系

    公开(公告)号:US20080250041A1

    公开(公告)日:2008-10-09

    申请号:US12098350

    申请日:2008-04-04

    CPC classification number: G06F17/30595 Y10S707/99942

    Abstract: In business systems, one or more methods can be used to reduce an amount of redundant data. In one implementation, a method to reduce redundancy within a data model in a database, in which the data model is represented by at least one table, includes determining a number of distinct values of partial keys in a table. Each partial key represents at least one row in the table. The method includes reordering one or more columns of the table by cardinality of partial keys, in which the cardinality of a partial key represents a number of distinct values of the partial key. The method further includes determining whether pairs of partial keys are functionally dependent and eliminating one or more columns having functional dependencies from the table.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少冗余数据量。 在一个实现中,减少数据库中的数据模型中的冗余的方法,其中数据模型由至少一个表表示,包括确定表中部分密钥的不同值的数量。 每个部分键表示表中的至少一行。 该方法包括通过部分键的基数对表的一个或多个列重新排序,其中部分键的基数表示部分键的不同值的数量。 该方法还包括确定部分密钥对是否在功能上相关并且消除具有来自该表的功能依赖性的一个或多个列。

    Fast search with very large result set
    4.
    发明授权
    Fast search with very large result set 有权
    快速搜索非常大的结果集

    公开(公告)号:US07337164B2

    公开(公告)日:2008-02-26

    申请号:US10816011

    申请日:2004-03-31

    Abstract: Methods and apparatus, including computer systems and program products, for executing a query on a subset of data, for example, to facilitate a fast search with a very large result set. In one general aspect, a method of executing a query includes receiving a query for execution on data in the data repository; generating an estimate of a number of results of the query; defining a subset of data in the data repository; determining whether to execute the query on the subset of the data; executing the query on the subset of the data to generate a partial set of results if the query is to be executed on the subset of the data, otherwise executing the query on the data repository to generate a complete set of results; and providing query results.

    Abstract translation: 用于执行关于数据子集的查询的方法和装置,包括计算机系统和程序产品,以便于用非常大的结果集进行快速搜索。 在一个一般方面,执行查询的方法包括:接收对数据存储库中的数据执行的查询; 产生查询结果的数量估计值; 定义数据存储库中的数据子集; 确定是否对数据子集执行查询; 如果要在数据的子集上执行查询,则对数据的子集执行查询以生成部分结果集合,否则在数据存储库上执行查询以生成完整的一组结果; 并提供查询结果。

    FAST AGGREGATION OF COMPRESSED DATA USING FULL TABLE SCANS
    5.
    发明申请
    FAST AGGREGATION OF COMPRESSED DATA USING FULL TABLE SCANS 有权
    使用全表扫描快速聚合压缩数据

    公开(公告)号:US20080016322A1

    公开(公告)日:2008-01-17

    申请号:US11780013

    申请日:2007-07-19

    Abstract: Methods and apparatus, including computer systems and program products, relating to an information management system and aggregating data by performing table scans. In general, in one aspect, the technique includes receiving a query for a response to a search on a database, loading data from the database into memory, filtering the data based on the query to generate a list of results, buffering at least one key figure corresponding to a result, buffering at least one dimension value corresponding to each key figure, aggregating the dimension values to generate an aggregate key, aggregating key figures corresponding to the sane aggregate key to generate one or more aggregate key figures, and displaying the response to the search on a display device. Loading the data may include compressing the data. Filtering the data may be performed blockwise.

    Abstract translation: 涉及信息管理系统的方法和装置,包括计算机系统和程序产品,并通过执行表扫描来聚合数据。 通常,在一个方面,该技术包括接收对数据库上的搜索的响应的查询,将数据从数据库加载到存储器中,基于查询过滤数据以生成结果列表,缓冲至少一个密钥 图形对应于结果,缓冲对应于每个关键字图形的至少一个维度值,聚合维度值以生成聚合密钥,聚合对应于合理聚合密钥的关键字数据,以生成一个或多个聚合密钥数字,以及显示响应 在显示设备上进行搜索。 加载数据可能包括压缩数据。 数据的过滤可以是块状的。

    Query plan execution by implementation of plan operations forming a join graph
    6.
    发明申请
    Query plan execution by implementation of plan operations forming a join graph 有权
    通过实现形成连接图的计划操作来查询计划执行

    公开(公告)号:US20070156701A1

    公开(公告)日:2007-07-05

    申请号:US11322626

    申请日:2005-12-30

    Abstract: A system and method for executing a query plan are disclosed. In the system and method, a join graph is generated to represent the query plan. The join graph includes a set of plan operations that are to be executed for implementing the join graph. The query plan is received by a distributed network of a logical index server and one or more selected physical index servers. Each physical index server receives a portion of the plan operations, and determines what plan data is needed to execute the portion of the plan operations. A system and method includes a process for determining what plan data is needed from other physical index servers, or what plan data is needed by other physical index servers.

    Abstract translation: 公开了一种用于执行查询计划的系统和方法。 在系统和方法中,生成连接图以表示查询计划。 连接图包括要执行连接图的一组计划操作。 查询计划由逻辑索引服务器和一个或多个选定的物理索引服务器的分布式网络接收。 每个物理索引服务器接收计划操作的一部分,并确定执行计划操作的一部分所需的计划数据。 系统和方法包括用于从其他物理索引服务器确定需要什么计划数据的过程,或其他物理索引服务器需要哪些计划数据。

    Automatic elimination of functional dependencies between columns
    7.
    发明申请
    Automatic elimination of functional dependencies between columns 有权
    自动消除列之间的功能依赖关系

    公开(公告)号:US20050192961A1

    公开(公告)日:2005-09-01

    申请号:US10789371

    申请日:2004-02-26

    CPC classification number: G06F17/30595 Y10S707/99942

    Abstract: In business systems, one or more methods can be used to reduce an amount of redundant data. In one implementation, a method to reduce redundancy within a data model in a database, in which the data model is represented by at least one table, includes determining a number of distinct values of partial keys in a table. Each partial key represents at least one row in the table. The method includes reordering one or more columns of the table by cardinality of partial keys, in which the cardinality of a partial key represents a number of distinct values of the partial key. The method further includes determining whether pairs of partial keys are functionally dependent and eliminating one or more columns having functional dependencies from the table.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少冗余数据量。 在一个实现中,减少数据库中的数据模型中的冗余的方法,其中数据模型由至少一个表表示,包括确定表中部分密钥的不同值的数量。 每个部分键表示表中的至少一行。 该方法包括通过部分键的基数对表的一个或多个列重新排序,其中部分键的基数表示部分键的不同值的数量。 该方法还包括确定部分密钥对是否在功能上相关并且消除具有来自该表的功能依赖性的一个或多个列。

    Block compression of tables with repeated values
    9.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US09450605B2

    公开(公告)日:2016-09-20

    申请号:US13674477

    申请日:2012-11-12

    Abstract: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    Abstract translation: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复值连续,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。

    Block compression of tables with repeated values
    10.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US08326810B2

    公开(公告)日:2012-12-04

    申请号:US12848862

    申请日:2010-08-02

    Abstract: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    Abstract translation: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复的值相邻,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。

Patent Agency Ranking