Query plan execution by implementation of plan operations forming a join graph
    41.
    发明授权
    Query plan execution by implementation of plan operations forming a join graph 有权
    通过实现形成连接图的计划操作来查询计划执行

    公开(公告)号:US07461057B2

    公开(公告)日:2008-12-02

    申请号:US11322626

    申请日:2005-12-30

    Abstract: A system and method for executing a query plan are disclosed. In the system and method, a join graph is generated to represent the query plan. The join graph includes a set of plan operations that are to be executed for implementing the join graph. The query plan is received by a distributed network of a logical index server and one or more selected physical index servers. Each physical index server receives a portion of the plan operations, and determines what plan data is needed to execute the portion of the plan operations. A system and method includes a process for determining what plan data is needed from other physical index servers, or what plan data is needed by other physical index servers.

    Abstract translation: 公开了一种用于执行查询计划的系统和方法。 在系统和方法中,生成连接图以表示查询计划。 连接图包括要执行连接图的一组计划操作。 查询计划由逻辑索引服务器和一个或多个选定的物理索引服务器的分布式网络接收。 每个物理索引服务器接收计划操作的一部分,并确定执行计划操作的一部分所需的计划数据。 系统和方法包括用于从其他物理索引服务器确定需要什么计划数据的过程,或其他物理索引服务器需要哪些计划数据。

    Compression of tables based on occurrence of values
    42.
    发明申请
    Compression of tables based on occurrence of values 有权
    基于值的发生压缩表

    公开(公告)号:US20080294676A1

    公开(公告)日:2008-11-27

    申请号:US11805178

    申请日:2007-05-21

    CPC classification number: G06F17/30324 G06F17/30477 H03M7/30

    Abstract: Methods and apparatus, including computer program products, for compression of tables based on occurrence of values. In general, a number representing an amount of occurrences of a frequently occurring value in a group of adjacent rows of a column is generated, a vector representing whether the frequently occurring value exists in a row of the column is generated, and the number and the vector are stored to enable searches of the data represented by the number and the vector. The vector may omit a portion representing the group of adjacent rows. The values may be dictionary-based compression values representing business data such as business objects. The compression may be performed in-memory, in parallel, to improve memory utilization, network bandwidth consumption, and processing performance.

    Abstract translation: 方法和装置,包括计算机程序产品,用于根据值的发生压缩表。 通常,生成表示列的相邻行的频繁发生值的出现次数的数字,生成表示列的行中是否存在频繁出现的值的向量,并且生成数量和 存储矢量以使得能够搜索由数字和矢量表示的数据。 向量可以省略表示相邻行的组的部分。 这些值可以是表示业务数据(例如业务对象)的基于字典的压缩值。 可以并行地在内存中执行压缩,以提高存储器利用率,网络带宽消耗和处理性能。

    Merging partial query results into a single result
    43.
    发明授权
    Merging partial query results into a single result 有权
    将部分查询结果合并到单个结果中

    公开(公告)号:US07412494B2

    公开(公告)日:2008-08-12

    申请号:US10789812

    申请日:2004-02-27

    CPC classification number: G06F17/30498 G06F17/30545 Y10S707/99933

    Abstract: A method and system for executing an information retrieval query in a multiserver computing environment is disclosed. The method and system employ a technique in which the query is distributed among each of a plurality of partial index servers in the multiserver environment, and a subset of results is calculated for each of the plurality of partial index servers. Then, the subset of results are merged in one logical index server to generate a merged result.

    Abstract translation: 公开了一种用于在多服务器计算环境中执行信息检索查询的方法和系统。 该方法和系统采用这样一种技术,其中查询分布在多服务器环境中的多个部分索引服务器的每个之间,并且针对多个部分索引服务器中的每一个计算结果的子集。 然后,将结果子集合在一个逻辑索引服务器中以生成合并结果。

    Automatic elimination of functional dependencies between columns
    44.
    发明授权
    Automatic elimination of functional dependencies between columns 有权
    自动消除列之间的功能依赖关系

    公开(公告)号:US07373354B2

    公开(公告)日:2008-05-13

    申请号:US10789371

    申请日:2004-02-26

    CPC classification number: G06F17/30595 Y10S707/99942

    Abstract: In business systems, one or more methods can be used to reduce an amount of redundant data. In one implementation, a method to reduce redundancy within a data model in a database, in which the data model is represented by at least one table, includes determining a number of distinct values of partial keys in a table. Each partial key represents at least one row in the table. The method includes reordering one or more columns of the table by cardinality of partial keys, in which the cardinality of a partial key represents a number of distinct values of the partial key. The method further includes determining whether pairs of partial keys are functionally dependent and eliminating one or more columns having functional dependencies from the table.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少冗余数据量。 在一个实现中,减少数据库中的数据模型中的冗余的方法,其中数据模型由至少一个表表示,包括确定表中部分密钥的不同值的数量。 每个部分键表示表中的至少一行。 该方法包括通过部分键的基数对表的一个或多个列重新排序,其中部分键的基数表示部分键的不同值的数量。 该方法还包括确定部分密钥对是否在功能上相关并且消除具有来自该表的功能依赖性的一个或多个列。

    Fast Search With Very Large Result Set
    45.
    发明申请
    Fast Search With Very Large Result Set 有权
    快速搜索与非常大的结果集

    公开(公告)号:US20080097989A1

    公开(公告)日:2008-04-24

    申请号:US11960598

    申请日:2007-12-19

    Abstract: Methods and apparatus, including computer systems and program products, for executing a query on a subset of data, for example, to facilitate a fast search with a very large result set. In one general aspect, a method of executing a query includes receiving a query for execution on data in the data repository; generating an estimate of a number of results of the query; defining a subset of data in the data repository; determining whether to execute the query on the subset of the data; executing the query on the subset of the data to generate a partial set of results if the query is to be executed on the subset of the data, otherwise executing the query on the data repository to generate a complete set of results; and providing query results.

    Abstract translation: 用于执行关于数据子集的查询的方法和装置,包括计算机系统和程序产品,以便于用非常大的结果集的快速搜索。 在一个一般方面,执行查询的方法包括:接收对数据存储库中的数据执行的查询; 产生查询结果的数量估计值; 定义数据存储库中的数据子集; 确定是否对数据子集执行查询; 如果要在数据的子集上执行查询,则对数据的子集执行查询以生成部分结果集合,否则在数据存储库上执行查询以生成完整的一组结果; 并提供查询结果。

    Automatic reduction of table memory footprint using column cardinality information
    46.
    发明授权
    Automatic reduction of table memory footprint using column cardinality information 有权
    使用列基数信息自动减少表内存占用

    公开(公告)号:US07269606B2

    公开(公告)日:2007-09-11

    申请号:US10789370

    申请日:2004-02-26

    CPC classification number: G06F17/30321 G06F17/30312 Y10S707/99942

    Abstract: In a business system, one or more methods can be used to reduce an amount of redundancy in the storage of data. One implementation includes a method of reducing a memory footprint of a database table having multiple rows and one or more columns, in which each of the one or more columns has a cardinality, and the cardinality is a total number of different values in the rows of each column. The method includes comparing the cardinality with a total number of possible values in the rows of at least one column based on a width of the column. The method also includes reducing the width of the column if the cardinality is less than a threshold based on the total number of possible values in the rows of the column.

    Abstract translation: 在业务系统中,可以使用一种或多种方法来减少数据存储中的冗余量。 一个实施方案包括一种减少具有多个行和一个或多个列的数据库表的存储器占用的方法,其中一个或多个列中的每一个具有基数,并且基数是行中的不同值的总数 每列。 该方法包括基于列的宽度将基数与至少一列的行中的可能值的总数进行比较。 如果基数小于阈值,则该方法还包括基于列的行中的可能值的总数来减小列的宽度。

    Merging partial query results into a single result
    47.
    发明申请
    Merging partial query results into a single result 有权
    将部分查询结果合并到单个结果中

    公开(公告)号:US20050193088A1

    公开(公告)日:2005-09-01

    申请号:US10789812

    申请日:2004-02-27

    CPC classification number: G06F17/30498 G06F17/30545 Y10S707/99933

    Abstract: A method and system for executing an information retrieval query in a multiserver computing environment is disclosed. The method and system employ a technique in which the query is distributed among each of a plurality of partial index servers in the multiserver environment, and a subset of results is calculated for each of the plurality of partial index servers. Then, the subset of results are merged in one logical index server to generate a merged result.

    Abstract translation: 公开了一种用于在多服务器计算环境中执行信息检索查询的方法和系统。 该方法和系统采用这样一种技术,其中查询分布在多服务器环境中的多个部分索引服务器的每个之间,并且针对多个部分索引服务器中的每一个计算结果的子集。 然后,将结果子集合在一个逻辑索引服务器中以生成合并结果。

    Accelerated query refinement by instant estimation of results
    48.
    发明申请
    Accelerated query refinement by instant estimation of results 审中-公开
    通过对结果的即时估算加速查询细化

    公开(公告)号:US20050192942A1

    公开(公告)日:2005-09-01

    申请号:US10789813

    申请日:2004-02-27

    CPC classification number: G06F16/2462

    Abstract: Methods and apparatus, including computer systems and program products, for processing queries for which a solution requires that an information management system perform logical operations on a data repository. In general, in one aspect, the techniques feature a method of executing queries on a data repository. That method includes receiving a query, adapted for execution on a data set in the data repository; defining a sample of the data set, where the sample is a subset of the data set; executing the query on the sample; generating an estimate of a result of the execution of the query on the sample; and providing the estimate to a user interface. The method may further include defining an Nth sample, such that the Nth sample is larger than an (N−1) th sample, and generating an Nth estimate of the result based on the execution of the query on the Nth sample.

    Abstract translation: 用于处理查询的方法和装置,包括计算机系统和程序产品,解决方案要求信息管理系统对数据存储库执行逻辑操作。 通常,在一个方面,该技术特征在于在数据存储库上执行查询的方法。 该方法包括接收适于在数据存储库中的数据集上执行的查询; 定义数据集的样本,其中样本是数据集的子集; 对样本执行查询; 生成对样本查询执行结果的估计; 并将估计提供给用户界面。 该方法还可以包括定义第N个样本,使得第N个样本大于第(N-1)个样本,并且基于对第N个样本的查询的执行来生成结果的第N个估计。

Patent Agency Ranking