Interdistinct Operator
    2.
    发明申请
    Interdistinct Operator 有权
    区分运算符

    公开(公告)号:US20120011133A1

    公开(公告)日:2012-01-12

    申请号:US12982767

    申请日:2010-12-30

    IPC分类号: G06F17/30 G06F7/00

    摘要: A computer-implemented system and method for performing distinct operations on multiple tables of shared memory of parallel computing environments are disclosed. A distinct operation is executed on each table of a plurality of tables, each distinct operation eliminating duplicate data from each table, the executing creating a hierarchy of table pairs and distinct results, the distinct results comprising a reduced row set for each table. Duplicates on each reduced row set are detected to complete the distinct operation on the plurality of tables.

    摘要翻译: 公开了一种用于在并行计算环境的共享存储器的多个表上执行不同操作的计算机实现的系统和方法。 在多个表的每个表上执行不同的操作,每个不同的操作消除来自每个表的重复数据,执行创建表对和不同结果的层次,不同的结果包括每个表的减少的行集合。 检测每个缩小的行集上的复制以完成多个表上的不同操作。

    Interdistinct operator
    3.
    发明授权
    Interdistinct operator 有权
    区分运算符

    公开(公告)号:US09223829B2

    公开(公告)日:2015-12-29

    申请号:US12982767

    申请日:2010-12-30

    IPC分类号: G06F17/30

    摘要: A computer-implemented system and method for performing distinct operations on multiple tables of shared memory of parallel computing environments are disclosed. A distinct operation is executed on each table of a plurality of tables, each distinct operation eliminating duplicate data from each table, the executing creating a hierarchy of table pairs and distinct results, the distinct results comprising a reduced row set for each table. Duplicates on each reduced row set are detected to complete the distinct operation on the plurality of tables.

    摘要翻译: 公开了一种用于在并行计算环境的共享存储器的多个表上执行不同操作的计算机实现的系统和方法。 在多个表的每个表上执行不同的操作,每个不同的操作消除来自每个表的重复数据,执行创建表对和不同结果的层次,不同的结果包括每个表的减少的行集合。 检测每个缩小的行集上的复制以完成多个表上的不同操作。

    Database Table Re-Partitioning Using Trigger-Based Capture and Replay
    4.
    发明申请
    Database Table Re-Partitioning Using Trigger-Based Capture and Replay 审中-公开
    使用基于触发的捕获和重放的数据库表重新分区

    公开(公告)号:US20150242400A1

    公开(公告)日:2015-08-27

    申请号:US14188543

    申请日:2014-02-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Partitioning of a source table of a database to a target table is initiated. Thereafter, a replay table is generated that is populated with triggers for database operations performed on the source table for subsequent replay for the target partitions. Data is later moved (e.g., asynchronously moved, etc.) from the source table to the target table. The database operations are replayed on the target table T subsequent to the moving of the data using the replay table. In addition, the source table is dropped when all of the data has been moved to the target table and there are no operations requiring replay. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 启动将数据库的源表分区到目标表。 此后,生成重播表,其中填充有用于在源表上执行的数据库操作的触发器,用于目标分区的后续重放。 数据稍后从源表移动到目标表(例如异步移动等)。 在使用重放表移动数据之后,在目标表T上重播数据库操作。 此外,当所有数据已被移动到目标表并且没有需要重放的操作时,源表被删除。 还描述了相关设备,系统,技术和物品。

    Block compression of tables with repeated values
    5.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US09450605B2

    公开(公告)日:2016-09-20

    申请号:US13674477

    申请日:2012-11-12

    IPC分类号: G06F17/30 H03M7/30

    摘要: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    摘要翻译: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复值连续,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。

    ADAPTIVE DICTIONARY COMPRESSION/DECOMPRESSION FOR COLUMN-STORE DATABASES
    6.
    发明申请
    ADAPTIVE DICTIONARY COMPRESSION/DECOMPRESSION FOR COLUMN-STORE DATABASES 审中-公开
    柱状数据库的自适应词典压缩/解压缩

    公开(公告)号:US20150178305A1

    公开(公告)日:2015-06-25

    申请号:US14139669

    申请日:2013-12-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30129 G06F17/30315

    摘要: Innovations for adaptive compression and decompression for dictionaries of a column-store database can reduce the amount of memory used for columns of the database, allowing a system to keep column data in memory for more columns, while delays for access operations remain acceptable. For example, dictionary compression variants use different compression techniques and implementation options. Some dictionary compression variants provide more aggressive compression (reduced memory consumption) but result in slower run-time performance. Other dictionary compression variants provide less aggressive compression (higher memory consumption) but support faster run-time performance. As another example, a compression manager can automatically select a dictionary compression variant for a given column in a column-store database. For different dictionary compression variants, the compression manager predicts run-time performance and compressed dictionary size, given the values of the column, and selects one of the dictionary compression variants.

    摘要翻译: 用于列存储数据库字典的自适应压缩和解压缩的创新可以减少数据库列使用的内存量,从而允许系统将列数据保留在内存中以获取更多列,而访问操作的延迟仍然可以接受。 例如,字典压缩变体使用不同的压缩技术和实现选项。 一些字典压缩变体提供更积极的压缩(减少内存消耗),但会导致较慢的运行时性能。 其他字典压缩变体提供较不积极的压缩(更高的内存消耗),但支持更快的运行时性能。 另一个例子,压缩管理器可以自动为列存储数据库中的给定列选择字典压缩变体。 对于不同的字典压缩变体,压缩管理器根据列的值来预测运行时性能和压缩字典大小,并选择一个字典压缩变量。

    Generic node including stored script
    7.
    发明授权
    Generic node including stored script 有权
    通用节点包括存储的脚本

    公开(公告)号:US09026525B2

    公开(公告)日:2015-05-05

    申请号:US12979182

    申请日:2010-12-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30463

    摘要: Methods and apparatus, including computer program products, are provided for providing for processing calculation plans. In one aspect, there is provided a computer-implemented method. The method may include generating a calculation plan including a plurality of nodes; determining whether at least one of the nodes includes a function node; and compiling the function node into executable code to enable execution of the plurality of nodes including the function node at the database. Related apparatus, systems, methods, and articles are also described.

    摘要翻译: 提供包括计算机程序产品在内的方法和装置,用于提供处理计算计划。 一方面,提供了一种计算机实现的方法。 该方法可以包括生成包括多个节点的计算计划; 确定所述节点中的至少一个是否包括功能节点; 以及将所述功能节点编译成可执行代码以使得能够执行包括所述数据库处的所述功能节点的所述多个节点。 还描述了相关装置,系统,方法和制品。

    Many-core algorithms for in-memory column store databases
    8.
    发明授权
    Many-core algorithms for in-memory column store databases 有权
    内存列存储数据库的多核算法

    公开(公告)号:US08914353B2

    公开(公告)日:2014-12-16

    申请号:US13332189

    申请日:2011-12-20

    IPC分类号: G07F7/00 G06F17/30

    CPC分类号: G06F17/30466 G06F17/30463

    摘要: A pattern can be identified in at least part of a query whose definition is received in a query request. The identified pattern can be matched with a set of pre-defined patterns, each of which has associated therewith at least one pre-compiled query execution sub-component of a plurality of pre-compiled query execution sub-components retained in a library. A plan for executing the query can be generated, for example by incorporating the pre-compiled query execution sub-component associated with the matched pattern into the plan based on a pseudo code representation of the plan derived from the definition.

    摘要翻译: 可以在查询的至少一部分中识别模式,该查询的定义在查询请求中被接收。 所识别的模式可以与一组预先定义的模式相匹配,每一模式都与保存在库中的多个预编译查询执行子组件的至少一个预编译查询执行子组件相关联。 可以生成用于执行查询的计划,例如通过基于从定义导出的计划的伪代码表示将与匹配模式相关联的预编译查询执行子组件合并到计划中。

    FIXED STRING DICTIONARY
    9.
    发明申请
    FIXED STRING DICTIONARY 审中-公开
    固定字典

    公开(公告)号:US20140222418A1

    公开(公告)日:2014-08-07

    申请号:US13874327

    申请日:2013-04-30

    IPC分类号: G06F17/27

    摘要: The subject matter described herein relates to implementation of a dictionary in a column-based, in-memory database where values are not stored directly, rather, for each column, a dictionary is created with all distinct values. For each row, a reference to the corresponding value in the dictionary is stored. In one aspect, data is stored in a memory structure organized in a column store format defined by a plurality of columns and a plurality of rows. A dictionary for each column in the memory structure is generated. The dictionary has distinct values for each column. A reference to the dictionary is generated for each column in the memory structure. The dictionary and the reference to the dictionary are stored in the memory structure.

    摘要翻译: 本文描述的主题涉及在基于列的存储器内数据库中实现字典,其中值不直接存储,而是对于每一列,创建具有所有不同值的字典。 对于每一行,存储对字典中对应值的引用。 在一个方面,数据被存储在以由多个列和多行定义的列存储格式组织的存储器结构中。 生成内存结构中每列的字典。 字典对于每一列都有不同的值。 为存储器结构中的每个列生成对字典的引用。 字典和对字典的引用存储在存储器结构中。

    Block compression of tables with repeated values
    10.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US08326810B2

    公开(公告)日:2012-12-04

    申请号:US12848862

    申请日:2010-08-02

    IPC分类号: G06F17/30

    摘要: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    摘要翻译: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复的值相邻,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。