STRING AND SUB-STRING SEARCHING USING INVERTED INDEXES
    1.
    发明申请
    STRING AND SUB-STRING SEARCHING USING INVERTED INDEXES 有权
    使用反向索引搜索STRING和SUB-STRING

    公开(公告)号:US20120158782A1

    公开(公告)日:2012-06-21

    申请号:US12970780

    申请日:2010-12-16

    IPC分类号: G06F17/30

    摘要: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 单独提供术语和术语分隔符的反向索引,以最大限度地减少数据冗余。 解析搜索查询以识别术语和术语分隔符(如果有),并搜索相应的反向索引以获得响应文档。 还描述了相关设备,系统,技术和物品。

    String and Sub-String Searching Using Inverted Indexes
    2.
    发明申请
    String and Sub-String Searching Using Inverted Indexes 有权
    使用反向索引的字符串和子字符串搜索

    公开(公告)号:US20130290345A1

    公开(公告)日:2013-10-31

    申请号:US13926917

    申请日:2013-06-25

    IPC分类号: G06F17/30

    摘要: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 单独提供术语和术语分隔符的反向索引,以最大限度地减少数据冗余。 解析搜索查询以识别术语和术语分隔符(如果有),并搜索相应的反向索引以获得响应文档。 还描述了相关设备,系统,技术和物品。

    String and sub-string searching using inverted indexes
    3.
    发明授权
    String and sub-string searching using inverted indexes 有权
    使用反向索引的字符串和子字符串搜索

    公开(公告)号:US08805808B2

    公开(公告)日:2014-08-12

    申请号:US13926917

    申请日:2013-06-25

    IPC分类号: G06F17/30

    摘要: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 单独提供术语和术语分隔符的反向索引,以最大限度地减少数据冗余。 解析搜索查询以识别术语和术语分隔符(如果有),并搜索相应的反向索引以获得响应文档。 还描述了相关设备,系统,技术和物品。

    String and sub-string searching using inverted indexes
    4.
    发明授权
    String and sub-string searching using inverted indexes 有权
    使用反向索引的字符串和子字符串搜索

    公开(公告)号:US08498972B2

    公开(公告)日:2013-07-30

    申请号:US12970780

    申请日:2010-12-16

    IPC分类号: G06F17/30

    摘要: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 单独提供术语和术语分隔符的反向索引,以最大限度地减少数据冗余。 解析搜索查询以识别术语和术语分隔符(如果有),并搜索相应的反向索引以获得响应文档。 还描述了相关设备,系统,技术和物品。

    Block compression of tables with repeated values
    5.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US09450605B2

    公开(公告)日:2016-09-20

    申请号:US13674477

    申请日:2012-11-12

    IPC分类号: G06F17/30 H03M7/30

    摘要: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    摘要翻译: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复值连续,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。

    ADAPTIVE DICTIONARY COMPRESSION/DECOMPRESSION FOR COLUMN-STORE DATABASES
    6.
    发明申请
    ADAPTIVE DICTIONARY COMPRESSION/DECOMPRESSION FOR COLUMN-STORE DATABASES 审中-公开
    柱状数据库的自适应词典压缩/解压缩

    公开(公告)号:US20150178305A1

    公开(公告)日:2015-06-25

    申请号:US14139669

    申请日:2013-12-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30129 G06F17/30315

    摘要: Innovations for adaptive compression and decompression for dictionaries of a column-store database can reduce the amount of memory used for columns of the database, allowing a system to keep column data in memory for more columns, while delays for access operations remain acceptable. For example, dictionary compression variants use different compression techniques and implementation options. Some dictionary compression variants provide more aggressive compression (reduced memory consumption) but result in slower run-time performance. Other dictionary compression variants provide less aggressive compression (higher memory consumption) but support faster run-time performance. As another example, a compression manager can automatically select a dictionary compression variant for a given column in a column-store database. For different dictionary compression variants, the compression manager predicts run-time performance and compressed dictionary size, given the values of the column, and selects one of the dictionary compression variants.

    摘要翻译: 用于列存储数据库字典的自适应压缩和解压缩的创新可以减少数据库列使用的内存量,从而允许系统将列数据保留在内存中以获取更多列,而访问操作的延迟仍然可以接受。 例如,字典压缩变体使用不同的压缩技术和实现选项。 一些字典压缩变体提供更积极的压缩(减少内存消耗),但会导致较慢的运行时性能。 其他字典压缩变体提供较不积极的压缩(更高的内存消耗),但支持更快的运行时性能。 另一个例子,压缩管理器可以自动为列存储数据库中的给定列选择字典压缩变体。 对于不同的字典压缩变体,压缩管理器根据列的值来预测运行时性能和压缩字典大小,并选择一个字典压缩变量。

    Generic node including stored script
    7.
    发明授权
    Generic node including stored script 有权
    通用节点包括存储的脚本

    公开(公告)号:US09026525B2

    公开(公告)日:2015-05-05

    申请号:US12979182

    申请日:2010-12-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30463

    摘要: Methods and apparatus, including computer program products, are provided for providing for processing calculation plans. In one aspect, there is provided a computer-implemented method. The method may include generating a calculation plan including a plurality of nodes; determining whether at least one of the nodes includes a function node; and compiling the function node into executable code to enable execution of the plurality of nodes including the function node at the database. Related apparatus, systems, methods, and articles are also described.

    摘要翻译: 提供包括计算机程序产品在内的方法和装置,用于提供处理计算计划。 一方面,提供了一种计算机实现的方法。 该方法可以包括生成包括多个节点的计算计划; 确定所述节点中的至少一个是否包括功能节点; 以及将所述功能节点编译成可执行代码以使得能够执行包括所述数据库处的所述功能节点的所述多个节点。 还描述了相关装置,系统,方法和制品。

    Many-core algorithms for in-memory column store databases
    8.
    发明授权
    Many-core algorithms for in-memory column store databases 有权
    内存列存储数据库的多核算法

    公开(公告)号:US08914353B2

    公开(公告)日:2014-12-16

    申请号:US13332189

    申请日:2011-12-20

    IPC分类号: G07F7/00 G06F17/30

    CPC分类号: G06F17/30466 G06F17/30463

    摘要: A pattern can be identified in at least part of a query whose definition is received in a query request. The identified pattern can be matched with a set of pre-defined patterns, each of which has associated therewith at least one pre-compiled query execution sub-component of a plurality of pre-compiled query execution sub-components retained in a library. A plan for executing the query can be generated, for example by incorporating the pre-compiled query execution sub-component associated with the matched pattern into the plan based on a pseudo code representation of the plan derived from the definition.

    摘要翻译: 可以在查询的至少一部分中识别模式,该查询的定义在查询请求中被接收。 所识别的模式可以与一组预先定义的模式相匹配,每一模式都与保存在库中的多个预编译查询执行子组件的至少一个预编译查询执行子组件相关联。 可以生成用于执行查询的计划,例如通过基于从定义导出的计划的伪代码表示将与匹配模式相关联的预编译查询执行子组件合并到计划中。

    FIXED STRING DICTIONARY
    9.
    发明申请
    FIXED STRING DICTIONARY 审中-公开
    固定字典

    公开(公告)号:US20140222418A1

    公开(公告)日:2014-08-07

    申请号:US13874327

    申请日:2013-04-30

    IPC分类号: G06F17/27

    摘要: The subject matter described herein relates to implementation of a dictionary in a column-based, in-memory database where values are not stored directly, rather, for each column, a dictionary is created with all distinct values. For each row, a reference to the corresponding value in the dictionary is stored. In one aspect, data is stored in a memory structure organized in a column store format defined by a plurality of columns and a plurality of rows. A dictionary for each column in the memory structure is generated. The dictionary has distinct values for each column. A reference to the dictionary is generated for each column in the memory structure. The dictionary and the reference to the dictionary are stored in the memory structure.

    摘要翻译: 本文描述的主题涉及在基于列的存储器内数据库中实现字典,其中值不直接存储,而是对于每一列,创建具有所有不同值的字典。 对于每一行,存储对字典中对应值的引用。 在一个方面,数据被存储在以由多个列和多行定义的列存储格式组织的存储器结构中。 生成内存结构中每列的字典。 字典对于每一列都有不同的值。 为存储器结构中的每个列生成对字典的引用。 字典和对字典的引用存储在存储器结构中。

    Block compression of tables with repeated values
    10.
    发明授权
    Block compression of tables with repeated values 有权
    用重复的值块压缩表

    公开(公告)号:US08326810B2

    公开(公告)日:2012-12-04

    申请号:US12848862

    申请日:2010-08-02

    IPC分类号: G06F17/30

    摘要: Methods and apparatus, including computer program products, for block compression of tables with repeated values. In general, value identifiers representing a compressed column of data may be sorted to render repeated values contiguous, and block dictionaries may be generated. A block dictionary may be generated for each block of value identifiers. Each block dictionary may include a list of block identifiers, where each block identifier is associated with a value identifier and there is a block identifier for each unique value in a block. Blocks may have standard sizes and block dictionaries may be reused for multiple blocks.

    摘要翻译: 方法和装置,包括计算机程序产品,用于块重复值压缩表。 通常,表示压缩数据列的值标识符可以被排序以使重复的值相邻,并且可以生成块字典。 可以为每个值标识符块生成块字典。 每个块字典可以包括块标识符的列表,其中每个块标识符与值标识符相关联,并且在块中存在每个唯一值的块标识符。 块可以具有标准尺寸,并且块字典可以重用于多个块。