Hash join using collaborative parallel filtering in intelligent storage with offloaded bloom filters
    1.
    发明授权
    Hash join using collaborative parallel filtering in intelligent storage with offloaded bloom filters 有权
    使用协同并行过滤在智能存储中使用卸载的布隆过滤器进行散列加入

    公开(公告)号:US08825678B2

    公开(公告)日:2014-09-02

    申请号:US12562984

    申请日:2009-09-18

    IPC分类号: G06F7/00

    摘要: Processing resources at a storage system for a database server are utilized to perform aspects of a join operation that would conventionally be performed by the database server. When requesting a range of data units from a storage system, the database server includes join metadata describing aspects of the join operation for which the data is being requested. The join metadata may be, for instance, a bloom filter. The storage system reads the requested data from disk as normal. However, prior to sending the requested data back to the storage system, the storage system analyzes the raw data based on the join metadata, removing a certain amount of data that is guaranteed to be irrelevant to the join operation. The storage system then returns filtered data to the database server. The database system thereby avoids the unnecessary transfer of certain data between the storage system and the database server.

    摘要翻译: 利用用于数据库服务器的存储系统处理资源来执行通常由数据库服务器执行的连接操作的方面。 当从存储系统请求一系列数据单元时,数据库服务器包括描述正在请求数据的连接操作的方面的连接元数据。 连接元数据可以是例如布隆过滤器。 存储系统正常从磁盘读取所请求的数据。 然而,在将所请求的数据发送回存储系统之前,存储系统基于连接元数据分析原始数据,去除与加入操作无关的一定量的数据。 存储系统然后将过滤的数据返回到数据库服务器。 因此,数据库系统避免了存储系统和数据库服务器之间的某些数据的不必要的传送。

    Cross platform transportable tablespaces
    2.
    发明授权
    Cross platform transportable tablespaces 有权
    跨平台可运输的表空间

    公开(公告)号:US08554806B2

    公开(公告)日:2013-10-08

    申请号:US10966679

    申请日:2004-10-14

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30569

    摘要: Stored data are transported between different systems having different platforms by converting the format of the stored data into a common format either before or after transmitting the stored data from the source database system to the target database system. The common format may be a format that is universal in that the common format is not machine dependent. The stored data may be kept in the common format at the target database system and never converted to another format. Optionally, the stored data may be converted from the common format to a format associated with the target database system, thereby facilitating backward compatibility. A tool may be provided for performing the conversion.

    摘要翻译: 通过将存储的数据从源数据库系统发送到目标数据库系统之前或之后将所存储的数据的格式转换成通用格式,将存储的数据在具有不同平台的不同系统之间传送。 通用格式可以是通用的格式,因为通用格式不依赖于机器。 存储的数据可以在目标数据库系统中保持为通用格式,并且不会转换为另一种格式。 可选地,存储的数据可以从公共格式转换成与目标数据库系统相关联的格式,从而有助于向后兼容。 可以提供用于执行转换的工具。

    DATABASE OPERATION-AWARE STRIPING TECHNIQUE
    3.
    发明申请
    DATABASE OPERATION-AWARE STRIPING TECHNIQUE 有权
    数据库操作技巧

    公开(公告)号:US20110047330A1

    公开(公告)日:2011-02-24

    申请号:US12859184

    申请日:2010-08-18

    IPC分类号: G06F12/00

    摘要: A method, device, and computer readable medium for striping rows of data across logical units of storage with an affinity for columns is provided. Alternately, a method, device, and computer readable medium for striping columns of data across logical units of storage with an affinity for rows is provided. When data of a logical slice is requested, a mapping may provide information for determining which logical unit is likely to store the logical slice. In one embodiment, data is retrieved from logical units that are predicted to store the logical slice. In another embodiment, data is retrieved from several logical units, and the data not mapped to the logical unit is removed from the retrieved data.

    摘要翻译: 提供了一种方法,设备和计算机可读介质,用于跨逻辑存储单元条带化数据行,并且具有对列的亲和性。 或者,提供了一种方法,设备和计算机可读介质,用于以对行进行亲和度的跨存储的逻辑单元条带化数据列。 当请求逻辑片的数据时,映射可以提供用于确定哪个逻辑单元可能存储逻辑片的信息。 在一个实施例中,从预测存储逻辑片的逻辑单元检索数据。 在另一个实施例中,从多个逻辑单元检索数据,并且从所检索的数据中去除未映射到逻辑单元的数据。

    Encoding data to be sorted
    5.
    发明申请

    公开(公告)号:US20060112097A1

    公开(公告)日:2006-05-25

    申请号:US10996742

    申请日:2004-11-24

    IPC分类号: G06F17/30

    摘要: Key conditioning involves the construction of a byte orderable array from values for a possibly multi-field key concatenated key, for comparison by a sort routine. Byte-orderable sort keys are conditioned prior to execution of a sort routine, to facilitate an accurate and/or efficient sort procedure. For example, key conditioning may be applied to values in one or more columns of a database table, where the column(s) are used as keys for sorting rows of the table. Six factors are considered in encoding the byte array, such as whether nulls compare high or low; whether a field is fixed width or variable width; whether a field is guaranteed to be not null; whether a field is ordered ascending or descending; whether a field is the last field of a multi-field sort key or the only field of a sort key; and whether a field is likely to contain a significant number of zero values.

    HASH JOIN USING COLLABORATIVE PARALLEL FILTERING IN INTELLIGENT STORAGE WITH OFFLOADED BLOOM FILTERS
    6.
    发明申请
    HASH JOIN USING COLLABORATIVE PARALLEL FILTERING IN INTELLIGENT STORAGE WITH OFFLOADED BLOOM FILTERS 有权
    使用协作平行过滤在智能存储中与上传的BLOOM过滤器进行HASH加入

    公开(公告)号:US20100082648A1

    公开(公告)日:2010-04-01

    申请号:US12562984

    申请日:2009-09-18

    IPC分类号: G06F17/30

    摘要: Processing resources at a storage system for a database server are utilized to perform aspects of a join operation that would conventionally be performed by the database server. When requesting a range of data units from a storage system, the database server includes join metadata describing aspects of the join operation for which the data is being requested. The join metadata may be, for instance, a bloom filter. The storage system reads the requested data from disk as normal. However, prior to sending the requested data back to the storage system, the storage system analyzes the raw data based on the join metadata, removing a certain amount of data that is guaranteed to be irrelevant to the join operation. The storage system then returns filtered data to the database server. The database system thereby avoids the unnecessary transfer of certain data between the storage system and the database server.

    摘要翻译: 利用用于数据库服务器的存储系统处理资源来执行通常由数据库服务器执行的连接操作的方面。 当从存储系统请求一系列数据单元时,数据库服务器包括描述正在请求数据的连接操作的方面的连接元数据。 连接元数据可以是例如布隆过滤器。 存储系统正常从磁盘读取所请求的数据。 然而,在将所请求的数据发送回存储系统之前,存储系统基于连接元数据分析原始数据,去除与加入操作无关的一定量的数据。 存储系统然后将过滤的数据返回到数据库服务器。 因此,数据库系统避免了存储系统和数据库服务器之间的某些数据的不必要的传送。

    Transportable database
    7.
    发明申请
    Transportable database 有权
    可运输数据库

    公开(公告)号:US20050256908A1

    公开(公告)日:2005-11-17

    申请号:US10966961

    申请日:2004-10-15

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method and apparatus for creating a target database on a target platform based on a source database on a source platform is provided. A target database is created on the target platform based on data, metadata (for example, data in the system tablespace and control files), and external objects (for example, password files and externally stored tables) comprised within the source database. Data and metadata stored in tablespaces of the source database are converted to the format compatible with the target platform. The conversion of the data and metadata may be performed either at the source database or the target database. Redo information and undo information of the source database are not copied from the source database to the target database. Certain files, e.g., a password files, and links to externally stored object may be recreated on the target database.

    摘要翻译: 提供了一种用于在源平台上基于源数据库在目标平台上创建目标数据库的方法和装置。 基于源数据库中包含的数据,元数据(例如,系统表空间和控制文件中的数据)以及外部对象(例如,密码文件和外部存储的表),在目标平台上创建目标数据库。 存储在源数据库的表空间中的数据和元数据将转换为与目标平台兼容的格式。 可以在源数据库或目标数据库执行数据和元数据的转换。 源数据库的重做信息和撤消信息不会从源数据库复制到目标数据库。 可以在目标数据库上重新创建某些文件,例如密码文件和到外部存储对象的链接。

    Fact table storage in a decision support system environment
    8.
    发明申请
    Fact table storage in a decision support system environment 有权
    事实表存储在决策支持系统环境中

    公开(公告)号:US20050004936A1

    公开(公告)日:2005-01-06

    申请号:US10719819

    申请日:2003-11-20

    IPC分类号: G06F17/00 G06F17/30

    摘要: A fact table is set up without any dimension columns. The data in the table is ordered according to an order corresponding to the ordering of the dimensions. The table is divided into segments corresponding to contiguous portions of data. The portions of the table containing non-null values for the measures are placed into the segments. The size of the segments and the location segments may be determined according to the density of discontinuities in the data. An indexed organized table is used to determine the beginning and end of each segment of the table, the locations of the discontinuities in the data, and to facilitate accessing the measures of the table.

    摘要翻译: 事实表设置没有任何维度列。 表中的数据按照与尺寸顺序相对应的顺序进行排序。 该表被分成对应于数据的连续部分的段。 包含度量值的非空值的表的部分被放入段中。 可以根据数据中的不连续密度来确定段和位置段的大小。 索引的有组织的表用于确定表的每个段的开始和结束,数据中的不连续性的位置,以及便于访问表的度量。

    Method for abstract syntax tree building for large-scale data analysis
    9.
    发明授权
    Method for abstract syntax tree building for large-scale data analysis 有权
    用于大规模数据分析的抽象语法树构建方法

    公开(公告)号:US09372846B1

    公开(公告)日:2016-06-21

    申请号:US14547793

    申请日:2014-11-19

    申请人: Dmitry Potapov

    发明人: Evgueni Perkov

    IPC分类号: G06F17/27

    CPC分类号: G06F17/271

    摘要: A method and computer program product for an abstract syntax tree (AST) builder for faster Earley parser. The algorithm for building an AST based on an amplified LRE (Left Recursive Earley parser) technique is provided. The proposed method constructs the AST during the recognition process. The proposed method preserves enough structural information during recognition phase that allows to avoid a mapping step and to eliminate the converter module. The LRE method is used in such a way that the AST processor can produce its output working directly with LRE data structures. This important feature leads to essential performance improvement of the entire parsing process.

    摘要翻译: 用于更快速的Earley解析器的抽象语法树(AST)构建器的方法和计算机程序产品。 提供了基于放大LRE(左递归Earley解析器)技术构建AST的算法。 所提出的方法在识别过程中构建AST。 所提出的方法在识别阶段保留足够的结构信息,允许避免映射步骤并消除转换器模块。 使用LRE方法使得AST处理器可以产生其与LRE数据结构直接工作的输出。 这个重要特征导致整个解析过程的基本性能改进。

    Fact table storage in a decision support system environment
    10.
    发明授权
    Fact table storage in a decision support system environment 有权
    事实表存储在决策支持系统环境中

    公开(公告)号:US07480662B2

    公开(公告)日:2009-01-20

    申请号:US10719819

    申请日:2003-11-20

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: A fact table is set up without any dimension columns. The data in the table is ordered according to an order corresponding to the ordering of the dimensions. The table is divided into segments corresponding to contiguous portions of data. The portions of the table containing non-null values for the measures are placed into the segments. The size of the segments and the location segments may be determined according to the density of discontinuities in the data. An indexed organized table is used to determine the beginning and end of each segment of the table, the locations of the discontinuities in the data, and to facilitate accessing the measures of the table.

    摘要翻译: 事实表设置没有任何维度列。 表中的数据按照与尺寸顺序对应的顺序进行排序。 该表被分成对应于数据的连续部分的段。 包含度量值的非空值的表的部分被放入段中。 可以根据数据中的不连续密度来确定段和位置段的大小。 索引的有组织的表用于确定表的每个段的开始和结束,数据中的不连续性的位置,以及便于访问表的度量。