Table redistribution in distributed databases

    公开(公告)号:US09830346B2

    公开(公告)日:2017-11-28

    申请号:US14090783

    申请日:2013-11-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Tables in a distributed database can require redistribution, for example to provide improved collocation of tables or table partitions that require joining at a node of multiple nodes across which the distributed database is distributed. Based at least in part on a set of table redistribution parameters, a table redistribution plan can be generated to include redistribution of a table from a first node to a second node. The set of table redistribution parameters can include a grouping parameter indicating at least one other table with which the table should be collocated. The table redistribution plan can be executed to cause the moving of the table from the first node to the second node.

    Interdistinct operator
    2.
    发明授权
    Interdistinct operator 有权
    区分运算符

    公开(公告)号:US09223829B2

    公开(公告)日:2015-12-29

    申请号:US12982767

    申请日:2010-12-30

    IPC分类号: G06F17/30

    摘要: A computer-implemented system and method for performing distinct operations on multiple tables of shared memory of parallel computing environments are disclosed. A distinct operation is executed on each table of a plurality of tables, each distinct operation eliminating duplicate data from each table, the executing creating a hierarchy of table pairs and distinct results, the distinct results comprising a reduced row set for each table. Duplicates on each reduced row set are detected to complete the distinct operation on the plurality of tables.

    摘要翻译: 公开了一种用于在并行计算环境的共享存储器的多个表上执行不同操作的计算机实现的系统和方法。 在多个表的每个表上执行不同的操作,每个不同的操作消除来自每个表的重复数据,执行创建表对和不同结果的层次,不同的结果包括每个表的减少的行集合。 检测每个缩小的行集上的复制以完成多个表上的不同操作。

    Database Table Re-Partitioning Using Trigger-Based Capture and Replay
    3.
    发明申请
    Database Table Re-Partitioning Using Trigger-Based Capture and Replay 审中-公开
    使用基于触发的捕获和重放的数据库表重新分区

    公开(公告)号:US20150242400A1

    公开(公告)日:2015-08-27

    申请号:US14188543

    申请日:2014-02-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Partitioning of a source table of a database to a target table is initiated. Thereafter, a replay table is generated that is populated with triggers for database operations performed on the source table for subsequent replay for the target partitions. Data is later moved (e.g., asynchronously moved, etc.) from the source table to the target table. The database operations are replayed on the target table T subsequent to the moving of the data using the replay table. In addition, the source table is dropped when all of the data has been moved to the target table and there are no operations requiring replay. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 启动将数据库的源表分区到目标表。 此后,生成重播表,其中填充有用于在源表上执行的数据库操作的触发器,用于目标分区的后续重放。 数据稍后从源表移动到目标表(例如异步移动等)。 在使用重放表移动数据之后,在目标表T上重播数据库操作。 此外,当所有数据已被移动到目标表并且没有需要重放的操作时,源表被删除。 还描述了相关设备,系统,技术和物品。

    Parallel uniqueness checks for partitioned tables
    4.
    发明授权
    Parallel uniqueness checks for partitioned tables 有权
    分区表的并行唯一性检查

    公开(公告)号:US08812564B2

    公开(公告)日:2014-08-19

    申请号:US13332215

    申请日:2011-12-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30584

    摘要: An insertion of a record into a table that includes a primary key column and a second column that includes a global uniqueness constraint across all of a plurality of data partitions across which the table is split is initiated without checking that a value of the record in the second column is globally unique by contacting other partitions the one partition to which the record is to be added to. The insertion can be processed, at least in part by implementing a write lock on the one partition but without implementing a read lock on the other partitions. The write lock on the one partition can be released after the insertion is completed, after which the validity of the insertion can be verified, for example by examining the other parts and a delta partition corresponding to the table. The insertion can be undone if the insertion was not valid.

    摘要翻译: 将记录插入到包括主键列和第二列的表中,该主列和第二列包括横跨所述表被分割的多个数据分区的全部唯一性约束,而不检查该表中的记录的值 第二列是全局唯一的,通过联系其他分区,将要添加记录的一个分区。 可以至少部分地通过在一个分区上实现写入锁定而不在其他分区上实现读取锁来处理该插入。 插入完成后,可以释放一个分区上的写锁定,之后可以验证插入的有效性,例如通过检查其他部分和对应于该表的增量分区。 如果插入无效,插入可以撤消。

    Split processing paths for a database calculation engine

    公开(公告)号:US10146834B2

    公开(公告)日:2018-12-04

    申请号:US14518593

    申请日:2014-10-20

    IPC分类号: G06F17/30

    摘要: A dynamic split node defined within a calculation model can receive data being operated on by a calculation plan generated based on the calculation model. A partition specification can be applied to one or more reference columns in a table containing at least some of the received data. The applying can cause the table to be split such that a plurality of records in the table are partitioned according to the partition specification. A separate processing path can be set for each partition, and execution of the calculation plan can continue using the separate processing paths, each of which can be assigned to a processing node of a plurality of available processing nodes.

    Replication mechanisms for database environments
    8.
    发明授权
    Replication mechanisms for database environments 有权
    数据库环境的复制机制

    公开(公告)号:US09411866B2

    公开(公告)日:2016-08-09

    申请号:US13719737

    申请日:2012-12-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30575

    摘要: Data replication in a database includes identifying a source database system. The source database includes a main index file and a delta log file. To create a replica, one or more symbolic links to the source database system are generated. The symbolic links identify a path to a physical location of the source database. A replica of the source database is generated based on the symbolic links. The replica includes a copy of the main index file and delta log file. Information associated with the replica and the symbolic links is stored in a recovery log. Replica are provided transparently to most database engine components by re-using partitioning infrastructure. Components “see” replica as tables with a single partition; that partition is a local replica.

    摘要翻译: 数据库中的数据复制包括识别源数据库系统。 源数据库包括主索引文件和增量日志文件。 要创建副本,将生成到源数据库系统的一个或多个符号链接。 符号链接标识到源数据库的物理位置的路径。 基于符号链接生成源数据库的副本。 副本包括主索引文件和增量日志文件的副本。 与副本和符号链接相关联的信息存储在恢复日志中。 通过重新使用分区基础架构,对大多数数据库引擎组件透明地提供副本。 组件“将”副本视为具有单个分区的表; 该分区是本地副本。

    Database Table Re-Partitioning Using Two Active Partition Specifications
    9.
    发明申请
    Database Table Re-Partitioning Using Two Active Partition Specifications 审中-公开
    使用两个活动分区规范的数据库表重新分区

    公开(公告)号:US20150242451A1

    公开(公告)日:2015-08-27

    申请号:US14188541

    申请日:2014-02-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30584 G06F17/30578

    摘要: Partitioning of source partitions of a table of a database to target partitions is initiated. Thereafter, a transition partition specification is specified that identifies the source partitions and the target partitions. Data is then moved (e.g., asynchronously moved, etc.) from the source partitions to the target partitions. Concurrently with the moving of the data, operates are handled using the transition partition specification. Subsequently, the source partitions are dropped when all of the data has been moved to the target partitions and there are no open transactions accessing the source partitions. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 启动将数据库表的源分区分区到目标分区。 此后,指定了标识源分区和目标分区的转换分区规范。 然后将数据从源分区移动(例如异步移动等)到目标分区。 同时随着数据的移动,操作使用转换分区规范进行处理。 随后,当所有数据已被移动到目标分区并且没有访问源分区的打开事务时,源分区将被丢弃。 还描述了相关设备,系统,技术和物品。

    Table Creation for Partitioned Tables
    10.
    发明申请
    Table Creation for Partitioned Tables 有权
    分区表创建表

    公开(公告)号:US20140351291A1

    公开(公告)日:2014-11-27

    申请号:US14453470

    申请日:2014-08-06

    IPC分类号: G06F17/30

    摘要: A table creation request pertaining to a table in a database maintained on a multi-node data partitioning landscape that comprises a plurality of processing nodes can specify a number of partitions to be generated. At run time, a currently available number of processing nodes in the multi-node data partitioning landscape can be queried, and this currently available number of processing nodes can be compared with the specified number of partitions to be generated for the created table. The table can be generated with the specified number of partitions such that the generated partitions are located across the plurality of partitions according to a load balancing approach if the number of processing nodes equals the number of partitions to be generated or according to other information in the table request if the number of processing nodes does not equal the specified number of partitions.

    摘要翻译: 与维护在包含多个处理节点的多节点数据划分区域上的数据库中的表相关联的表创建请求可以指定要生成的分区的数量。 在运行时,可以查询多节点数据分区场景中当前可用数量的处理节点,并将此当前可用数量的处理节点与要为创建的表生成的指定数量的分区进行比较。 如果处理节点的数量等于要生成的分区的数量,或者根据其中的其他信息,则可以使用指定数量的分区生成表,使得生成的分区根据负载平衡方法位于多个分区之间 表请求如果处理节点的数量不等于指定的分区数。