Table redistribution in distributed databases

    公开(公告)号:US09830346B2

    公开(公告)日:2017-11-28

    申请号:US14090783

    申请日:2013-11-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Tables in a distributed database can require redistribution, for example to provide improved collocation of tables or table partitions that require joining at a node of multiple nodes across which the distributed database is distributed. Based at least in part on a set of table redistribution parameters, a table redistribution plan can be generated to include redistribution of a table from a first node to a second node. The set of table redistribution parameters can include a grouping parameter indicating at least one other table with which the table should be collocated. The table redistribution plan can be executed to cause the moving of the table from the first node to the second node.

    TABLE REDISTRIBUTION IN DISTRIBUTED DATABASES
    2.
    发明申请
    TABLE REDISTRIBUTION IN DISTRIBUTED DATABASES 有权
    分布式数据库中的表重新分配

    公开(公告)号:US20150149513A1

    公开(公告)日:2015-05-28

    申请号:US14090783

    申请日:2013-11-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Tables in a distributed database can require redistribution, for example to provide improved collocation of tables or table partitions that require joining at a node of multiple nodes across which the distributed database is distributed. Based at least in part on a set of table redistribution parameters, a table redistribution plan can be generated to include redistribution of a table from a first node to a second node. The set of table redistribution parameters can include a grouping parameter indicating at least one other table with which the table should be collocated. The table redistribution plan can be executed to cause the moving of the table from the first node to the second node.

    摘要翻译: 分布式数据库中的表可能需要重新分配,例如提供需要在分布式数据库分布的多个节点的节点上加入的表或表分区的改进的并置。 至少部分地基于一组表重新分配参数,可以生成表重新分发计划以包括将表从第一节点重新分配到第二节点。 该表再分配参数的集合可以包括指示至少一个其他表的分组参数,该表应与其并置。 可以执行表再分配计划以使表从第一节点移动到第二节点。

    Split processing paths for a database calculation engine

    公开(公告)号:US10146834B2

    公开(公告)日:2018-12-04

    申请号:US14518593

    申请日:2014-10-20

    IPC分类号: G06F17/30

    摘要: A dynamic split node defined within a calculation model can receive data being operated on by a calculation plan generated based on the calculation model. A partition specification can be applied to one or more reference columns in a table containing at least some of the received data. The applying can cause the table to be split such that a plurality of records in the table are partitioned according to the partition specification. A separate processing path can be set for each partition, and execution of the calculation plan can continue using the separate processing paths, each of which can be assigned to a processing node of a plurality of available processing nodes.

    Interdistinct operator
    5.
    发明授权
    Interdistinct operator 有权
    区分运算符

    公开(公告)号:US09223829B2

    公开(公告)日:2015-12-29

    申请号:US12982767

    申请日:2010-12-30

    IPC分类号: G06F17/30

    摘要: A computer-implemented system and method for performing distinct operations on multiple tables of shared memory of parallel computing environments are disclosed. A distinct operation is executed on each table of a plurality of tables, each distinct operation eliminating duplicate data from each table, the executing creating a hierarchy of table pairs and distinct results, the distinct results comprising a reduced row set for each table. Duplicates on each reduced row set are detected to complete the distinct operation on the plurality of tables.

    摘要翻译: 公开了一种用于在并行计算环境的共享存储器的多个表上执行不同操作的计算机实现的系统和方法。 在多个表的每个表上执行不同的操作,每个不同的操作消除来自每个表的重复数据,执行创建表对和不同结果的层次,不同的结果包括每个表的减少的行集合。 检测每个缩小的行集上的复制以完成多个表上的不同操作。

    Database Table Re-Partitioning Using Trigger-Based Capture and Replay
    6.
    发明申请
    Database Table Re-Partitioning Using Trigger-Based Capture and Replay 审中-公开
    使用基于触发的捕获和重放的数据库表重新分区

    公开(公告)号:US20150242400A1

    公开(公告)日:2015-08-27

    申请号:US14188543

    申请日:2014-02-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30339

    摘要: Partitioning of a source table of a database to a target table is initiated. Thereafter, a replay table is generated that is populated with triggers for database operations performed on the source table for subsequent replay for the target partitions. Data is later moved (e.g., asynchronously moved, etc.) from the source table to the target table. The database operations are replayed on the target table T subsequent to the moving of the data using the replay table. In addition, the source table is dropped when all of the data has been moved to the target table and there are no operations requiring replay. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 启动将数据库的源表分区到目标表。 此后,生成重播表,其中填充有用于在源表上执行的数据库操作的触发器,用于目标分区的后续重放。 数据稍后从源表移动到目标表(例如异步移动等)。 在使用重放表移动数据之后,在目标表T上重播数据库操作。 此外,当所有数据已被移动到目标表并且没有需要重放的操作时,源表被删除。 还描述了相关设备,系统,技术和物品。

    Parallel uniqueness checks for partitioned tables
    7.
    发明授权
    Parallel uniqueness checks for partitioned tables 有权
    分区表的并行唯一性检查

    公开(公告)号:US08812564B2

    公开(公告)日:2014-08-19

    申请号:US13332215

    申请日:2011-12-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30584

    摘要: An insertion of a record into a table that includes a primary key column and a second column that includes a global uniqueness constraint across all of a plurality of data partitions across which the table is split is initiated without checking that a value of the record in the second column is globally unique by contacting other partitions the one partition to which the record is to be added to. The insertion can be processed, at least in part by implementing a write lock on the one partition but without implementing a read lock on the other partitions. The write lock on the one partition can be released after the insertion is completed, after which the validity of the insertion can be verified, for example by examining the other parts and a delta partition corresponding to the table. The insertion can be undone if the insertion was not valid.

    摘要翻译: 将记录插入到包括主键列和第二列的表中,该主列和第二列包括横跨所述表被分割的多个数据分区的全部唯一性约束,而不检查该表中的记录的值 第二列是全局唯一的,通过联系其他分区,将要添加记录的一个分区。 可以至少部分地通过在一个分区上实现写入锁定而不在其他分区上实现读取锁来处理该插入。 插入完成后,可以释放一个分区上的写锁定,之后可以验证插入的有效性,例如通过检查其他部分和对应于该表的增量分区。 如果插入无效,插入可以撤消。

    SPLIT PROCESSING PATHS FOR A DATABASE CALCULATION ENGINE
    8.
    发明申请
    SPLIT PROCESSING PATHS FOR A DATABASE CALCULATION ENGINE 审中-公开
    数据库计算机分割处理程序

    公开(公告)号:US20150039588A1

    公开(公告)日:2015-02-05

    申请号:US14518593

    申请日:2014-10-20

    IPC分类号: G06F17/30

    摘要: A dynamic split node defined within a calculation model can receive data being operated on by a calculation plan generated based on the calculation model. A partition specification can be applied to one or more reference columns in a table containing at least some of the received data. The applying can cause the table to be split such that a plurality of records in the table are partitioned according to the partition specification. A separate processing path can be set for each partition, and execution of the calculation plan can continue using the separate processing paths, each of which can be assigned to a processing node of a plurality of available processing nodes.

    摘要翻译: 在计算模型中定义的动态分割节点可以通过基于计算模型生成的计算计划来接收正在操作的数据。 分区规范可以应用于包含至少一些接收到的数据的表中的一个或多个参考列。 应用可以使表被拆分,使得根据分区规范对表中的多个记录进行分区。 可以为每个分区设置单独的处理路径,并且可以使用单独的处理路径继续执行计算计划,每个处理路径可被分配给多个可用处理节点的处理节点。

    Table creation for partitioned tables
    9.
    发明授权
    Table creation for partitioned tables 有权
    分区表的表创建

    公开(公告)号:US08880565B2

    公开(公告)日:2014-11-04

    申请号:US13336942

    申请日:2011-12-23

    IPC分类号: G06F17/30

    摘要: A table creation request pertaining to a table in a database maintained on a multi-node data partitioning landscape that comprises a plurality of processing nodes can specify a number of partitions to be generated. At run time, a currently available number of processing nodes in the multi-node data partitioning landscape can be queried, and this currently available number of processing nodes can be compared with the specified number of partitions to be generated for the created table. The table can be generated with the specified number of partitions such that the generated partitions are located across the plurality of partitions according to a load balancing approach if the number of processing nodes equals the number of partitions to be generated or according to other information in the table request if the number of processing nodes does not equal the specified number of partitions.

    摘要翻译: 与维护在包含多个处理节点的多节点数据划分区域上的数据库中的表相关联的表创建请求可以指定要生成的分区的数量。 在运行时,可以查询多节点数据分区场景中当前可用数量的处理节点,并将此当前可用数量的处理节点与要为创建的表生成的指定数量的分区进行比较。 如果处理节点的数量等于要生成的分区的数量,或者根据其中的其他信息,则可以使用指定数量的分区生成表,使得生成的分区根据负载平衡方法位于多个分区之间 表请求如果处理节点的数量不等于指定的分区数。

    Split processing paths for a database calculation engine
    10.
    发明授权
    Split processing paths for a database calculation engine 有权
    分割数据库计算引擎的处理路径

    公开(公告)号:US08868594B2

    公开(公告)日:2014-10-21

    申请号:US13336935

    申请日:2011-12-23

    IPC分类号: G06F17/30

    摘要: A dynamic split node defined within a calculation model can receive data being operated on by a calculation plan generated based on the calculation model. A partition specification can be applied to one or more reference columns in a table containing at least some of the received data. The applying can cause the table to be split such that a plurality of records in the table are partitioned according to the partition specification. A separate processing path can be set for each partition, and execution of the calculation plan can continue using the separate processing paths, each of which can be assigned to a processing node of a plurality of available processing nodes.

    摘要翻译: 在计算模型中定义的动态分割节点可以通过基于计算模型生成的计算计划来接收正在操作的数据。 分区规范可以应用于包含至少一些接收到的数据的表中的一个或多个参考列。 应用可以使表被拆分,使得根据分区规范对表中的多个记录进行分区。 可以为每个分区设置单独的处理路径,并且可以使用单独的处理路径继续执行计算计划,每个处理路径可被分配给多个可用处理节点的处理节点。