Managing parallel execution of work granules according to their affinity
    1.
    发明授权
    Managing parallel execution of work granules according to their affinity 有权
    根据工作颗粒的亲和力管理工作颗粒的并行执行

    公开(公告)号:US07386849B2

    公开(公告)日:2008-06-10

    申请号:US10802553

    申请日:2004-03-16

    IPC分类号: G06F9/46

    摘要: A method and apparatus are provided for managing work granules being executed in parallel. A task is evenly divided between a number of work granules. The number of work granules falls between a threshold minimum and a threshold maximum. The threshold minimum and maximum may be configured to balance a variety of efficiency factors affected by the number of work granules, including workload skew and overhead incurred in managing larger number of work granules. Work granules are distributed to processes on nodes according to which of the nodes, if any, may execute the work granule efficiently. A variety of factors may used to determine where a work granule may be performed efficiently, including whether data accessed during the execution of a work granule may be locally accessed by a node.

    摘要翻译: 提供了一种用于管理并行执行的工作颗粒的方法和装置。 任务在一些工作颗粒之间被均匀分配。 工作颗粒的数量落在阈值最小值和阈值最大值之间。 阈值最小值和最大值可以被配置为平衡受工作颗粒数量影响的各种效率因子,包括工作量偏差和管理更大数量的工作颗粒物所产生的开销。 工作颗粒分配到节点上的过程,根据哪个节点(如果有的话)可以有效地执行工作颗粒。 可以使用各种因素来确定工作颗粒可以有效执行的位置,包括在工作颗粒的执行期间访问的数据是否可以由节点本地访问。

    Dynamic performance views with a parallel single cursor model
    2.
    发明授权
    Dynamic performance views with a parallel single cursor model 有权
    具有并行单光标模型的动态性能视图

    公开(公告)号:US07203676B2

    公开(公告)日:2007-04-10

    申请号:US10898300

    申请日:2004-07-23

    IPC分类号: G06F17/30

    摘要: A query coordinator handles a multiple-server dynamic performance query by sending remote query slaves (1) first information for generating a complete plan for the query, and (2) second information for participating in the dynamic performance view portion of the query. If the slaves on the remote servers are unable to use the first information to generate an equivalent query (for example, if they reside in a database server that has closed the database), then the slaves on the remote servers use the second information to participate in the dynamic performance view portion of the query.

    摘要翻译: 查询协调器通过发送远程查询从站来处理多服务器动态性能查询(1)用于生成查询的完整计划的第一信息,以及(2)用于参与查询的动态性能视图部分的第二信息。 如果远程服务器上的从站无法使用第一个信息来生成等效查询(例如,如果它们位于已关闭数据库的数据库服务器中),则远程服务器上的从站将使用第二个信息来参与 在查询的动态性能视图部分。

    Distributing workload between resources used to access data
    3.
    发明授权
    Distributing workload between resources used to access data 有权
    在用于访问数据的资源之间分配工作负载

    公开(公告)号:US06549931B1

    公开(公告)日:2003-04-15

    申请号:US09415033

    申请日:1999-10-07

    IPC分类号: G06F1730

    CPC分类号: G06F9/5066 Y10S707/99952

    摘要: A task is divided into work granules that require access to data objects. The work granules are distributed to slave processes in a manner that causes the data objects to be accessed in a balanced way, such that the difference in the number of slave processes accessing any object is not greater than one. Distributing the work granules in this manner decreases the likelihood that the resources required to access any particular data object will become a bottleneck in performing the task. For each data object in the set of data objects, a work granule list is maintained. The list of each data object identifies work granules requiring access to the data object. A slave process is assigned a work granule selected from a set of work granule lists. To select a work granule for a slave process, an initial list is picked at random. If the quantity of currently-assigned work granules from the selected work granule list is less than or equals a “threshold minimum”, then a work granule from the work granule list is assigned to the slave process. If the quantity of work granules is greater than the threshold minimum, then another work granule list is selected. The threshold minimum may be, for example, the minimum number of currently-assigned work granules from the work granule list.

    摘要翻译: 任务分为需要访问数据对象的工作颗粒。 工作颗粒以以平衡的方式访问数据对象的方式分配给从属进程,从而访问任何对象的从属进程数量的差异不大于1。 以这种方式分配工作颗粒减少访问任何特定数据对象所需的资源将成为执行任务的瓶颈的可能性。 对于数据对象集合中的每个数据对象,维护工作粒子列表。 每个数据对象的列表标识需要访问数据对象的工作颗粒。 从属进程被分配从一组工作颗粒列表中选择的工作颗粒。 要为从属进程选择工作颗粒,将随机挑选一个初始列表。 如果来自所选择的工作颗粒列表的当前分配的工作颗粒的数量小于或等于“阈值最小值”,则将工作颗粒列表中的工作颗粒分配给从属过程。 如果工作颗粒的数量大于阈值最小值,则选择另一个工作颗粒列表。 阈值最小值可以是例如来自工作颗粒列表的当前分配的工作颗粒的最小数量。

    Enabling intra-partition parallelism for partition-based operations
    4.
    发明授权
    Enabling intra-partition parallelism for partition-based operations 有权
    为分区操作启用分区内并行

    公开(公告)号:US06954776B1

    公开(公告)日:2005-10-11

    申请号:US09851065

    申请日:2001-05-07

    IPC分类号: G06F9/46 G06F9/50 G06F15/16

    CPC分类号: G06F9/5066 G06F17/30445

    摘要: Techniques are provided for increasing the degree of parallelism without incurring overhead costs associated with inter-nodal communication for performing parallel operations. One aspect of the invention is to distribute-phase partition-pairs of a parallel partition-wise operation on a pair of objects among the nodes of a database system. The -phase partition-pairs that are distributed to each node are further partitioned to form a new set of-phase partition-pairs. One -phase partition-pair from the set of new-phase partition-pairs is assigned to each slave process that is on a given node. In addition, a target object may be partitioned by applying an appropriate hash function to the tuples of the target object. The parallel operation is performed by broadcasting each tuple from a source table only to the group of slave processes that is working on the static partition to which the tuple is mapped.

    摘要翻译: 提供了用于增加并行程度的技术,而不会产生与用于执行并行操作的节间通信相关联的开销成本。 本发明的一个方面是在数据库系统的节点之间的一对对象上分布并行分区操作的相位分区对。 分配给每个节点的相位分区对进一步分割以形成一组新的相位分区对。 来自一组新阶段分区对的单相分区对被分配给给定节点上的每个从进程。 此外,可以通过对目标对象的元组应用适当的散列函数来对目标对象进行分区。 并行操作是通过将每个元组从源表广播到在该元组被映射到的静态分区上工作的从属进程组来执行的。

    Managing parallel execution of work granules according to their affinity
    5.
    发明授权
    Managing parallel execution of work granules according to their affinity 有权
    按照其亲和力管理工作颗粒的并行执行

    公开(公告)号:US06826753B1

    公开(公告)日:2004-11-30

    申请号:US09415031

    申请日:1999-10-07

    IPC分类号: G06F900

    摘要: A method and apparatus are provided for managing work granules being executed in parallel. A task is evenly divided between a number of work granules. The number of work granules falls between a threshold minimum and a threshold maximum. The threshold minimum and maximum may be configured to balance a variety of efficiency factors affected by the number of work granules, including workload skew and overhead incurred in managing larger number of work granules. Work granules are distributed to processes on nodes according to which of the nodes, if any, may execute the work granule efficiently. A variety of factors may used to determine where a work granule may be performed efficiently, including whether data accessed during the execution of a work granule may be locally accessed by a node.

    摘要翻译: 提供了一种用于管理并行执行的工作颗粒的方法和装置。 任务在一些工作颗粒之间被均匀分配。 工作颗粒的数量落在阈值最小值和阈值最大值之间。 阈值最小值和最大值可以被配置为平衡受工作颗粒数量影响的各种效率因子,包括工作量偏差和管理更大数量的工作颗粒物所产生的开销。 工作颗粒分配到节点上的过程,根据哪个节点(如果有的话)可以有效地执行工作颗粒。 可以使用各种因素来确定工作颗粒可以有效执行的位置,包括在工作颗粒的执行期间访问的数据是否可以由节点本地访问。