Scheduling Flows in a Multi-Platform Cluster Environment
    1.
    发明申请
    Scheduling Flows in a Multi-Platform Cluster Environment 有权
    在多平台集群环境中调度流程

    公开(公告)号:US20120304188A1

    公开(公告)日:2012-11-29

    申请号:US13116368

    申请日:2011-05-26

    IPC分类号: G06F9/46

    摘要: Techniques for scheduling multiple flows in a multi-platform cluster environment are provided. The techniques include partitioning a cluster into one or more platform containers associated with one or more platforms in the cluster, scheduling one or more flows in each of the one or more platform containers, wherein the one or more flows are created as one or more flow containers, scheduling one or more individual jobs into the one or more flow containers to create a moldable schedule of one or more jobs, flows and platforms, and automatically converting the moldable schedule into a malleable schedule.

    摘要翻译: 提供了在多平台集群环境中调度多个流的技术。 这些技术包括将集群划分成与集群中的一个或多个平台相关联的一个或多个平台容器,调度一个或多个平台容器中的每个平台容器中的一个或多个流,其中,一个或多个流被创建为一个或多个流 容器,将一个或多个单独作业调度到所述一个或多个流动容器中,以创建一个或多个作业,流动和平台的可模制计划,以及将可模制计划自动转换成可延展的计划。

    Scheduling Mapreduce Jobs in the Presence of Priority Classes
    3.
    发明申请
    Scheduling Mapreduce Jobs in the Presence of Priority Classes 审中-公开
    在优先课程中安排Mapreduce工作

    公开(公告)号:US20120304186A1

    公开(公告)日:2012-11-29

    申请号:US13116378

    申请日:2011-05-26

    IPC分类号: G06F9/46 G06F9/50

    CPC分类号: G06F9/46 G06F9/4881 G06F9/50

    摘要: Techniques for scheduling one or more MapReduce jobs in a presence of one or more priority classes are provided. The techniques include obtaining a preferred ordering for one or more MapReduce jobs, wherein the preferred ordering comprises one or more priority classes, prioritizing the one or more priority classes subject to one or more dynamic minimum slot guarantees for each priority class, and iteratively employing a MapReduce scheduler, once per priority class, in priority class order, to optimize performance of the one or more MapReduce jobs.

    摘要翻译: 提供了在存在一个或多个优先级类别的情况下调度一个或多个MapReduce作业的技术。 这些技术包括获得一个或多个MapReduce作业的优选顺序,其中优选顺序包括一个或多个优先级等级,对于每个优先级类别,受限于一个或多个动态最小时隙保证的一个或多个优先级类别进行优先级排序,并迭代地使用 MapReduce调度程序每优先级一次,按优先级顺序排列,以优化一个或多个MapReduce作业的性能。

    Amortizing costs of shared scans
    5.
    发明授权
    Amortizing costs of shared scans 失效
    摊销共享扫描成本

    公开(公告)号:US08484649B2

    公开(公告)日:2013-07-09

    申请号:US12984909

    申请日:2011-01-05

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843

    摘要: Techniques for scheduling a plurality of jobs sharing input are provided. The techniques include partitioning one or more input datasets into multiple subcomponents, analyzing a plurality of jobs to determine which of the plurality of jobs require scanning of one or more common subcomponents of the one or more input datasets, and scheduling a plurality of jobs that require scanning of one or more common subcomponents of the one or more input datasets, facilitating a single scanning of the one or more common subcomponents to be used as input by each of the plurality of jobs.

    摘要翻译: 提供了用于调度多个作业共享输入的技术。 这些技术包括将一个或多个输入数据集划分成多个子组件,分析多个作业以确定多个作业中的哪个作业需要扫描一个或多个输入数据集的一个或多个公共子组件,以及调度需要 扫描所述一个或多个输入数据集中的一个或多个通用子部件,便于对所述一个或多个公共子部件进行单次扫描以用作多个作业中的每个作业的输入。

    Partitioning operator flow graphs
    7.
    发明授权
    Partitioning operator flow graphs 有权
    分区操作员流程图

    公开(公告)号:US08490072B2

    公开(公告)日:2013-07-16

    申请号:US12489805

    申请日:2009-06-23

    IPC分类号: G06F9/44 G06F9/45

    CPC分类号: G06F8/443 G06F8/433 G06F8/44

    摘要: Techniques for partitioning an operator flow graph are provided. The techniques include receiving source code for a stream processing application, wherein the source code comprises an operator flow graph, wherein the operator flow graph comprises a plurality of operators, receiving profiling data associated with the plurality of operators and one or more processing requirements of the operators, defining a candidate partition as a coalescing of one or more of the operators into one or more sets of processing elements (PEs), using the profiling data to create one or more candidate partitions of the processing elements, using the one or more candidate partitions to choose a desired partitioning of the operator flow graph, and compiling the source code into an executable code based on the desired partitioning.

    摘要翻译: 提供了划分操作员流程图的技术。 所述技术包括接收流处理应用的源代码,其中所述源代码包括操作者流图,其中所述操作者流程图包括多个操作者,接收与所述多个操作者相关联的分析数据以及所述多个操作者的一个或多个处理要求 运营商,使用所述分析数据,使用所述一个或多个候选者来创建一个或多个所述处理元件的候选分区,将候选分区定义为将一个或多个运算符合并为一组或多组处理元素(PE) 分区以选择操作员流图的期望分区,并且基于期望的分区将源代码编译成可执行代码。

    AMORTIZING COSTS OF SHARED SCANS
    10.
    发明申请
    AMORTIZING COSTS OF SHARED SCANS 失效
    共享扫描成本

    公开(公告)号:US20120174110A1

    公开(公告)日:2012-07-05

    申请号:US12984909

    申请日:2011-01-05

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843

    摘要: Techniques for scheduling a plurality of jobs sharing input are provided. The techniques include partitioning one or more input datasets into multiple subcomponents, analyzing a plurality of jobs to determine which of the plurality of jobs require scanning of one or more common subcomponents of the one or more input datasets, and scheduling a plurality of jobs that require scanning of one or more common subcomponents of the one or more input datasets, facilitating a single scanning of the one or more common subcomponents to be used as input by each of the plurality of jobs.

    摘要翻译: 提供了用于调度多个作业共享输入的技术。 这些技术包括将一个或多个输入数据集划分成多个子组件,分析多个作业以确定多个作业中的哪个作业需要扫描一个或多个输入数据集的一个或多个公共子组件,以及调度需要 扫描所述一个或多个输入数据集中的一个或多个通用子部件,便于对所述一个或多个公共子部件进行单次扫描以用作多个作业中的每个作业的输入。