PARTITIONING OPERATOR FLOW GRAPHS
    1.
    发明申请
    PARTITIONING OPERATOR FLOW GRAPHS 有权
    分区操作员流程图

    公开(公告)号:US20100325621A1

    公开(公告)日:2010-12-23

    申请号:US12489805

    申请日:2009-06-23

    IPC分类号: G06F9/45

    CPC分类号: G06F8/443 G06F8/433 G06F8/44

    摘要: Techniques for partitioning an operator flow graph are provided. The techniques include receiving source code for a steam processing application, wherein the source code comprises an operator flow graph, wherein the operator flow graph comprises a plurality of operators, receiving profiling data associated with the plurality of operators and one or more processing requirements of the operators, defining a candidate partition as a coalescing of one or more of the operators into one or more sets of processing elements (PEs), using the profiling data to create one or more candidate partitions of the processing elements, using the one or more candidate partitions to choose a desired partitioning of the operator flow graph, and compiling the source code into an executable code based on the desired partitioning.

    摘要翻译: 提供了划分操作员流程图的技术。 这些技术包括接收蒸汽处理应用的源代码,其中源代码包括操作者流程图,其中操作者流程图包括多个操作者,接收与多个操作者相关联的分析数据和一个或多个处理要求 运营商,使用所述分析数据,使用所述一个或多个候选者来创建一个或多个所述处理元件的候选分区,将候选分区定义为将一个或多个运算符合并为一组或多组处理元素(PE) 分区以选择操作员流图的期望分区,并且基于期望的分区将源代码编译成可执行代码。

    Partitioning operator flow graphs
    2.
    发明授权
    Partitioning operator flow graphs 有权
    分区操作员流程图

    公开(公告)号:US08490072B2

    公开(公告)日:2013-07-16

    申请号:US12489805

    申请日:2009-06-23

    IPC分类号: G06F9/44 G06F9/45

    CPC分类号: G06F8/443 G06F8/433 G06F8/44

    摘要: Techniques for partitioning an operator flow graph are provided. The techniques include receiving source code for a stream processing application, wherein the source code comprises an operator flow graph, wherein the operator flow graph comprises a plurality of operators, receiving profiling data associated with the plurality of operators and one or more processing requirements of the operators, defining a candidate partition as a coalescing of one or more of the operators into one or more sets of processing elements (PEs), using the profiling data to create one or more candidate partitions of the processing elements, using the one or more candidate partitions to choose a desired partitioning of the operator flow graph, and compiling the source code into an executable code based on the desired partitioning.

    摘要翻译: 提供了划分操作员流程图的技术。 所述技术包括接收流处理应用的源代码,其中所述源代码包括操作者流图,其中所述操作者流程图包括多个操作者,接收与所述多个操作者相关联的分析数据以及所述多个操作者的一个或多个处理要求 运营商,使用所述分析数据,使用所述一个或多个候选者来创建一个或多个所述处理元件的候选分区,将候选分区定义为将一个或多个运算符合并为一组或多组处理元素(PE) 分区以选择操作员流图的期望分区,并且基于期望的分区将源代码编译成可执行代码。

    Determining Operator Partitioning Constraint Feasibility
    3.
    发明申请
    Determining Operator Partitioning Constraint Feasibility 有权
    确定操作员分区约束可行性

    公开(公告)号:US20110061060A1

    公开(公告)日:2011-03-10

    申请号:US12556931

    申请日:2009-09-10

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5066

    摘要: Techniques for determining feasibility of a set of one or more operator partitioning constraints are provided. The techniques include receiving one or more sets of operator partitioning constraints, wherein each set of one or more constraints define one or more desired conditions for grouping together of operators into partitions and placing partitions on hosts, wherein each operator is embodied as software that performs a particular function, processing each set of one or more operator partitioning constraints to determine feasibility of each set of one or more operator partitioning constraints, creating and outputting one or more candidate partitions and one or more host placements for each set of feasible partitioning constraints, and creating and outputting a certificate of infeasibility for each set of infeasible partitioning constraints, wherein the certificate of infeasibility outlines one or more reasons for infeasibility.

    摘要翻译: 提供了用于确定一组或多个操作者分区约束的可行性的技术。 这些技术包括接收一组或多组运算符分配约束,其中每组一个或多个约束定义一个或多个期望条件,用于将运算符分组到分区中并在主机上放置分区,其中每个运算符被实现为执行 特定功能,处理每组一个或多个运营商划分约束以确定每组一个或多个运营商分区约束的可行性,为每组可行分区约束创建和输出一个或多个候选分区和一个或多个主机布局,以及 创建和输出每组不可行分区约束的不可行性证书,其中,不可行性证书概述了一个或多个不可行性的原因。

    Amortizing costs of shared scans
    5.
    发明授权
    Amortizing costs of shared scans 失效
    摊销共享扫描成本

    公开(公告)号:US08484649B2

    公开(公告)日:2013-07-09

    申请号:US12984909

    申请日:2011-01-05

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843

    摘要: Techniques for scheduling a plurality of jobs sharing input are provided. The techniques include partitioning one or more input datasets into multiple subcomponents, analyzing a plurality of jobs to determine which of the plurality of jobs require scanning of one or more common subcomponents of the one or more input datasets, and scheduling a plurality of jobs that require scanning of one or more common subcomponents of the one or more input datasets, facilitating a single scanning of the one or more common subcomponents to be used as input by each of the plurality of jobs.

    摘要翻译: 提供了用于调度多个作业共享输入的技术。 这些技术包括将一个或多个输入数据集划分成多个子组件,分析多个作业以确定多个作业中的哪个作业需要扫描一个或多个输入数据集的一个或多个公共子组件,以及调度需要 扫描所述一个或多个输入数据集中的一个或多个通用子部件,便于对所述一个或多个公共子部件进行单次扫描以用作多个作业中的每个作业的输入。

    AMORTIZING COSTS OF SHARED SCANS
    9.
    发明申请
    AMORTIZING COSTS OF SHARED SCANS 失效
    共享扫描成本

    公开(公告)号:US20120174110A1

    公开(公告)日:2012-07-05

    申请号:US12984909

    申请日:2011-01-05

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843

    摘要: Techniques for scheduling a plurality of jobs sharing input are provided. The techniques include partitioning one or more input datasets into multiple subcomponents, analyzing a plurality of jobs to determine which of the plurality of jobs require scanning of one or more common subcomponents of the one or more input datasets, and scheduling a plurality of jobs that require scanning of one or more common subcomponents of the one or more input datasets, facilitating a single scanning of the one or more common subcomponents to be used as input by each of the plurality of jobs.

    摘要翻译: 提供了用于调度多个作业共享输入的技术。 这些技术包括将一个或多个输入数据集划分成多个子组件,分析多个作业以确定多个作业中的哪个作业需要扫描一个或多个输入数据集的一个或多个公共子组件,以及调度需要 扫描所述一个或多个输入数据集中的一个或多个通用子部件,便于对所述一个或多个公共子部件进行单次扫描以用作多个作业中的每个作业的输入。

    Scheduling Flows in a Multi-Platform Cluster Environment
    10.
    发明申请
    Scheduling Flows in a Multi-Platform Cluster Environment 有权
    在多平台集群环境中调度流程

    公开(公告)号:US20120304188A1

    公开(公告)日:2012-11-29

    申请号:US13116368

    申请日:2011-05-26

    IPC分类号: G06F9/46

    摘要: Techniques for scheduling multiple flows in a multi-platform cluster environment are provided. The techniques include partitioning a cluster into one or more platform containers associated with one or more platforms in the cluster, scheduling one or more flows in each of the one or more platform containers, wherein the one or more flows are created as one or more flow containers, scheduling one or more individual jobs into the one or more flow containers to create a moldable schedule of one or more jobs, flows and platforms, and automatically converting the moldable schedule into a malleable schedule.

    摘要翻译: 提供了在多平台集群环境中调度多个流的技术。 这些技术包括将集群划分成与集群中的一个或多个平台相关联的一个或多个平台容器,调度一个或多个平台容器中的每个平台容器中的一个或多个流,其中,一个或多个流被创建为一个或多个流 容器,将一个或多个单独作业调度到所述一个或多个流动容器中,以创建一个或多个作业,流动和平台的可模制计划,以及将可模制计划自动转换成可延展的计划。