METHOD, APPARATUS, AND COMPUTER PROGRAM PRODUCT FOR SCHEDULING WORK IN A STREAM-ORIENTED COMPUTER SYSTEM WITH CONFIGURABLE NETWORKS
    2.
    发明申请
    METHOD, APPARATUS, AND COMPUTER PROGRAM PRODUCT FOR SCHEDULING WORK IN A STREAM-ORIENTED COMPUTER SYSTEM WITH CONFIGURABLE NETWORKS 有权
    在具有可配置网络的面向流行的计算机系统中调度工作的方法,装置和计算机程序产品

    公开(公告)号:US20090241123A1

    公开(公告)日:2009-09-24

    申请号:US12052836

    申请日:2008-03-21

    IPC分类号: G06F9/50

    摘要: A method, apparatus, and computer program product for scheduling stream-based applications in a distributed computer system with configurable networks are provided. The method includes choosing, at a highest temporal level, jobs that will run, an optimal template alternative for the jobs that will run, network topology, and candidate processing nodes for processing elements of the optimal template alternative for each running job to maximize importance of work performed by the system. The method further includes making, at a medium temporal level, fractional allocations and re-allocations of the candidate processing elements to the processing nodes in the system to react to changing importance of the work. The method also includes revising, at a lowest temporal level, the fractional allocations and re-allocations on a continual basis to react to burstiness of the work, and to differences between projected and real progress of the work.

    摘要翻译: 提供了一种用于在具有可配置网络的分布式计算机系统中调度基于流的应用的方法,装置和计算机程序产品。 该方法包括在最高时间级别选择将运行的作业,用于将运行的作业的最佳模板替代物,网络拓扑和候选处理节点,用于处理每个正在运行的作业的最佳模板替代物的元素以最大化重要性 系统执行的工作。 该方法还包括在中等时间级别将候选处理元件的分数分配和重新分配到系统中的处理节点,以对工作的不断变化的重要性作出反应。 该方法还包括在最低时间级别修改分数分配和连续重新分配以对工作的突发性作出反应,以及对工作的预测和实际进展之间的差异进行修改。

    Fault tolerance in distributed systems
    4.
    发明授权
    Fault tolerance in distributed systems 失效
    分布式系统容错

    公开(公告)号:US08473783B2

    公开(公告)日:2013-06-25

    申请号:US12942326

    申请日:2010-11-09

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1438

    摘要: Fault tolerance is provided in a distributed system. The complexity of replicas and rollback requests are avoided; instead, a local failure in a component of a distributed system is tolerated. The local failure is tolerated by storing state related to a requested operation on the component, persisting that stored state in a data store, such as a relational database, asynchronously processing the operation request, and if a failure occurs, restarting the component using the stored state from the data store.

    摘要翻译: 容错在分布式系统中提供。 避免复制和回滚请求的复杂性; 相反,容忍分布式系统的组件中的本地故障。 通过将与所请求的操作相关的状态存储在组件上,将数据存储器(例如关系数据库)中的存储状态保持为异步处理操作请求,并且如果发生故障,则可以容忍本地故障,使用所存储的重新启动组件 状态从数据存储。

    Scheduling Flows in a Multi-Platform Cluster Environment
    5.
    发明申请
    Scheduling Flows in a Multi-Platform Cluster Environment 有权
    在多平台集群环境中调度流程

    公开(公告)号:US20120304188A1

    公开(公告)日:2012-11-29

    申请号:US13116368

    申请日:2011-05-26

    IPC分类号: G06F9/46

    摘要: Techniques for scheduling multiple flows in a multi-platform cluster environment are provided. The techniques include partitioning a cluster into one or more platform containers associated with one or more platforms in the cluster, scheduling one or more flows in each of the one or more platform containers, wherein the one or more flows are created as one or more flow containers, scheduling one or more individual jobs into the one or more flow containers to create a moldable schedule of one or more jobs, flows and platforms, and automatically converting the moldable schedule into a malleable schedule.

    摘要翻译: 提供了在多平台集群环境中调度多个流的技术。 这些技术包括将集群划分成与集群中的一个或多个平台相关联的一个或多个平台容器,调度一个或多个平台容器中的每个平台容器中的一个或多个流,其中,一个或多个流被创建为一个或多个流 容器,将一个或多个单独作业调度到所述一个或多个流动容器中,以创建一个或多个作业,流动和平台的可模制计划,以及将可模制计划自动转换成可延展的计划。

    Amortizing costs of shared scans
    6.
    发明授权
    Amortizing costs of shared scans 失效
    摊销共享扫描成本

    公开(公告)号:US08484649B2

    公开(公告)日:2013-07-09

    申请号:US12984909

    申请日:2011-01-05

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843

    摘要: Techniques for scheduling a plurality of jobs sharing input are provided. The techniques include partitioning one or more input datasets into multiple subcomponents, analyzing a plurality of jobs to determine which of the plurality of jobs require scanning of one or more common subcomponents of the one or more input datasets, and scheduling a plurality of jobs that require scanning of one or more common subcomponents of the one or more input datasets, facilitating a single scanning of the one or more common subcomponents to be used as input by each of the plurality of jobs.

    摘要翻译: 提供了用于调度多个作业共享输入的技术。 这些技术包括将一个或多个输入数据集划分成多个子组件,分析多个作业以确定多个作业中的哪个作业需要扫描一个或多个输入数据集的一个或多个公共子组件,以及调度需要 扫描所述一个或多个输入数据集中的一个或多个通用子部件,便于对所述一个或多个公共子部件进行单次扫描以用作多个作业中的每个作业的输入。

    Scheduling Mapreduce Jobs in the Presence of Priority Classes
    7.
    发明申请
    Scheduling Mapreduce Jobs in the Presence of Priority Classes 审中-公开
    在优先课程中安排Mapreduce工作

    公开(公告)号:US20120304186A1

    公开(公告)日:2012-11-29

    申请号:US13116378

    申请日:2011-05-26

    IPC分类号: G06F9/46 G06F9/50

    CPC分类号: G06F9/46 G06F9/4881 G06F9/50

    摘要: Techniques for scheduling one or more MapReduce jobs in a presence of one or more priority classes are provided. The techniques include obtaining a preferred ordering for one or more MapReduce jobs, wherein the preferred ordering comprises one or more priority classes, prioritizing the one or more priority classes subject to one or more dynamic minimum slot guarantees for each priority class, and iteratively employing a MapReduce scheduler, once per priority class, in priority class order, to optimize performance of the one or more MapReduce jobs.

    摘要翻译: 提供了在存在一个或多个优先级类别的情况下调度一个或多个MapReduce作业的技术。 这些技术包括获得一个或多个MapReduce作业的优选顺序,其中优选顺序包括一个或多个优先级等级,对于每个优先级类别,受限于一个或多个动态最小时隙保证的一个或多个优先级类别进行优先级排序,并迭代地使用 MapReduce调度程序每优先级一次,按优先级顺序排列,以优化一个或多个MapReduce作业的性能。

    FAULT TOLERANCE IN DISTRIBUTED SYSTEMS
    8.
    发明申请
    FAULT TOLERANCE IN DISTRIBUTED SYSTEMS 失效
    分布式系统中的容错

    公开(公告)号:US20120117423A1

    公开(公告)日:2012-05-10

    申请号:US12942326

    申请日:2010-11-09

    IPC分类号: G06F11/07

    CPC分类号: G06F11/1438

    摘要: Fault tolerance is provided in a distributed system. The complexity of replicas and rollback requests are avoided; instead, a local failure in a component of a distributed system is tolerated. The local failure is tolerated by storing state related to a requested operation on the component, persisting that stored state in a data store, such as a relational database, asynchronously processing the operation request, and if a failure occurs, restarting the component using the stored state from the data store.

    摘要翻译: 容错在分布式系统中提供。 避免复制和回滚请求的复杂性; 相反,容忍分布式系统的组件中的本地故障。 通过将与所请求的操作相关的状态存储在组件上,将数据存储器(例如关系数据库)中的存储状态保持为异步处理操作请求,并且如果发生故障,则可以容忍本地故障,使用所存储的重新启动组件 状态从数据存储。

    METHOD AND SYSTEM FOR PREDICTING RESOURCE USAGE OF REUSABLE STREAM PROCESSING ELEMENTS
    9.
    发明申请
    METHOD AND SYSTEM FOR PREDICTING RESOURCE USAGE OF REUSABLE STREAM PROCESSING ELEMENTS 有权
    用于预测资源使用可回收流程处理元素的方法和系统

    公开(公告)号:US20090119238A1

    公开(公告)日:2009-05-07

    申请号:US11935079

    申请日:2007-11-05

    IPC分类号: G06N5/04

    CPC分类号: G06N99/005

    摘要: A method is provided for generating a resource function estimate of resource usage by an instance of a processing element configured to consume zero or more input data streams in a stream processing system having a set of available resources that comprises receiving at least one specified performance metric for the zero or more input data streams and a processing power of the set of available resources, wherein one specified performance metric is stream rate; generating a multi-part signature of executable-specific information for the processing element and a multi-part signature of context-specific information for the instance; accessing a database of resource functions to identify a static resource function corresponding to the executable-specific information and a context-dependent resource function corresponding to the context-specific information; combining the static resource function and the context-dependent resource function to form a composite resource function for the instance; and applying the resource function to the at least one specified performance metric and the processing power to generate the resource function estimate of the at least one specified performance metric for processing by the instance.

    摘要翻译: 提供了一种用于通过被配置为在具有一组可用资源的流处理系统中消耗零个或多个输入数据流的处理元件的实例来生成资源使用的资源功能估计的方法,所述流处理系统包括:一组可用资源,其包括接收至少一个指定的性能度量 零个或多个输入数据流和可用资源集合的处理能力,其中一个指定的性能度量是流速率; 生成用于处理元件的可执行特定信息的多部分签名和该实例的上下文特定信息的多部分签名; 访问资源功能的数据库以识别与所述可执行特定信息相对应的静态资源功能以及与所述上下文特定信息相对应的与上下文相关的资源功能; 结合静态资源功能和上下文相关资源功能,形成实例的复合资源功能; 以及将所述资源功能应用于所述至少一个指定的性能度量和所述处理能力以生成所述至少一个指定的性能度量的所述资源功能估计,以供所述实例处理。