Method and apparatus for estimating a completion time for mapreduce jobs

    公开(公告)号:US09612876B2

    公开(公告)日:2017-04-04

    申请号:US14135114

    申请日:2013-12-19

    CPC classification number: G06F9/5066 G06F2209/5013

    Abstract: A method, non-transitory computer readable medium, and apparatus for estimating a completion time for a MapReduce job are disclosed. For example, the method builds a general MapReduce performance model, computes one or more performance characteristics of each one of one or more benchmark workloads, computes one or more performance characteristics of the MapReduce job in the known processing system, selects a subset of the one or more benchmark workloads that have similar performance characteristics as the one or more performance characteristics of the MapReduce job, targets a cluster of processing nodes in a distributed processing system, computes one or more performance characteristics of the subset of the one or more benchmark workloads in the cluster of processing nodes and estimates the completion time for the MapReduce job.

Patent Agency Ranking