-
公开(公告)号:US09311194B1
公开(公告)日:2016-04-12
申请号:US14229313
申请日:2014-03-28
Applicant: Google Inc.
Inventor: Robert Cypher , Healfdene Goguen , Steven Robert Schirripa , Christian Eric Schrock
CPC classification number: G06F9/5083 , G06F11/1088 , G06F11/2094 , G06F11/3433 , G06F11/3485
Abstract: A method for efficiently using resources (e.g., memory devices) in data centers of a distributed storage system includes identifying high-availability jobs and low-availability jobs that demand usage of resources of the distributed system. The method further includes allocating resource usage to the jobs, determining a first load of the jobs on resources available during a failure event, and determining a second load of the jobs on the resources lost during the failure event. The method includes determining a scaled third load of the jobs on the resources available during the failure event based on the first and second loads and reallocating resource usage assigned to the low-availability jobs to the high-availability jobs during the failure event. The reallocation is associated with the scaled third load of the jobs.
Abstract translation: 用于在分布式存储系统的数据中心中有效地使用资源(例如,存储设备)的方法包括识别需要对分布式系统的资源的使用的高可用性作业和低可用性作业。 该方法还包括将资源使用分配给作业,确定在故障事件期间可用资源上的作业的第一次加载,以及确定在故障事件期间丢失的资源上的作业的第二次加载。 该方法包括基于第一和第二负载确定在故障事件期间可用的资源上的作业的缩放的第三负载,并且在故障事件期间将分配给低可用性作业的资源使用重新分配给高可用性作业。 重新分配与缩放的第三次工作负载相关联。