-
公开(公告)号:US09323615B2
公开(公告)日:2016-04-26
申请号:US14169322
申请日:2014-01-31
Applicant: Google Inc.
Inventor: Robert Cypher , Sean Quinlan , Steven Robert Schirripa , Lidor Carmi , Christian Eric Schrock
CPC classification number: G06F11/2094 , G06F3/0619 , G06F3/0643 , G06F3/067 , G06F11/0727 , G06F11/0793 , G06F11/1076 , G06F2201/805 , G06F2211/1028
Abstract: A method of distributing data in a distributed storage system includes receiving a file into non-transitory memory and dividing the received file into chunks. The chunks are data-chunks and non-data chunks. The method also includes grouping one or more of the data chunks and one or more of the non-data chunks in a group. One or more chunks of the group is capable of being reconstructed from other chunks of the group. The method also includes distributing the chunks of the group to storage devices of the distributed storage system based on a hierarchy of the distributed storage system. The hierarchy includes maintenance domains having active and inactive states, each storage device associated with a maintenance domain, the chunks of a group are distributed across multiple maintenance domains to maintain the ability to reconstruct chunks of the group when a maintenance domain is in an inactive state.
-
公开(公告)号:US09213576B2
公开(公告)日:2015-12-15
申请号:US14169357
申请日:2014-01-31
Applicant: Google Inc.
Inventor: Robert Cypher , Peter Dahl , Steven Robert Schirripa
CPC classification number: G06F9/5011 , G06F9/4881 , G06F9/50 , G06F9/5061 , G06F9/5088 , G06F2209/503
Abstract: A method includes identifying high-availability jobs and low-availability jobs that demand usage of resources of a distributed system. The method includes determining a first quota of the resources available to low-availability jobs as a quantity of the resources available during normal operations, and determining a second quota of the resources available to high-availability jobs as a quantity of the resources available during normal operations minus a quantity of the resources lost due to a tolerated event. The method includes executing the jobs on the distributed system and constraining a total usage of the resources by both the high-availability jobs and the low-availability jobs to the quantity of the resources available during normal operations.
Abstract translation: 一种方法包括识别需要使用分布式系统资源的高可用性作业和低可用性作业。 该方法包括将可用于低可用性作业的资源的第一配额确定为在正常操作期间可用的资源的数量,以及将可用于高可用性作业的资源的第二配额确定为正常期间可用资源的数量 操作减去由于容忍事件而丢失的资源的数量。 该方法包括在分布式系统上执行作业,并将高可用性作业和低可用性作业的资源的总使用限制在正常操作期间可用资源的数量。
-