摘要:
A data model represents semantic information associated with objects stored in a file system. The data model includes a first object identifier, a second object identifier and a relation identifier. The first object identifier identifies a first object stored in the file system. The second object identifier identifies a second object stored in the file system, wherein the second object is related to the first object. The relation identifier identifies a relationship between the first object and the second object.
摘要:
A data model represents semantic information associated with objects stored in a file system. The data model includes a first object identifier, a second object identifier and a relation identifier. The first object identifier identifies a first object stored in the file system. The second object identifier identifies a second object stored in the file system, wherein the second object is related to the first object. The relation identifier identifies a relationship between the first object and the second object.
摘要:
An embodiment of a method of controlling access to a computing resource within a shared computing environment begins with a first step of measuring performance parameters for workloads accessing the computing resource to determine a performance parameter vector for the workloads. The method continues with a second step of estimating a controller function for the computing resource by analysis of recent performance parameters and recent throughputs. The controller function comprises a mathematical operation which takes an input and provides an output. In a third step, the method compares the performance parameter vector to a reference performance parameter vector to determine an error parameter. In a fourth step, the method applies the controller function to the error parameter to determine a target throughput for each of the workloads. The method concludes with a fifth step of adjusting access to the computing resource for each work load having a throughput limit different from about the target throughput by reducing or increasing the throughput limit for the workload to about the target throughput.
摘要:
According to one embodiment, a method comprises scheduling, by a weighted proportional-share scheduler, access of competing flows to a computing service in accordance with respective weights assigned to the competing flows. The method further comprises autonomously determining, by a controller, based at least in part on received performance measurement for the flows, values of the respective weights assigned to the competing flows for achieving respective performance goals desired for the competing flows. When determined by the controller that the values of the respective weights assigned to the competing flows should be changed for achieving the performance goals, the controller communicates the changed values of the respective weights to the weighted proportional-share scheduler.
摘要:
According to one embodiment, a method comprises scheduling, by a weighted proportional-share scheduler, access of competing flows to a computing service in accordance with respective weights assigned to the competing flows. The method further comprises autonomously determining, by a controller, based at least in part on received performance measurement for the flows, values of the respective weights assigned to the competing flows for achieving respective performance goals desired for the competing flows. When determined by the controller that the values of the respective weights assigned to the competing flows should be changed for achieving the performance goals, the controller communicates the changed values of the respective weights to the weighted proportional-share scheduler.
摘要:
An embodiment of a method of determining a lower bound for a minimum cost of placing data objects onto nodes of a distributed storage system begins with a first step of assigning a placement of a data object to a node and a time interval which meets a benefit criterion. Assignment of the placement of the data object to the node and the time interval comprises assigning the placement of the data object to a node-interval. The method continues with a second step of continuing to assign additional placements of the data object to other node-intervals which each meet the benefit criterion until a performance reaches a performance threshold. The method performs the first and second steps for each of the data objects. The method concludes with a step of calculating a sum of storage costs and creation costs for the placement and the additional placements of the data objects. According to another embodiment, the data object placed in the first and second steps is chosen on a basis of a triplet of the data object, the node, and the interval which meets the benefit criterion.
摘要:
A method of determining data placement for a distributed storage system with a selection of a heuristic class for placing data objects onto nodes of the distributed storage system. The heuristic class meets a performance requirement and provides a replication cost that is within an allowable limit of a minimum replication cost. The method continues with an instantiation of a data placement heuristic selected from a range of data placement heuristics according to the heuristic class. According to one embodiment, the method concludes with the instantiation of the data placement heuristic. According to another embodiment, the method concludes with an evaluation of a placement of the data objects onto the nodes made according to the data placement heuristic. According to another embodiment, the method continues by iteratively performing the selection of the heuristic class, the instantiation of the data placement heuristic, and the evaluation of the data placement heuristic.
摘要:
An embodiment of a method of selecting a heuristic class for data placement in a distributed storage system begins by forming a general integer program which models the data placement and forming a specific integer program which models a heuristic class for the data placement. The general and specific integer programs each comprising an objective of minimizing a replication cost. The method continues with solving the general integer program which provides a general lower bound for the replication cost and solving the specific integer program which provides a specific lower bound for the replication cost. The method concludes with selecting the heuristic class if a difference between the general lower bound and the specific lower bound is within an allowable amount.
摘要:
According to one embodiment, a method comprises receiving at a scheduler a change to a weight assigned to a consumer. The method further comprises utilizing, by the scheduler, a weighted proportional-share scheduling algorithm to maintain fairness in allocating shares of a resource to competing consumers in accordance with the changed weight. According to another embodiment, a system comprises at least one resource, and a plurality of competing consumers desiring to use the resource(s). A scheduler allocates shares of the resource(s) to the competing consumers according to a weighted proportional-share algorithm. A controller monitors at least one of performance of the competing consumers and utilization of the resource(s), and controls the performance and/or utilization by dynamically changing a scheduler parameter. The scheduler maintains fairness in allocating shares of the resource(s) to the competing consumers in accordance with the dynamically changed scheduler parameter.
摘要:
An embodiment of a method of determining bounds for a minimum cost begins, by solving an integer program using a relaxation of binary variables to determine a lower bound for the minimum cost. The integer program comprises a performance constraint and an objective of minimizing a cost. The binary variables which have values between zero and one comprise a subset. The method rounds up a first binary variable within the subset having a lowest ratio of a cost penalty to a performance reward. The method then rounds down one or more of the binary variables within the subset until no binary variables within the subset may be rounded down without violating the performance constraint. The method iteratively rounds up one of the binary variables within the subset and then rounds down others until no binary variables remain in the subset. The method concludes with determining an upper bound for the minimum cost according to the binary variables having binary values.