摘要:
In a computer system comprising a plurality of computing devices wherein the plurality of computing devices processes a plurality of tasks and each task has a task type, a method for determining overheads associated with task types comprises the following steps. Overheads are estimated for a plurality of task types. One of the plurality of computing devices is selected to execute one of the plurality of tasks, wherein the selection comprises estimating load on at least a portion of the plurality of computing devices from tasks assigned to at least a portion of the plurality of computing devices and the estimates of overheads of the plurality of task types. One or more of the estimates of overheads of the plurality of task types are varied.
摘要:
Systems and methods for admission control to a physical host system are provided herein. One aspect provides for receiving at least one resource request at an admission control component of a distributed computing system, the at least one resource request comprised of at least one system type; processing the at least one resource request utilizing at least one physical host accessible to the distributed computing system; specifying a number of resource request slots to be reserved for at least one system type based on at least one future reservation threshold accessible to the admission control component; and blocking resource requests from entering the system through the admission control component based on a number of available resource request slots and the at least one future reservation threshold. Other embodiments and aspects are also described herein.
摘要:
Techniques are disclosed for load balancing in networks such as those networks handling telephony applications. By way of example, such techniques direct requests associated with calls to servers in a system comprised of a network routing calls between a plurality of callers and at least one receiver wherein a load balancer sends requests associated with calls to a plurality of servers as follows. A request associated with a call, a caller, or a receiver is received, depending on the particular load balancing technique. A server is selected to receive the request. A subsequent request is received. A determination is made whether or not the subsequent request is associated with the call, the caller, or the receiver, depending on the particular load balancing technique. The subsequent request is sent to the server based on determining that the subsequent request is associated with the call, the caller, or the receiver, again depending on the particular load balancing technique.
摘要:
A generic queueing network model of a Web services environment is introduced. The behavior of a service is abstracted in three phases: serial, parallel and dormant, thus yielding a Serial Parallel Queueing Network (SPQN) model with a small number of parameters. A method is provided for estimated the parameters of the model that is based on stochastic approximation techniques for solving stochastic optimization problems. The parameter estimation method is shown to perform well in a noisy environment, where performance data is obtained through measurements or using approximate model simulations.
摘要:
Systems and methods for admission control to a physical host system are provided herein. One aspect provides for receiving at least one resource request at an admission control component of a distributed computing system, the at least one resource request comprised of at least one system type; processing the at least one resource request utilizing at least one physical host accessible to the distributed computing system; specifying a number of resource request slots to be reserved for at least one system type based on at least one future reservation threshold accessible to the admission control component; and blocking resource requests from entering the system through the admission control component based on a number of available resource request slots and the at least one future reservation threshold. Other embodiments and aspects are also described herein.
摘要:
An optimization problem models the dynamic placement of applications on servers under two types of simultaneous resource requirements, those that are dependent on the loads placed on the applications and those that are independent. The demand (load) for applications changes over time and the goal is to satisfy all the demand while changing the solution (assignment of applications to servers) as little as possible.
摘要:
Described herein are methods, systems, apparatuses and products for cost-aware replication of intermediate data in dataflows. An aspect provides receiving at least one measurement indicative of a reliability cost associated with executing a dataflow; computing a degree of replication of at least one intermediate data set in the dataflow based on the reliability cost; and communicating at least one replication factor to at least one component of a system responsible for replication of the at least one intermediate data set in the dataflow; wherein the at least one intermediate data set is replicated according to the replication factor. Other embodiments are disclosed.
摘要:
An optimization problem models the dynamic placement of applications on servers under two types of simultaneous resource requirements, those that are dependent on the loads placed on the applications and those that are independent. The demand (load) for applications changes over time and the goal is to satisfy all the demand while changing the solution (assignment of applications to servers) as little as possible.
摘要:
A generic queueing network model of a Web services environment is introduced. The behavior of a service is abstracted in three phases: serial, parallel and dormant, thus yielding a Serial Parallel Queueing Network (SPQN) model with a small number of parameters. A method is provided for estimated the parameters of the model that is based on stochastic approximation techniques for solving stochastic optimization problems. The parameter estimation method is shown to perform well in a noisy environment, where performance data is obtained through measurements or using approximate model simulations.
摘要:
Techniques are disclosed for load balancing based on call length in networks such as those networks handling telephony applications. By way of example, one method for directing requests associated with calls to servers in a system comprised of a network routing calls between at least one caller and at least one receiver wherein the network comprises a load balancer sending requests to a plurality of servers comprises the following steps. A first request of a call is received. A server s1 is selected to receive the request based on an estimated duration of the call. Another method for directing requests associated with calls to servers in a system comprised of a network routing calls between at least one caller and at least one receiver wherein the network comprises a load balancer sending requests to a plurality of servers comprises the following steps. Information is maintained regarding load assigned to a plurality of servers. A first request of a call is received. A server s1 is selected to receive the request based on the maintained information. The request is sent to server s1. The information regarding load is updated based on an estimated length of the call.