Abstract:
Method, system, and programs for dynamic control of a processing system having a plurality of tiers. Queue lengths of a plurality of nodes in one of the plurality of tiers are received. A control objective is received from a higher tier. One or more requests from the higher tier are processed by the plurality of nodes in the tier. A control model of the tier is computed based on the received queue lengths. One or more parameters of the control model are adjusted based on the received control objective. At least one control action is determined based on the control model and the control objective.
Abstract:
Method, system, and programs for balancing work load in a distributed system. A plurality of resource units in the distributed system are ranked based a plurality of multi-dimensional load metrics determined from each of the plurality of resource units. The plurality of resource units are divided into a plurality of resource unit groups based on the ranking of the plurality of resource units. At least one pair of resource unit groups is formed. Each of the at least one pair of resource unit groups is assigned with a probability of load balancing action. For each pair of resource unit groups, a load balancing action is determined based on the assigned probability of load balancing action.
Abstract:
A ProbeSystem for one or more Replicas is disclosed. For each replica, the Probe System includes a ProbeHead that updates a Head field in a Target Record pertaining to the Replica and a ProbeEcho that updates an Echo field in the TargetRecord. The Head field includes a time stamp and an epoch value. The Echo field includes a latency value and an epoch value. The fields are such that, during reporting, the ProbeHead can analyze the TargetRecord for the replica to determine the latency between one or more paths between the Replica and other Replicas. The ProbeSystem has a peer-to-peer shared nothing architecture and applies to a large class of high-replica, distributed and high-scale cloud storage systems and services.
Abstract:
Method system, and programs for balancing work load in a distributed system. A plurality of resource units in the distributed system are ranked based a plurality of multi-dimensional load metrics determined from each of the plurality of resource units. The plurality of resource units are divided into a plurality of resource unit groups based on the ranking of the plurality of resource units. At least one pair of resource unit groups is formed. Each of the at least one pair of resource unit groups is assigned with a probability of load balancing action. For each pair of resource unit groups, a load balancing action is determined based on the assigned probability of load balancing action.
Abstract:
A ProbeSystem for one or more Replicas is disclosed. For each replica, the Probe System includes a ProbeHead that updates a Head field in a Target Record pertaining to the Replica and a ProbeEcho that updates an Echo field in the TargetRecord. The Head field includes a time stamp and an epoch value. The Echo field includes a latency value and an epoch value. The fields are such that, during reporting, the ProbeHead can analyze the TargetRecord for the replica to determine the latency between one or more paths between the Replica and other Replicas. The ProbeSystem has a peer-to-peer shared nothing architecture and applies to a large class of high-replica, distributed and high-scale cloud storage systems and services.