Techniques are provided for determining a capacity allocation in a
multi-tiered computing system. In one aspect of the invention, a
technique for designing capacity allocation for a multi-tiered computing
system, each tier of the computing system having one or more computing
devices (e.g., servers), comprises the following steps/operations. Input
parameters relating to capacity allocation are obtained. A capacity
allocation is computed based on at least a portion of the input
parameters, the capacity allocation being computable such that one or
more end-to-end performance measures are substantially satisfied for
multiple service classes. The capacity allocation computation may
minimize a cost function. The capacity allocation computation may also
satisfy one or more mean delay guarantees for the multiple service
classes, one or more tail distribution guarantees for the multiple
service classes, or both one or more mean delay guarantees and one or
more tail distribution guarantees for the multiple service classes.