4: Introduction to Queueing Theory and Its Applications

     Requests for service very often find some or all of the units associated with some urban service system busy. The performance of these systems on such occasions (i.e., under conditions of some "stress") plays a major role in shaping citizens' perceptions with regard to the level of service offered.
    That some deterioration in the level of service will occur during periods of intensive activity is clear. For instance, a fire alarm may occur at an instant when all fire companies stationed at the nearest fire houses have already been dispatched to another alarm elsewhere in a city. To the extent, then, that the response time to the new fire will be longer under these circumstances (because of the need to dispatch fire engines from remote fire stations), the perceived level of service will be lower. Similarly, a police dispatcher will often postpone service to medium-priority calls for police assistance during periods when nearly all police cars in a district are busy. In doing so, the dispatcher preserves the ability of the police to respond immediately to top priority calls while maintaining some level of police "visibility" through the patrolling activities of non busy cars. Under normal conditions such medium priority calls would have received prompt service.
    Busy period service delays must inevitably occur in the case of services that respond to unpredictable demands whose time and location of occurrence are governed by some type of (known or unknown) probabilistic law. The cost of providing sufficient capacity to avoid all delays under all circumstances would be insupportable. The proper role of analysis, therefore, is to design service systems that achieve an acceptable balance between system operating costs, on the one hand, and the delays suffered by users of that system, on the other. As to where this acceptable balance lies, it all depends on the nature of the service provided. In some cases-for example, in fire or ambulance services-the costs of delays are generally perceived to be very high. To assure a low probability of such delays, it is therefore necessary to design systems that are relatively underutilized and whose servers (e.g., fire engines, ambulances, etc.), experience long periods of idleness. In other instances (e.g., collection of solid refuse or the delivery of mail), delays of a few hours or even of days are not usually catastrophic. This, in turn, permits a high level of utilization for the servers used by the system.
    Queueing theory, the theory of congestion, is the branch of operations research which explores the relationships between demand on a service system and the delays suffered by the users of that system. Since almost all urban service systems can be viewed as queueing systems (as it will become clear in this chapter), queueing theory plays a central role in the analysis of and planning for urban services. This chapter will therefore deal with a review of some important results in queueing theory and with an introduction to the applications of these results to the problems on which this book focuses.