Chapter 3
Chapter 3
Chapter 3
Queuing system
3.1. Introduction
Most systems of interest in a simulation study contain a process in which there is a demand for
services. The system can service entities at a rate which is less than the rate at which entities
arrives. The entities are then said to join waiting line. The line where the entities or customers
wait is generally known as queue. The combination of all entities in system being served and
being waiting for services will be called a queuing system. The general diagram of queuing
system can be shown as a queuing system involves customers arriving at a constant or variable
time rate for service at a service station. Customers can be students waiting for registration in
college, airplane queuing for landing at airfield, or jobs waiting in machines shop. If the
customer after arriving can enter the service center, it is good, otherwise they have to wait for
the service and form a queue i.e. waiting line. They remain in queue till they are provided the
service. Sometimes queue being too long, they will leave the queue and go, it results a loss of
customer. Customers are to be serviced at a constant or variable rate before they leave the
service station.
a) FIFO (First in First out): According to this rule, Service is offered on the basis of arrival time
of customer. The customer who comes first will get the service first. So in other word the
customer who get the service next will be determine on the basis of longest waiting time.
b) Last in First out (LIFO): It is usually abbreviated as LIFO, occurs when service is next
offered to the customer that arrived recently or which have waiting time least. In the crowded
train the passenger getting in or out from the train is an example of LIFO.
c) Service in Random order (SIRO): it means that a random choice is made between all waiting
customers at the time service is offered i.e. a customer is picked up randomly forms the waiting
queue for the service.
d) Shortest processing time First (SPT): it means that the customer with shortest service time
will be chosen first for the service i.e. the shortest service time customer will get the priority in
the selection process.
e) Priority: a special number is assigned to each customer in the waiting line and it is called
priority. Then according to this number, the customer is chosen for service.
We will be frequently using notation for queuing system, called Kendall’s notation, i.e
A/B/c/N/K, where, A, B, c, N, K respectively indicate arrival pattern, service pattern, number of
servers, system capacity, and Calling population (The potential customers to a system is known
as calling pouplation:Finite and infinite)
The symbols used for the probability distribution for inter arrival time, and service time are, D
for deterministic, M for exponential (or Markov) and Ek for Erlang.
If the capacity Y is not specified, it is taken as infinity, and if calling population is not specified,
it is assumed unlimited or infinite
Example
a) M/D/2/5/∞ stands for a queuing system having exponential arrival times, deterministic
service time, 2 servers, capacity of 5 customers, and infinite population.
b) If notation is given as M/D/2 means exponential arrival time, deterministic service time, 2
servers, infinite service capacity, and infinite population.
• First-in, First-out (FIFO): Service is provided on the first come, first served basis.
• Random: Arrivals of customers is completely random but at a certain arrival rate.
• Steady state: The queuing system is at a steady state condition.
The above conditions are very ideal conditions for any queuing system and assumptions are
made to model the situation mathematically. First condition only means irrespective of
customer, one who comes first is attended first and no priority is given to anyone.
Traffic intensity
The ratio of the mean service time to the mean inter arrival time is called traffic intensity.
I.e. u= λ"Ts or u=Ts/Ta
If there is any balking or reneging, not all arriving entities get served. It is necessary therefore to
distinguish between actual arrival rate and the arrival rate of entities that get served.
Here λ" denoted the all arrivals including balking or reneging.
Server utilization
It consists of only the arrival that gets served. It is denoted by and defined as
= λTs= λ/ μ (server utilization for single server).
This is also the average number of customers in the service facility.
Thus probability of finding service counter free is
(1 – ρ)
That is there are zero customers in the service facility.
= = =
= = =
= = =
Example
At the ticket counter of football stadium, people come in queue and purchase tickets. Arrival
rate of customers is 1/min. It takes at the average 20 seconds to purchase the ticket.
(a) If a sport fan arrives 2 minutes before the game starts and if he takes exactly 1.5 minutes to
reach the correct seat after he purchases a ticket, can the sport fan expects to be seated for the
kick-off?
Solution:
(a) A minute is used as unit of time. Since ticket is disbursed in 20 seconds, this means, three
customers enter the stadium per minute, that is service rate is 3 per minute.
Therefore,
λ = 1 arrival/min
μ = 3 arrivals/min
= waiting time in the system=1/( μ- λ)=0.5 minutes
The average time to get the ticket plus the time to reach the correct seat is 2 minutes exactly, so
the sports fan can expect to be seated for the kick-off.
Example2
Customers arrive in a bank according to a Poisson's process with mean inter arrival time of 10
minutes. Customers spend an average of 5 minutes on the single available counter, and leave.
(a) What is the probability that a customer will not have to wait at the counter?
(b)What is the expected number of customers in the bank?
(c) How much time can a customer expect to spend in the bank?
Solution:
We will take an hour as the unit of time. Thus,
λ = 6 customers/hour,
μ = 12 customers/hour.
The customer will not have to wait if there are no customers in the bank. Thus,
P0 = 1 – λ/μ= 1− 6/12 = 0.5
Expected numbers of customers in the bank are given by
= λ /( μ - λ )=6/6=1
Expected time to be spent in the bank is given by
=1/( μ – λ)= 1/(12-6) = 1/6 hour = 10 minutes.
Figure shows a generalization of the simple model we have been discussing for multiple
servers, all sharing a common queue. If an item arrives and at least one server is available, then
the item is immediately dispatched to that server. It is assumed that all servers are identical;
thus, if more than one server is available, it makes no difference which server is chosen for the
item. If all servers are busy, a queue begins to form. As soon as one server becomes free, an
item is dispatched from the queue using the dispatching discipline in force. The key
characteristics typically chosen for the multi-server queue correspond to those for the single-
server queue. That is, we assume an infinite population and an infinite queue size, with a single
infinite queue shared among all servers. Unless otherwise stated, the dispatching discipline is
FIFO. For the multi-server case, if all servers are assumed identical, the selection of a particular
server for a waiting item has no effect on service time.
The total server utilization in case of Multi-server queue for N server system is
Packet Switching
Messages are transmitted through intermediate stages and the route a message
takes depends entirely upon the current load on the system. The route allocation
is dynamic. Each stage requires a random amount of time reflecting the length
of the queue at that stage.
3.8.3 Broadcasting
Radio Communication
Considering the nodes as transmitters/receivers you can treat each as having a
queue for their channels. Without going into great detail of the various systems
used: it is always necessary to consider the fact that to open a channel you must
check to see if the two adjacent channels are also free as interference blocks
transmissions. When the channels are not free it may be necessary to re-allocate
communications that already have channels to make room.
Digital Communication
This is done on the basis of time slots. For a given communication link it could
have several or all slots filled and no interference would take place making
allocation far simpler. The aspect of nodes with queues still applies however.