Professional Documents
Culture Documents
a r t i c l e i n f o a b s t r a c t
Article history: We consider multi-server tandem queues where both stations have a nite buffer and all services times
Received 21 September 2015 are phase-type distributed. Arriving customers enter the rst queueing station if buffer space is avail-
Accepted 16 July 2016
able or get lost otherwise. After completing service in the rst station customers proceed to the second
Available online 21 July 2016
station if buffer space is available, otherwise a server at the rst station is blocked until buffer space
Keywords: becomes available at the second station. We provide an exact computational analysis of various steady-
Queueing state performance measures such as loss and blocking probabilities, expectations and higher moments
Multi-server tandem queue of numbers of customers in the queues and in the whole system by modeling the tandem queue as a
Markovian arrival process level-dependent quasi-birth-and-death process and applying suitable matrix-analytic methods. Numerical
Phase-type service time distributions results are presented for selected representative examples.
Matrix-analytic method
2016 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.ejor.2016.07.035
0377-2217/ 2016 Elsevier B.V. All rights reserved.
188 H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195
the seemingly simple case of a multi-server queue with Poisson vice time distributions and varying parameter values we study the
arrivals, PH distributed service times, and nite buffer by using a loss and blocking probabilities as well as the expected numbers of
reduced state description in which the state of only one server is customers in the two stations of the tandem queueing network.
represented explicitly, while the other servers are accounted for Section 7 concludes the paper and outlines further research direc-
through their rate of completions. In fact, queueing models with tions.
both MAP and PH distributed service times, even for single sta-
tion single-server queues, are typically analyzed by approxima- 2. Tandem queue specication
tions using state space reduction approaches and/or sophisticated
matrix-analytic methods. Different variants of single-server queues The class of the tandem queue we investigate can be specied
with MAP and PH distributed service times have been studied in, in Kendall notation by MAP/PH/c1 /c1 + K1 /PH/c2 /c2 + K2 . The
e.g., Artalejo and Chakravarthy (2006), Krishnamoorthy, Babu, and rst station consists of c1 identical servers and a buffer of nite
Narayanan (2009), Sreenivasan, Chakravarthy, and Krishnamoorthy capacity K1 . The second station consists of c2 identical servers and
(2013). Multi-server queues with MAP and PH distributed service a buffer of nite capacity K2 . Customers arrive at the rst queue-
times are addressed in Chakravarthy (2013) where the MAP/PH/c ing station. If upon arrival the buffer is fully occupied, then the
retrial queue with PH distributed retrials is studied via simu- customer is lost. Customers having completed service in the rst
lation, and in Kim, Dudin, Taramin, and Baek (2013b) where a station proceed to the second station. If upon service completion
MAP/PH/c/c+K queue with two customer classes is analyzed by in the rst station the buffer at the second station is fully occu-
combining specic state space reduction techniques for queues pied, then the customer blocks a server at the rst station until
with PH distributed service times and matrix-analytic methods. buffer space becomes available at the second station.
Only relatively few works have investigated tandem queues The service times at both stations are PH distributed. A prob-
with MAP and PH distributed service times. Gomez-Corral ability distribution on R+ = [0, ) is a (continuous) PH distribu-
(2002) analyzes a tandem queue with MAP and two single-server tion, iff it is the distribution of the time until absorption in a -
stations, innite buffer at the rst station and no buffer at the sec- nite time-homogeneous continuous-time Markov chain (CTMC). For
ond station. Service times at the rst station are PH distributed and such a CTMC with state space {1, . . . , n + 1} and initial distribution
the service times at the second station are general. Van Houdt and = (1 , . . . , n+1 ), where the states 1, . . . , n are transient and the
Alfa (2005) consider a (discrete-time) tandem queue with MAP and state n + 1 is absorbing, the generator matrix A has the form
two single-server stations where the service times at both stations
B b
are PH distributed, the queue at the rst station is innite and the A= , B Rnn , b Rn (1)
queue at the second station is nite. Lian and Liu (2008) deal with 0 0
a tandem queue with MAP, two single-server stations with queues and (1 , . . . , n , B ) is a representation of the PH distribution, in
of innite capacity and exponentially distributed service times at short PH( , B), with = (1 , . . . , n ). The number n of transient
both stations. Baumann and Sandmann (2013) consider a tandem states is called the order of the PH distribution or the number of
queue with MAP, two single-server stations with queues of innite phases. In our model we denote the service time distributions at
capacity and PH distributed service times at both stations where the rst and the second station by PH( , S) and PH( , T), respec-
customers may leave the system after completing service at the tively, where PH( , S) is of order V and PH( , T) is of order W.
rst station. Kim, Dudin, Dudin, and Dudina (2013a) model a call Hence, S = (si j )Vi, j=1 and T = (ti j )W i, j=1
. Besides, we dene the vec-
center as a tandem queue with MAP, multiple servers at both sta- tors s := S1 and t := T 1. The moments of a PH( , B) distributed
tions, exponentially distributed service times and no buffer at the random variable Z are given by E[Z k ] = k! (B )k 1. Hence, the
rst station, PH distributed service times and a nite buffer at the expected service times in our model are S1 1 at the rst sta-
second station. Kim, Dudin, Dudina, and Dudin (2014) investigate tion and T 1 1 at the second station.
a tandem queue with two types of customers, marked MAP, multi- The arrival process is a continuous-time MAP of order U de-
ple servers at both stations, exponentially distributed service times termined by the pair of (U U)-matrices (D0 , D1 ) where D0 =
and no buffer at the rst station, PH distributed service times, one (i j )Ui, j=1 , D1 = (i j )Ui, j=1 with ij 0 for i = j, ij < 0 for i =
nite buffer and one innite buffer at the second station. j, ij 0 for all i, j {1, 2 . . . , U }, D1 = 0, and D0 1 + D1 1 = 0
In this paper, we consider tandem queues with MAP, two multi- such that D = D0 + D1 is the generator matrix of an irreducible
server stations with nite buffers and PH distributed service times CTMC on the state space {1, 2, . . . , U }. This CTMC is commonly
at both stations where at the rst station losses of arriving cus- referred to as environmental process, phase process, or back-
tomers and blocking after service can occur. We compute steady- ground CTMC. When it changes its state from state i to state
state performance measures without introducing any approxima- j = i, with probability i j /(i j + i j ) it triggers an arrival and
tion. For this purpose we model the class of tandem queues under with probability i j /(i j + i j ) it does not. The average arrival
consideration by level-dependent quasi-birth-and death (LDQBD) rate of the MAP is given by = D1 1 where denotes the
processes and use a matrix-analytic method according to Baumann stationary distribution of the environmental process, i.e. D =
and Sandmann (2013) for obtaining the desired steady-state perfor- 0, 1 = 1. The stationary moments of the interarrival time Y are
mance measures without explicitly computing the stationary dis- E[Y k ] = k!1 (D0 )1k 1. In particular, E[Y ] = 1 , and Var[Y ] =
tribution. In the next section, we formally introduce the class of (2 (D0 )1 1 1 )/2 , from which we get the squared coecient
tandem queues considered, in particular we give general formal of variation c2 = 2 (D0 )1 1 1. The squared coecient of cor-
descriptions of the arrival process and the service time distribu- relation is given by = ( (D0 )1 D1 D1 1 1 )/c2 .
0
tions. In Section 3 we describe the modeling of this class of tan-
dem queues as LDQBD processes, where we introduce the structur- 3. Markov chain model
ing of the multi-dimensional state space as well as the state transi-
tions and transition rates. Subsequently, in Section 4 we show how We model the tandem queue as a CTMC (X (t ))t0 =
to express relevant performance measures and in Section 5 we (X1 (t ), . . . , XV +W +6 (t ))t0 whose states are integer-valued vectors
present a matrix-analytic algorithm for the ecient computation of the form
of these performance measures. Numerical results for specic tan-
dem queues of the considered type are presented in Section 6. (n1 , n2 , u, v0 , v1 , . . . , vV , vV +1 , w0 , w1 , . . . , wW )
In particular, for certain choices of the arrival process, the ser- where
H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195 189
Table 1
Transitions in the CTMC model of the tandem queue.
n1 , n2 are the numbers of customers waiting at the rst and Changes of phases are quite clear. The factor vi is explained by
second station, respectively, i.e. the numbers of customers in the fact that there are vi servers operating in phase i. The factor
the buffers, wk is explained analogously.
u is the environmental state of the MAP, With rate u,u , an arrival occurs and the MAP goes to state u .
v0 , w0 are the numbers of free servers at the rst and second For v0 > 0, there is at least one idle server at the rst station. An
station, respectively, arriving customer will head for one of the idle servers and there-
vi , w j for 1 i V, j = 1, . . . , W are the number of servers op- fore decrease the number of idle servers by 1. With probability
erating in phase i at the rst station and the number of servers i , the new service starts in phase i. For v0 = 0 and n1 < K1 , all
operating in phase j at the second station, respectively, servers are busy, and arriving customers are forced to wait, that
vV +1 is the number of blocked servers at the rst station due is they enter the buffer. Hence, the number of waiting customers
to lack of buffer space at the second station. increases by 1. For n1 = K1 , all servers are busy, and there is no
buffer space available. Thus, arriving customers are lost.
The (V + W + 6 )-dimensional state space S NV +W +6 of the
If a server at the rst station operates in phase i, the service
CTMC is given by the set of all integer-valued vectors of this form
completion rate is si . The number of servers operating in phase i
subject to the conditions
is vi . Hence, the rate for service completion of one of the servers
1 u U, operating in phase i is given by vi si .
0 n1 K1 , 0 n2 K2 , If no customer waits at the rst station (n1 = 0), the number
n1 = 0 or v0 = 0, of idle servers increases by 1. Otherwise, the server starts imme-
n2 = 0 or w0 = 0, diately with the next service, the number n1 of waiting customers
vi 0 for 0 i V + 1 and wi 0 for 0 i W, decreases by 1, and the server starts in phase j with probability
v0 + v1 + + vV + vV +1 = c1 and w0 + w1 + + wW = c2 , j . For j = i, i is decreased by 1 and j is increased by 1. If
vV +1 = 0 or n2 = K2 . j = i, then i does not change. For w0 > 0, there is at least one
For computing the number of states, we remark that accord- idle server at the second station. After service completion at the
c +k rst station, customers will head for one of the idle servers at
ing to simple elementary combinatorics there are 1k partitions
the second station and the service starts in phase k with proba-
c1 = 0 + + k with 0 , . . . , k 0. By distinguishing for the rst
bility k . For w0 = 0 and n2 < K2 , all servers at the second sta-
station between n1 = 0 and n1 > 0 and for the second station be-
tion are busy and the number n2 of waiting customers at the sec-
tween n2 = 0 and 0 < n2 < K2 and n2 = K2 , we obtain the size of
ond station is increased by 1. For n2 = K2 , there is no buffer space
the state space
available at the second station. Hence, after service completion at
c1 + V 1 c2 + W 1 the rst station, customers cannot leave the server and will block
|S | = U
V 1 W 1 it. In this case, the number vV +1 of blocked servers is increased
(c + V )(c + W ) c1 + V by 1.
+ (K2 1 )
1 2
If a server at the second station operates in phase k, the service
VW V
completion rate is tk . The number of servers operating in phase
(c1 + V )(c1 + V + 1 ) c2 + W k is wk , and hence, the rate for service completion of one of the
+ + K1
V (V + 1 ) W servers operating in phase k is wk tk .
c1 + V
For n2 = 0, no customer waits at the second station. Hence, the
+ K1 (K2 1 ) + K1
V number of idle servers is increased by 1. Otherwise, the number
of waiting customers decreases by 1 and the new service starts in
c1 + V c2 + W
U K1 K2 . phase with probability . For vV +1 > 0, implying n2 = K2 , ser-
V W vice completion at the second station has impact on the rst sta-
All transitions from state (u, n1 , v0 , v1 , . . . , vV , vV +1 , n2 , w0 , w1 , tion: A customer leaves the system, service for another customer
. . . , wW ) that change this state, their effects, rates, and conditions (waiting at the second station) starts, and thus, there is a buffer
under which the transitions occur are given in Table 1. Note that space available at the second station. vV +1 > 0 implies that there
formally, all transitions in the second (third) block require vi > 0 are customers blocking servers at the rst station after service
(wk > 0). We can omit these conditions here due to the fact that completion. One of these customers will join the buffer at the sec-
the transition rates have the value 0 for vi = 0 or wk = 0. In the ond station so that still n2 = K2 . The corresponding server will be-
following we explain the transitions. come idle if there are no customers waiting at the rst station or
190 H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195
will start a new service, which then starts in phase i with proba- the higher moments. Furthermore, according to (5) and (7) also
bility i . loss probabilities and blocking probabilities can be expressed by
expectations. As we shall see, expressing and computing perfor-
4. Performance measures mance measures via expectations is in many regards much nicer
than doing so via the stationary probabilities.
In order to describe relevant steady-state queueing network An obvious approach to obtaining the performance measures is
performance measures we consider the limit X (t ) X ( ) =: X = by rst computing the stationary probability distribution (or an ap-
(X1 , . . . , XV +W +6 ) and the corresponding limiting probability distri- proximation thereof) and then using the stationary probabilities for
bution that coincides for any ergodic CTMC with the unique sta- calculating the desired expectations according to (2), (3), (4) and
tionary probability distribution. (6). However, in many cases, in particular for our tandem queue
Let (n1 , n2 , u, v0 , v1 , . . . , vV , vV +1 , n2 , w0 , w1 , . . . , wW ) denote with MAP and PH distributed service times, the whole stationary
the stationary probability of the state (n1 , n2 , u, v0 , v1 , . . . , vV , vV +1 , distribution is not relevant. More specically, the stationary prob-
w0 , w1 , . . . , wW ) and u := u1 + u2 + + uU , u = 1, . . . , U. abilities of each state are typically not of practical interest, e.g.,
For notational convenience we dene (n1 , n2 , u, v, w ) := the stationary distribution includes the probabilities that services
(n1 , n2 , u, v0 , v1 , . . . , vV , vV +1 , w0 , w1 , . . . , wW ). The stationary are in specic phases and the like. In fact, the practical interest is
k-th moments of the numbers of customers in the buffers are in marginal probabilities, loss and blocking probabilities, expecta-
E[X1k ] and E X2k , respectively, which are given by tions, higher moments as dened above, and in cumulants such as
variances, skewness, or kurtosis, all of which can be composed of
K1
K2
c1
c1
c2
c2
higher moments. Moreover, from a computational point of view, of
E X1k = nk1 (n1 , n2 , u, v, w ),
course, obtaining the whole stationary distribution requires a lot
n1 =0 n2 =0 u=1 v0 =0 vV +1 =0 w0 =0 wW =0
of storage that can easily exceed the memory resources even of
(2) todays computers.
K1
K2
c1
c1
c2
c2
E X2k = nk2 (n1 , n2 , u, v, w ), 5. Computational solution technique
n1 =0 n2 =0 u=1 v0 =0 vV +1 =0 w0 =0 wW =0
Two key points in our following analysis are:
(3)
where (n1 , n2 , u, v, w ) := 0 for (n1 , n2 , u, v, w )
/ S. Note that 1. We partition the multidimensional state space such that the
these moments in particular include the marginal probabilities (for model can be expressed as a level-dependent quasi-birth-and-
k = 0) and the expectations (for k = 1) of the numbers of cus- death (LDQBD) process, that is, a CTMC with block tridiagonal
tomers in the buffers. Furthermore, cumulants such as variances generator matrix where the matrix structure can be exploited
and skewnesses can be obtained easily from the higher moments. for ecient computational analysis.
The stationary probability that an arriving customer is lost can 2. We compute the expectations that describe the relevant per-
be expressed as the proportion of the rate of arrivals while Xt(1 ) = formance measures by a memory-ecient matrix-analytic al-
K1 and the total arrival rate. Hence, it is given by gorithm without explicitly computing the stationary distribu-
tion or a huge set of stationary probabilities. In this manner
1
K2
c1
c1
c2
c2
we avoid huge storage requirements for stationary probabilities
ploss = (K1 , n2 , u, v, w )u
n2 =0 u=1 v0 =0 vV +1 =0 w0 =0 wW =0
so that it becomes indeed possible to compute the performance
measures eciently without approximation errors.
(4)
5.1. State space partitioning
1
U
u
= E I{X1 =K1 ,X3 =u} u =E I{X1 =K1 ,X3 =u} (5)
u=1 u=1
We partition the multidimensional state space S of the CTMC
according to the number of customers in the system, that is, the
where I{ } denotes an indicator function. Similarly, the probability number of customers in both queues and all servers, which is n1 +
that a customer completing service at the rst station blocks this n2 + c1 v0 + c2 w0 . Denote by N the maximum possible number
station, because there is no free space at the second station, can of customers in the system and for all n {0, 1, . . . , N} by S (n ) the
be expressed as the proportion of the rate of service completions
set of all states in which exactly n customers are in the system.
at the rst station while Xt(2 ) = K2 and the total service completion
Hence, N = c1 + K1 + c2 + K2 and
rate at the rst station. Hence, this probability is given by
S (n ) : = {(n1 , n2 , u, v0 , v1 , . . . , vV , vV +1 , w0 , w1 , . . . , wW ) S : n1
pblock
K1 U
c1 c1 c2 c2 V + n 2 + c1 v0 + c2 w 0 = n }, (8)
n1 =0 v0 =0 . . . vV +1 =0 w0 =0 . . . wW =0 (n1 , K2 , u, v, w ) i=1 si vi
u=1
=
K1 K2 U c1 c1 c2 c2 V which implies S (i ) S ( j ) = for i = j and S = S (0 ) S (1 )
n1 =0 n2 =0 u=1 v0 =0 . . . vV +1 =0 w0 =0 . . . wW =0 (n1 , n2 , u, v, w ) i=1 si vi
S (N ) . We refer to the number of customers in the system in this
(6) context as the level and call S (n ) the nth level set. It is clear from
1 the description of the tandem queue and in particular from the
V
possible transitions of the corresponding CTMC, given in Table 1,
= E I{X2 =K2 } si Xi+4 E si Xi+4 . (7)
that any event in the tandem queue changes the number of cus-
i=1 i=1
tomers in the system (i.e. the level) by at most one and that cor-
An important observation with regard to the eciency of our respondingly any direct state transition in the CTMC is either be-
following analysis is that all these performance measures can be tween two states within the same level set or between two states
expressed conveniently in terms of expectations with respect to in adjacent (with regard to the level number) level sets. Thus, any
the stationary probability distribution, i.e. the probability vector state transition is from a state in S (i ) to a state in S ( j ) , |i j| 1.
containing all stationary probabilities of the states. The mean num- This means, according to the state space partitioning the states can
bers of customers in the queues are obviously expectations, as are be arranged such that the generator matrix of the CTMC is of block
H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195 191
tridiagonal form Since the state space S is discrete (countable) we can inter-
pret f() for any {1, . . . , m} as a column vector, partitioned
Q00 Q01
conformally with the state space partitioning, that is, f ( ) =
Q10 Q11 Q12
( f1() , . . . , fN() )T with fn() = ( fn,(1) , . . . , fn,d
( ) T
) for n = 0, 1, . . . , N,
Q =
Q21 Q22 Q23 (9) n
..
( )
.. .. where fn,i for i = 1, . . . , dn denotes the value of the function f()
. . . applied to the ith state (according to the numbering in lexico-
QN,N1 QNN graphical order) in the nth level set S (n ) . Finally, we use the nota-
di d j tion fn := ( fn(1 ) , . . . , fn(m ) ) Rdn m for n = 0, 1, . . . , N. Similarly, we
where Qi j R , i, j = 0, 1, . . . , N, contains the transition rates
consider the stationary probability distribution of the LDQBD
from states in level set i to states in level set j, and dn := |S (n ) | process partitioned conformally with the state space partitioning,
for all n {0, 1, . . . , N}. A CTMC with a block tridiagonal generator that is, = (0 , 1 , . . . , N ) with n = (n,1 , n,2 , . . . , n,dn ), n =
matrix is called a level-dependent quasi-birth-and-death (LDQBD) 0, 1, . . . , N, where n, i for i = 1, . . . , dn denotes the stationary
process. Hence, with our state space partitioning we have modeled probability of the ith state (according to the numbering in lex-
the multi-server tandem queue under consideration as an LDQBD icographical order) in the nth level set S (n ) . Then, as shown by
process. Although with regard to the block tridiagonality of the Baumann and Sandmann (2013),
generator matrix the numbering of states within the level sets can
be arbitrary, we assume that they are numbered in lexicographical
N
n 1
order. Since all transition rates are properly included in Table 1 we E[ f ] = f = n f n = 0 Rk fn (11)
omit to describe the block matrices Qij explicitly. n=0 n=0 k=0
5.2. Computing performance measures via expectations of the LDQBD = 0 ( f0 + R0 ( f1 + R1 ( f2 + R2 (. . . + RN2 ( fN1 + RN1 fN ))))) (12)
As outlined in Section 4, in principle one could compute per- where R0 , . . . , RN1 are nonnegative matrices dened by
formance measures by rst computing the stationary probability
Rn = Qn,n+1 (Qn+1,n+1 + Rn+1 Qn+2,n+1 )1 , n = 0, . . . , N 2, (13)
distribution of the underlying CTMC and then using it for com-
puting the desired performance measures from it. For our case
1
of an LDQBD process fast and numerically stable algorithms for RN1 = QN1,N QNN . (14)
computing stationary probability distributions exist, see Baumann
These equations can be derived from (backward) block-Gaussian
and Sandmann (2010, 2012); Bright and Taylor (1995), Gaver, Ja-
elimination for block-tridiagonal matrices, see Grassmann
cobs, and Latouche (1984), but they have enormous storage re-
and Heyman (1990) for details: In the reduction step, Qnn
quirements for models where the dimension is as high and the size
is replaced by Un , where Un = Qn,n+1 (Un+1 )1 Qn+1,n . By
of the state space is as huge as for the tandem queue under con-
setting Rn = Qn,n+1 (Un+1 )1 , we obtain (13) and (14). Fur-
sideration. Following our argumentation in Section 4 we instead
apply the memory-ecient method developed by Baumann and
thermore, block-Gaussian elimination guarantees n+1 =
Sandmann (2013) for computing stationary expectations without
n Qn,n+1 (Un+1 )1 = n Rn . Since the recursions for Rn and
explicitly computing the stationary probability distribution, which
n are well-established in the context of matrix-analytic methods
(see references above), we omit further details.
is as fast and numerically stable as the aforementioned algorithms
Note that the matrices Rn appear in numerical techniques for
but provides an enormous complexity reduction in that it saves
computing stationary probability distributions of LDQBD processes
a huge amount of storage. It renders possible to compute per-
where all these matrices must be stored, but when computing
formance measures of such huge models as, e.g., a multi-server
family E[f] according to (12) the term
tandem queue with Markovian arrival process, phase-type service
times, nite buffers, losses, and blocking. In fact, we can even com- Z0 := f0 + R0 ( f1 + R1 ( f2 + R2 (. . . + RN1 ( fN1 + RN1 fN )))) (15)
pute multiple performance measures simultaneously.
Assume we want to compute multiple performance measures can be computed in a Horner-type fashion starting with ZN = fN
simultaneously. Then we express all these performance mea- and repeatedly updating
sures together as one expectation family E[f] of a function f =
( f (1) , . . . , f (m) ) : S Rm where the components f (1) , . . . , f (m) are Zn = fn + Rn Zn+1 (16)
the functions whose expectations are taken to express the desired
for n = N 1, N 2, . . . , 0, so that all Rn , and all Zn are required
performance measures. For instance, if we want to compute the
only once and need not be stored for any further operation. Finally,
expected steady-state numbers of customers in the buffers, the loss
0 required in (12) for determining E[ f ] = 0 Z0 can be computed
probability and the blocking probability simultaneously, then we
as solution x to the equation
choose
x(Q00 + R0 Q10 ) = 0, (17)
U
u
V
V
f = X1 , X2 , I{X1 =K1 ,X3 =u} , si Xi+4 , I{X2 =K2 } si Xi+4 (10)
u=1
i=1 i=1
which is unique up to a multiplicative constant c given by the
normalization condition that the sum of all state probabilities
where the rst three components of family E[f] directly yield the must be one. In order to obtain the normalization constant with-
expected numbers of customers in the rst and second buffer, re- out explicitly computing all stationary probabilities the func-
spectively, and the loss probability. The blocking probability can tion f is extended to f = ( f (0 ) , f (1 ) , . . . , f (m ) ) where f(0) is the
be composed by the fourth and fth component according to for- constant function that takes the value one for all states such
mula (7). Similarly, if we additionally want to compute the steady- that the corresponding column vector representation is f n(0 ) =
state variances of the numbers of customers in the buffers we can (1, 1, . . . , 1 )T Rdn . Then the normalization condition can be ex-
extend the function f by the components X12 and X22 , and com- pressed as E[ f (0 ) ] = 1 and for f a vector Z = (z(0 ) , z(1 ) , . . . , z(m ) ) =
pose the variances as Var[Xi ] = E[Xi2 ] E[Xi ]2 , i = 1, 2. Of course, cE[ f] is obtained. Thus, the expectation of f, that is, the vector con-
the same principle works for arbitrary cumulants, all of which can taining all expectations needed for the performance measures that
be composed by moments of appropriate orders. we consider simultaneously, is given by
192 H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195
In this section, we present numerical results for performance 6.2. Base parameter setting
measures of specic MAP/PH/c1 /c1 + K1 /PH/c2 /c2 + K2 tandem
queues. We emphasize that with the approach presented in the If not indicated otherwise (by the corresponding variation of
previous section, one can conduct detailed studies of any such tan- parameters),
dem queue, but we have to restrict the presentation of numerical the arrival process is a MMPP with U = 2, average arrival rate
results to selected choices of MAPs and PH distributions. In par- = 19 and squared cov 1.2 for the interarrival times.
ticular, we study in detail the expectations of the numbers of cus- the service at each server at the rst station is Erlang dis-
tomers in the queues, loss and blocking probabilities, where we tributed with 2 phases and expected service time 1/7.
vary the buffer sizes as well as the respective parameters of the there are c1 = 3 servers at the rst station.
MAPs and the PH distributions considered. the waiting capacity at the rst station is K1 = 10.
the service at each server at the second station is Erlang dis-
6.1. Specication of the service time distributions and the arrival tributed with 2 phases and expected service time 1/5.
process there are c2 = 4 servers at the second station.
the waiting capacity at the second station is K2 = 10.
For the service times, we consider Erlang and hyperexponen-
tial distributions, which are discussed extensively in, e.g., Morse 6.3. Variation of the arrival rate
(1958), Neuts (1981), so that here it is sucient to summarize the
most important properties. In our rst variation of parameters, we vary the arrival rate
An Erlang distribution with n phases and parameter n is a at the rst station, while keeping the other parameters xed as
PH( , B) distribution with = (1 0 . . . 0 ) and B = (bi j ) where specied above.
H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195 193
As can be seen in Fig. 2, unsurprisingly, with increasing arrival rithm in order to nd, e.g., buffer sizes that yield acceptable block-
rate, the expected numbers of customers in both stations (includ- ing and loss probabilities.
ing those in service) as well as the probabilities of loss and block-
ing increase. However, in particular for the respective probabili- 6.4. Variation of the buffer sizes
ties we observe that the slopes of the functions are quite different.
Due to an increasing arrival rate the probability of a full buffer at While keeping all other parameters xed as specied before, we
the rst station, which causes losses of newly arriving customers, vary the buffer size either at the rst or at the second station.
grows quickly and correspondingly the loss probability does. For As depicted in Fig. 3, for increasing buffer size K1 at the rst
the blocking probability the increase is slower, because blocking station, the expected number of customers in the rst station in-
only occurs if upon service completion at the rst station there is creases, and the probability of loss decreases. Up to some value of
no buffer space available at the second station. This implies that the buffer size K1 at the rst station more customers are served
even if the rst station is highly saturated (i.e. all servers are al- and intended to proceed to the second station, and hence, the
most always busy), it can, of course, not serve faster than at its blocking probability and the expected number of customers at the
overall (multi-server) service rate. Thus, for the departure rate out second station increase. However, similarly as for the variation in
of the rst station there is a maximum and if this is reached then the arrival rate, when the rst station becomes more and more sat-
a further increase in the arrival rate at the rst station still causes urated it cannot serve on a higher rate than its overall service rate
more and more losses but the departure rate out of the rst station such that the curves showing the number of customers at the sec-
does not increase more and more. Therefore, at a certain value of ond station and the blocking probability become atter and at-
the arrival rate the rst station becomes a serious bottleneck im- ter, which means both the number of customers at the second sta-
plying that the loss probability grows further while the blocking tion and the blocking probability approach a respective nite max-
probability remains almost constant. imum.
Clearly, in practice, for instance when dimensioning a system Fig. 4 shows that for increasing buffer size K2 at the second
with multi-server nite-buffer tandem queue structure this effect station, up to some value of K2 the expected number of customers
can be taken into account using numerical studies with our algo- in the second station increases and then approaches some nite
194 H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195
Fig. 5. Performance measures for various values of the squared cov of the interarrival times.
constant value, and the probability of blocking decreases, nally for c2 = 1, we use a Poisson arrival process (that is, exponentially
approaching zero. More precisely, for K2 there is always distributed interarrival times), for c2 < 1, we use PH renewal pro-
enough buffer space available at the second station such that there cesses with Erlang distributed interarrival times, and for c2 > 1, we
is no blocking of servers at the rst station. Besides, the arrival rate use second-order MMPPs as arrival processes. Note that this choice
at the second station cannot exceed the arrival rate at the rst sta- is not unique but our general framework of MAPs is very exible.
tion. Thus, for K2 the second station becomes essentially a For instance, an alternative choice for obtaining interarrival times
stable innite capacity queue such that the expected number of with c2 > 1 would be a PH-type renewal process with hyperexpo-
customers converges to a nite value. As for the rst station, due nentially distributed interarrival times.
to fewer customers blocking the servers there, the expected num- Fig. 5 shows the performance measures considered before, now
ber of customers at the rst station and the loss probability de- as a function of the squared cov of the interrarival times accord-
crease with increasing K2 . For K2 , the rst station behaves ing to the just described choices. Clearly, for increasing cov, the
like a MAP/PH/c1 /c1 + K1 queue without blocking, that is, the ex- arrival process becomes more random, and the probability of loss
pected number of customers in the rst station and the probability at the rst station increases. Up to some value of the (squared) cov
of loss converge to some nite value. c2 the expected number of customers in the rst station increases,
too, but then slowly decreases again. This observation can be ex-
6.5. Variation of coecients of variation plained by the fact that the arrival rate is constant and the loss
probability increases with increasing cov, which implies that fewer
One great advantage of our model is that we can vary arrival customers enter the system. At the second station, for increasing
or service rate and higher moments of the interarrival or ser- c2 , the expected number of customers decreases. Again, this is due
vice times, respectively, separately. We vary the squared cov c2 to the fact that fewer customers enter the system. Hence, it seems
for the interarrival times, keeping all other parameters, in partic- to make sense that the blocking probability should decrease, too.
ular the arrival rate, xed as specied in Section 6.2 by exploit- However, this is not true for very small values of the cov. Any-
ing that MAPs include as special cases many classes of arrival pro- way, as can be seen in the respective curve in the picture on the
cesses with specic interarrival time distributions. In our studies, left hand side of Fig. 5, the impact of the cov on the blocking
H. Baumann, W. Sandmann / European Journal of Operational Research 256 (2017) 187195 195
probability is quite weak, that is the blocking probability remains in a next step the approach may be extended to series of multi-
almost constant and also for increasing cov decreases only very server queues with MAP and PH distributed service times and/or
slowly such that the effect of the cov is negligible. However, we to respective models with innite buffers. In order to cope with
emphasize that the inuence of the cov on the expected number such complex models it is planned to combine the matrix-analytic
of customers (in particularly, in the second station), and on the loss algorithm used in the present paper with state space truncation
probability is quite strong. methods similar to those in Baumann and Sandmann (2015) in or-
der to develop an approximate analysis with a priori error bounds.