On Two-Level State-Dependent Routing Polling Systems with Mixed Service

Based on priority differentiation and efficiency of the system, we consider an N + 1 queues’ single-server two-level polling system which consists of one key queue and N normal queues. The novel contribution of the present paper is that we consider that the server just polls active queues with customers waiting in the queue. Furthermore, key queue is served with exhaustive service and normal queues are served with 1-limited service in a parallel scheduling. For this model, we derive an expression for the probability generating function of the joint queue length distribution at polling epochs. Based on these results, we derive the explicit closedform expressions for the mean waiting time. Numerical examples demonstrate that theoretical and simulation results are identical and the new system is efficient both at key queue and normal queues.


Introduction
In this paper, we study a class of +1 queues' polling systems that consists of one key queue,  ℎ , and  normal queues,  1 ,  2 , . . .,   , which are attended by a single server.Studies on the polling systems have attracted extensive attentions in the last years due to their vast area of applications in communication network, production, and transportation.Excellent surveys on polling systems analysis and their applications may be found in [1][2][3][4].However, many studies in the literatures assume that the server visit the queues in a fixed, cyclic order.This might not be a realistic assumption, as queues might have different priority level; queues with high priority should be visit more frequently than the lower ones; sometime queues might be empty and then there is no need to visit.As such, we study the case where the server just visits active queues with customers.Note that as a consequence, after skipping the empty queues, server could provide more visit opportunity to active queues with customers.Furthermore, parallel process of service period and switch-over period allows a successive service between two active queues without the duration of switch-over time.To provide priority differentiation service, queues are separated as one key queue and  normal queues.Two-level route order and mixed service scheme are used to provide high priority to key queue.
It is observed that in the wide body of literature on polling system hardly can any studies be found that take the consideration of queue state-dependent routing and service priority simultaneously.The reason for this may lie in the fact that the analysis of state-dependent routing polling model is much more complex than that of cyclic polling model, especially in priority differentiated model.In particular, waiting time and queue length analysis of two-level priorities polling systems can be found in [5][6][7], in which the server visits queues in a two-level route; that is, the server polls key queue with exhaustive scheme after each gated service to normal queue [5].This work is extended in [6] with assigning 1-limited service discipline to normal queues.More recently, Yang et al. set the exhaustive service for normal queue and gated service for key queue to ensure fairness but just acquire the first moment performance of the system as mean queue length at the polling epoch and the mean cyclic time [7].The parallel discipline is used to improve the delay performance in [8], in which when the current polling queue has customers in storage the server will process service while switching to the successive queue simultaneously and begins to serve the successor once it finishes the service of the current one.This scheme could improve polling efficiency in high traffic cases.However, the parallel mechanism will be invalid when there is no customer in the queue.In low traffic cases, useless polling to idle queue becomes an obvious liability in cyclic polling model.Routing depends on the event whether a queue is empty or it is not helpful to this problem [9].In this paper, we consider the special setting to a two-level mixed service polling model, where the key queue is served exhaustively while normal queues are served in 1-limited mechanism.Furthermore, the server no longer checks all the stations in a fixed order; only active stations with transfer requirements could be served and then the switch-over period and service period are processed paralleled.This mechanism increases the system utilization and reduces the mean waiting time.
Although the exhaustive service discipline in principle fits the branching property, the present model involves 1limited service discipline, which does not satisfy the abovementioned branching property.The explicit analysis of nonbranching service disciplines is mostly in special setting, such as [10,11] studied on two-queue polling systems and [12] studied on symmetric 1-limited model.In this paper, we follow the special setting in [8] and analyze the mean waiting time of the present model under the assumption on the symmetrical characteristic among normal queues, as will be described in greater detail in Section 2.
Initially, we follow an approach similar to the analysis of [5], which uses a recursive iteration of a functional equation, for the probability generating function (PGF) of the joint queue-length distribution at moments the server starts a visit period.
The main contributions of this paper can be summarized as follows.Firstly, we extend the parallel two-level poling system in [8] by using queue state-dependent routing, in which only active queues with customers could be visited by server.This scheme is helpful to avoid the consumptions induced by idle visit.Secondly, under the assumption of a stable system, we obtain the explicit expressions for the PGF for the joint queue length distribution at polling epochs as a starting point of key queue and normal queue separately.Thirdly, we achieve the exact closed-form expression of the mean waiting time under the assumption on the symmetrical characteristic of normal queue.
The rest of the paper is structured as follows.In Section 2, we give a formal description of the polling model that we study and we introduce the necessary notation.Based on this, in Section 3, we derive the expressions for the mean waiting time of the present model under the assumption of a semisymmetric (symmetrical characteristic of normal queue) stable system, by taking a functional equation for the PGF for the joint queue length distribution at polling epochs as a starting point.In Section 4, numerical results obtained with the proposed analytical models are shown and their very good agreement with realistic simulation results is discussed.Finally, concluding remarks and directions for future research are given in the end.

Model Description
Consider a discrete time (timeline is divided into time slot) polling system consisting of  ( ≥ 2) infinite-buffer queues  1 ,  2 , . . .,   , and  ℎ .The single server visits active queues in a two-level state-dependent routing order and serves the customers with mixed service discipline.
In the arrival process, type- ( = 1, 2, . . ., , ℎ) customers arrive at   according to an independent Poisson arrival process.The generating function of arrival process in queue  is   (  ), with the variance of   (1).We propose a two-level server routing make the high priority queue be visited more frequently than others and add mixservice discipline to ensure the high priority of  ℎ .The load offered to   is   =     , and the total offered load is equal to ∑  =1   +  ℎ .
State-Dependent Routing.Queues are partitioned as active queue and idle queue by their buffer condition.Only active queues with customers waiting in the buffer could be visited by the server in order.Idle queue with empty buffer would be skipped in the current polling round.
Two-Level Polling.The server visits queues governed by a twolevel routing.In the first polling level, the server polls between the high priority queue  ℎ and an active normal queue; in the second level, for each time after the exhaustive service at  ℎ , one normal active queue is visited in a cyclic order; that is, the server routing in this model is In the switch-over process, a parallel mechanism is used.When the server polls an active queue at time with customers in its buffer, the server will provide service and inquire the next active queue simultaneously and then switch to serve the successor immediately without the switch-over time once it has finished the current service.Combined with the statedependent routing scheme, over the course of a visit period, the server serves the active queues and normal queue in sequence continuously until the entire system is empty; there will be no consumption of switch-over time anymore in the present model.More especially, we assume the server consume one time slot to confirm the system state when the system is entirely empty.
Mix-Service Discipline.Exhaustive discipline is specified for the key queue and 1-limited discipline for normal queues, so that the entire customers in the key queue could be served in the present server round, while those who are in normal queues might need several rounds when there are more than one customer in the buffer.Let  ℎ denote the duration of a service period for the customers arrive during arbitrary time slot in  ℎ .This service period consists of the services of its ancestral customers arriving during the exact slot and the services of the offspring line of the ancestral customers [13].The generating function of  ℎ is denoted by Such a functional equation has already been derived in [14] as In the remainder of this paper, we are interested in the queue length distributions at the polling epoch of   and  ℎ .Let   () denote the number of customers present at   at   when the server starts a visit period at   , and let   ( * ) denote the number of customers present at   at  *  when the server starts a visit period at  ℎ successively with the service of   .The joint distribution of   ( + 1) and   ( * ) is represented by the -dimensional PGF  +1 ( 1 , . . .,   ,  ℎ ) and  ℎ ( 1 , . . .,   ,  ℎ ).
We analyze the system under stability conditions (∑  =1   +  ℎ < 1) [12].Normal queues in the present model are served in a 1-limited manner, which does not satisfy the well-known branching property in polling systems.Therefore, more specifically, in the analyses of mean waiting time, we assume the normal queues are symmetric; that is, normal queues have the same customer arrival rate and service rate.

Analysis for Steady-State Systems
In this section, we derive explicit expression for the joint queue length distribution.In Section 3.1, we first obtain expressions for  +1 ( 1 , . . .,   ,  ℎ ) and  ℎ ( 1 , . . .,   ,  ℎ ), the joint queue length PGF at the polling epoch at  +1 and  ℎ .These results ultimately lead in Section 3.2 to the first and second moment of the PGF, and obtain the expressions for [  ] and [ ℎ ], the mean waiting time of type- and type-ℎ customers that arrive at an arbitrary point in time.

Joint Queue Length Distribution at Polling Epoch.
Assuming that the server begin the service of   at   , define a random variable   () as the number of type- ( = 1, 2, . . ., , ℎ) customers at time   .Then the status of the entire polling model at time   can be represented as { 1 (), . . .,   (),  ℎ ()}.Denote   ( + ) as the number of type- customers at  + , the polling epoch of  + .The status of the entire polling model at time  + can be represented as { 1 (+), . . .,   (+),  ℎ (+)} while   ( * ) is the number of type- customers in at time  *  , at which the server begins providing service to  ℎ and the status of the entire polling model at time  *  can be represented as { 1 ( * ), . . .,   ( * ),  ℎ ( * )}.Under the necessary and sufficient condition for the stability of the system ∑  =1   +  ℎ < 1, the probability distribution is defined as The generating functions at   and  *  are According to the proposed mechanism, the system variables have the following equations.When the server begins the service on  +1 at  +1 , we have V  () is the service time in   and   (V  ) is the number of arrivals to   during V  ().
The server just finishes the service of  ℎ in an exhaustive manner and starts the polling on  +1 at  +1 .Such a functional equation of exhaustive service has already been derived in [12].Applying these results to our case, we obtain The expression can be interpreted as follows.At the start of the visit period at  +1 , type- customers are those at the polling epoch of  ℎ plus the new customers arriving at each queue during the service period of the  ℎ in exhaustive scheme, and no type-ℎ customer resumes at that moment.
When the server begins the service on  ℎ at  *  , we have ]  () is the service time in   , and   (]  ) is the number of arrivals to   during ]  ().
In our case, for normal queues, the server just polls the active queues with customers in parallel 1-limited manner.To gain more insight in the state-dependent service discipline, let   denote the queue length at the service epoch in an M/G/1 queue with the same arrival process and service-time distribution as   .We assume that the  customers have waited in   at the start of the busy period with probability   ∈ [0, 1), ∑ ∞ =0   = 1.Then we can acquire the queue length generating function at the service epoch as where   (  ) is the PGF of the arrival process as defined in Section 2. Specifically, the server does not provide service when the queue length is zero, so we assume that  * customers resumed after the end of the busy time in 1-limited service with the probability of  *  ∈ [0, 1), and  *  =   +1 for  = 0, 1, . ... Consequently, the probability space could be rebuilt as With the definition of   (  ), we have Applying these results to our case, we obtain The expression can be interpreted as follows.At the start of the visit period at  ℎ , in the case that the former   is active, one type- customer would have been served at  *  and new customers arrived at each queue during the service period of the exact type- customer.The server would skip   to  +1 when   is empty; in that case, the distribution of the number of customers in the systems is represented by the generating function   ( 1 , . . .,   , . . .,  −1 ,  ℎ )|   =0 , with the exception as the system is entirely empty, which is represented by the generating function   ( 1 , . . .,   , . . .,  −1 ,  ℎ )|  1 ,..., −1 , ℎ =0 .When the system is entirely empty, the server will stop providing service for one time slot until new customers arrive during this time slot, and this number of customers is represented by the last partition of the addition formula.
+1 (z) is the PGF of the joint queue length at the polling epoch of   , so we have Taking the th derivative with respect to   yields Setting   = 0 yields Rearranging terms and setting  = 0, we have Extending this result we have 0 is the (1 ×  + 1) vector with 0, and 1  is the (1 ×  + 1) vector with 0 in th position and 1 in all other entries.Define the first derivative of   (z) and  ℎ (z) at z = 1 as Calculate ∑  =1  +1 () yields Define the second derivative of   (z) and  ℎ (z) at z = 1 as Substitute ( 4) and (8) into the above second derivative formulas.
We assume the  normal queues are symmetrical; that is,   = ,   = ,  = 1, 2, . . ., .Then simplifying these we get the second derivative of   (z) and  ℎ (z) at z = 1 as follows: Remark 1.Though   () is the first derivative at z = 1   (z) in definition, it is clear that it contains the second moment parameter as    (1) and    (1).So,   () is a second moment parameter for the system performance.

Analysis of 𝐸[𝑊 ℎ ] and 𝐸[𝑊 𝑖
]. Define  ℎ and   as the waiting time of type-ℎ and type- customers, which denotes the time from the epoch when a customer arrives at the queue to the time it is served.In the present model, high priority type-ℎ customers are served in the exhaustive service and normal type- customers are served in 1-limited service.Based on the related research works in [14], the mean waiting time of type-ℎ customers [ ℎ ] and the type- customers [  ] can be calculated as follows: Taking ( 17), ( 22) in (24) in the above expressions, we have Taking ( 17), ( 22), and ( 23) in (25) in the above expressions, we have (27)

Numerical Study
In this section we study the accuracy of the theoretical analysis and compare the mean waiting time of the present model with two existing two-level polling models.From Figure 1, we can clearly see that, firstly, the theoretical value and the simulation result coincided with each other.Secondly, when the total offered load grew with the arrival rate, service time, and the number of queues, with the mean waiting time increasing distinctly in   , while the performances in  ℎ are much better, both queue and mean waiting time are much lower than normal queues, and the growth in  ℎ with the total offered load presents much more smoothly.
It is worth considering whether the state-dependent mechanism improves the performance of the system comparing with the existing two-level polling systems.In order to answer this question, we compare a classical two-level system with switch-over time [6], abbreviated as classical system and a parallel two-level system [8], abbreviated as parallel system in Figure 2. The service discipline in the comparisons is 1limited service for normal queues and exhaustive service for the key queue.Overall models have the same test bed as shown in Table 1.We just vary the working mechanism.
Figure 2 shows the mean waiting time of normal queues in (a) and mean waiting time of key queue in (b).Comparing with the forgoing, the state-dependent system achieves a better performance in delay guarantee and stability.It is clear in Figure 2(a), for lower load, in most of the cases, that there is no customer in the buffers; thus a switch-over time is necessary when the server switches between   and  ℎ in the classical and parallel system, while the empty queues would be skipped in the present model.Therefore, customers in the state-dependent system achieve a lower mean waiting time, which is under 20% of the forgoing.In the heavy traffic, the server could not provide service in the necessary switchover time for the classical system; consequently, it becomes unstable when the arrival rate of   grows over 0.06 in this case.The parallel system and the state-dependent system have better performance in system stability; especially in statedependent system, the mean waiting time of the normal customers has less than 50% of which in the parallel system.A conclusion can be drawn from a comparison between Figures 2(a) and 2(b), which is that for all three two-level models the mean waiting time of the customers in key queue is significantly lower than that in normal queues, and as illustrated in Figure 2(b), the mean waiting time for ℎ-type customers in state-dependent system is lower than that of the others.

Conclusion
When comparing the model of the present paper with the existing literature, the contribution of the present paper is twofold.One of the most striking differences is the queues which are partitioned as active queue and idle queue by their buffer condition, and only active queues with customers waiting in the buffer could be visited by the server in a twolevel order.As illustrated in the numerical example, both -type customers in normal queues and ℎ-type customers in key queue acquire better delay performance than those in systems without queue-stated differentiation.Another notable contribution of the paper is that we achieve the closed-form exact expressions of the mean waiting time for customers in normal queues and key queue, under the assumption of the symmetric of normal queues.The total unknowns in these equations are all first moments of random variables and, thus, no correlation terms are required.

Figure 1 :
Figure 1: Theoretical and simulation values of [ ℎ ] and [  ] from different values of the load increasing with the increasing of the number of normal queues.(a) is the total offered load increasing with the growth of the number of normal queues.(b) is the total offered load increasing with the growth of the arrival rate of  ℎ .(c) is the total offered load increasing with the growth of the arrival rate of   .(d) is the total offered load increasing with the growth of the service time of  ℎ .(e) is the total offered load increasing with the growth of the service time of   .

Figure 2 :
Figure 2: Comparing of mean waiting time among the classical two-level system [6], the parallel two-level system [8], and the state-dependent two-level system.(a) is the theoretical value comparison of [  ] with the growth of the arrival rate in   .(b) is the theoretical value comparison of [ ℎ ] with the growth of the arrival rate in   .
,...,  , ℎ → 1 1 ,...,  ,...,  , ℎ → 1 Consider an  + 1 queues' model with one high priority queue  ℎ and  normal queues   ( = 1, . . ., ) defined as follows: the service times of all customers are exponentially distributed with mean  in   and  ℎ in  ℎ .The arrival processes are Poisson process with rate  in   and  ℎ in  ℎ .The relative parameter values are listed in Table 1, in which { :  : } means the parameter is varied between  and  in steps of .

Table 1 :
Test bed used to compare the mean waiting time.