1.Introduction
Priority disciplines are an important scheme for service systems to differentiate their services for different classes of customers. For this reason, many telecommunication, computer, and production systems employ various priority disciplines. Priority queueing models are typically utilized to analyze the performance of systems under various priority disciplines, and there have been a number of studies on priority queueing models [9, 10, 12, 21, 22, 24].
There are two fundamental classical priority disciplines [23] : One is the preemptive priority discipline, under which customers with high priority can interrupt the service of customers with lower priority, and the other is the nonpreemptive priority discipline, under which, once the service of customers with low priority has been started, it is not interrupted even though there are customers with high priority arriving in the system during the service. However, these two priority disciplines have their own drawbacks. Under the preemptive priority discipline, the quality of service (QoS) for customers with low priority may be severely degraded especially when preemption frequently occurs and the service interrupted has to be completely repeated. On the other hand, under the nonpreemptive priority discipline, the QoS for customers with high priority may be severely degraded especially when the service time of customers with low priority is relatively long and significantly varies in length.
In order to overcome these drawbacks of the classical priority disciplines, several hybrid-type priority disciplines have been introduced [1~4, 6~8, 11, 13~16, 18~20, 22~24]. See [13~16] for a brief review on these hybrid-type priority disciplines. The (N, n)-preemptive priority discipline is one of these hybrid-type priority discipline [14]. Under this discipline, the decision on whether or not to preempt the service of customers with low priority depends on the number of customers with high priority present in the system. Once the service of customers with low priority has started, it is preempted only when the number of customers with high priority reaches or exceeds a certain threshold N, N ≥ 1; the interrupted service is restored when the number of customers with high priority shrinks to another certain threshold n, 0 ≤ n ≤ N - 1. The advantage of the (N, n)-preemptive priority discipline is that it is easier than other hybrid-type priority disciplines to control the QoS for customers with high priority in a certain bound, regardless of the arrival and service characteristics of customers with low priority [14].
This paper extends Kim’s result [14] to a different variant of the (N, n) preemptive priority discipline. While Kim [14] assumes that the interrupted service is retained and resumed when the service is available again for the preempted lowpriority customer (which will be called the (N, n)-preemptive resume (PR) priority discipline) or it is repeated with the identical service time (which will be called the (N, n)-preemptive repeat-identical (PRI) priority discipline), this paper assumes that the interrupted service of the preempted low-class customer is repeated with a different service time from the same service distribution, which will be called the (N, n)- preemptive repeat-different (PRD) priority discipline). The (N, n)-PRD discipline is a natural extension of the classical preemptive repeat-different discipline and its variant [6, 17, 25] to the (N, n)-preemptive discipline. There are two contributions which extended Kim’s original (N, n)-preemptive models [18, 19] to discrete-time queueing models. However, these contributions only considered the models with geometric service times, where there are no differences between (N, n)-PR, PRD, and PRI disciplines due to the so-called memoryless property of the service time distribution. Here, we consider the models with general service-time distributions, and the types of the way of restoring interrupted services have different impact on the system overall performance.
This paper is organized as follows : In Section 2, we define the mathematical model considered in this paper; In Section 3, we analyze the effective service time structure of a low-priority customer. In Section 4, we derive the queuelength distributions for high-priority and low-priority customers. In Section 5, we present some numerical examples by using the result obtained in Sections 3 and 4. In Section 6, we conclude the paper with some comments.
2.Model
We consider the following M/G/1 queueing model under the (N, n)-preemptive repeat-different preemptive discipline : There are two classes of customers and class-i, i = 1,2, customers arrive at the system according to a Poisson process with rate λi. The total arrival rate λ is defined as λ = λ1 + λ2. The service times Si of class-i customers are independent and identically distributed, and follow an arbitrary general distribution. Let Si (x) denote the distribution function of Si, and denote the Laplace-Stieltjes transform (LST) of Si (x) .
Under the (N, n)-PRD priority discipline, class-1 customers have priority over class-2 customers. If there are no preempted class-2 customers in the system, one of class-1 customers (if any) is first selected for the next service just after the service being process is completed. If there are no class-1 customers at the service completion time, one of class-2 customers (if any) can be selected for the next service. While the service of a class-2 customer is being processed, class-1 customers can preempt that service only when the number of class-1 customers reaches or exceeds a certain threshold N, N ≥ 1. Then, the interrupted service of the preempted class-2 customer is completely restarted from the beginning with a different service time resampled from the same service time distribution S2 (x) , when the number of class-1 customers shrinks to another threshold n,0 ≤ n ≤N - 1. Note that, when N = 1 and n = 0, the (N, n)-PRD priority discipline become identical to the classical preemptive repeat-different priority discipline. Also, when N = ∞, the (N, n)-PRD prio rity discipline become identical to the classical nonpreemptive priority discipline.
Let . We assume that the system is stable for class-1 customers. That is, ρ1 < 1. Let also ρ = ρ1 + ρ2. Note that ρ < 1 is not the stability condition for class-2 customers because the service of class-2 customers can be repeated more than once. We assume that the system can be either stable or unstable for class-2 customers. We will assume the stability of the system for class-2 customers only when we derive the queue-length distribution of class-2 customers.
3.Analysis of Service Time Structure
The overall approach here is the same as that in [14]. Thus, the common derivation in both [14] and this study will not be repeated here, except when it is needed for the overall understanding of the system considered here. As with many studies on priority queueing models [5~7, 13, 14, 16], we employ a classical method in which the effective service time of a low-class customer is analyzed and the result obtained is plugged into a kind of M/G/1 vacation queueing models to derive the queue-length distributions of high- and low-class customers, respectively (see [23] for a brief review on the queue-length distribution of vacation queues). In this approach, three types of the effective service time of a class-2 customer are defined : the gross service time G of a class-2 customer is defined as the total time spent by the server for this class-2 customers (including all the service repetitions (if any)); the completion time C is defined as the time interval from the first service start time of the class-2 customer until the completion of that service; the occupation time R is defined as the time interval from the first service start time of the class-2 customer until the server is available for the next class-2 customer (if any).
We further divide the class-1 customers who arrived during G into two groups : some class-1 customers who arrive during G will preempt the service of the class-2 customer and complete their service before G ends. We will call those class-1 customers GP customers. Some other class-1 customers who arrive during G will not preempt the service of the class-2 customer and begin their service after G ends. We will call those class-1 customers GN customers. Let aGP and aGN denote the number of GP and GN customers during G, respectively. Let aG denote the number of class-1 customers who arrive during G. Thus, aG = aGP + aGN . We also define the following joint transform of G, aGP and aGN :(1)
If we let C*(θ) and R*(θ ) denote the LSTs of C and R , then we have from [14] :
and
where B*(θ ) is the LST of the standard M/G/1 busy period with S1 as the corresponding service time and λ1 as the corresponding arrival rate, and B*(θ ) is expressed as the following well-known equation (see [23]) :(4)
Under the (N, n)-PRD discipline, each time the number of class-1 customers in the system reaches N during the service time of a class-2 customer, the service of the class-2 customer is preempted. This preempted service will be completely repeated at the next service attempt with a different service time resampled from the same service-time distribution S2 (x) , as soon as the number of class-1 customers shrinks to n.
Hence, if the service time S2 of a class-2 customer at its first service attempt is shorter than the Erlang random variable EN with parameters N and λ1, the service of the class-2 customer will be completed at this service attempt. Otherwise, it will be preempted and repeated with a different service time resampled from the same distribution, when the number of class-1 customers shrinks to n. If the service time S2 of a class-2 customer at each service attempt except the first service attempt is shorter than the Erlang random variable EN-n with parameters (N - n) and λ1, the service of the class-2 customer will be completed at this service attempt. Otherwise, it will be preempted again. It is because, at the beginning of each service attempt except the first service attempt, there are already n class-1 customers in the system.
In order to take into consideration this dependency of the service completion of a class-2 customer on S2, EN and EN-n , we first define as the number of class-1 customers who arrive during S2 at an arbitrary service attempt. We next define the following two joint transforms for r = 1, 2, ⋯ :(5)
and
where Er denotes the Erlang random variable with parameters r and λ1. By integrating by parts we have
Thus we have(6)
which also yields(7)
To derive the joint transform , we also define to be the remaining gross service time at the beginning of the second service attempt of a class-2 customer, provided the service of the class-2 customer has not completed at its first service attempt. Let denote the numbers of class-1 customers who arrive during and preempt (do not preempt) the service of the class-2 customer. The corresponding joint transform is defined as
Since the service order among class-1 customers does not affect the distribution of G, aGP and aGN , we will assume the Last-In-First-Out (LIFO) service order for class-1 customers in our derivation of the joint transformation . We now consider the following two events that can occur at the first service attempt of a class-2 customer. (Note that the service time of a class-2 customer is resampled from the same distribution function S2 (x) at each service attempt) :
If ≥ N (i.e., EN < S2) : In this case, at least N class-1 customers arrive during the first service attempt, and the service of the class-2 customer gets preempted immediately after the Nth class-1 customer arrives at the system. Since we assume the LIFO service order for class-1 customers, only the last N - n class-1 customers are served before the preempted class-2 customer is restored again for its service. Also, the remaining n class-1 customers are not serviced until the service of the class-2 customer is completed. It is because, whenever the queue length of class-1 customers shrinks to n, the service of the class-2 customer is restored, and we assumed the LIFO service order for class-1 customers. Thus, we have
If = r, 0 ≤ r ≤N - 1 : In this case, no preemption occurs and the service of the class-2 customer is completed at the first service attempt. Thus, all the r class-1 customers who have arrived during the first service attempt are GN customers, and we have
Combining the two cases above, we have
In order to derive the term , we consider the following two events at the second service attempt in a manner similar to the first service attempt.
If ≥ N - n (i.e., EN-n < S2) : In this case, at least N - n class-1 customers arrive during the second service attempt, and the service of the class-2 customer gets preempted immediately after the (N - n)th class-1 customer arrives at the system and it makes N class-1 customers in the system. (Note that there are already n customers at the beginning of the second service attempt.) Since we assume the LIFO service order for class-1 customers, all the (N - n) class-1 customers who arrive during the second service attempt are served before the preempted class-2 customer is restored again for its service. Also, the probability structure at the third service attempt is the same as that at the second service attempt, except that the two service attempts have been already tried, because the arrival process is a Poisson process and the service time is resampled from the identical distribution. Thus, we have
If , 0 ≤ r ≤N - n - 1 : In this case, no preemption occurs and the service of the class-customer is completed at the second service attempt. Thus, all the r class-1 customers who have arrived during the second service attempt are GN customers, and we have
Combining the two cases, we have
which yields
From (2), (3) and (10), we have
and
Differentiating (10) with respect to θ , z , and w , respectively, and letting θ = 0, z = 1, and w = 1 gives
Differentiating (11) and (12) with respect to θ and letting θ = 0 gives
and
In a similar way, expressions for E [C2], E [R2], and E [R3], which are needed for the moments of the class-2 queue length, can be calculated by taking the appropriate derivatives of the respective PGFs as well. (The expressions are omitted because they are too elaborate, but we can derive them using a computer algebra system such as Mathematica or Maxima. In order to show this, we demonstrate some figures of the second moment of the class-2 queue length in Section 5).
4.Queue-Length Distributions
Let ρG denote the total utilization factor of the server for class-2 customers. Then, it is expressed as from Little's formula. Let ρT denote the total utilization factor for both class-1 and class-2 customers. Then it is expressed as ρT = ρ1 + ρG . Since the stability of the system is only assumed for class-1 customers (i.e., ρ1 < 1), ρT may equal or exceed 1, and in that case, the system is unstable for class-2 customers.
Since the (N, n)-PRD priority queue has the same structure of the general cycle as those in the other (N, n)-preemptive priority queue (see [14]), we have the same PGF form of the queue-length distributions of both classes of customers, except that the distributions of G, C, and R are different. Thus, from [14], the PGF of the queue-length distribution of class-1 customers is expressed as :
where
and
We now derive the moments of the queue length of class-1 customers explicitly, which was omitted in [14]. If we let L1 and be the first and second moments of the queue length of class-1 customers in the system in a steady state, we have from (13)
where
and
Similarly, if we assume that ρT < 1, then, from [14], the PGF of the queue-length distribution of class-2 customers is expressed as :
We also derive the moments of the queue length of class-2 customers explicitly, which was also omitted in [14]. From (14), we have
and
Even though we omit the proof, Lemma 1 and Theorem 2 in [14] on the upper bounds of L1 and also hold for the (N, n)-PRD queueing model. Refer to the last two paragraphs in [14] for an intuitive explanation of why the same upper bounds hold for different types of (N, n)-preemptive priority queues.
5.Numerical Examples
In this section, we present a numerical example of the first and second moments of the queue lengths of class-1 and class-2 customers. <Figure 1> shows how the moments of the queue lengths are influenced as the upper threshold N changes, when the lower threshold n = 0, for three different squared coefficients of variation (SCVs) of S2 (which is denoted by h in <Figure 1>). Note that, if N = 1,n = 0, the (N, n)-PRD queue is identical to the classical PRD queue, and if N = ∞, the (N, n)-PRD queue is identical to the classical nonpreemptive queue. In <Figure 1>, we can see that the shapes of the first and second moments of the class-1 queue length in the (N, n)-PRD priority queue are very similar to those in other (N, n)-preemptive queues in [14]. Actually, the upper bounds of the moment of the class-1 queue length in this (N, n)-PRD example are exactly the same as those in the previous (N, n)-preemptive example in [14] because we use the identical arrival processes and the service time distributions for the both examples, in order to make it easy to compare the effect of N for different (N, n)-preemptive disciplines. There is a single remarkable difference between the shapes of the moments of the class-1 queue lengths between the (N, n)-PRD queue and the other (N, n)-preemptive queues : While, for the other (N, n)-preemptive priority queues, the smaller the SCV is (i.e., the relative variance of the service time of class-2 customers is lower), the smaller are the first and second moments of the class-2 queue lengths regardless of the value of N , for the (N, n)-PRD priority queue, this tendency holds only for a sufficiently large N . For the (N, n)-PRD priority discipline, for several small values of N , the smaller the SCV is, the larger are the first and second moments of the class-2 queue length. The reason for this is that, for the case of a large variance of the service time of a class-2 customer, a long service time of a class-2 customer can have a chance to be replaced with a relatively shorter service time by preemption of class-1 customers, and this can positively affect the performance measures of class-2 customers.
In <Figure 2>, the first and second moments of the class-1 and class-2 queue lengths are shown as functions of the threshold n, given N - n = 1, in the (N, n)-PRD priority queue, for three different SCVs (denoted by h in the figure) of the service time of a class-2 customer.
While, for a sufficiently large n, the shapes of the first and second moments of the class-1 queue length are very similar to the other (N, n)-preemptive priority queue in [14], for a small n, the shapes are different. Interestingly, for the cases of h = 2 and 3, as n increases, the first and second moments of the class-1 queue length first increase, and then decrease, and finally converge to the first and second moment in the corresponding nonpreemptive queue. These tendencies are also different from the shapes of the first and second moments of the class-1 queue length in <Figure 1>, where the threshold N increase with n = 0.
The reason of these rising and fall shapes of the first and second moments of the class-1 queue length in the (N, n)-PRD priority queue is that, for the case of N - n = 1, as n increases, class-1 customers have less chance to preemption, and even if they preempt the service of a class-2 customer, the service of the class-2 customer has to be completely restarted, which is a very severely negative effect on the class-1 customers’ performance. However, when n becomes too high, hence N also high, class-1 customers almost never preempt class-2 customers, which avoids the repetition of services of class-2 customers, therefore a better performance for class-1 customers.
In <Figure 3>, the first and second moments of the class-1 and and class-2 queue lengths are shown as functions of the threshold n and the difference of N and n in the (N, n)-PRD priority queue, for the case of h = 2. For a given n, as N - n increases, the first and second moments of the class-1 queue length increase, while the first and second moments of the class-2 queue length decrease. Also, for a given N - n, as n increases, the first and second moments of the class-1 queue length increase, while the first and second moments of the class-2 queue length decrease.
6.Conclusion
In this paper, we derived the queue-length distributions of high-class and low-class customers in the (N, n)-preemptive repeat-different queueing model, and presented a numerical example for this queueing model. As shown in the numerical example, the first and second moments of the class-1 queue length are bounded by the upper bounds, regardless of the characteristics of the service time of class-1 customers and the preemptive mode, as in the other (N, n)-preemptive priority queue in [14]. This property helps system engineers design such service systems that guarantee the mean and variance of delay for primary users under a certain bounds, when preempted services have to be restarted with another service time resampled from the same service time distribution. The property may be very useful, especially when the stochastic characteristic of low-priority customers easily varies or it cannot be easily determined.