1. Introduction
In 2017, U.S. Federal Communications Commission (FCC) dismantled the net neutrality regulations that prohibited Internet service providers (ISP) from blocking websites or charging for higher-quality service or certain content. This means that ISPs are given free rein to deliver service at their own discretion, that is, to discriminate contents by blocking, throttling and prioritization etc.
There are many important static and dynamic issues regarding net neutrality. First of all, it is an important issue whether repealing net neutrality and giving priorities to some content providers may help alleviate delays in content transmission due to heavy traffic. From a dynamic perspective, it is also important to examine whether ISPs will have a stronger incentive to improve the transmission quality by increasing the network capacity.
For years, queuing models have been adopted to address congestion problems in areas of priority pricing and network neutrality, since Choi and Kim [2] and Cheng et al.[1]. It is well known that queuing models give a good approximation for the arrival process in communication networks where many service requests compete transmission under limited bandwidth. Among many queuing models, M/M/1 models have found the widest range of application in most areas. In M/M/1 system, Poisson arrivals with rate λ are assumed. That is, interarrival times between two adjacent arrivals are exponentially distributed with mean 1/λ. This feature well suits the random nature inherent in the consumer requests in communications networks. The system has one server, service times of which are exponentially distributed with mean 1/μ. The term μ is thus the service rate, or the average number of services performed in unit time. Choi and Kim [2] interpret it as bandwidth in communications networks.1)
By using an M/M/1 model, Choi and Kim [2] derive the waiting time elegantly and beautifully and established the invariance result that given a fixed network capacity, the average waiting times are identical regardless of net neutrality. They also showed that the ISP’s incentive to invest may be weaker under prioritization, because an increase in the network capacity reduces the relative value of prioritized contents. They call this the rent extraction effect.
In this short paper, we argue that the waiting times of content transmission that Choi and Kim [2] derived by using the M/M/1 queuing model under the non-preemptive priority rule are miscalculated. We will provide corrected waiting times in the M/M/1 queuing model in the prioritization case. We also show that this correction does not alter their static and dynamic results on the delay time and the incentive to invest in the network capacity qualitatively.
2. Model
We closely follow the model of Choi and Kim [2]. End users are uniformly distributed over [0, 1]. So, the mass of end users is normalized to one. CP1 and CP2 are located in x = 0 and x = 1 respectively. Each consumer request contents from one of the content providers and gets some valuation υ. The unit transportation cost is t.
A monopolistic Internet service provider (ISP) has n servers with identical capacity.
Under net neutrality, both the interarrival time of content requests and the service time of each server of the ISP are assumed to follow exponential distributions with λ and μ. That is, the mean of the time between content requests is 1/λ and the mean of the service time is 1/μ. As usual, we assume that μ > λ to avoid the possibility that the waiting time will explode. Under no net neutrality, let λ1 and λ2 be the rate parameters of the exponential distributions that the interarrival times of content requests from CP1 and CP2 follow respectively.
Our main interest in this paper is how the repeal of net neutrality could affect the average waiting time (transmission time) of data that end users request. To compute the waiting times under two different regimes (net neutrality vs. no net neutrality), we borrow some established results in queuing theory.
3. Preliminaries
We consider a system in which customers arrive at rate λ according to a Poisson process and a server serves customers one at a time from the front of the queue on a first-come, first-served basis. Service times of each server have an exponential distribution with rate parameter μ.
It is useful to define server utilization by ρ = λ/μ to characterize waiting times. It has an interpretation as the probability that a server is occupied at arbitrary point in time. We assume that 0 < ρ < 1. Otherwise, the number of customers in the system, and in turn the average waiting time will eventually explode.
No Priority The average number of customers in the system, denoted by L , is calculated as , where pk is the probability that there are k customers in the system at arbitrary point in time. It is well known for M/M/1 that {pk} is a geometric sequence with the common ratio ρ,2) and that the resulting sum is
The average waiting time in system, denoted by W, is obtained using Little’s Law, L = λW, as
Note that W is the average total time spent in the system.3) Since a customer has to wait not only in queue but also for his own service time, the average total waiting time consists of two components: the average time in queue, Wq , and the average time in service, Ws. That is, W = Wq + Ws . From this relation, we can compute W alternatively as:
The first term, L/μ, is computed by using the PASTA (Poisson Arrivals See Time Averages) property saying that an arrival (arriving customer) from a Poisson process sees L customers on average, each of which is expected to have service time with mean 1/μ.
Since Ws = 1/μ, we can obtain the average time in queue as
Little’s Law holds in any queuing system, and particularly, it applies to systems within a system. Since queue (waiting line) itself is one subsystem and a server is another, Little’s Law could be applied to each one, as well as the whole system.
The intuition underlying behind Little’s Law is quite clear. It simply states that the average number of customers in a system or a subsystem (total quantity) must be equal to the average arrival rate of the customers (speed), multiplied by the average time that a customer spends in that system (total time). It always holds regardless of the interarrival time distribution or service time distribution, nor does it depend upon the number of servers in the system or queuing discipline within the system. For example, if we apply Little’s Law to the waiting time (queue) only, we can find the average number of customers in queue as
This implies that we can also find the average number of customers who are served as
Priority Now, we consider an M/M/1 system serving two different types of customers, type 1 and type 2. Type 1 and type 2 customers arrive according to independent Poisson processes with rate λ1 and λ2 respectively. We assume that ρ1 + ρ2 < 1, where ρi = λ1/μ, i.e., the occupation rate by type i customers. Type 1 customers are prioritized over type 2 customers.
First, we assume the non-preemptive priority system Choi and Kim [2]. Under this priority system, even prioritized type 1 customers are not allowed to interrupt the service of unprioritized type 2 customers.
The average waiting time for prioritized type 1 customers is computed as
where Li is the average number of type i customers in the system. The first term is the waiting time for the services for type 1 customers to be completed, i.e., the service time type 1 customers in the system. This is again due to the PASTA property. The second term comes from the fact that when an arriving type 1 customer finds a type 2 customer in service, he has to wait until the service of the type 2 customer is completed. According to the PASTA property, the probability that he finds a type 2 customer in service is equal to the fraction of time the server spends on type 2 customer, ρ2. The third term is the average service time of his own. Thus, we can rewrite (7) as
where and .
Equation (7), together with Little’s Law given by L1 = λ1W1, leads to
Equation (10) implies that the waiting time of prioritized customers must depend on λ2. This is due to the second term of (7). Intuitively, a newly arriving customer must wait in the front of the queue while a type 2 customer is in service (if any), even if the arriver is prioritized, under the non-pre emptive priority rule. Note that λ2 does not enter equation (2) of Choi and Kim [2] that assumes the non-preemptive priority rule.
For the waiting time of unprioritized type 2 customers, we need the following relation:
which can be similarly derived as (1). Then, from (9) and (11), we obtain
Therefore, by applying Little’s Law we obtain the average waiting time of unprioritized customers as
Equation (13) can be rewritten as
where for any λ1, λ2 > 0. This implies that W2 > W1 for any λ1, λ2 > 0.
On the other hand, in the preemptive priority rule under which a newly arriving type 1 customer is allowed to interrupt the service time of an unprioritized type 2 customer, the computation of the average waiting time for type 1 customers is relatively simple, because an arriving type 1 customer does not need to worry about the possibility that type 2 customers are in queue or in service. Therefore, the situation is the same for a type 1 customer as if there were only type 1 customers in the system. This leads to
Then, by using (11), we get
By applying Little’s Law, we obtain
where λ = λ1 + λ2.
While the waiting time of the prioritized customers depends on λ2 in the non-preemptive priority rule, because they cannot interrupt the services for unprioritized customers, the waiting time for the prioritized customers does not depend on λ2 in the preemptive priority rule, because they can always interrupt the service for type 2 customers and regard them as nonexistent null players.
The upshot is that under the non-preemptive priority rule which Choi and Kim [2] assumed, their result on the waiting times is not correct, while their result is correct if the preemptive priority rule is assumed.
4. The Effects of Prioritization on Waiting Times
Under net neutrality, all contents from either CP are equally treated by ISP. So, the average waiting time of contents from either CP is identically .
A consumer chooses to request contents from a CP that yields a lower total cost defined by the sum of waiting time and the transportation cost. Let xN be the location of the consumer who is indifferent between two CPs. Then, it is clear that , because the waiting times of CP1 and CP2 are the same.
If net neutrality is repealed, ISP can grant priority to either CP. We will consider both priority rules, the preemptive priority rule and the non-preemptive priority rule.
Let xPP be the borderline consumer under the preemptive priority rule. Since xPP is the ratio of users who request contents from CP1, the total amounts of content requests from CP1 and CP2 are λ1 = λxPP, and λ2 = λ(1 - xPP) respectively. Then xPP is determined by
where
This leads to
It is clear that , because .
If we denote the average waiting time under the preemptive priority rule by WPP , we have . Then, the invariance result of Choi and Kim [2] follows.
Proposition 1. (Choi and Kim [2]). WPP =W .
It says that the invariance result holds in the case of the preemptive priority rule.
Now, consider the non-preemptive priority rule. Let xNP be the borderline consumer under the non-preemptive priority rule. Then, the total amounts of content requests from CP1 and CP2 are λ1 = λxNP and λ2 = λ(1 - xNP) respectively. Similarly, xNP is determined by
where
Similarly, the average waiting time under the non-preemptive priority rule is . Then, we have
Proposition 2.WNP =W .
This proposition says that the invariance result also holds in the case of the non-preemptive priority rule.
It is not surprising that the invariance result holds in both priority regimes, because the repeal of net neutrality simply affects the order of serving content requests as long as the total amount of traffic remains the same.4)
5. The Effect on the Investment in the Network Capacity
Choi and Kim [2] showed that the ISP’s incentive to invest may be weaker in the case of prioritization under the preemptive priority rule,5) because an increase in the network capacity reduces the relative value of prioritized contents. They call this the rent extraction effect.6)
Proposition 3.In M/M/1 queuing model,under both the preemptive priority rule and the non-preemptive priority rule.
This proposition says that as ISP invests more in network capacity, the quality difference between prioritized contents and unprioritized contents becomes smaller, i.e., the relative value of prioritized contents becomes lower. This implies that ISP will have less incentive to invest in the network capacity.
This result was first established by Choi and Kim [2] only for the case of the preemptive priority rule. In Appendix, we provide a proof for the case of the non-preemptive priority rule as well. Therefore, Proposition 3 complements their result in the sense that it extends the result to the case of the non-preemptive priority rule.
6. Conclusion
In this paper, we showed that the invariance result that Choi and Kim [2] obtained holds both under the preemptive rule and under the non-preemptive rule. We also showed that the rent extraction effect of an investment in the network capacity that Choi and Kim [2] identified is valid both under the preemptive rule and under the non-preemptive rule. We believe that it will be worthwhile to extend this model to the case that there are several servers by using the M/M/n queuing model to provide an analysis for a more natural interpretation of the network capacity as the number of servers.