Server pooling models for separate and bounded queues

Hila Hindy Ling; Ran Etgar; Hillel Bar-Gera

doi:10.1016/j.heliyon.2024.e25344

. 2024 Feb 1;10(5):e25344. doi: 10.1016/j.heliyon.2024.e25344

Server pooling models for separate and bounded queues

Hila Hindy Ling ^a,^b,^∗, Ran Etgar ^b, Hillel Bar-Gera ^a

PMCID: PMC10907676 PMID: 38434330

Abstract

The focus of this paper is on queueing models with arrivals from several sources, where the number of waiting spaces for each arrival source is finite. The main distinction from prior studies is that in our models waiting spaces are separate. Within this context, we examine the benefit of pooling, regardless of whether arrival rates are equal or different. We present a Quasi-Birth-and-Death (QBD) model to address the general case, with simplified versions tailored to specific scenarios. One practical application of the proposed models can be found in the loading and unloading processes at container terminals.

We define a measure for stochasticity-related inefficiency, denoted relative interaction delay (RID), and analyze its behavior for the case of a single waiting space for each source. We show analytically that in the base model, the RID approximation is inversely proportional to the number of pooled queues. Numerical evaluations show an added benefit of pooling when arrival rates differ, observing a linear enhancement that is notably more pronounced.

Keywords: Pooling, QBD, Separate waiting queues, Container terminals, Arrival rates, Service rates

1. Introduction

A common question in queuing theory is whether to pool the queues such that the servers can serve all the customers, or handle a system with separate queues such that each group of servers serves its own queue (e.g. Refs. [[1], [2], [3], [4], [5], [6], [7]]).

The basic model for server pooling, denoted M/M/C, assumes Markov arrival and service processes, C independent servers with identical service rates, and unlimited queue length [8,9]. This model consists of one queue with a given average arrival rate, denoted by $λ$ .

The effect of queue pooling in this context can be analyzed by the classic comparison of C queueing systems of type M/M/1, to a single M/M/C queueing system. It is well known that such pooling reduces the variability of the workload between the servers, and thus improves all service performance measures, such as the average waiting times. In particular, Ding [10] showed that M/M/C pooling can achieve shorter task response time (average waiting time in the system) which improves cloud computing energy efficiency. He found that on average the M/M/20 task response time performed 55.7 % better than twenty M/M/1's when both cases servers are homogeneous with the same rates of service. Furthermore, M/M/20's average task response time was 68.7 % better than twenty M/M/1's (with heterogeneous servers and varying service rates). In general, Ding shows that the average waiting time in C systems of M/M/1 queue is always inferior to the average waiting time in M/M/C model when C > 1.

A prevalent variation of the M/M/C model presumes a limited capacity, N ≥ C, and is denoted as M/M/C/N. In this model, should customers arrive when N customers are already present in the system (either receiving service or waiting), they are unable to join the queue and must depart. An especially notable instance of these finite capacity models occurs when $N = C$ , meaning there are no waiting spots available for arrivals during times when all servers are occupied. Such scenarios are referred to as pure "loss" models and are commonly utilized to assess service systems where having customers wait to commence service is deemed either impractical or highly undesirable [11].

In the case of $N = C$ , there is no queue and therefore the waiting time is zero. The performance in this case can be measured by the effective arrival rate, $λ_{eff}$ , or equivalently by its inverse, the average output time, $AOT = \frac{1}{λ_{eff}}$ . Our focus is on the case of limited queues and thus the main performance measure would be the AOT.

Queueing models often assume similar arrival rates $λ$ and similar service rates $μ$ . Fibich et al. [12] Examined a queue comprising C heterogeneous servers, with distinct service rates denoted as $μ_{1}, . . ., μ_{C}$ . They argue that since certain conditions of differentiability and symmetry hold, an averaging principle applies, and therefore the effect of the magnitude of service rate variability on system performance should be quadratic. We are interested in the effect of pooling J queues while considering heterogeneous arrival rates $λ_{1}, λ_{2}, \dots, λ_{J} .$ This system satisfies the differentiability and symmetry conditions, and therefore the averaging principle theorem of Fibich et al. [12] holds as well.

In an M/M/C/N queue there are $K = N - C$ waiting spaces. When pooling J queues of this type, it is natural to assume joint waiting spaces. Under this assumption, the pooled system can also be modeled by M/M/C/N, with $C_{pool} = J ∙ C$ servers, $K_{pool} = J ∙ K$ waiting spaces, and arrival rate of $λ_{pool} = J ∙ λ$ . The M/M/C/N model can also handle the case of different arrival rates by setting $λ_{pool} = {\sum λ}_{j}$ .

The main goal of our research is to analyze the influence of pooling when the waiting spaces remain separate. We consider both the case with similar arrival rates and the case with different arrival rates. We define a measure for stochasticity-related inefficiency, denoted relative interaction delay (RID), and show that in the base model, the RID approximation is inversely proportional to the number of pooled queues.

While we consider different arrival rates, we focus on models with identical service rates. However, these models can be expanded to accommodate varying service rates per queue, as discussed and illustrated in Appendix A.

Separate waiting spaces may not be the general case, yet they can be found in various systems, such as container terminals, in the unloading and loading processes. Details about this test case are presented in section 2. To clarify the idea of separate waiting spaces, consider the following hypothetical scenario.

In the system, there are three distinct types of customers, and each customer type is allocated its own specific set of waiting spaces. If a customer of type I arrives at the system when all the servers and all type I waiting spaces are occupied, then, even if there are available waiting spaces for customers of type II or III, the customer will have to leave the system without getting a service. Servers can still work as one pool. When a server completes a previous service, it will be assigned to serve one of the waiting customers (if there are any, otherwise it will remain idle). The choice between customer types is made with the same probability without priorities. The choice between customers of the same type is based on first-come-first-served.

To summarize, the queuing models we will discuss here can be classified by several attributes: 1. With or without pool; 2. Finite/unlimited waiting spaces; 3. Joint/separate waiting spaces; and 4. Same/different arrival rates. The types of model needed for each of these cases are listed in Table 1.

Table 1.

Queuing models for different scenarios.

Scenario	Pool	Waiting spaces	Number of waiting spaces	Arrival rates	Model
1	with/without	joint	infinite	same/different	M/M/C
2	with/without	joint	finite	same/different	M/M/C/N
3	with/without	separate	infinite	same/different	M/M/C
4	without	separate	finite	same/different	M/M/C/N
5	with	separate	0	same/different	M/M/C/N
6	with	separate	1	same	BD
7	with	separate	1	different	QBD
8	with	separate	2+	same	QBD
9	with	separate	2+	different	QBD

Open in a new tab

As discussed above, scenarios 1–5 have been considered in the literature. Specifically, scenario 5 describes a special case of pooling in which $N = C$ , meaning there are no waiting spaces at all ( $K = 0)$ . Therefore, specifying whether waiting spaces are joint or separate is redundant. This case is listed in Table 1 only for the sake of completeness. Analysis of scenarios 6–9 is a key component of the contribution of the current paper, and the specific models needed for this purpose will be presented in section 3.

The rest of this paper is structured in the following manner: Section 2 presents a test case of a system with separate waiting spaces. In Section 3 we present the methodology of our research. Next, in Section 4, we present illustrative numeric results. Conclusions and recommendations for future research are provided in Section 5.

Practical application (test case)

The models discussed in this paper are suitable, among other things, when pooling at container terminals is considered. Container terminals play a crucial role as intermodal junctions within the worldwide transportation network. Effective management of container operations at these terminals is vital for minimizing transportation expenses and maintaining adherence to shipping timetables [13]. A primary goal in port operation, therefore, is to minimize the vessel turn-around time, i.e., the total makespan of the loading and unloading processes summed together. Loading and unloading involve quay cranes (QC), yard trucks (YT) and in some cases yard cranes (YC) and are thus multi-stage processes.

Carlo et al. [14] discussed the primary challenges involved in transport operations in container terminals and distinguished among three principal decision-making challenges: (1) choosing the appropriate type of vehicle, (2) deciding on the number of vehicles needed, and (3) planning the routes and dispatching the vehicles.

The type of yard truck vehicle can have a substantial effect on the analysis of other decisions. Yard trucks can be with or without lifting capabilities, automated or manually driven, and vary in other ways. If yard trucks do not have lifting capabilities, in the unloading process for example, when the QC picks up a container from a vessel, and there is no available YT, it must wait while holding the container and cannot continue bringing containers from the vessel. When automated lifting vehicles (ALVs) are used as YTs, they can pick containers from a buffer near the QC. Still, such buffers are typically limited in space area, and therefore, when the buffer is full, the crane can hold one extra container, but after that, new containers' arrival must stop. In either case, the number of waiting spaces near each crane is limited and separated from waiting spaces near other cranes, thus corresponding to scenarios 6–9 in Table 1. QC rates may be the same, or similar, but more often the rates vary due to cargo type (e.g. empty or full containers), QC operator skills (if not automated), etc. Therefore, the case of different arrival rates (scenarios 7 & 9) is more relevant in this context than the case of equal arrival rates (scenarios 6 & 8).

To clarify the connection between the queueing models discussed in this paper, and the joint operation of QCs and YTs, note that the operation of each tool can be described as a sequence of active cycles and idle intervals. Cycles for both QC and YTs begin at the same moment: during unloading when a container is released on a YT and during loading when a container is picked up from a YT. The end of every QC (YT) cycle is considered a QC (YT) event. Following a QC (YT) event, the QC (YT) may begin a new active cycle, or it may enter an idle interval.

In the model presented for scenarios 6 and 7 from Table 1, every QC event (i.e., the end of a QC cycle) is considered as a “birth” or as a new arrival to the system, and every YT event (i.e., the end of a YT cycle) is considered as a “death”. Following this interpretation, YTs are the servers, and YT “jobs” are the customers. Note that in the last part of the job in the unloading process and the first part of the job in the loading process, the YT does not carry a container, and thus, the container itself cannot be considered as the customer of the service.

It should be noted that other studies of stochastic analytic models for container terminals have used different interpretations of “customers” and “servers,” as summarized in Table 2. The above interpretation is relevant to our situation, as we concentrate on the interaction between quay cranes and yard trucks, whether they are pooled or not. In section 4 we present illustrative numeric results for the case of $K = 1$ . This case corresponds to regular YTs, where a QC can hold up to one job (and stay idle) if all the servers (YTs) are occupied.

Table 2.

Interpretations of "customers” and “servers” in queueing models for port operations.

Citation	Customers	Servers
Dhingra et al. [15]	Automatic guided vehicles	Automated stacking cranes, QC, travel times
Zhang et al. [16]	Truck	Handling operations (virtual service process)
Easa [17]	Berths	Tugs
This paper	YT “jobs”	YTs

Open in a new tab

2. Methodology

As described in section 1, we consider a system with multiple queues, each queue belongs to a different type of entity and has its own waiting spaces. Consider J queues with a total of C servers working at an average rate of $μ$ , where each queue has a different average arrival rate $λ_{1}, λ_{2}, \dots, λ_{J}$ and a separate waiting area that can hold up to K entities. We assume that inter-arrival times and service times follow exponential distributions, the system is memoryless and can be described as a Markov process. When each queue operates independently, their behavior can be described by the M/M/C/N model.

If all servers are pooled, but waiting areas remain separate, the model has two types of states. The first type of state describes the system with idle servers. Each state of this type is identified by the number of entities in the system, $0 \leq n < C$ , which is equal to the number of active servers. The transition rate from state $n$ to state $n + 1$ equals to the sum of all the arrival rates ( $\sum_{j = 1}^{J} λ_{j}$ ). The transition rate from state $n$ to state $n - 1$ depends on the number of active servers, and equals to $n ∙ μ$ .

The second type of state describes the system when all servers are active. In this case, each state is identified by the number of entities in each queue, $0 \leq b_{j} \leq K$ . Hence, there are ${(K + 1)}^{J}$ states of the second type, and $G = C + {(K + 1)}^{J}$ possible states in total. The maximum quantity of entities present in the system is $N = C + K ∙ J$ . This number is obtained when all the servers are active, and the waiting spaces of all queues are full.

To illustrate the state space, consider an example of four queues (a,b,c,d) and five servers working as a pool, when the number of waiting spaces in each queue is one $(J = 4, C = 5, K = 1)$ . Fig. 1 presents the state diagram for this scenario. The first five states ('0′ to '4′) are the states with idle servers. The sixth state ('5′) is the state in which all the servers are active and all waiting spaces are empty. Consider next states with 6 entities in the system, meaning all the servers are active and there is one idle queue since its waiting space is full. The idle queue can be one of each of the four queues, so there are in fact four states in this situation - 6a,6b,6c,6d, when the number '6′ indicates the number of entities in the system and the letter indicates the idle queue. A 'birth' means a new arrival to the system, which can be only to one of the other queues. Following a “birth” there would be 7 entities in the system and 2 fully occupied queues (a and b, a and c, a and d, b and c, b and d, or c and d). Another ‘birth’ leads to 8 entities in the system, 5 of them in service and 3 in waiting spaces, such that the set of states in this case would be 8abc, 8abd, 8acd or 8bcd. The last state (‘9’) indicates a full system in which all waiting spaces are full and all the servers are busy, so there are 4 + 5 entities in the system.

Fig. 1 — State diagram for four queues with different arrival rates, one waiting space in each queue and total of five servers $(J = 4, C = 5, K = 1)$ .

In the general case, we define two related indices n and m such that each state can be represented by the pair $(n, m)$ :

n-number of entities in the system $n = 0, 1, 2, \dots, C + K ∙ J$ .

m-a combination of the waiting status of all areas $m = 0, \dots, {(K + 1)}^{J} - 1$ .

When $n \leq C$ (all waiting spaces are empty) then m gets the value of zero, and when $n > C$ then m gets the value of $1, \dots, {(K + 1)}^{J} - 1$ . Let $b_{j} (m)$ be the jth digit of m, when represented in base $(K + 1)$ , i.e. $m = \sum_{j = 1}^{J} b_{j} (m) ∙ {(K + 1)}^{j - 1} .$ We consider $b_{j} (m)$ as describing the number of occupied waiting spaces in queue j. For example, state 6b in Fig. 1 would be described as $({6, 2}_{dec} = 0010_{bin})$ meaning that there are 6 entities in the system and there is one entity in the second queue. This entity denotes a full queue because k = 1. In this case the digits are being represented by binary representation – '0′ for empty queue and '1′ for full queue. Note that when $m > 0$ $n = C + \sum_{j = 1}^{J} b_{j} (m)$ .

A birth is represented by a single-digit increase by 1, while all the other digits remain unchanged, and a death is represented by a single-digit decrease by 1, while all the other digits remain unchanged.

We define w_j as an indicator describing whether waiting area j is active (not empty):

Equation 1.

(1)

From eq. (1), W(m) can be defined as the number of active waiting areas:

Equation 2.

(2)

when m > 0 all servers are busy. From eq. (2), when a server becomes available, the transition will be to a state with

Equation 3.

(3)

There are $W (m) = | S^{-} (m) |$ such possible transitions. Subsequently, we will also use a corresponding set of non-empty waiting areas $J^{-} (m) = {j | b_{j} (m) > 0}$ .

The sum of these transition rates is the rate that a server becomes available, i.e. $C ∙ μ$ . According to our assumption, the probability of choosing each one of these transitions is $\frac{1}{W (m)}$ . Therefore, the probability of the transition from state $(n, m)$ to state $(n - 1, \hat{m})$ , where $\hat{m} \in S^{-} (m),$ equals to $C ∙ μ ∙ \frac{1}{W (m)}$ .

The arrival rate of entity of type j is $λ_{j} (j = 1, 2, . ., J)$ , and this is the transition rate from state $(n, m)$ to state $(n + 1, \hat{m})$ where

Equation 4.

(4)

Subsequently, we will also use a corresponding set of non-full waiting areas:

J^{+} (m) = {j | b_{j} (m) < K} .

The transition matrix of the system's states is represented in Table 3.

Table 3.

State transition rates of pooling queues with different arrival rates (all other transition rates are zero).

From state	To state	Conditions	Rate
$n, 0$	$n + 1, 0$	$0 \leq n < C$	$\sum_{j = 1}^{J} λ_{j}$
$n, 0$	$n - 1, 0$	$0 < n \leq C$	$n μ$
$n, m$	$n + 1, \hat{m}$	$0 \leq m < {(K + 1)}^{J}$ ; $0 <$ $\hat{m} \leq {(K + 1)}^{J}$ $j \in J^{+} (m);$ $b_{j} (\hat{m}) = b_{j} (m) + 1 \leq$ K; $b_{j^{'}} (\hat{m}) = b_{j^{'}} (m) \forall j^{'} \neq j$	$λ_{j}$
$n, m$	$n - 1, \hat{m}$	$0 < m \leq {(K + 1)}^{J}$ ; $0 \leq \hat{m} < {(K + 1)}^{J}$ $j \in J^{-} (m); b_{j} (\hat{m}) = b_{j} (m) - 1 \geq 0; b_{j^{'}} (\hat{m}) = b_{j^{'}} (m) \forall j^{'} \neq j$	$\frac{C μ}{W (m)}$

Open in a new tab

The steady-state probability of state $(n, m)$ is denoted by $P_{n, m}$ . Denote by P a diagonal matrix of dimension $G \times G$ , with all state probabilities. Denote by STR the square matrix of state transition rates, as given in Table 3. Let $J = {(1, \dots, 1)}^{t}$ be a vector of ones of the same dimension. Having eqs. (3), (4)), we can calculate $P_{n, m}$ using the equations:

Equation 5.

(5)

Eq. (5) enables the creation of Table 3.

Once we have the steady-state probabilities, based on eqs. (6), (7), (8), $λ_{eff}$ can be calculated, which is the actual arrival rate, while considering all the states in which a birth occurs, meaning a new entity arrives in the system.

Equation 6.

(6)

Equation 7.

(7)

Equation 8.

(8)

We are considering systems where the focus is queue output, and therefore, a key performance measure is the average output time (AOT) per entity per queue. It can be computed by dividing the number of queues by $λ_{eff}$ ,as depicted in eq. (9).

Equation 9.

(9)

From here on we shall assume that each queue has C₁ servers, and hence there are $C = J ∙ C_{1}$ servers in total. We assumed that each queue may have a different arrival rate, $λ_{j}$ , and denote the average arriving rate by $λ = \frac{\sum_{j = 1}^{J} λ_{j}}{J}$ . Considering that $λ_{eff} \leq \sum_{j = 1}^{J} λ_{j} = J ∙ λ$ and $λ_{eff} \leq C ∙ μ$ , we get a lower bound for AOT (eq. (10)):

Equation 10.

(10)

We define θ as the ratio of arrival rates to service rates, thereby capturing the demand-to-supply ratio (eq. (11)):

Equation 11.

(11)

Therefore, the lower bound (eq. (12)) is:

Equation 12.

(12)

If the system were deterministic, the lower bound (12) would prevail. In particular, when $θ = 1$ , $λ = C_{1} ∙ μ$ , arrivals and departures are synchronized and there are no delays in the system. When variances exist, AOT may be higher than LB. These additional delays are due to the stochastic interaction between the arrival process and the service process. Our focus is on the gap between AOT and LB. To capture this gap, we define the relative interaction delay (RID) as the relative difference between the AOT and its LB, as depicted in eqs. (13), (14)):

Equation 13.

(13)

Equation 14.

(14)

Pooling does not change the available resources, and therefore does not change LB. The change in RID is in this case the main contribution of pooling. In the next section we illustrate the influence of various factors on RID, and use these results to gain insights into the benefits of pooling.

The model presented here for general K corresponds to scenario 9 in Table 1. Scenario 8 is a special case with equal arrival rates, but the same model is needed. Scenario 7 is also a special case, with $K = 1$ . It requires a QBD model of a similar structure, although some minor technical simplifications can be implemented. Scenario 6 can be treated as a special case of scenario 9, but since arrival rates are the same and $K = 1$ , the following BD model is sufficient. In this scenario, as long as there are fewer entities than active servers, a new arrival, or birth, can occur, with the rate of $J ∙ λ$ . When n, which is the number of entities in the system, is bigger than the total number of servers, C, then the entities will have to wait (at most one in each queue), and new arrivals will occur only in queues with available waiting space. Thus, the arrival rate will be $(N - n) ∙ λ$ . A full system means that all the servers are busy and all the spaces of all types of entities are occupied, so there is a maximum of $N = J + C$ entities in the system.

Fig. 2 and Table 4 present the Markov chain and the system transition rates matrix for this model.

Table 4.

State transition rates of BD model for pool with similar arrival rates and separate waiting spaces.

From state	To state	Condition	Rate
N	$n + 1$	$0 \leq n < C$	$J λ$
N	$n + 1$	$C \leq n \leq N - 1$	$(N - n) λ$
N	$n - 1$	$0 < n \leq C$	$n μ$
N	$n - 1$	$C < n \leq N$	$C μ$

Open in a new tab

$λ_{eff}$ and AOT are calculated as follows:

λ_{eff} = \sum_{n = 0}^{C - 1} J λ ∙ P (n) + \sum_{n = C}^{N - 1} (N - n) λ ∙ P (n)

AOT = \frac{J}{λ_{eff}}

This model may seem similar to the standard M/M/C/N model, but it is not fully equivalent, because arrival rates are state-dependent. The ability to represent scenario 6 by a birth-and-death model, which is simpler to analyze than the more general QBD model, allows for a more rigorous analysis of its properties. Specifically, the effectiveness of pooling can be proven to hold in general. The details of the proof are presented in Appendix B.

3. Illustrative numeric results

To better understand the effects that various factors can have on pooling queues it is useful to consider specific numerical examples. We start the evaluation with the reference case, where the queues are not pooled and the arrival rates for the queues are similar. Next, we present the model's results when the queues are pooled, both when the arrival rates are similar, and when they differ. At the end of this section, we present a comparison between the pooling and non-pooling policies. The analysis focuses on the case in which each queue can hold up to 1 entity at most $(K = 1)$ .

3.1. Non-pooling with similar arrival rates

Fig. 3 shows how AOT changes as a function of $θ$ in the range between $\frac{1}{3}$ to $3$ for various options in terms of the number of servers per queue: $C_{1} = 1, 2, 4, 6, 8, 10$ . For the sake of this illustration, the average arrival rate $(λ)$ is 30 units per hour. For every combination of parameters, the value of $AOT (λ, μ, C_{1})$ as defined in section 1 is computed by using the equations presented there. Note that when $λ$ and $C_{1}$ are given, $θ$ can be determined from $μ$ and vice versa.

As observed from Fig. 3, for a given number of servers per queue $C_{1}$ , AOT increases as a function of $θ$ , exhibiting a progressively steeper slope. Two lower bounds are observed (explained in section 3): The first one is dictated by the arrival rate, as $AOT \geq \frac{1}{λ}$ and is represented in Fig. 3 by the solid horizontal line. The second one is dictated by the service rate, $AOT \geq \frac{1}{C_{1} ∙ μ} = \frac{θ}{λ}$ and represented in Fig. 3 by the solid diagonal line, which is linear in $θ$ , and crosses the first lower bound when $θ = 1$ . When $θ ≪ 1$ , i.e., towards the left side of the graph, performance is dominated by the arrival rate. When $θ ≫ 1$ , i.e., towards the right side of the graph, performance is dominated by the service rate. In all cases, the more limiting bound governs.

As indicated previously, pooling cannot affect the lower bound, but it can affect AOT. Therefore, we focus our attention on the gap between AOT and the lower bound, which is represented by the RID, as defined in section 3. Fig. 4 describes the RID as a function of $θ$ and $C_{1}$ .

Fig. 4 shows clearly that for a given number of servers, the maximum value of RID is obtained when $θ = 1$ and that RID values decrease as θ deviates further from 1. Fig. 4 also better depicts the quantitative effect of the number of servers on the RID for any specific value of $θ$ , for example, the reduction in RID from 50 % to 40 % and 23.6 % when $θ = 1$ as the number of servers per queue increases from 1 to 2, and 8, respectively.

3.2. Pooling with similar arrival rates

312 scenarios were examined for the case that the queues are pooled and the arrival rates to the queues are the same. In addition to the parameters we used in 4.1, we picked 4 values for the number of pooled queues, J: 1, 2, 4 and 8. Note that $J = 1$ describes the case in which the queues are not pooled and is used as a reference for the other scenarios. Also, note that the number of servers per queue, $C_{1}$ , dictates the total number of servers in the system. For example, when there are 6 servers per queue $(C_{1} = 6)$ in the case that there is a pool of 4 queues $(J = 4)$ , the total number of servers is $C = 24$ .

Examples for the RID values obtained in the examined scenarios are shown in Fig. 5.

From Fig. 5 it can be observed that the general pattern of RID as a function of $θ$ , as discussed above, is similar in all the curves, pooled or not pooled. Additional conclusions from Fig. 5 are: i) For a given $C_{1}$ , RID is a monotonically decreasing function of the number of pooled queues, thus demonstrating the benefit of pooling. For example, in the case of $C_{1} = 6$ , pooling 8 queues reduces RID from 0.265 to 0.086. ii) The largest improvement in RID values is obtained in the transition from the non-pooling option $(J = 1)$ to pooling of two queues $(J = 2)$ . iii) The pooling advantage is more significant when $C_{1}$ is low. For example, when $C_{1} = 2$ , pooling 8 queues reduces RID from 0.4 to 0.126.

3.3. Pooling with different arrival rates

When each queue has a different arrival rate, we analyze the effect of the variance between the arrival rates on system metrics. We selected arrival rates in three dispersion ranges, such that the average of each range is 30 units per hour, and the scattering within the range is in even intervals. For example, the values of $λ$ in the range between 20 and 40 when two queues are being pooled are 20 and 40, while in a pool of four queues, the values of $λ$ are 20, 26.67, 33.33 and 40. Note that due to the small number of elements, the manner of dispersion of two items cannot be equivalent to the manner of dispersion of four or eight items. In particular, enlarging the pool lowers the variance between the values of $λ$ . The selected ranges for rates are: $Range = 10, 20, 40$ $(20 \leq λ \leq 40, 25 \leq λ \leq 35, 10 \leq λ \leq 50)$ . The results obtained for these three ranges were compared to the case in which the arrival rates are the same and equal to the average rate of the other ranges $(λ = 30, Range = 0)$ .

72 scenarios were examined for the case that the queues are pooled and the arrival rates of the queues are different. In all of them the supply to demand ratio, $θ$ , is one. Fig. 6 presents the values of RID obtained in different scenarios in each dispersion range.

Fig. 6 shows that in all of the scenarios, the values of RID increase as the dispersion between arrival rates increases. For each J, increasing $C_{1}$ lowers the influence of the dispersion between arrival rates on RID values. For each $C_{1}$ , RID decreases as J increases. In all of the scenarios RID decreases as $C_{1}$ increases and the most significant improvement in RID is in the transition from $C_{1} = 1$ to $C_{1} =$ 2.

As stated in section 1, based on Fibich et al. [12], since the conditions of differentiability and symmetry hold, the effect of $λ$ -range on RID should be quadratic. Indeed, in our results the correlations between the range squared and the RID were all above 99.6 %. Due to the quadratic relationship, it is sufficient to consider only one range, as representative of all other ranges.

3.4. Pooling Vs. non-pooling policies with different arrival rates

Fig. 7 summarizes the main results differently, to further clarify the impacts of pooling. The coordinates of each point in the graph present the RID value obtained in the non-pooling policy (the horizontal axis) and in the pooling policy (the vertical axis) when all the arrival rates are the same. Each point represents a different combination of the number of queues (symbol and color) and the number of servers per queue ( $C_{1}$ ). Note that the non-pooling RID (horizontal axis) is not influenced by the number of queues, and therefore each “column” of points can be associated with a specific value of $C_{1}$ , which is indicated at the bottom of the figure. An explanation for the nearly linear trend in the points of each sequence in this curve is provided in Appendix B.

The impact of variability in arrival rates is shown by “error bars”, i.e. the horizontal and vertical lines stemming from each point (for $J > 1$ ). The horizontal line from each point to the right represents the RID increase for $Range = 20$ in the non-pooling policy. The vertical line from each point upwards represents the RID increase for $Range = 40$ in the pooling policy.

For example, the point (0.5,0.33) in the graph means that for $θ = 1$ , $Range = 0$ , $C_{1} = 1$ and $J = 2$ the RID is 0.5 in the non-pooling policy and 0.333 in the pooling policy, i.e. a ratio of 1.5. Increasing the range to 20 raises the RID value to 0.56 under the non-pooling policy, and increasing the range to 40 raises the RID value to 0.36 under the pooling policy.

Note: given the quadratic impact of range on RID increase (as discussed above), modifying vertical error bars to reflect $Range = 20$ would make them 4 times smaller, and thus barely observable. Similarly, modifying horizontal error bars to reflect $Range = 40$ would make them 4 times larger, thus clattering the figure and making it illegible. This choice of different ranges is necessary for figure clarity, but it does not permit a proper direct comparison of the impact of arrival rate variability between the pooling and non-pooling cases. Such a comparison will be presented later using Fig. 8.

Fig. 8 — RID increase ratio as a function of $C_{1}$ and J, $Range = 20$ , $θ = 1$ .

Fig. 7 clearly shows that without pooling the impact of variability in arrival rates ( $Range = 20$ ) is quite substantial compared to the RID without variability. For example, in the case of $J = 2$ , $C_{1} = 10$ , the RID of 0.214 when arrival rates are the same, increases by 0.08–0.294 when the Range = 20. With pooling the relative impact of arrival rate variability is substantially less significant, even for $Range = 40$ . For example, in the case of $J = 2$ , $C_{1} = 1$ , the RID of 0.333 when arrival rates are the same, increases by 0.026–0.36 when the $Range = 40$ .

In addition to the impact of arrival rate variability, Fig. 7 also summarizes the impact of pooling when arrival rates are the same. The points associated with a given number of queues follow straight lines, reflecting constant ratios of pooling to non-pooling RID of 1, 0.66, 0.45 and 0.31 for J = 1, 2, 4, and 8 queues respectively.

Finally, we point out that the enlargement in RID values due to the differences in the arrival rates depends on the number of queues, both in the pooling and in the non-pooling policies. For the same value of $C_{1}$ , both the horizontal line and the vertical line are longer as J decreases (except for one queue, obviously). We assume that the reason for this trend under the non-pooling policy is due to the differences between the distributions of arrival rates, and especially due to the differences between the variances (under the same range) as a function of the number of queues.

As mentioned above, Fig. 7 does not permit a proper direct comparison of the impact of arrival rate variability between the pooling and non-pooling cases. To enable such comparison, Fig. 8 describes the RID increase ratio at $Range = 20$ as a function of the number of servers per queue. For example, for $J = 2$ and $C_{1} = 1$ the RID value in the non-pooling policy increases from 0.5 to 0.559 and in the pooling policy it increases from 0.333 to 0.339, so the RID increase ratio is $\frac{0.559 - 0.5}{0.339 - 0.333} = 9.83$ . This value is the lowest ratio that we obtained by changing policies from the non-pooling policy to the pooling policy. The pooling benefit increases with $C_{1}$ and with J. For example, for eight queues and ten servers the RID increase ratio is 108.68. As Fig. 8 shows, for a given number of queues, the increase ratio is linear in the number of servers per queue ( $R^{2} = 0.9997)$ , according to the following calculations:

{RIDIncreasRatio (J = 2) = 2.469 + 6.278 C}_{1}

{RIDIncreasRatio (J = 4) = 4.083 + 7.819 C}_{1}

{RIDIncreasRatio (J = 8) = 6.089 + 10.17 C}_{1}

While we do not have a theoretical explanation for these linear trends, the match is quite striking. The main conclusion from Fig. 8 is that the impact of the variation between the arrival rates is much larger when the work is under the non-pooling policy, compared to the results with the pooling policy. Put in another way, when arrival rates are different the benefit of pooling is substantially higher.

4. Conclusions and future work

In this paper we examined the contribution of pooling queues with finite waiting spaces while maintaining separated waiting areas for each queue. We compared the non-pooling policy to the pooling policy both when the arrival rates are similar and when they differ. We mainly focused on a special case that involved only a single waiting space in each queue. This type of system finds practical application in container terminals, particularly in the processes of unloading and loading containers.

The main operational measure we examined is the relative interaction delay (RID), which is defined as the gap between the average output time and its lower bound. We found that when the arrival rates are similar, for a given number of servers, the RID is a monotonically decreasing function of the number of pooled queues, the largest improvement in RID values is obtained in the transition from the non-pooling option to pooling of two queues and the pooling advantage is more significant when the number of servers is low.

When there are different arrival rates, the values of RID increase as the dispersion between arrival rates increases. For each number of queues, increasing the number of servers per queue lowers the influence of the dispersion between arrival rates on RID values. For each number of servers per queue, RID decreases as the number of queues increases. RID decreases as the number of servers per queue increases and the most significant improvement in RID is in the transition from a single server to two servers per queue.

By comparing between the two options, we found that without pooling the impact of variability in arrival rates is quite substantial compared to the RID without variability. In addition, the enlargement in RID values due to the differences in the arrival rates depends on the number of queues, both in the pooling and in the non-pooling policies. The pooling benefit increases with the number of servers per queue and with the number of queues.

While this paper offers valuable practical insights, its contribution could be further extended through additional research, which would shed light on a wider range of scenarios. In the context of this paper, we assumed that servers select active queues randomly, without favoring any particular entity type. An alternative avenue for future research could involve exploring different assumptions, where servers consider various characteristics – such as importance, size, and more – when choosing which queue to serve.

Another promising direction for expanding this research involves relaxing the assumption of exponentially distributed service times. Although this assumption has its advantages, it does not cover all possible cases. Therefore, it is important to investigate how different service time distributions might impact the conclusions presented here.

CRediT authorship contribution statement

Hila Hindy Ling: Conceptualization, Formal analysis, Methodology, Software, Writing – original draft, Writing – review & editing. Ran Etgar: Methodology, Supervision, Writing – review & editing. Hillel Bar-Gera: Conceptualization, Formal analysis, Methodology, Supervision, Writing – original draft, Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Contributor Information

Hila Hindy Ling, Email: hilal@ruppin.ac.il.

Ran Etgar, Email: ranetgar@ruppin.ac.il.

Hillel Bar-Gera, Email: bargera@bgu.ac.il.

Appendix A. Accommodating different service rates

In the models presented throughout this paper, we assumed that the distribution of service time is the same for all queues and particularly the average service rate. In practice, arrival times and service times may be affected by many factors. Some of the factors are related to the specific item. For example, in the context of container terminals, a specific container may be empty or full. Item specific factors may contribute to the distribution of arrival and service times within each queue, and they may also cause differences between queues. In the context of container terminals, item-specific attributes may affect the operations of quay cranes (i.e., arrival rates in our model) or yard trucks (i.e., service rates in our model). Other factors, like the experience of a human crane operator, or the maintenance level of the quay crane, may influence the quay crane process and hence lead to different arrival rates, even if the properties of items within each queue are similar. Factors associated with the quay crane and its operator are not likely to affect the yard trucks, i.e., the service rates of different queues. Therefore, at least in the context of container terminals, differences between the average service rates (yard truck operations) of separate queues (different cranes) are often sufficiently modest, so it is appropriate to rely on the models we analyzed so far. In other cases, whether related to container terminals or not, an extension of the proposed models to address different service rates may be useful.

The main implication of the extension for different service rates is a substantial increase in the number of states in the QBD model. Even if the capacity of each queue is limited to one, the total number of states is:

\sum_{n = 0}^{C} (\begin{array}{c} J + n - 1 \\ n \end{array}) + \sum_{i = 1}^{J} (\begin{array}{c} J + C - 1 \\ C \end{array}) ∙ (\begin{array}{c} J \\ i \end{array}),

Where the first sum considers states with $n \leq C$ entities in the system, and the second sum considers states with $i \leq J$ queues where the capacity of one is utilized.

To further illustrate the implications of such an extension, the remainder of this appendix focuses on the case of two queues with one server in each queue ( $C_{1} = 1$ ), and a maximum queue capacity of one in each queue (K = 1). Similar to other models in this paper, we assume Poisson processes, with arrival rates of $λ_{a}$ and $λ_{b}$ , and service rates of $μ_{a}$ and $μ_{b}$ . Possible state transitions are shown in Fig. 9, and their probabilities are listed in Table 5. RID can be derived following similar procedures to equations (5), (6), (7), (8), (9), (10), (11), (12), (13), (14), with a few minor modifications.

Fig. 9 — States diagram for 2 classes of customers, 2 servers and maximal capacity of 1 in each queue

Table 5.

Transition matrix for 2 classes of customers, 2 servers and maximal capacity of 1 in each queue.

Rate	From state- > To state
$λ_{a}$	0-1a; 1a-2aa; 2aa-3aaa; 1b-2ab; 2ab-3aba; 2bb-3bba; 3aab-4aaab; 3abb-4abab; 3bbb-4bbab
$λ_{b}$	0–1b; 1b-2bb; 2bb-3bbb; 1a-2ab; 2ab-3abb; 2aa-3aab; 3bba-4bbab; 3aba-4abab; 3aaa-aaaab
$μ_{a}$	1a-0; 2ab-1b; 3abb-2bb; 3aba-2ab; 4aaab-3aaa; 4aaab-3aab
$μ_{b}$	1b-0; 2ab-1a; 3abb-2aa; 3abb-2ab; 4bbab-3bbb; 4bbab-3bba
${2 μ}_{a}$	2aa-1a; 3aaa-2aa; 3aab-2ab
2 $μ_{b}$	2bb-1b; 3bbb-2bb; 3bba-2ab
$\frac{μ_{a} + μ_{b}}{2}$	4abab-3aba; 4abab-3abb

Open in a new tab

The main differences in the derivations are.

1.
The lower bound is:

Equation 15.

(15)

2.
the potential state-dependent arrival rates implied by Table 5 are:

Equation 16.

(16)

3.
the effective arrival rate is:

Equation 17.

(17)

Where $S$ is the set of all possible states.

16 scenarios were examined for this case of two queues, each with its own arrival rate and service rate. The selected ranges for arrival rates were: $Range = 10, 20, 40$ $(λ_{a} = 20, λ_{b} = 40; λ_{a} = 25, λ_{b} = 35; λ_{a} = 10, λ_{b} = 50)$ . The results obtained for these three ranges were compared to the case in which the arrival rates are the same and equal to the average rate of the other ranges $(λ_{a} = λ_{b} = 30, Range = 0)$ . Four service rates' ratios between the two queues were selected as: 1:1, 2:1, 3:1 and 4:1. Their values were a result of the calculation of the supply to demand ratio, $θ$ , which was equal to one in all the scenarios. The RID values obtained in the 16 scenarios are presented in Fig. 10.

As observed in Fig. 10, the simple instance where the ratio is 1 (implying identical service rates) aligns with the outcomes illustrated in Fig. 6. The greater the ratio (signifying a disparity in service rates), the more pronounced the departure of the Average Output Time (AOT) from the lower bound. Similarly, heightened divergence from the lower bound occurs as the discrepancy between arrival rates increases.

Appendix B. Formal analysis of pooling efficiency

One of the key findings that is shown consistently in the numerical analysis is that pooling enhances efficiency. The purpose of this appendix is to provide a formal mathematical analysis of this finding under one simplifying assumption, namely that the capacity in each queue is one (K = 1), i.e. that we can use the BD model of scenario 6. Recall that in scenario 6 the number of states is $C + J$ , where $C = C_{1} J$ .

Using node balance equations:

Equation 18.

(18)

Equation 19.

(19)

We find the connection between each state probability and the probability of the “critical” state where all servers are busy but there is still no item in the queue:

Equation 20.

(20)

Equation 21.

(21)

We can rewrite (20–21) as $P_{C - i} = P_{C} θ^{- i} f_{C, i}$ and $P_{C + j} = P_{C} θ^{j} f_{J, j}$ , by denoting:

Equation 22.

(22)

Lemma 1

the sequence $f_{N, k}$ for a given $1 \leq k \leq N$ , is increasing with $N$ , and $0 < f_{N, k} < 1$ .

Proof: for $1 \leq k \leq N - 1$ we can rewrite the term as $f_{N, k} = \frac{N!}{N^{k} (N - k)!} = \prod_{j = 1}^{k - 1} \frac{N - j}{N}$ . For a given $1 \leq j \leq N - 1$ , each term $\frac{N - j}{N}$ is increasing with N, and $0 < \frac{N - j}{N} < 1$ . Therefore, these properties are maintained in the product $f_{N, k}$ . For $k = N$ we get $f_{N, k} = \frac{N!}{N^{N}}$ and we also have that $0 \leq \frac{N!}{N^{N}} \leq \frac{(N + 1)!}{{(N + 1)}^{N}} \leq 1$ .

Comment: for $k = 0$ , clearly $f_{N, k} = 1$ , is constant for any $N$ .

To find the critical state probability, $P_{c}$ , we use the fact that state probabilities add up to one,

$Equation 23.$ (23)

This expression can be rewritten as $P_{c} (F (C, θ^{- 1}) + 1 + F (J, θ)) = 1$ by denoting:

$Equation 24.$ (24)

Hence $P_{c} = {(F (C, θ^{- 1}) + 1 + F (J, θ))}^{- 1}$ .

Lemma 2

For a given value of $α > 0$ , the function $F (N, α)$ has the following properties:

a.
$F (N, α)$ is increasing with N

b.
If $α < 1$ then $F (N, α) \leq \frac{α}{1 - α}$ .

c.
For a given N, $\lim_{α \to 0} F (N, α) = 0$ .

d.
For a given N, $\lim_{α \to \infty} F (N, α) = \infty$ .

Proof.

a.
compare the polynomials $F (N + 1, α)$ and $F (N, α)$ . The coefficients of $α^{k}$ for $1 \leq k \leq N$ in these two functions are $f_{N + 1, k}$ and $f_{N, k}$ respectively. ByLemma 1we know that $f_{N + 1, k} \geq f_{N, k}$ . In addition, the coefficient of $α^{N + 1}$ in the first polynomial is $f_{N + 1, N + 1} > 0$ while in the second polynomial is it zero.

b.
$F (N, α) = \sum_{1 \leq k \leq N} f_{N, k} α^{k} = \sum_{1 \leq k \leq N} \frac{N!}{N^{k} (N - k)!} α^{k} \leq \sum_{1 \leq k \leq ∞} α^{k} = \frac{α}{1 - α}$ .

c.
For any $1 \leq k \leq N$ , $\lim_{α \to 0} α^{k} = 0$ . $f_{N, k}$ are constants. Hence, $\lim_{α \to 0} F (N, α) = 0$ .

d.
For any $1 \leq k \leq N$ , $\lim_{α \to \infty} α^{k} = \infty$ . $f_{N, k}$ are constants. Hence, $\lim_{α \to \infty} F (N, α) = \infty$ .

QED.

Next, to find the effective arrival rate, we multiply each state probability by the total arrival rate in that state.

$Equation 25.$ (25)

$Equation 26.$ (26)

This expression can be rewritten as, $λ_{e f f} = P_{c} λ J [F (C, θ^{- 1}) + 1 + \hat{F} (J, θ)]$ , by denoting:

$Equation 27.$ (27)

Lemma 3

$\hat{F} (N, α) = α^{- 1} F (N, α) - 1$ and equivalently $F (N, α) = α (1 + \hat{F} (N, α))$ .

Proof:

$Equation 28.$ (28)

$Equation 29.$ (29)

$Equation 30.$ (30)

$Equation 31.$ (31)

$Equation 32.$ (32)

QED.

Lemma 4

$F (N, α) - \hat{F} (N, α) = 1 - (α^{- 1} - 1) F (N, α)$ converges to zero as $α$ approaches zero.

Proof: $(N, α) - \hat{F} (N, α) = \sum_{1 \leq k \leq N} f_{N, k} \frac{k}{N} α^{k}$ , and for any $1 \leq k \leq N$ , $\lim_{α \to 0} α^{k} = 0$ .

Based onLemma 3, the effective arrival rate can be written as

$Equation 33.$ (33)

$Equation 34.$ (34)

$Equation 35.$ (35)

$Equation 36.$ (36)

$Equation 37.$ (37)

Lemma 5

pooling improves efficiency, in other words for a given load $θ$ and servers per queue $C_{1}$ , the value of RID decreases with the number of pooled queues $J$ .

Proof: the total number of servers ${C = J C}_{1}$ increases with the number of pooled queues $J$ , and therefore byLemma 2both $F (J, θ)$ and $F (C, θ^{- 1})$ are increasing with $J$ . As a result, for both ranges of $θ$ , the RID formula in eq. (37) has a decreasing numerator and an increasing denominator.

The claim in Lemma 5 about the effectiveness of pooling is qualitative. Quantitative estimates for the effect of pooling can be derived in two special cases: when comparing $J = 1$ with $J = 2$ with $C_{1} = 1$ ; and when the load is balanced $θ = 1$ .

In the first special case, since $F (1, α) = α$ , and $F (2, α) = α + 0.5 α^{2}$ ,

$Equation 38.$ (38)

$Equation 39.$ (39)

We see with or without pooling, RID is increasing while $θ \leq 1$ and decreasing while $θ \geq 1$ , with a peak at $θ = 1$ . The pooling improvement ratio is:

$Equation 40.$ (40)

The ratio is always above one, so pooling is indeed effective for any value of $θ$ , but the improvement is monotonically decreasing, approaching infinity as $θ$ approaches zero, and vanishing (ratio of one) when $θ$ approaches infinity.

To evaluate the second special case of a balanced load, we use an approximation for $F (N, 1)$ , which is presented next.

$Equation 41.$ (41)

Now let $F_{a} (N) = \frac{N! e^{N}}{N^{N}}$ , and $F_{b} (N) = \sum_{0 \leq i < N} \frac{N^{i} e^{- N}}{i!}$ , thus $F (N, 1) = F_{a} (N) F_{b} (N)$ . By the Stirling approximation $F_{a} (N) \approx {(2 π N)}^{0.5}$ . For a Poisson random variable with mean $N$ , i.e. $Y \sim Pois (N)$ , the term $F_{b} (N) = P (Y < N)$ is the probability of $Y$ being less than its mean. By the central limit theorem, as $N$ increases the Poisson distribution can be approximated by the normal distribution, and therefore (as is quite well known) $\lim_{K \to \infty} F_{b} (N) = 0.5$ . Since $P (Y \leq N) = F_{b} (N) + 1 / F_{a} (N)$ , the sequence $F_{b} (N) + 0.5 / F_{a} (N)$ converges much more quickly to 0.5. Therefore, a slightly better approximation is $F_{b} (N) \approx 0.5 - \frac{0.5}{F_{a} (N)} \approx 0.5 - {\frac{1}{2} (2 π N)}^{- 0.5}$ . A numerical evaluation of the error in this approximation suggests that an even better approximation might be $F_{b} (N) {\approx 0.5 - \frac{1}{3} (2 π N)}^{- 0.5}$ , with a maximal absolute difference of 1E-4 in the range of $1 \leq N \leq 100$ . Using the latter approximation leads to the approximation $\begin{array}{c} F (N, 1) \approx {(\frac{π N}{2})}^{0.5} - \frac{1}{3} \end{array}$ . In the range of $1 \leq N \leq 100$ this approximation is slightly too low, with an error term that decreases monotonically from 0.08 to 0.01. Substituting a simpler approximation of $F (N, 1) \approx {(\frac{π N}{2})}^{0.5}$ , in the RID for $θ = 1$ ,

$Equation 42.$ (42)

we get that:

$Equation 43.$ (43)

Meaning that the approximated RID is inversely proportional to the square root of the number of pooled queues. This observation explains the almost perfectly linear trends that can be observed in Fig. 7.

In addition to the effect of pooling, the above derivations also cover the analysis of the effect of the number of servers per queue, $C_{1}$ , under the assumption that the load $θ$ remains the same, that is when we compare several slow servers to one fast server. In the remainder of this appendix, we examine the effect of $θ$ on RID. Using eq. (37) we can show that the RID is always positive (by property b of Lemma 2). The RID converges to zero as $θ$ approaches either zero or infinity, since in both cases the numerator converges to zero (by Lemma 4) and the denominator to infinity (by property d of Lemma 2). Lemma 6 shows that for given values of $J$ and $C$ , the RID peaks at $θ = 1$ , increasing monotonically up to the peak, and decreasing monotonically from there on.

Lemma 6

RID is monotonically increasing when $0 \leq θ \leq 1$ and monotonically decreasing when $θ \geq 1$ .

Proof:

Using the notations

$Equation 44.$ (44)

$Equation 45.$ (45)

The RID equation can be rewritten as

$Equation 46.$ (46)

To prove Lemma 6, we need to show that $R (N, M, α)$ is increasing with $α \in [0, 1]$ for any pair of values, $N$ and $M$ . The numerator in $\frac{\partial}{\partial α} R (N, M, α)$ can be written as the sum of three terms:

$Equation 47.$ (47)

$Equation 48.$ (48)

$Equation 49.$ (49)

We will show that all three are positive, and therefore $\frac{\partial}{\partial α} R (N, M, α) > 0$ . Both $F (N, α) = \sum_{1 \leq k \leq N} f_{N, k} α^{k}$ and $Δ (N, α) = \sum_{1 \leq k \leq N} f_{N, k} \frac{k}{N} α^{k}$ are always positive and increasing with $α \geq 0$ . Therefore, $\frac{\partial}{\partial α} Δ (N, α) > 0$ and $\frac{\partial}{\partial α} F (M, α^{- 1}) > 0$ , and thus $A_{1} > 0$ and $A_{2} > 0$ . It remains to show that $A_{3} > 0$ or equivalently that $α^{2} A_{3} > 0$ .

$Equation 50.$ (50)

$Equation 51.$ (51)

$Equation 52.$ (52)

$Equation 53.$ (53)

Substituting

$Equation 54.$ (54)

We get the first component of $A_{3}$ :

$Equation 55.$ (55)

The second component of $A_{3}$ is:

$Equation 56.$ (56)

And the difference between them simplifies to:

$Equation 57.$ (57)

Recall that:

$Equation 58.$ (58)

By substitution we get:

$Equation 59.$ (59)

$Equation 60.$ (60)

Since

$Equation 61.$ (61)

We see that:

$Equation 62.$ (62)

To show that ${α^{2} D}_{3} > 0$ we show that all the coefficients of $α^{k}$ are non-negative, and some are positive. Recall that $N \cdot Δ (N, α) =$ $\sum_{1 \leq k \leq N} k f_{N, k} α^{k}$ , and

$Equation 63.$ (63)

Therefore, for $N < k \leq 2 N$ , the coefficient of $α^{k}$ is clearly positive. The coefficient of $α^{k}$ for 1 $\leq k \leq N$ is:

$Equation 64.$ (64)

To show that this coefficient is positive, the final step in the proof is obtained by showing that $f_{N, k_{1}} \cdot f_{N, k_{2}} \geq f_{N, k_{1} + k_{2}}$ . Indeed:

$Equation 65.$ (65)

QED.

References

1.Smith D.R., Whitt W. Resource sharing for efficiency in traffic systems. Bell System Technical Journal. 1981;60(1):39–55. [Google Scholar]
2.Mandelbaum A., Reiman M.I. On pooling in queueing networks. Manag. Sci. 1998;44(7):971–981. [Google Scholar]
3.Gans N., Koole G., Mandelbaum A. Telephone call centers: tutorial, review, and research prospects. Manuf. Serv. Oper. Manag. 2003;5(2):79–141. [Google Scholar]
4.Jouini O., Dallery Y., Nait-Abdallah R. Analysis of the impact of team-based organizations in call center management. Manag. Sci. 2008;54(2):400–414. [Google Scholar]
5.Ata B., Van Mieghem J.A. The value of partial resource pooling: should a service network be integrated or product-focused? Manag. Sci. 2009;55(1):115–131. [Google Scholar]
6.Song H., Tucker T.L., Murrell K.L. The diseconomies of queue pooling: an empirical investigation of emergency department length of stay. Manag. Sci. 2015;61(12):3032–3053. [Google Scholar]
7.Armony M., Roels G., Song H. Pooling queues with strategic servers: the effects of customer ownership. Oper. Res. 2021;69(1):13–29. [Google Scholar]
8.Morse P.M., Feshbach H. Methods of theoretical physics. Am. J. Phys. 1954;22(6):410–413. [Google Scholar]
9.Tiwari S.K., Gupta V.K., Joshi T.N. M/M/S queueing theory model to solve waiting line and to minimize estimated total cost. Int. J. Sci. Res. 2016;5(5):1901–1904. [Google Scholar]
10.Ding D., Fan X., Zhao Y., Kang K., Yin Q., Zeng J. Q-learning based dynamic task scheduling for energy-efficient cloud computing. Future Generat. Comput. Syst. 2020;108:361–371. [Google Scholar]
11.Green L., Yih Y. Columbia University; New York: 2011. Handbook of Healthcare Delivery Systems. [Google Scholar]
12.Fibich G., Gavious A., Solan E. Averaging principle for second-order approximation of heterogeneous models with homogeneous models. Proc. Natl. Acad. Sci. USA. 2012;109(48):19545–19550. doi: 10.1073/pnas.1206867109. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Zhang C., Liu J., Wan Y.W., Murty K.G., Linn R.J. Storage space allocation in container terminals. Transp. Res. Part B Methodol. 2003;37(10):883–903. [Google Scholar]
14.Carlo H.J., Vis I.F., Roodbergen K.J. Transport operations in container terminals: literature overview, trends, research directions and classification scheme. Eur. J. Oper. Res. 2014;236(1):1–13. [Google Scholar]
15.Dhingra V., Roy D., de Koster R.B. A cooperative quay crane-based stochastic model to estimate vessel handling time. Flex. Serv. Manuf. J. 2017;29:97–124. [Google Scholar]
16.Zhang X., Zeng Q., Sheu J.-B. Modeling the productivity and stability of a terminal operation system with quay crane double cycling. Transport. Res. E Logist. Transport. Rev. 2019;122:181–197. [Google Scholar]
17.Easa S.M. Approximate queueing models for analyzing harbor terminal operations. Transp. Res. Part B Methodol. 1987;21(4):269–286. [Google Scholar]

[bib1] 1.Smith D.R., Whitt W. Resource sharing for efficiency in traffic systems. Bell System Technical Journal. 1981;60(1):39–55. [Google Scholar]

[bib2] 2.Mandelbaum A., Reiman M.I. On pooling in queueing networks. Manag. Sci. 1998;44(7):971–981. [Google Scholar]

[bib3] 3.Gans N., Koole G., Mandelbaum A. Telephone call centers: tutorial, review, and research prospects. Manuf. Serv. Oper. Manag. 2003;5(2):79–141. [Google Scholar]

[bib4] 4.Jouini O., Dallery Y., Nait-Abdallah R. Analysis of the impact of team-based organizations in call center management. Manag. Sci. 2008;54(2):400–414. [Google Scholar]

[bib5] 5.Ata B., Van Mieghem J.A. The value of partial resource pooling: should a service network be integrated or product-focused? Manag. Sci. 2009;55(1):115–131. [Google Scholar]

[bib6] 6.Song H., Tucker T.L., Murrell K.L. The diseconomies of queue pooling: an empirical investigation of emergency department length of stay. Manag. Sci. 2015;61(12):3032–3053. [Google Scholar]

[bib7] 7.Armony M., Roels G., Song H. Pooling queues with strategic servers: the effects of customer ownership. Oper. Res. 2021;69(1):13–29. [Google Scholar]

[bib8] 8.Morse P.M., Feshbach H. Methods of theoretical physics. Am. J. Phys. 1954;22(6):410–413. [Google Scholar]

[bib9] 9.Tiwari S.K., Gupta V.K., Joshi T.N. M/M/S queueing theory model to solve waiting line and to minimize estimated total cost. Int. J. Sci. Res. 2016;5(5):1901–1904. [Google Scholar]

[bib10] 10.Ding D., Fan X., Zhao Y., Kang K., Yin Q., Zeng J. Q-learning based dynamic task scheduling for energy-efficient cloud computing. Future Generat. Comput. Syst. 2020;108:361–371. [Google Scholar]

[bib11] 11.Green L., Yih Y. Columbia University; New York: 2011. Handbook of Healthcare Delivery Systems. [Google Scholar]

[bib12] 12.Fibich G., Gavious A., Solan E. Averaging principle for second-order approximation of heterogeneous models with homogeneous models. Proc. Natl. Acad. Sci. USA. 2012;109(48):19545–19550. doi: 10.1073/pnas.1206867109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Zhang C., Liu J., Wan Y.W., Murty K.G., Linn R.J. Storage space allocation in container terminals. Transp. Res. Part B Methodol. 2003;37(10):883–903. [Google Scholar]

[bib14] 14.Carlo H.J., Vis I.F., Roodbergen K.J. Transport operations in container terminals: literature overview, trends, research directions and classification scheme. Eur. J. Oper. Res. 2014;236(1):1–13. [Google Scholar]

[bib15] 15.Dhingra V., Roy D., de Koster R.B. A cooperative quay crane-based stochastic model to estimate vessel handling time. Flex. Serv. Manuf. J. 2017;29:97–124. [Google Scholar]

[bib16] 16.Zhang X., Zeng Q., Sheu J.-B. Modeling the productivity and stability of a terminal operation system with quay crane double cycling. Transport. Res. E Logist. Transport. Rev. 2019;122:181–197. [Google Scholar]

[bib17] 17.Easa S.M. Approximate queueing models for analyzing harbor terminal operations. Transp. Res. Part B Methodol. 1987;21(4):269–286. [Google Scholar]

PERMALINK

Server pooling models for separate and bounded queues

Hila Hindy Ling

Ran Etgar

Hillel Bar-Gera

Abstract

1. Introduction

Table 1.

Table 2.

2. Methodology

Fig. 1.

Table 3.

Fig. 2.

Table 4.

3. Illustrative numeric results

3.1. Non-pooling with similar arrival rates

Fig. 3.

Fig. 4.

3.2. Pooling with similar arrival rates

Fig. 5.

3.3. Pooling with different arrival rates

Fig. 6.

3.4. Pooling Vs. non-pooling policies with different arrival rates

Fig. 7.

Fig. 8.

4. Conclusions and future work

CRediT authorship contribution statement

Declaration of competing interest

Contributor Information

Appendix A. Accommodating different service rates

Fig. 9.

Table 5.

Fig. 10.

Appendix B. Formal analysis of pooling efficiency

Lemma 1

Lemma 2

Lemma 3

Lemma 4

Lemma 5

Lemma 6

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases