Skip to main content
Springer logoLink to Springer
. 2021 Jun 9;83(9):2803–2832. doi: 10.1007/s00453-021-00841-8

Scheduling in the Random-Order Model

Susanne Albers 1, Maximilian Janke 1,
PMCID: PMC8550002  PMID: 34720298

Abstract

Makespan minimization on identical machines is a fundamental problem in online scheduling. The goal is to assign a sequence of jobs to m identical parallel machines so as to minimize the maximum completion time of any job. Already in the 1960s, Graham showed that Greedy is (2-1/m)-competitive. The best deterministic online algorithm currently known achieves a competitive ratio of 1.9201. No deterministic online strategy can obtain a competitiveness smaller than 1.88. In this paper, we study online makespan minimization in the popular random-order model, where the jobs of a given input arrive as a random permutation. It is known that Greedy does not attain a competitive factor asymptotically smaller than 2 in this setting. We present the first improved performance guarantees. Specifically, we develop a deterministic online algorithm that achieves a competitive ratio of 1.8478. The result relies on a new analysis approach. We identify a set of properties that a random permutation of the input jobs satisfies with high probability. Then we conduct a worst-case analysis of our algorithm, for the respective class of permutations. The analysis implies that the stated competitiveness holds not only in expectation but with high probability. Moreover, it provides mathematical evidence that job sequences leading to higher performance ratios are extremely rare, pathological inputs. We complement the results by lower bounds, for the random-order model. We show that no deterministic online algorithm can achieve a competitive ratio smaller than 4/3. Moreover, no deterministic online algorithm can attain a competitiveness smaller than 3/2 with high probability.

Keywords: Scheduling, Makespan minimization, Online algorithm, Competitive analysis, Lower bound, Random-order

Introduction

We study one of the most basic scheduling problems. Consider a sequence of jobs J=J1,,Jn that has to be assigned to m identical parallel machines. Each job Jt has an individual processing time pt, 1tn. Preemption of jobs is not allowed. The goal is to minimize the makespan, i.e. the maximum completion time of any job in the constructed schedule. Both the offline and online variants of this problem have been studied extensively, see e.g. [4, 11, 14, 19, 21, 34] and references therein.

We focus on the online setting, where jobs arrive one by one. Whenever a job Jt is presented, its processing time pt is revealed. The job has to be scheduled immediately on one of the machines without knowledge of any future jobs Js, with s>t. Given a job sequence J, let A(J) denote the makespan of an online algorithm A on J. Let OPT(J) be the optimum makespan. A deterministic online algorithm A is c-competitive if A(J)c·OPT(J) holds for all J [39]. The best competitive ratio that can be achieved by deterministic online algorithms is in the range [1.88, 1.9201], see 14, 35. No randomized online algorithm is known that beats deterministic ones, for general m.

In this paper we investigate online makespan minimization in the random-order model. Here an input instance/job sequence is chosen by an adversary. Then a random permutation of the input elements/jobs arrives. The random-order model was considered by Dynkin [10] and Lindley [29] for the secretary problem. Over the last years the framework has received quite some research interest and many further problems have been studied. These include generalized secretary problems [2, 3, 13, 28, 29], the knapsack problem [2, 28], bin packing [26], facility location [31], matching problems [17, 22, 30], packing LPs [27] and convex optimization [20].

We present an in-depth study of online makespan minimization in the random-order model. As a main contribution we devise a new deterministic online algorithm that achieves a competitive ratio of 1.8478. After almost 20 years this is the first progress for the pure online setting, where an algorithm does not resort to extra resources in handling a job sequence.

Previous Work

We review the most important results relevant to our work and first address the standard setting where an online algorithm must schedule an arbitrary, worst-case job sequence. Graham in 1966 showed that the famous Greedy algorithm, which assigns each job to a least loaded machine, is (2-1m)-competitive. Using new deterministic strategies the competitiveness was improved in a series of papers. Galambos and Woeginger [15] gave an algorithm with a competitive ratio of (2-1m-ϵm), where ϵm tends to 0 as m. Bartal et al. [4] devised a 1.986-competitive algorithm. The bound was improved to 1.945 [23] and 1.923 [1]. Fleischer and Wahl [14] presented an algorithm that attains a competitive ratio of 1.9201 as m. Chen et al. [7] gave an algorithm whose competitiveness is at most 1+ε times the best possible factor, but no explicit bound was provided. Lower bounds on the competitive ratio of deterministic online algorithms were shown in [1, 5, 12, 18, 35, 36]. For general m, the bound was raised from 1.707 [12] to 1.837 [5] and 1.854 [18]. Rudin [35] showed that no deterministic strategy has a competitiveness smaller than 1.88.

For randomized online algorithms, there is a significant gap between the best known upper and lower bounds. For m=2 machines, Bartal et al. [4] presented an algorithm that achieves an optimal competitive ratio of 4/3. To date, there exists no randomized algorithm whose competitiveness is smaller than the deterministic lower bound, for general m. The best known lower bound on the performance of randomized online algorithms tends to e/(e-1)1.581 as m [6, 38].

Recent research on makespan minimization has examined settings where an online algorithm is given extra resources when processing a job sequence. Specifically, an algorithm might have a buffer to reorder the incoming job sequence [11, 25] or is allowed to migrate jobs [37]. Alternatively, an algorithm has information on the job sequence [8, 9, 24, 25], e.g. it might know the total processing time of the jobs or even the optimum makespan.

In the random-order model only one result is known for makespan minimization on identical machines. Osborn and Torng [33] showed that Greedy does not achieve a competitive ratio smaller than 2 as m. Recently Molinaro [32] studied online load balancing with the objective to minimize the lp-norm of the machine loads. He considers a general scenario with machine-dependent job processing times, which are bounded by 1. For makespan minimization he presents an algorithm that, in the worst case, is O(logm/ε)-competitive and, in the random-order model, has an expected makespan of (1+ε)OPT(J)+O(logm/ε), for any ε(0,1]. Göbel et al. [16] consider a scheduling problem on one machine where the goal is to minimize the average weighted completion time of all jobs. Under random-order arrival, their competitive ratio is logarithmic in n, the number of jobs, for the general problem and constant if all jobs have processing time 1.

Our Contribution

We investigate online makespan minimization in the random-order model, a sensible and widely adopted input model to study algorithms beyond the worst case. Specifically, we develop a new deterministic algorithm that achieves a competitive ratio of 1.8478 as m. This is the first improved performance guarantee in the random-order model. The competitiveness is substantially below the best known ratio of 1.9201 in the worst-case setting and also below the corresponding lower bound of 1.88 in that framework.

A new feature of our algorithm is that it schedules an incoming job on one of three candidate machines in order to maintain a certain load profile. The best strategies in the worst-case setting use two possible machines, and it is not clear how to take advantage of additional machines in that framework. The choice of our third, extra machine is quite flexible: An incoming job is placed either on a least loaded, a heavily loaded or—as a new option—on an intermediate machine. The latter one is the (h+1)st least loaded machine, where h may be any integer with hω(1) and ho(m).

When assigning a job to a machine different from the least loaded one, an algorithm has to ensure that the resulting makespan does not exceed c times the optimum makespan, for the targeted competitive ratio c. All previous strategies in the literature lower bound the optimum makespan by the current average load on the machines. Our new algorithm works with a refined lower bound that incorporates the processing times of the largest jobs seen so far. The lower bound is obvious but has not been employed by previous algorithms.

The analysis of our algorithm proceeds in two steps. First we define a class of stable job sequences. These are sequences that reveal information on the largest jobs as processing volume is scheduled. More precisely, once a certain fraction of the total processing volume t=1npt has arrived, one has a good estimate on the hth largest job and has encountered a certain number of the m+1 largest jobs in the input. The exact parameters have to be chosen carefully.

We prove that with high probability, a random permutation of a given input of jobs is stable. We then conduct a worst-case analysis of our algorithm on stable sequences. Using their properties, we show that if the algorithm generates a flat schedule, like Greedy, and can be hurt by a huge job, then the input must contain many large jobs so that the optimum makespan is also high. A new ingredient in the worst-case analysis is the processing time of the hth largest job in the input. We will relate it to machine load in the schedule and to the processing time of the (m+1)st largest job; twice the latter value is a lower bound on the optimum makespan.

The analysis implies that the competitive ratio of 1.8478 holds with high probability. Input sequences leading to higher performance ratios are extremely rare. We believe that our analysis approach might be fruitful in the study of other problems in the random-order model: Identify properties that a random permutation of the input elements satisfies with high probability. Then perform a worst-case analysis.

Finally in this paper we devise lower bounds for the random-order model. We prove that no deterministic online algorithm achieves a competitive ratio smaller than 4/3. Moreover, if a deterministic online algorithm is c-competitive with high probability, then c3/2.

Strong Competitiveness in the Random-Order Model

We define competitiveness in the random-order model and introduce a stronger measure of competitiveness that implies high-probability bounds. Recall that traditionally a deterministic online algorithm A is c-competitive if A(J)c·OPT(J) holds for all job sequences J=J1,,Jn. We will refer to this worst-case model also as the adversarial model.

In the random-order model a job sequence J=J1,,Jn is given, which may be specified by an adversary. (Alternatively, a set of jobs could be specified.) Then a random permutation of the jobs arrives. We define the expected cost / makespan of a deterministic online algorithm. Let Sn be the permutation group of the integers from 1 to n, which we consider a probability space under the uniform distribution, i.e. each permutation in Sn is chosen with probability 1/n!. Given σSn, let Jσ=Jσ(1),,Jσ(n) be the job sequence permuted by σ. The expected makespan of A on J in the random-order model is Arom(J)=EσSn[A(Jσ)]=1n!σSnA(Jσ). The algorithm A is c-competitive in the random-order model if Arom(J)c·OPT(J) holds for all job sequences J.

We next define the notion of a deterministic online algorithm A being nearly c-competitive. The second condition in the following definition requires that the probability of A not meeting the desired performance ratio must be arbitrarily small as m grows and a random permutation of a given job sequence arrives. The subsequent Lemma 1 states that a nearly c-competitive algorithm is c-competitive in the random-order model.

Definition 1

A deterministic online algorithm A is called nearly c-competitive if the following two conditions hold.

  • Algorithm A achieves a constant competitive ratio in the adversarial model.

  • For every ε>0, there exists an m(ε) such that for all machine numbers mm(ε) and all job sequences J there holds PσSn[A(Jσ)(c+ε)OPT(J)]ε.

Lemma 1

If a deterministic online algorithm is nearly c-competitive, then it is c-competitive in the random-order model as m.

Proof

Let C be the constant such that A is C-competitive in the adversarial model. We may assume that C>c. Given 0<δC-c, we show that there exists an m(δ) such that, for all mm(δ), we have Arom(J)(c+δ)OPT(J) for every job sequences J. Let ε=δ/(C-c+1). Since A is nearly c-competitive, there exists an m(ε) such that, for all mm(ε) and all inputs J, there holds Pε(J)=PσSn[A(Jσ)(c+ε)OPT(J)]ε. Set m(δ)=m(ε). We obtain

Arom(J)=EσSn[A(Jσ)](1-Pε(J))(c+ε)OPT(J)+Pε(J)·C·OPT(J)((1-ε)(c+ε)+εC)OPT(J)(c+ε(C-c+1))OPT(J)=(c+δ)OPT(J).

Description of the New Algorithm

The deficiency of Greedy is that it tends to generate a flat, balanced schedule in which all the machines have approximately the same load. An incoming large job can then enforce a high makespan relative to the optimum one. It is thus crucial to try to avoid flat schedules and maintain steep schedules that exhibit a certain load imbalance among the machines.

However, in general, this is futile. Consider a sequence of m identical jobs with a processing time of, say, Pm+1 (referring to the size of the (m+1)st largest job in an input). Any online algorithm that is better than 2-competitive must schedule these m jobs on separate machines, obtaining the flattest schedule possible. An incoming even larger job of processing time pmax will now enforce a makespan of Pm+1+pmax. Observe that OPTmax{2Pm+1,pmax} since there must be one machine containing two jobs. In particular Pm+1+pmax1.5OPT. Hence sensible online algorithms do not perform badly on this sequence.

This example summarizes the quintessential strategy of online algorithms that are good on all sequences: Ensure that in order to create a schedule that is very flat, i.e. such that all machines have high load λ, the adversary must present m jobs that all are large relative to λ. In order to exploit this very flat schedule and cause a high makespan the adversary needs to follow up with yet another large job. But with these m+1 jobs, the optimum scheduler runs into the same problem as in the example: Of the m+1 large jobs, two have to be scheduled on the same machine. Thus the optimum makespan is high, compensating to the high makespan of the algorithm.

Effectively realizing the aforementioned strategy is highly non-trivial. In fact it is the central challenge in previous works on adversarial makespan minimization that improve upon Greedy [1, 4, 14, 15, 23]. These works gave us clear notions of how to avoid flat schedules, which form the basis for our approaches. Instead of simply rehashing these ideas, we want to outline next how we profit from random-order arrival in particular.

How Random-Order Arrival Helps

The first idea to profit from random-order arrival addresses the lower bound on OPT sophisticated online algorithms need. In the literature only the current average load has been considered, but under random-order arrival another bound comes to mind: The largest job seen so far. In order for an algorithm to perform badly, a large job needs to come close to the end of the sequence. Under random-order arrival, it is equally likely for such a job to arrive similarly close to the beginning of the sequence. In this case, the algorithm knows a better lower bound for OPT. The main technical tool will be our Load Lemma, which allows us to relate what a job sequence should reveal early from an analysis perspective to the actual fraction of jobs scheduled. This idea does not work for worst-case orders since they tend to order jobs by increasing processing times.

Recall that the general challenge of our later analysis will be to establish that there had to be m large jobs once the schedule gets very flat. In classical analyses, which consider worst-case orders, these jobs appear with increasing density towards the end of the sequence. In random orders this is unlikely, which can be exploited by the algorithm.

The third idea improves upon the first idea. Suppose, that we were to modify our algorithm such that it could handle one very large job arriving close to the end of the sequence. In fact, assume that it could only perform badly when confronted with h very large jobs. We can then disregard any sequence which contains fewer such jobs. Recall that the first idea requires one very large job to arrive sufficiently close to the beginning. Now, as h grows, the probability of the latter event grows as well and approaches 1. This will not only improve our competitive ratio tremendously, it also allows us to adhere to the stronger notion of nearly competitiveness introduced in Sect. 2. Let us discuss how such a modification is possible: The first step is to design our algorithm in a way that it is reluctant to use the h least loaded machines. Intuitively, if the algorithm tries to retain machines of small load it will require very large jobs to fill them. In order to force these filling jobs to actually be large enough, our algorithm needs to use a very high lower bound for OPT. In fact, here it uses another lower bound for the optimum makespan, 2Pm+1t, twice the (m+1)st largest job seen so far at time t. Common analysis techniques can only make predictions about Pm+1t at the very end of the sequence. It requires very subtle use of the random-order model to work around this.

Formal Definition

Formally our algorithm ALG is nearly c-competitive, where c is the unique real root of the polynomial Q[x]=4x3-14x2+16x-7, i.e.

c=7+28-3873+28+38736<1.8478.

Given J, ALG schedules a job sequence/permutation Jσ=Jσ(1),,Jσ(n) that must be scheduled in this order. Throughout the scheduling process ALG always maintains a list of the machines sorted in non-increasing order of current load. At any time the load of a machine is the sum of the processing times of the jobs already assigned to it. After ALG has processed the first t-1 jobs Jσ(1),,Jσ(t-1), we also say at time t, let M1t-1,,Mmt-1 be any ordering of the m machines according to non-increasing load. More specifically, let ljt-1 denote the load of machine Mjt-1. Then l1t-1lmt-1 and l1t-1 is the makespan of the current schedule.

ALG places each incoming job Jσ(t), 1tn, on one of three candidate machines. The choice of one machine, having an intermediate load, is flexible. Let h=h(m) be an integer with h(m)ω(1) and h(m)o(m). We could use e.g. h(m)=m3 or h(m)=logm. Let1

i=(2c-3)m+h0.6956m.

ALG will assign the incoming job to the machine with the smallest load, the (h+1)st smallest load or the ith largest load.

When scheduling a job on a machine that is different from the least loaded one, an algorithm has to ensure that the resulting makespan does not exceed c times the optimum makespan, where c is the desired competitiveness. All previous algorithms lower bound the optimum makespan by the current average machine load. Algorithm ALG works with a refined lower bound that incorporates the processing time of the largest job and twice the processing time of the (m+1)st largest job seen so far. These lower bounds on the optimum makespan are immediate but have not been used in earlier strategies.

Formally, for j=1,,m, let Ljt be the average load of the m-j+1 least loaded machines Mjt,,Mmt, i.e. Ljt=1m-j+1r=jmlrt. We let Lt=L1t=1ms=1tps be the average load of all the machines. For any j=1,,n, let Pjt be the processing time of the jth largest job among the first t jobs Jσ(1),,Jσ(t) in Jσ. If t<j, we set Pjt=0. We let pmaxt=P1t be the processing time of the largest job among the first t jobs in Jσ. Finally, let L=Ln, Pj=Pjn and pmax=pmaxn.

The value Ot=max{Lt,pmaxt,2Pm+1t} is a common lower bound on the optimum makespan for the first t jobs and hence OPT(J), see Proposition 1 in the next section. Note that immediately before Jσ(t) is scheduled, ALG can compute Lt and hence Ot because Lt is 1/m times the total processing time of the jobs that have arrived so far.

We next characterize load imbalance. Let

k=2i-m(4c-7)m0.3912m

and

α=2(c-1)2c-32.7376.

The schedule at time t is the one immediately before Jσ(t) has to be assigned. The schedule is flat if lkt-1<αLi+1t-1, i.e. if lkt-1, the load of the kth most loaded machine, does not exceed Li+1t-1=1m-ir=i+1mlrt-1, the average load of the m-i least loaded machines, by a factor of at least α . Otherwise the schedule is steep. Job Jσ(t) is scheduled flatly (steeply) if the schedule at time t is flat (steep).

ALG handles each incoming job Jσ(t), with processing time pσ(t), as follows. If the schedule at time t is steep, the job is placed on the least loaded machine Mmt-1. On the other hand, if the schedule is flat, the machines Mit-1, Mm-ht-1 and Mmt-1 are probed in this order. If lit-1+pσ(t)c·Ot, then the new machine load on Mit-1 will not violate the desired competitiveness. The job is placed on this machine Mit-1. Otherwise, if the latter inequality is violated, ALG checks if a placement on Mm-ht-1 is safe, i.e. if lm-ht-1+pσ(t)c·Ot. If this is the case, the job is put on Mm-ht-1. Otherwise, Jσ(t) is finally scheduled on the least loaded machine Mmt-1. A pseudo-code description of ALG is given below in Algorithm 1. The job assignment rules are also illustrated in Figs. 1 and 2. graphic file with name 453_2021_841_Figa_HTML.jpg

Fig. 1.

Fig. 1

A steep schedule. ALG only considers the least loaded machine

Fig. 2.

Fig. 2

A flat schedule. The three machines considered by ALG are marked for h=2

In the next section we will prove the following theorem, Theorem 1, which uses the notion from Sect. 2. Lemma 1 then immediately gives the main result, Corollary 1.

Theorem 1

ALG is nearly c-competitive, with c<1.8478 defined as above.

Corollary 1

ALG is c-competitive in the random-order model as m.

Analysis of the Algorithm

Analysis Basics

We present some results for the adversarial model so that we can focus on the true random-order analysis of ALG in the next sections. First, recall the three common lower bounds used for online makespan minimization.

Proposition 1

For any J, there holds OPT(J)max{L,pmax,2Pm+1}. In particular, O1O2OnOPT(J).

Proof

The optimum makespan OPT(J) cannot be smaller than the average machine load L for the input, even if all the jobs are distributed evenly among the m machines. Moreover, the job with the largest processing time pmax must be scheduled non-preemptively on one of the machines in an optimal schedule. Thus OPT(J)pmax. Finally, among the m+1 largest jobs of the input, two must be placed on the same machine in an optimal solution. Hence OPT(J)2Pm+1.

For any job sequence J=J1,,Jn, let R(J)=min{Lpmax,pmaxL}. Intuitively, this measures the complexity of J.

Proposition 2

There holds Alg(J)<=max{1+R(J),c}OPT(J) for any J=J1,,Jn.

Proof

Let J=J1,,Jn be an arbitrary job sequence and let Jt be the job that defines ALG’s makespan. If the makespan exceeds c·OPT(J), then it exceeds c·Ot. Thus ALG placed Jt on machine Mmt-1, cf. lines 4 and 5 of the algorithm. This machine was a least loaded one, having a load of at most L. Hence ALG(J)L+ptL+pmaxL+pmaxmax{L,pmax}·OPT(J)=(1+R(J))·OPT(J).

Since R(J)1 we immediately obtain the following result, which ensures that ALG satisfies the first condition of a nearly c-competitive algorithm, see Definition 1.

Corollary 2

ALG is 2-competitive in the adversarial model.

We next identify a class of plain job sequences that we do not need to consider in the random-order analysis because ALG’s makespan is upper bounded by c times the optimum on these inputs.

Definition 2

A job sequence J=J1,,Jn is called plain if nm or if R(J)c-1. Otherwise it is called proper.

Let J=J1,,Jn be any job sequence that is processed/scheduled in this order. Observe that if it contains at most m jobs, i.e. nm, and ALG cannot place a job Jt on machines Mit-1 or Mm-ht-1 because the resulting load would exceed c·Ot, then the job is placed on an empty machine. Using Proposition 2 we derive the following fact.

Lemma 2

There holds ALG(J)c·OPT(J) for any plain job sequenceJ=J1,,Jn.

If a job sequence J is plain (proper), then every permutation of it is. Hence, given Lemma 2, we may concentrate on proper job sequences in the remainder of the analysis. We finally state a fact that relates to the second condition of a nearly c-competitive algorithm, see again Definition 1.

Lemma 3

Let J=J1,,Jn be any job sequence that is scheduled in this order and let Jt be a job that causes ALG’s makespan to exceed (c+ε)OPT(J), for some ϵ0. Then both the load of ALG’s least loaded machine at the time of the assignment as well as pt exceed (c-1+ε)OPT(J).

Proof

ALG places Jt on machine Mmt-1, which is a least loaded machine when the assignment is done. If lmt-1 or pt were upper bounded by (c-1+ε)OPT(J), then the resulting load would be lmt-1+pt(c-1+ε)OPT(J)+max{L,pt}(c-1+ε)OPT(J)+OPT(J)=(c+ε)OPT(J).

Stable Job Sequences

We define the class of stable job sequences. These sequences are robust in that they will admit an adversarial analysis of ALG. Intuitively, the sequences reveal information on the largest jobs when a significant fraction of the total processing volume t=1npt has been scheduled. More precisely, one gets an estimate on the processing time of the hth largest job in the entire sequence and encounters a relevant number of the m+1 largest jobs. If a job sequence is unstable, large jobs occur towards the very end of the sequence and can cause a high makespan relative to the optimum one.

We will show that ALG is adversarially (c+ε)-competitive on stable sequences, for any given ε>0. Therefore, the definition of stable sequences is formulated for a fixed ε>0. Given J, let Jσ=Jσ(1),,Jσ(n) be any permutation of the jobs. Furthermore, for every jn and in particular j{h,m+1}, the set of the j largest jobs is a fixed set of cardinality j such that no job outside this set has a strictly larger processing time than any job inside the set.

Definition 3

A job sequence Jσ=Jσ(1),,Jσ(n) is stable if the following conditions hold.

  • There holds n>m.

  • Once Lt(c-1)imL, there holds pmaxtPh.

  • For every ji, the sequence ending once we have Lt(jm+ε2)L contains at least j+h+2 many of the m+1 largest jobs in J.

  • Consider the sequence ending right before either (a) Ltim(c-1)εL holds or (b) one of the hth largest jobs of J arrives; this sequence contains at least h+1 many of the m+1 largest jobs in J.

Otherwise the job sequence is unstable.

Given ε>0 and m, let Pε(m) be the infimum, over all proper job sequences J, that a random permutation of J is stable, i.e.

Pε(m)=infJproperPσSn[Jσisstable].

As the main result of this section we will prove that this probability tends to 1 as m.

Main Lemma 1

For every ε>0, there holds limmPε(m)=1.

The Main Lemma 1 implies that for any ε>0 there exists an m(ε) such that, for all mm(ε) and all J, there holds PσSn[Jσisstable]1-ε. In Sect. 4.3 we will show that ALG is (c+ε)-competitive on stable job sequences. This implies PσSn[ALG(Jσ)(c+ε)OPT(J)]ε on proper sequences. By Lemma 2 this probability is 0 on plain sequences. We obtain the following corollary to Main Lemma 1.

Corollary 3

If ALG is adversarially (c+ε)-competitive on stable sequences, for every ε>0 and mm(ε) sufficiently large, then it is nearly c-competitive.

In the remainder of this section we describe how to establish Main Lemma 1. We need some notation. In Sect. 3 the value Ljt was defined with respect to a fixed job sequence that was clear from the context. We adopt the notation Ljt[Jσ] to make this dependence visible. We adopt the same notation for the variables L, Pjt, Pj, pmaxt and pmax. For a fixed input J and variable σSn, we use the simplified notation Ljt[σ]=Ljt[Jσ]. Again, we use the same notation for the variables Pjt and pmaxt.

At the heart of the proof of Main Lemma 1 is the Load Lemma. Observe that after t time steps in a random permutation of an input J, each job has arrived with probability t/n. Thus the expected total processing time of the jobs seen so far is t/n·s=1nps. Equivalently, in expectation Lt equals t/n·L. The Load Lemma proves that this relation holds with high probability. We set t=φn.

Load Lemma

Given any ε>0 and φ(0,1], there exists an m(ε,φ) such that for all mm(ε,φ) and all proper sequences J, there holds

PσSnLφn[Jσ]φL[Jσ]-1εε.

Proof

Let us fix a proper job sequence J. We use the shorthand L^[σ]=L^[Jσ]=Lφn[Jσ] and L=L[J].

Let δ=φε2. We will first treat the case that we have pmax[J]=1 and every job size in J is of the form (1+δ)-j, for some j0. Note that we have in particular c-1L1c-1 because we are working with a proper sequence. For j0 let hj denote the number of jobs Jt of size (1+δ)-j and, given σSn, let hjσ denote the number of such jobs Jt that additionally satisfy σ(t)φn, i.e. they are among the φn first jobs in the sequence Jσ. We now have

L=1mj=0(1+δ)-jhjandL^[σ]=1mj=0(1+δ)-jhjσ.

The random variables hjσ are hypergeometrically distributed, i.e. we sample φn jobs from the set of all n jobs and count the number of times we get one of the hj many jobs of processing time (1+δ)-j. Hence, we know that the random variable hjσ has mean

E[hjσ]=φnnhjφhj

and variance

Var[hjσ]=hjn-hjφnn-φnn2(n-1).

In particular,

Var[hjσ]hj(1+δ)jmL(1+δ)jmc-1.

By Chebyshev’s inequality we have

P[hjσ-φhj(1+δ)3j/4m3/4](1+δ)-3j/2Var[hjσ]m-3/2(1+δ)-j/2m-1/2c-1.

In particular, by the Union Bound, with probability

P(m)=1-j=0(1+δ)-j/2m-1/2c-1=1-m-1/21-1+δ(c-1)=1-Om-1/2

we have for all j,

hjσ-φhj<(1+δ)3j/4m3/4.

We conclude that the following holds:

L^[σ]-φL=1mj=0(1+δ)-jhjσ-φmj=0(1+δ)-jhjj=0(1+δ)-jhjσ-hj·φm<j=0(1+δ)-j/4m-1/4=m-1/41-(1+δ)-1/4.

In particular, with probability P(m), we have

L^[σ]φL-1=L^[σ]-φLφLm-1/4φ(c-1)1-(1+δ)-1/4=Om-1/4.

Hence, if we choose m large enough we can ensure that

PL^[σ]φL-1>ε21-P(m)ε. 1

So far we have assumed that pmax[J]=1 and every job in J has a processing time of (1+δ)-j, for some j0. Now we drop these assumptions. Given an arbitrary sequence J with 0<pmax[J]1, let J denote the sequence obtained from J by first dividing every job processing time by pmax[J] and rounding every job size down to the next power of (1+δ)-1. We have proven that inequality (1) holds for J. The values L and L^[σ] only change by a factor lying in the interval [pmax,(1+δ)pmax) when passing over from J to J. This implies that

L^[Jσ]φL[J]-L^[Jσ]φL[J]δL^[Jσ]φL[J].

Since L^[Jσ]L[J] we obtain

L^[Jσ]φL[J]-L^[Jσ]φL[J]δφ=ε2.

Combining this with inequality (1) for J (and the triangle inequality), we obtain

PL^[Jσ]φL[J]-1>εPL^[Jσ]φL[J]-1>ε2ε.

Thus the lemma follows.

We note that the Load Lemma does not hold for general sequences. A counterexample is a job sequence in which one job carries all the load, while all the other jobs have a negligible processing time. The proof of the Load Lemma relies on a lower bound of R(J), which is c-1 for proper sequences.

We present two consequences of the Load Lemma that will allow us to prove that stable sequences reveal information on the largest jobs when a certain processing volume has been scheduled. Consider a proper J. Given Jσ=Jσ(1),,Jσ(n) and φ>0, let N(φ)[Jσ] be the number of jobs Jσ(t) that are among the m+1 largest jobs in J and such that LtφL.

Lemma 4

Let ε>0 and φ(0,1]. Then there holds

limminfJproperPσSnN(φ+ε)[Jσ]φm+h+2=1.

Proof

Fix any proper job sequence J. For any Jσ, let N(φ+ε)[σ]=N(φ+ε)[Jσ]. Furthermore, let N~φ+ε2[σ] denote the number of the m+1 largest jobs of J that appear among the first φ+ε2n jobs in Jσ. Then we derive by the inclusion-exclusion principle:

PσSnN(φ+ε)[σ]φm+h+2PσSnN~φ+ε2[σ]φm+h+2andLφ+ε2n[σ]<(φ+ε)LPσSnN~φ+ε2[σ]φm+h+2+PσSnLφ+ε2n[σ]<(φ+ε)L-1.

By the Load Lemma the second summand can be lower bounded for every proper sequence J by a term approaching 1 as m. Hence it suffices to verify that this is also possible for the term

PσSnN~φ+ε2[σ]φm+h+2.

We will upper bound the probability of the opposite event by a term approaching 0 for m. The random variable N~φ+ε2[σ] is hypergeometrically distributed and therefore has expected value

E=φ+ε2nn(m+1)φ+25ε(m+1).

Recall that for proper sequences n>m holds. For the second inequality we require m and hence in also n to be large enough such that 1nε10 holds. Again, the variable N~φ+ε2[σ] is hypergeometrically distributed and its variance is thus

V=φ+ε2nn-φ+ε2n(m+1)(n-m-1)n2(n-1)m+1.

Note that we have for m large enough:

φm+h+21+15εφ(m+1)E-εφm+15V.

Hence, using Chebyshev’s inequality, we have

PσSnN~φ+ε2[σ]<φm+h+2PσSnE-N~φ+ε2[σ]>εφm+15V25ε2φ2(m+1)

and this term vanishes as m.

Lemma 5

Let ε>0 and φ(0,1]. Then there holds

limminfJproperPσSnφ~φN(φ~+ε)[Jσ]φ~m+h+2=1.

Proof

Let us fix any proper sequence J and set

Λ=1-ε2jjN,φ1-ε2j

which is a finite set whose size only depends on ε and φ. Given φ~φ, let u(φ~) be the smallest element in Λ greater or equal to φ~. Then

φ~u(φ~)φ~+ε2

and if we have

N(φ~+ε)[Jσ]<φ~m+h+2

there holds

Nu(φ~)+ε2[Jσ]<φ~m+h+2u(φ~)m+h+2.

In particular, in order to prove the lemma it suffices to verify that

limminfJproperPσSnφ~ΛNφ~+ε2[Jσ]φ~m+h+2=1.

The latter is a consequence of applying Lemma 4 to all φ~Λ and the Union Bound.

We can now conclude the main lemma of this section:

Proof of Main Lemma 1

A proper job sequence is stable if the following four properties hold.

  • Once Lt(c-1)im·L we have pmaxtPh.

  • For every ji the sequence ending once we have Ltjm+ε2L contains at least j+h+2 of the m+1 largest jobs.

  • The sequence ending right before Ltim(c-1)εL holds contains at least h+1 of the m+1 largest jobs.

  • The sequence ending right before the first of the h largest jobs contains at least h+1 of the m+1 largest jobs.

By the Union Bound we may consider each property separately and prove that it holds with a probability that tends to 1 as m.

Let φ=(c-1)im and choose ε>0. By the Load Lemma, for mm(ε,φ), after t=φn jobs of a proper job sequence Jσ have been scheduled, there holds Lt(c-1)im·L with probability at least 1-ε. Observe that φ is a fixed problem parameter so that m(ε,φ) is determined by ε. The probability of any particular job being among the first t jobs in Jσ is φn/n. Thus pmaxtPh holds with probability at least 1-(1-φn/n)h. Since Jσ is proper, we have n>m. Furthermore, h=h(m)ω(1). Therefore, the probability that the first property holds tends to 1 as m.

The second property is a consequence of Lemma 5 with φ=im. The third property follows from Lemma 4. We need to choose the ε in the statement of the lemma to be im(c-1)ε. Finally we examine the last property. In Jσ we focus on the positions of the m+1 largest jobs. Consider any of the h largest jobs. The probability that it is preceded by less than h+1 of the m+1 largest jobs is (h+1)/(m+1). Thus the probability of the fourth property not to hold is at most h(h+1)/(m+1). Since ho(m), the latter expression tends to 0 as m.

An Adversarial Analysis

In this section we prove the following main result.

Main Lemma 2

For every ε>0 and mm(ε) sufficiently large, ALG is adversarially (c+ε)-competitive on stable job sequences.

Consider a fixed ε>0. Given Corollary 2, we may assume that 0<ε<2-c. Suppose that there was a stable job sequence Jσ such that ALG(Jσ)>(c+ε)OPT(Jσ). We will derive a contradiction, given that m is large. In order to simplify notation, in the following let J=Jσ be the stable job sequence violating the performance ratio of c+ε. Let J=J1,,Jn and OPT=OPT(J).

Let Jn be the first job that causes ALG to have a makespan greater than (c+ε)OPT and let b0=lmn-1 be the load of the least loaded machine Mmn-1 right before Jn is scheduled on it. The makespan after Jn is scheduled, called the critical makespan, is at most b0+pnb0+OPT. In particular b0>(c-1+ε)OPT as well as pn>(c-1+ε)OPT, see Lemma 3. Let

λstart=c-11+2c(2-c)0.5426andλend=12(c-1+ε)0.5898.

There holds λstart<λend. The critical makespan of ALG is bounded by b0+OPT<(1+1c-1+ε)b0=(c+ε)b0c-1+ε=(c+ε)2λendb0. Since ALG does not achieve a performance ratio of c+ε on J we have

Pm+1OPT/2<λendb0. 2

Our main goal is to derive a contradiction to this inequality.

The impact of the variable Ph:

A new, crucial aspect in the analysis of ALG is Ph, the processing time of the hth largest job in the sequence J. Initially, when the processing of J starts, we have no information on Ph and can only infer Pm+1λstartb0. The second property in the definition of stable job sequences ensures that pmaxtPh once the load ratio Lt/L is sufficiently large. Note that ALG then also works with this estimate because PhpmaxtOt. This will allow us to evaluate the processing time of flatly scheduled jobs. In order prove that Pm+1 is large, we will relate Pm+1 and Ph, i.e. we will lower bound Pm+1 in terms of Ph and vice versa. Using the relation we can then conclude Pm+1λendb0. In the analysis we repeatedly use the properties of stable job sequences and will explicitly point to it when this is the case.

We next make the relationship between Ph and Pm+1 precise. Given 0<λ, let f(λ)=2cλ-1 and given w>0, let g(w)=(c(2c-3)-1)w+4-2c0.2854·w+0.3044. We set gb(λ)=gλbb and fb(w)=fwbb, for any b>0. Then we will lower bound Pm+1 by gb0(Ph) and Ph by fb0(Pm+1). We state two technical propositions.

Proposition 3

For λ>λstart, we have g(f(λ))>λ.

Proof

Consider the function

F(λ)=g(f(λ))-λ=(c(2c-3)-1)(2cλ-1)+4-2c-λ=(4c3-6c2-2c-1)λ-2c2+c+50.05446·λ+0.01900.

The function F is linear and strictly increasing in λ. Hence for the proposition to hold it suffices to verify that F(λstart)0. We can now compute that F(λstart)0.04865>0.

Proposition 4

For 0<ε1, we have g(1-ε)>λend.

Note that the following proof determines the choice of our competitive ratio c, which was chosen minimal such that Q[c]=4c3-14c2+16c-70.

Proof

We calculate that

g(1-ε)-λend=(c(2c-3)-1)(1-ε)+4-2c-12(c-1+ε)=2(c-1+ε)(2c2-5c+3-(2c2-3c-1)ε)-12(c-1+ε)=4c3-14c2+16c-7+(4-2c)ε-2(2c2-3c-1)ε22(c-1+ε).

Recall that Q[c]=4c3-14c2+16c-7=0. For 0<ε1 we have

(4-2c)ε-(2c2-3c-1)ε20.3044·ε-0.2854·ε2>0.

Thus we see that g(1-ε)-λend>0 and can conclude the lemma.

Analyzing Large Jobs Towards Lower Bounding Ph and Pm+1

Let b>(c-1+ε)OPT be a value such that immediately before Jn is scheduled at least m-h machines have a load of at least b. Note that b=b0 satisfies this condition but we will be interested in larger values of b as well. We call a machine b-full once its load is at least b; we call a job J a b-filling job if it causes the machine it is scheduled on to become b-full. We number the b-filling jobs according to their order of arrival J(1),J(2), and let t(j) denote the time of arrival of the jth filling job J(j).

Recall that our main goal is to show that Pm+1λendb0 holds. To this end we will prove that the b0-filling jobs have a processing time of at least λendb0. As there are m such jobs, the bound on Pm+1 follows by observing that Jn arrives after all b0-filling jobs are scheduled and that its processing time exceeds λendb0 as well. In fact, since OPTb0, we have

pn>(c-1)OPT>0.847·OPT>λendb00.5898·b0. 3

We remark that different to previous analyses in the literature we do not solely rely on lower bounding the processing time of filling jobs. By using the third property of stable job sequences, we can relate load and the size of the (m+1)st largest job at specific points in the time horizon, formally this is done later in Lemma 8.

In the following we regard b as fixed and omit it from the terms filling job and full. Let λ=max{λstartb,min{gbPh,λendb}}. We call a job large if it has a processing time of at least λ. Let t~=t(m-h) be the time when the (m-h)th filling job arrived. The remainder of this section is devoted to showing the following important Lemma 6. Some of the underlying lemmas, but not all of them, hold if mm(ε) is sufficiently large. We will make the dependence clear.

Lemma 6

At least one of the following statements holds:

  • All filling jobs are large.

  • If mm(ε), there holds Pm+1t~λ=max{λstartb,min{gbPh,λendb}}, i.e. there are at least m+1 large jobs once the (m-h)-th filling job is scheduled.

Before we prove the lemma we derive two important implications towards a lower bound of Pm+1.

Corollary 4

We have Pm+1λ=max{λstartb0,min{gb0Ph,λendb0}}.

Proof

Apply the previous lemma, taking into account that bb0, and use that there are m many b0-filling jobs followed by Jn. The latter has size at least λ by inequality (3).

We also want to lower bound the processing time of the (m+1)st largest job at time t~. However, at that time only m-h filling jobs have arrived. The next lemma ensures that, if additionally Ph is not too large, this is not a problem.

Corollary 5

If Ph(1-ε)b and mm(ε), the second statement in Lemma 6holds, i.e. Pm+1t~λ=max{λstartb,min{gbPh,λendb}}.

The proof of the lemma makes use of the fourth property of stable job sequences. In particular we would not expect such a result to hold in the adversarial model.

Proof

We will show that the first statement in Lemma 6 implies the second one if Ph(1-ε)b holds. In order to conclude the second statement it suffices to verify that at least m+1 jobs of processing time λ have arrived until time t~. By the first statement we know that there were m-h large filling jobs coming before time t~. Hence it is enough to verify that h+1 large jobs arrive (strictly) before the first filling job J.

To show that there are h+1 jobs with a processing time of at least Pm+1 before the first filling job J, we use the last property of stable job sequences. If J is among the h largest jobs, we are done immediately by the condition. Else J had size at most Ph(1-ε)b. Assume J=Jt was scheduled on the machine Mjt-1, for j{i,m-h,m}, and let l=ljt-1 be its load before J was scheduled. Because J is a filling job we have

lb-Phεbε(c-1)OPT.

In particular, before J was scheduled, the average load at that time was at least

jlmilmεim(c-1)OPT.

Again, by the last property of stable job sequences, at least h+1 jobs of processing time at least Pm+1 were scheduled before this was the case.

We introduce late and early filling jobs. We later need a certain condition to hold, namely the ones stated in Lemma 8, in order to show that the early filling jobs are large. We show that if this condition is not met, the fact that the given job sequence is stable ensures that Pmt~λ.

Let s be chosen maximal such that the sth filling job is scheduled steeply. If si, then set s=i+1 instead. We call all filling jobs J(j) with j>i that are scheduled flatly late filling jobs. All other filling jobs are called early filling jobs. In particular the job J(s+1) and the filling jobs afterwards are late filling jobs. The following proposition implies that the fillings jobs after J(m-h), if they exist, are all late, i.e. scheduled flatly.

Proposition 5

We have sm-h if mm(ε).

Proof of Proposition 5

Let h~<h and t=t(m-h~) be the time the (m-h~)th filling job J arrived. We need to see that J was scheduled flatly. Assume that was not the case. We know that for jm-h~ we have ljt-1b>(c-1+ε)OPT. In particular we have

Li+1t-1=1m-ij=i+1mljt-1>m-i-h-1m-i(c-1+ε)OPT(c-1)OPT.

For the last inequality we need to choose m large enough. If the schedule was steep at time t, then we had for every jk

ljt-1lkt-1α(c-1)OPT=2(c-1)22c-3OPT.

But then the average load at time t-1 would be:

Lt-1=1mj=1mljt-1>k2(c-1)22c-3+(m-h-k)(c-1)mOPT((4c-7)m+2h)2(c-1)22c-3+(m-(4c-7)m-h)(c-1)mOPT1.3247·OPT.

For the second inequality we need to observe that we have k(4c-7)m+2h and that the previous term decreases if we decrease k. One also can check that the second last term is minimized if h=0.

But now we have shown Lt-1>OPT, which is a contradiction. Hence the schedule could not have been steep at time t-1.

We need a technical lemma. For any time t, let L¯st=1m-h-s+1j=sm-hljt be the average load on the machines numbered s to m-h.

Lemma 7

If L¯st(s)-1α-1b holds and mm(ε) then Lt(s)-1>sm+ε2·L.

This lemma comes down to a mere computation. While being simple at its core, we have to account for various small error terms. These arise in three ways. Some are inherent to the properties of stable sequences. Others arise from the rounding involved in the definition of certain numbers, i in particular. Finally, the small number h introduces such an error. While all these errors turn out to be negligible, rigorously showing so is technical. Note that the following proof also determines our choice of the value i. For larger values the proof would not hold.

Proof of Lemma 7

Let t=t(s)-1. We have ljtb for js-1 as the first s-1 machines are full. Considering the load on the machines numbered up to m-h we obtain

Li+1tj=i+1s-1ljt+(m-h-s+1)L¯stm-i(s-i-1)b+(m-h-s+1)α-1bm-iα-1b+s-i-1m-i(1-α-1)b-hm-iα-1b=α-1b+s-i-1m-ib2(c-1)-hm-iα-1b.

If s>i+1, the schedule was steep at time t=t(s)-1 and hence

lktαLi+1t>b+s-i-1m-iα·b2(c-1)-h·bm-i.

Since lktlitb, the previous inequality holds for s=i+1, too, no matter whether J(s)=J(i+1) was scheduled flatly or steeply. We hence get, for all si+1,

Ltklkt+(s-k-1)ls-1t+(m-h-s+1)L¯stm>kb+s-i-1m-i·α·b2(c-1)-h·bm-i+(s-k-1)b+(m-h-s+1)b-b2(c-1)m=1+12(c-1)kms-i-1m-i·α-m-s+1mb-k·hm(m-i)+h·α-1mb.

In the above difference, we first examine the first term, which is minimized if s=i+1. With this setting it is still lower bounded by

1-m-i2(c-1)mb>1-2(2-c)2(c-1)b0.8205·b>3b4.

In the second term of the above difference km-i=2i-mm-i is increasing in i, where i(2c-3)m+h+1. We choose m large enough such that

km-i(4c-7)m+2(h+1)2(2-c)m-(h+1)1.5.

There holds α-1<0.5. Thus the second term in the difference is upper bounded by 2hbm.

Recall that b>(c-1+ε)OPT. Furthermore, 0<ε<2-c such that c-1+ε<1. Therefore, we obtain

Lt>c-1+12kms-i-1m-i·α-m-im+s-i+1m+34ε-2hmOPT.

In the previous term we intentionally highlighted three variables. It is easy to check that if we decrease these variables, the term decreases, too. We do this by setting k=(4c-7)m and i=(2c-3)m (while ignoring the non-highlighted occurrences of i). We also assume that m is large enough such that ε43h+2m. Then the previous lower bound on Lt can be brought to the following form:

Lt>2c-3+124c-72(2-c)·α+1s-i-1m+h+2m+ε2OPT

Using that i+1m<2c-3+h+2m and evaluating the term in front of s-i-1m we get

Lt>i+1m+1.0666·s-i-1m+ε2OPT>sm+ε2OPT.

The lemma follows by noting that OPTL.

Lemma 8

If the late filling jobs are large, L¯st(s)-1α-1b and mm(ε), we have Pm+1t~λ.

Proof

Assume that the conditions of the lemma hold. By Lemma 7 we have Lt(s)-1>sm+ε2·L. By the third property of stable sequences, at most m+1-(s+h+2)=m-s-h-1 of the largest m+1 jobs appear in the sequence starting after time t(s)-1. However, this sequence contains m-h-s late filling jobs. Thus there exists a late filling job that is not among the m+1 largest jobs. As it has a processing time of at least λ, by the assumption of the lemma, Pm+1λ holds.

Now consider the m+1 largest jobs of the entire sequence that arrive before J(s) as well as the jobs J(s+1),,J(m-h). There are at least s+h+2 of the former and m-h-s of the latter. Thus we have found a set of at least m+1 jobs arriving before (or at) time t~=t(m-h). Moreover, we argued that all these jobs have a processing time of at least λ. Hence Pm+1t~λ holds true.

We are ready to evaluate the processing time of filling jobs to prove Lemma 6, which we will do in the following two lemmas.

Lemma 9

Any late filling job’s processing time exceeds max{λstartb,gb(Ph)}.

Proof

Let ji+1 such that J(j) was scheduled flatly. Set t=t(j)-1 and l=lit. Because at least i machines were full, we have have Ltb·im(c-1)imOPT(c-1)imL. Hence by Definition 3 we have pmaxtPh.

Let λ~=max{λstartb,gb(Ph)}. We need to show that J(j) has a processing time strictly greater than λ~. If we have lm-ht<b-λ~, then this was the case because J(j) increased the load of some machine from a value smaller than b-λ~ to b. Hence let us assume that we have lm-htb-λ~. In particular we have

Lt(j-1)l+(m-j-h+1)(b-λ~)m.

By the definition of a late filling job, J(j) was scheduled flatly. In particular, it would have been scheduled on machine Mit (which was not the case) if any of the following two inequalities did not hold:

  • pt+l>cpmaxtcPh

  • pt+l>cLt

If lcPh-λ~ held true, we get pt>λ~ from the first inequality. Thus we only need to treat the case that l>cPh-λ~ held true. We also know that we have lb, because the ith machine is full. Hence we may assume that

lmax{b,cPh-λ~}.

In order to derive the lemma we need to prove that pt-λ~>0 holds. Using the second inequality we get

pt-λ~>cLt-l-λ~c(j-1)l+(m-j-h+1)(b-λ~)m-l-λ~.

Using that (2c-3)m+h<i<j-1 and b-λ~<bl hold, the previous term does not increase if we replace j-1 by (2c-3)m+h. The resulting term is

pt-λ~>c((2c-3)m+h)l+(m-(2c-3)m-2h+1)(b-λ~)m-l-λ~.

Now let us observe that we have lb2(b-λstartb)2(b-λ~). Hence the previous term is minimized if we set h=0. We get

pt-λ~>c[(2c-3)l+(1-(2c-3))(b-λ~)]-l-λ.

As c(2c-3)-10.2584>0 the above term does not increase if we replace l by either value: b or cPh-λ~.

If we have λ~=λstartb, we choose l=b and get

pt-λstartb>c[(2c-3)b+(1-(2c-3))(b-λstartb)]-b-λstartb=(c-1)b-(1+2c(2-c))λstartb=(c-1)b-(c-1)b=0.

The third equality uses the definition of λstart. The lemma follows if λ~=λstartb.

Otherwise, if λ~=gb(Ph), we choose l=cPh-λ~ and get

pt-λ~>c[(2c-3)(cPh-λ~)+(1-(2c-3))(b-λ~)]-(cPh-λ~)-λ~=(c2(2c-3)-c)Ph+(c(4-2c))b-cgb(Ph)=0.

Here the last equality follows from the definition of gb. The lemma follows in the case λ~=gb(Ph).

Lemma 10

If L¯st(s)-1<α-1b holds, the early filling jobs have a processing time of at least λendb.

Before proving Lemma 10 let us observe the following, strengthening its condition.

Lemma 11

We have

Li+1t(i+1)-1Li+2t(i+2)-1Lst(s)-1.
Proof

Let i+1j<s. It suffices to verify that

Ljt(j)-1Lj+1t(j)Lj+1t(j+1)-1.

The second inequality is obvious because for every r the loads lrt can only increase as t increases. For the first inequality we note that by definition the job J(j) was scheduled steeply and hence on a least loaded machine. This machine became full. Thus it is not among the m-j least loaded machines at time t(j). In particular Lj+1t(j), the average over the m-j smallest loads at time t(j), is also the average of the m-j+1 smallest loads excluding the smallest load at time t(j)-1. Therefore it cannot be less than Ljt(j)-1.

Proof of Lemma 10

Let i<js such that J(j) was an early filling job. By Lemma 11 we have Ljt(j)-1Lst(s)-1<α-1b=b-b2(c-1)<b-λendb. By definition J(j) was scheduled on a least loaded machine Mmt(j)-1 which had load less than Ljt(j)-1<b-λendb before and at least b afterwards because it became full. In particular J(j) had size λendb.

For k<ji the job J(j) is scheduled steeply because we have by Lemma 11

lkt(j)-1b>αLst(s)-1αLi+1t(i+1)-1αLi+1t(j)-1.

Thus for k<ji the job J(j) is scheduled on the least loaded machine Mmt(j)-1, whose load lmt(j)-1 is bounded by

lmt(j)-1Li+1t(j)-1Lst(s)-1<α-1b=b-b2(c-1)<b-λendb.

Hence the job J(j) had a size of at least λendb. We also observe that we have

lit(k)-1li+1t(k+1)-1li+(m-i)t(k+(m-i))-1=lmt(i)-1<b-λendb.

In particular for 1jk any filling job J(j) filled a machine with a load of at most max{lmt(k),lit(k)}=lit(k)<b-λendb. Hence it had a size of at least λendb.

We now conclude the main lemma of this subsection, Lemma 6.

Proof of Lemma 6

By Lemma 9, all late filling jobs are large. We distinguish two cases depending on whether or not L¯st(s)-1<α-1b holds. If it does, all filling jobs are large by Lemma 10 and the first statement in Lemma 6 holds. Otherwise, the second statement in Lemma 6 holds by Lemma 8.

Lower Bounding Ph and Pm+1

In this section we establish the following relations on Ph and Pm+1.

Lemma 12

There holds Ph>(1-ε)b0 or Pm+1λendb0 if mm(ε).

For the proof we need a way to lower bound the processing time of a job Jt depending on Pm+1t:

Lemma 13

Let Jt be any job scheduled flatly on the least loaded machine and let b=lm-ht-1 be the load of the (h+1)-th least loaded machine. Then Jt has a processing time of at least fb(Pm+1t).

Proof

From the fact that Jt was not scheduled on the (h+1)th least loaded machine Mm-ht we derive that pt>c·Ot-bc·Pm+1t-b=fb(Pm+1t) holds.

Proof of Lemma 12

Assume for a contradiction that we had Ph(1-ε)b0. Let J=Jt be the smallest among the h last b0-filling jobs. Then J has a processing time pPh. We want to derive a contradiction to that. Let b1=lm-ht-1 be the load of the (m-h)th machine right before J was scheduled. Because this machine was b0-full at that time we know that b1b0>(c-1+ε)OPT holds and it makes sense to consider b1-filling jobs. Let t~ be the time the (m-h)th b1-filling job arrived. By Lemma 6 we have Pm+1t~λ=max{λstartb1,min{gb1Ph,λendb1}}.

If we have λ=λendb1λendb0 we have already proven Pm+1λendb0 and the lemma follows. So we are left to treat the case that we have Pm+1t~λ=max{λstartb1,gb1Ph}.

Now we can derive the following contradiction:

Pm+1t~gb1Phgb1pgb1fb1Pm+1t~=gfPm+1t~b1b1>Pm+1t~.

For the second inequality, we use the monotonicity of gb1(-). The third inequality follows from Lemma 13 and the last one from Proposition 3.

Establishing Main Lemma 2.

Let mm(ε) be sufficiently large. The machine number m(ε) is determined by the proofs of Proposition 5 and Lemma 7, and then carries over to the subsequent lemmas. Let us assume for a contradiction sake that there was a stable sequence J such that ALG(J)>(c+ε)OPT(J). As argued in the beginning of Sect. 4.3, see (2), it suffices to show that Pm+1λendb0. If this was not the case, we would have Ph(1-ε)b0 by Lemma 12. In particular by Proposition 4 we had gb0Ph=g(1-ε)b0>λendb0. But now Lemma 4 shows that Pm+1max{λstartb0,min{gb0Ph,λendb0}}=λendb0.

We conclude, by Corollary 3, that ALG is nearly c-competitive.

Lower Bounds

We present lower bounds on the competitive ratio of any deterministic online algorithm in the random-order model. Theorem 3 implies that if a deterministic online algorithm is c-competitive with high probability as m, then c3/2.

Theorem 2

Let A be a deterministic online algorithm that is c-competitive in the random-order model. Then c4/3 if m8.

Theorem 3

Let A be a deterministic online algorithm that is nearly c-competitive. Then c3/2.

A basic family of inputs are job sequences that consist of jobs having an identical processing time of, say, 1. We first analyze them and then use the insight to derive our lower bounds. Let m2 be arbitrary. For any deterministic online algorithm A, let r(Am) be the maximum number in N{} such that A handles a sequence consisting of r(A,m)·m jobs with an identical processing time of 1 by scheduling each job on a least loaded machine.

Lemma 14

Let m2 be arbitrary. For every deterministic online algorithm A, there exists a job sequence J such that Arom(J)(1+1r(A,m)+1)OPT(J). We use the convention that 1+1=0.

Proof

For r(A,m)= there is nothing to show. For r(A)<, consider the sequence J consisting of (r(A,m)+1)·m identical jobs, each having a processing time of 1. It suffices to analyze the algorithm adversarially as all permutations of the job sequence are identical. After having handled the first r(A,m)·m jobs, the algorithm A has a schedule in which every machine has load of r(Am). By the maximality of r(Am), the algorithm A schedules one of the following m jobs on a machine that is not a least loaded one. The resulting makespan is r(A,m)+2. The lemma follows since the optimal makespan is r(A,m)+1.

Proof of Theorem 2

Let m8 be arbitrary. Consider any deterministic online algorithm A. If r(A,m)2, then, by Lemma 14, there exists a sequence J such that Arom(J)43·OPT(J). Therefore, we may assume that r(A,m)3. Consider the input sequence J consisting of 4m-4 identical small jobs of processing time 1 and one large job of processing time 4. Obviously OPT(J)=4.

Let i be the number of small jobs preceding the large job in Jσ. The random variable i takes any (integer) value between 0 and 4m-4 with probability 14m-3. Since r(A,m)3 the least loaded machine has load of at least l=im when the large job arrives. Thus A(Jσ)l+4. The load l takes the values 0, 1 and 2 with probability m4m-3 and the value 3 with probability m-34m-3. Hence the expected makespan of algorithm A is at least

Arom(J)m4m-3·(0+1+2)+m-34m-3·3+4=6m-94m-3+4>163=43OPT(J).

For the last inequality we use that m8.

Proof of Theorem 3

Let m2 be arbitrary and let A be any deterministic online algorithm. If r(A,m)=0, then consider the sequence J consisting of m jobs with a processing time of 1 each. On every permutation of J algorithm A has a makespan of 2, while the optimum makespan is 1. If r(A,m)1, then consider the sequence J consisting of 2m-2 small jobs having a processing time of 1 and one large job with a processing time of 2. Obviously OPT(J)=2. If the permuted sequence starts with m small jobs, the least loaded machine has load 1 once the large job arrives. Under such permutations A(Jσ)3=32·OPT(J) holds true. The probability of this happening is m-12m-1. The probability approaches 12 and in particular does not vanish, for m. Thus, if A is nearly c-competitive, then c3/2.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Footnotes

1

Note that besides rounding also hides a small term in o(m).

A preliminary version of this paper has appeared in the 47th International Colloqium on Automata, Languages and Programming (ICALP), 2020. Work supported by the European Research Council, Grant Agreement No. 691672, project APEG.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Susanne Albers, Email: albers@in.tum.de.

Maximilian Janke, Email: maximilian@janke.tech.

References

  • 1.Albers S. Better bounds for online scheduling. SIAM J. Comput. 1999;29(2):459–473. doi: 10.1137/S0097539797324874. [DOI] [Google Scholar]
  • 2.Babaioff, M., Immorlica, N., Kempe, D., Kleinberg, R.: A knapsack secretary problem with applications. In: Proceedings of 10th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems (APPROX). Springer, pp. 16–28 (2007)
  • 3.Babaioff M, Immorlica N, Kempe D, Kleinberg R. Matroid secretary problems. J. ACM. 2018;65(6):1–26. doi: 10.1145/3212512. [DOI] [Google Scholar]
  • 4.Bartal, Y., Fiat, A., Karloff, H., Vohra, R.: New algorithms for an ancient scheduling problem. In: Proceedings of 24th ACM Symposium on Theory of Computing (STOC), pp. 51–58 (1992)
  • 5.Bartal Y, Karloff H, Rabani Y. A better lower bound for on-line scheduling. Inf. Process. Lett. 1994;50(3):113–116. doi: 10.1016/0020-0190(94)00026-3. [DOI] [Google Scholar]
  • 6.Chen B, van Vliet A, Woeginger G. A lower bound for randomized on-line scheduling algorithms. Inf. Process. Lett. 1994;51(5):219–222. doi: 10.1016/0020-0190(94)00110-3. [DOI] [Google Scholar]
  • 7.Chen L, Ye D, Zhang G. Approximating the optimal algorithm for online scheduling problems via dynamic programming. Asia Pac. J. Oper. Res. 2015;32(01):1540011. doi: 10.1142/S0217595915400114. [DOI] [Google Scholar]
  • 8.Cheng T, Kellerer H, Kotov V. Semi-on-line multiprocessor scheduling with given total processing time. Theor. Comput. Sci. 2005;337(1–3):134–146. doi: 10.1016/j.tcs.2004.11.018. [DOI] [Google Scholar]
  • 9.Dohrau, J.: Online makespan scheduling with sublinear advice. In: 41st International Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM). Springer, pp. 177–188 (2015)
  • 10.Dynkin E. The optimum choice of the instant for stopping a Markov process. Sov. Math. 1963;4:627–629. [Google Scholar]
  • 11.Englert, M., Özmen, D., Westermann, M.: The power of reordering for online minimum makespan scheduling. In: Proceedings of 49th 676 IEEE Annual Symposium on Foundations of Computer Science (FOCS). IEEE, pp. 603–612 (2008)
  • 12.Faigle U, Kern W, Turán G. On the performance of on-line algorithms for partition problems. Acta Cybern. 1989;9(2):107–119. [Google Scholar]
  • 13.Feldman, M., Svensson, O., Zenklusen, R.: A simple O (log log (rank))-competitive algorithm for the matroid secretary problem. In: Proceedings of 26th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, pp. 1189–1201 (2014)
  • 14.Fleischer R, Wahl M. On-line scheduling revisited. J. Sched. 2000;3(6):343–353. doi: 10.1002/1099-1425(200011/12)3:6&#x0003c;343::AID-JOS54&#x0003e;3.0.CO;2-2. [DOI] [Google Scholar]
  • 15.Galambos G, Woeginger G. An on-line scheduling heuristic with better worst-case ratio than Graham’s list scheduling. SIAM J. Comput. 1993;22(2):349–355. doi: 10.1137/0222026. [DOI] [Google Scholar]
  • 16.Göbel, O., Kesselheim, T., Tönnis, A.: Online appointment scheduling in the random order model. In: Algorithms-ESA 2015. Springer, pp. 680–692 (2015)
  • 17.Goel G, Mehta A. Online budgeted matching in random input models with applications to Adwords. SODA. 2008;8:982–991. [Google Scholar]
  • 18.Gormley, T., Reingold, N., Torng, E., Westbrook, J.: Generating adversaries for request-answer games. In: Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 564–565 (2000)
  • 19.Graham R. Bounds for certain multiprocessing anomalies. Bell Syst. Tech. J. 1966;45(9):1563–1581. doi: 10.1002/j.1538-7305.1966.tb01709.x. [DOI] [Google Scholar]
  • 20.Gupta, A., Mehta, R., Molinaro, M.: Maximizing Profit with Convex Costs in the Random-Order Model. arXiv preprint arXiv:1804.08172 (2018)
  • 21.Hochbaum D, Shmoys D. Using dual approximation algorithms for scheduling problems theoretical and practical results. J. ACM. 1987;34(1):144–162. doi: 10.1145/7531.7535. [DOI] [Google Scholar]
  • 22.Karande, C., Mehta, A., Tripathi, P.: Online bipartite matching with unknown distributions. In: Proceedings of the Forty-Third Annual ACM Symposium on Theory of Computing, pp. 587–596 (2011)
  • 23.Karger D, Phillips S, Torng E. A better algorithm for an ancient scheduling problem. J. Algorithms. 1996;20(2):400–430. doi: 10.1006/jagm.1996.0019. [DOI] [Google Scholar]
  • 24.Kellerer H, Kotov V. An efficient algorithm for bin stretching. Oper. Res. Lett. 2013;41(4):343–346. doi: 10.1016/j.orl.2013.03.005. [DOI] [Google Scholar]
  • 25.Kellerer H, Kotov V, Speranza MG, Tuza Z. Semi on-line algorithms for the partition problem. Oper. Res. Lett. 1997;21(5):235–242. doi: 10.1016/S0167-6377(98)00005-4. [DOI] [Google Scholar]
  • 26.Kenyon C. Best-fit bin-packing with random order. SODA. 1996;96:359–364. [Google Scholar]
  • 27.Kesselheim, T., Tönnis, A., Radke, K., Vöcking, B.: Primal beats dual on online packing LPs in the random-order model. In: Proceedings of the Forty-Sixth Annual ACM Symposium on Theory of Computing, pp. 303–312 (2014)
  • 28.Kleinberg R. A multiple-choice secretary algorithm with applications to online auctions. SODA. 2005;5:630–631. [Google Scholar]
  • 29.Lachish, O.: O (log log rank) competitive ratio for the matroid secretary problem. In: 2014 IEEE 55th Annual Symposium on Foundations of Computer Science. IEEE, pp. 326–335 (2014)
  • 30.Mahdian, M., Yan, Q.: Online bipartite matching with random arrivals: an approach based on strongly factor-revealing lps. In: Proceedings of the Forty-Third Annual ACM Symposium on Theory of Computing, pp. 597–606 (2011)
  • 31.Meyerson, A.: Online facility location. In: Proceedings 42nd IEEE Symposium on Foundations of Computer Science. IEEE, pp. 426–431 (2001)
  • 32.Molinaro, M.: Online and random-order load balancing simultaneously. In: Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, pp. 1638–1650 (2017)
  • 33.Osborn C, Torng E. List’s worst-average-case or WAC ratio. J. Sched. 2008;11(3):213–215. doi: 10.1007/s10951-007-0019-7. [DOI] [Google Scholar]
  • 34.Pruhs, K., Sgall, J., Torng, E.: Online scheduling. (2004)
  • 35.Rudin, J., III.: Improved bounds for the on-line scheduling problem (2001)
  • 36.Rudin, J., III., Chandrasekaran, R.: Improved bounds for the online scheduling problem. SIAM J. Comput. 32(3), 717–735 (2003)
  • 37.Sanders P, Sivadasan N, Skutella M. Online scheduling with bounded migration. Math. Oper. Res. 2009;34(2):481–498. doi: 10.1287/moor.1090.0381. [DOI] [Google Scholar]
  • 38.Sgall J. A lower bound for randomized on-line multiprocessor scheduling. Inf. Process. Lett. 1997;63(1):51–55. doi: 10.1016/S0020-0190(97)00093-8. [DOI] [Google Scholar]
  • 39.Sleator D, Tarjan R. Amortized efficiency of list update and paging rules. Commun. ACM. 1985;28(2):202–208. doi: 10.1145/2786.2793. [DOI] [Google Scholar]

Articles from Algorithmica are provided here courtesy of Springer

RESOURCES