Competitive kill-and-restart and preemptive strategies for non-clairvoyant scheduling

Jäger, Sven; Sagnol, Guillaume; Schmidt genannt Waldschmidt, Daniel; Warode, Philipp

doi:10.1007/s10107-024-02118-8

Competitive kill-and-restart and preemptive strategies for non-clairvoyant scheduling

Full Length Paper
Series B
Open access
Published: 22 July 2024

Volume 210, pages 457–509, (2025)
Cite this article

Download PDF

You have full access to this open access article

Mathematical Programming Submit manuscript

Competitive kill-and-restart and preemptive strategies for non-clairvoyant scheduling

Download PDF

683 Accesses
1 Altmetric
Explore all metrics

Abstract

We study kill-and-restart and preemptive strategies for the fundamental scheduling problem of minimizing the sum of weighted completion times on a single machine in the non-clairvoyant setting. First, we show a lower bound of 3 for any deterministic non-clairvoyant kill-and-restart strategy. Then, we give for any $b > 1$ a tight analysis for the natural b-scaling kill-and-restart strategy as well as for a randomized variant of it. In particular, we show a competitive ratio of $(1+3\sqrt{3})\approx 6.197$ for the deterministic and of $\approx 3.032$ for the randomized strategy, by making use of the largest eigenvalue of a Toeplitz matrix. In addition, we show that the preemptive Weighted Shortest Elapsed Time First (WSETF) rule is 2-competitive when jobs are released online, matching the lower bound for the unit weight case with trivial release dates for any non-clairvoyant algorithm. Using this result as well as the competitiveness of round-robin for multiple machines, we prove performance guarantees smaller than 10 for adaptions of the b-scaling strategy to online release dates and unweighted jobs on identical parallel machines.

Competitive Kill-and-Restart and Preemptive Strategies for Non-clairvoyant Scheduling

Online C-benevolent job scheduling on multiple machines

Article 08 September 2017

Cost-Sharing Scheduling Games on Restricted Unrelated Machines

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Minimizing the total weighted completion time on a single processor is one of the most fundamental problems in the field of machine scheduling. The input consists of n jobs with processing times $p_1,\ldots ,p_n$ and weights $w_1,\dotsc ,w_n$, and the task is to sequence them in such a way that the sum of weighted completion times $\sum _{j=1}^n w_j C_j$ is minimized. We denote this problem as . Smith [2] showed in the 50’s that the optimal schedule is obtained by the Weighted Shortest Processing Time first ($\text {WSPT}$) rule, i.e., jobs are sequenced in non-decreasing order of the ratio of their processing time and their weight.

Reality does not always provide all information beforehand. Around 30 years ago, the non-clairvoyant model, in which the processing time of any job becomes known only upon its completion, was introduced for several scheduling problems [3,4,5]. It is easy to see that no non-preemptive non-clairvoyant algorithm can be constant-competitive for the unweighted variant . In their seminal work, Motwani et al. [5] proved for this problem that allowing preemption breaks the non-constant barrier. Specifically, they showed that the natural round-robin algorithm is 2-competitive, matching a lower bound for all non-clairvoyant algorithms. This opened up a new research direction, leading to constant-competitive preemptive non-clairvoyant algorithms in much more general settings, like weighted jobs [6], multiple machines [7,8,9], precedence constraints [10, 11], and non-trivial release dates. When jobs are released over time, they are assumed to be unknown before their arrivals (online scheduling). No lower bound better than 2 is known for this case, whereas the best known upper bound before this work was 3, see e.g. [12].

But there is a downside of the preemptive paradigm as it uses an unlimited number of interruptions at no cost and has a huge memory requirement to maintain the ability to resume all interrupted jobs. Therefore, we continue by studying the natural class of kill-and-restart strategies that—inspired by computer processes—can abort the execution of a job (kill), but when processed again later, the job has to be re-executed from the beginning (restart). It can be considered as an intermediate category of algorithms between preemptive and non-preemptive ones, as on one hand jobs may be interrupted, and on the other hand when jobs are completed, they have been processed as a whole. Hence, by removing all aborted executions one obtains a non-preemptive schedule. Although this class of algorithms has already been investigated since the 90’s [4], to the best of our knowledge, the competitive ratio of non-clairvoyant kill-and-restart strategies for the total completion time objective has never been studied.

Our contribution We start by strengthening the preemptive lower bound of 2 for the kill-and-restart model.

Theorem 2

For , no deterministic non-clairvoyant kill-and-restart strategy can achieve a competitive ratio smaller than $3-\frac{2}{n+1}$ on instances with $n\ge 3$ jobs, even if every job j has processing time $p_j \ge 1$.

The main part of this work is devoted to the b-scaling strategy ${\mathfrak {D}_{b}}$ that repeatedly probes each unfinished job for the time of an integer power of $b>1$ multiplied by its weight. For it is easy to see that $\mathfrak {D}_{2}$ is 8-competitive by comparing its schedule to the weighted round-robin schedule for a modified instance and using the 2-competitiveness due to Kim and Chwa [6]. Using a novel and involved analysis we determine the exact competitive ratio of ${\mathfrak {D}_{b}}$.

Theorem 6

For $b>1$, ${\mathfrak {D}_{b}}$ is $\bigl ( 1+\frac{2b^{\nicefrac {3}{2}}}{b-1} \bigr )$-competitive for . This ratio is minimized for $b=3$, yielding a performance guarantee of $1+3\sqrt{3}\approx 6.196$.

Theorem 7

For every $b > 1$, there exists a sequence of instances $(\varvec{p}_L)_{L \in \mathbb {N}}$ for such that

Our main technique is to reduce the problem of finding the competitive ratio of ${\mathfrak {D}_{b}}$ to the computation of the largest eigenvalue of a tridiagonal Toeplitz matrix. Subsequently, we obtain a significantly better exact competitive ratio for a randomized version of the b-scaling strategy, denoted by ${\mathfrak {R}_b}$, that permutes the jobs uniformly at random and chooses a random offset drawn from a log-uniform distribution.

Theorem 10

For every $b>1$, ${\mathfrak {R}_b}$ is $\frac{2b+\sqrt{b}-1}{\sqrt{b} \ln b}$-competitive for . This ratio is minimized for $b\approx 8.16$, yielding a performance guarantee smaller than 3.032.

Theorem 11

For all $b>1$, there exists a sequence of instances $(\varvec{p}_L)_{L\in \mathbb {N}}$ for such that

$$\lim _{L \rightarrow \infty } \frac{{\mathfrak {R}_b}(\varvec{p}_L)}{\text {OPT}(\varvec{p}_L)} = \frac{\sqrt{b}+2b-1}{\sqrt{b}\ln b}.$$

The analysis basically mimics that of the deterministic strategy, but it is necessary to group the jobs whose Smith ratio falls in the ith interval of the form $(b^{i/K},b^{(i+1)/K}]$, where K is a large natural number. This approach leads to the computation of the largest eigenvalue of a banded symmetric Toeplitz matrix of bandwidth $2K-1$, and the result is obtained by letting $K\rightarrow \infty $.

We then study more general scheduling environments. For the online problem, in which jobs are released over time, denoted by , we close the gap for the best competitive ratio of preemptive algorithms by analyzing the Weighted Shortest Elapsed Time First rule, short $\text {WSETF}$. This policy runs at every point in time the job(s) with minimum ratio of the processing time experienced so far (elapsed time) over the weight.

Theorem 12

$\text {WSETF}$ is 2-competitive for .

Theorem 12 generalizes the known 2-competitiveness for trivial release dates shown by Kim and Chwa [6]. It also matches the performance guarantee of the best known stochastic online scheduling policy $\text {F-GIPP}$ [13], a generalization of the Gittins index priority policy [14, 15], for the stochastic variant of our problem where the probability distributions of the processing times are given at the release dates and the expected objective value is to be minimized. Our improvement upon the analysis of this policy, applied to a single machine, is threefold: First, our strategy does not require any information about the distributions of the processing times, second, we compare to the clairvoyant optimum, while $\text {F-GIPP}$ is compared to the optimal non-anticipatory policy, and third, $\text {WSETF}$ is more intuitive and easier to implement in applications than the $\text {F-GIPP}$ policy.

Using Theorem 12, we then give an upper bound on the competitive ratio of a generalized version of ${\mathfrak {D}_{b}}$ that also handles jobs arriving over time by never interrupting a probe.

Theorem 19

${\mathfrak {D}_{b}}$ is $\frac{2b^4}{2b^2-3b+1}$-competitive for . This ratio is minimized for $b=\frac{9+\sqrt{17}}{8}$, yielding a performance guarantee of $\frac{107+51\sqrt{17}}{32} \approx 9.915$.

Finally, we also analyze the unweighted problem on multiple identical parallel machines.

Theorem 23

${\mathfrak {D}_{b}}$ is $\frac{3b^2-b}{b-1}$-competitive for . This ratio is minimized for $b=\frac{3+\sqrt{6}}{3}$, yielding a performance guarantee of $5+2\sqrt{6} \approx 9.899$.

Related work

Non-preemptive scheduling. The beginnings of the field of machine scheduling date back to the work of Smith [2], who investigated the problem of non-preemptively minimizing the sum of weighted completion times on a single machine. Its optimal schedule is obtained by sequencing the jobs in non-decreasing order of their processing time to weight ratio $\nicefrac {p_j}{w_j}$ (Smith’s rule). When all jobs have unit weights, one obtains the Shortest Processing Time first (SPT) rule. This can be generalized to the identical parallel machine setting, where list scheduling [16] according to SPT is optimal [17] for unit-weight jobs. However, the problem of scheduling jobs released over time on a single machine is strongly NP-hard [18] (even for unit weights), and Chekuri and Khanna developed a polynomial-time approximation scheme (PTAS) for it [19]. When jobs arrive online, no deterministic algorithm can be better than 2-competitive [20], and this ratio is achieved by a delayed variant of Smith’s rule [21].

In the non-clairvoyant setting it is well known that no (randomized) non-preemptive algorithm is constant-competitive (see Proposition 1). A less pessimistic model is the stochastic model, where the distributions of the random processing times $P_j$ are given and one is interested in non-anticipatory (measurable) policies [22]. For some classes of distributions, this information allows obtaining constant expected competitive ratios [23] for parallel identical machines. However, policies minimizing the expected competitive ratio do not need to minimize the expected objective value—the classic measure in stochastic optimization. For this criterion, Rothkopf [24] showed that for the single machine case the optimality of Smith’s rule can be transferred to the Weighted Shortest Expected Processing Time rule, in which jobs are sorted in non-decreasing order of $\nicefrac {\mathbb {E}[P_j]}{w_j}$. In order to deal with the stochastic counterparts of the NP-hard problems mentioned above, Möhring et al. [25] introduced approximative scheduling policies, whose expected objective value is compared to the expected objective value of an optimal non-anticipatory policy. While there are constant-competitive policies for stochastic online scheduling on a single machine [26], the performance guarantees of all known approximative policies for multiple machines depend on either the maximum coefficient of variation [27] or the number of jobs and machines [28], even for unit-weight jobs released at time 0.

Preemptive scheduling. For the clairvoyant offline model, allowing preemption only helps in the presence of non-trivial release dates [29]. In this case, the optimal preemptive schedule may be a factor of $\text {e}/(\text {e} - 1)$ better than the best non-preemptive one [30]. Finding an optimal preemptive schedule is still strongly NP-hard [31], and there is a PTAS adapted to this problem [19]. For jobs arriving online Sitters [32] developed a 1.566-competitive deterministic algorithm, and Epstein and van Stee [33] proved a lower bound of 1.073.

When the job lengths are uncertain, allowing preemption becomes much more crucial. Motwani et al. [5] showed that the simple (non-clairvoyant) round-robin procedure has a competitive ratio of 2 for minimizing the total completion time on identical machines. This gives the same share of machine time to each job in rounds of infinitesimally small time slices. For weighted jobs, the Weighted Round-Robin ($\text {WRR}$) rule (also known as generalized processor sharing (GPS) [34, 35]), which always distributes the available machine capacity to the jobs proportionally to their weights, was shown to be 2-competitive on a single machine by Kim and Chwa [6], and the same competitive ratio is achieved by a generalization for multiple identical machines [7]. Similar time sharing algorithms were also developed in the context of non-clairvoyant online scheduling, where jobs arrive over time and are not known before their release dates. Here one can distinguish between minimizing the total (weighted) completion time and the total (weighted) flow time. The $\text {WRR}$ rule can be generalized in two natural way in this setting: Either the machine capacity is still allocated based only on the weights or based on the weighted elapsed times, resulting in the $\text {WSETF}$ rule, mentioned above. It is easy to see that both are 3-competitive, see e.g. [12]. On the other hand, there exist examples showing that the first option is not 2-competitive for total weighted completion time. For the total weighted flow time objective constant competitiveness is unattainable [5]. Apart from work on non-constant competitive ratios [36], the problem has been primarily studied in the resource augmentation model [37], where the machine used by the algorithm runs $1+\varepsilon $ times faster. Kim and Chwa and Bansal and Dhamdhere [38] independently proved that $\text {WSETF}$ is $(1+\varepsilon )$-speed $(1+\nicefrac 1 \varepsilon )$-competitive for weighted flow time on a single machine. By running this algorithm on the original-speed machine, the completion times increase by a factor of $1+\nicefrac 1 \varepsilon $, so that one obtains a $(1+\varepsilon )(1+\nicefrac 1 \varepsilon )$-competitive algorithm for the total weighted completion time [39], which yields a ratio of 4 for $\varepsilon = 1$. The proofs of Kim and Chwa and Bansal and Dhamdhere both proceed by showing that at any time $t \ge 0$ the total weight of unfinished jobs in the $\text {WSETF}$ schedule is at most a factor of $(1+\nicefrac 1 \varepsilon )$ larger than the unfinished weight in the optimal schedule. The lower-bound example of Motwani et al. (many equal small jobs released at time 0) demonstrates that with such an approach no better bound than 4 is achievable. Consequently, a completely different technique is needed to prove Theorem 12. For the much more general setting of unrelated machines Im et al. [9] established a $(1+\varepsilon )$-speed $\mathcal {O}(\nicefrac {1}{\varepsilon ^2})$-competitive algorithm. Motwani et al. also considered the model in which the number of allowed preemptions is limited, for which they devised algorithms that resemble the kill-and-restart algorithms presented in this paper. As mentioned above, for the stochastic model for minimizing the expected total weighted completion time, the Gittins index policy is optimal for single-machine with trivial release dates [14, 15], and Megow and Vredeveld [13] established a 2-competitive online policy for multiple machines and arbitrary release dates.

Kill-and-restart scheduling. The kill-and-restart model was introduced by Shmoys et al. [4] in the context of makespan minimization. For the total completion time objective we are not aware of any work on kill-and-restart strategies in the non-clairvoyant model. However, in the clairvoyant online model, kill-and-restart algorithms have been considered by Vestjens [40] and Epstein and van Stee [33], who gave lower bounds that are larger than the lower bounds for preemptive algorithms but much smaller than the known lower bounds for non-preemptive online algorithms, suggesting that allowing restarts may help in the online model. The proof of this fact was given several years later by van Stee and La Poutré [41], who achieved a deterministic competitive ratio of 3/2 for minimizing the total completion time on a single machine, beating even the lower bound of $\text {e}/(\text {e} - 1) \approx 1.582$ for any randomized non-preemptive online algorithm [42]. In the non-clairvoyant setting, considered in this work, we observe a much larger benefit from allowing restarts, reducing the competitive ratio from $\Omega (n)$ to a constant.

Further related work. In the end, all aborted probes served only the purpose of collecting information about the unknown processing times of the jobs. Kill-and-restart strategies can thus be regarded as online algorithms for non-preemptive scheduling with the possibility to invest time in order to obtain some information. In that sense, the considered model resembles that of explorable uncertainty [43,44,45]. In order to allow for any reasonable competitiveness results, it must be ensured in both models that testing/probing provides some benefit to the algorithm other than information gain. In the explorable uncertainty model, this is achieved by the assumption that testing can shorten the actual processing times, while in our model the probing time replaces the processing time if the probe was long enough.

Scheduling on a single machine under the kill-and-restart model shares many similarities with optimal search problems, in which a number of agents are placed in some environment and must either find some target or meet each other as quickly as possible. A problem that received a lot of attention is the so-called w-lanes cow-path problem, in which an agent (the cow) is initially placed at the crossing of w roads, and must find a goal (a grazing field) located at some unknown distance on one the w roads [46]. For the case $w=2$, deterministic and randomized search strategies were given that achieve the optimal competitive ratio of 9 [47] and approximately 4.5911 [47, 48], respectively. This work has been extended by Kao et al. [49], who give optimal deterministic and randomized algorithms for all $w\in \mathbb {N}$. The single-machine scheduling problem with kill-and-restart strategies can in fact be viewed in this framework: There are now $n=w$ goals, and the jth goal is located at some unknown distance $p_j$ on the jth road. The agent can move at unit speed on any of the roads, and has the ability to teleport back to the origin at any point in time, which represents the action of aborting a job. The objective is to minimize the sum of times at which each goal is found.

2 Preliminaries

We consider the machine scheduling problem of minimizing the weighted sum of completion times on a single machine (). Formally, we consider instances $I = (\varvec{p},\varvec{w})$ consisting of a vector of processing times $\varvec{p} = (p_j)_{j=1}^n$ and a vector of weights $\varvec{w} = (w_j)_{j=1}^n$.

If the jobs are in WSPT order, i.e., jobs are ordered increasingly by their Smith ratios $p_j/w_j$, then it is easy to see that sequencing the jobs in this ordering yields an optimal schedule. We denote this (clairvoyant) schedule by $\text {OPT}(I)$. By slight abuse of notation, we also denote the objective value of an optimal schedule by $\text {OPT}(I)$. In particular, its cost is $ \text {OPT}(I) = \sum _{j=1}^n w_j\sum _{k=1}^j p_{k} = \sum _{j=1}^n p_{j}\sum _{k=j}^n w_j $.

The focus of our work lies in the analysis of non-clairvoyant strategies. We call a strategy non-clairvoyant if it does not use information on the processing time $p_j$ of a job j before j has been completed. A deterministic strategy $\mathfrak {D}$ is said to be c-competitive if, for all instances $I = (\varvec{p}, \varvec{w})$, $\mathfrak {D} (I) \le c \cdot \text {OPT}(I)$, where $\mathfrak {D} (I)$ denotes the cost of the strategy for instance I. The competitive ratio of $\mathfrak {D}$ is defined as the infimum over all c such that $\mathfrak {D}$ is c-competitive. For a randomized strategy $\mathfrak {R}$, the cost for instance I is a random variable $X_I:\Omega \rightarrow \mathbb {R}_{\ge 0}$ that associates an outcome $\omega $ of the strategy’s sample space to the realized cost, and we denote by $\mathfrak {R}(I){:}{=}\mathbb {E}[X_I]$ the expected cost of the randomized strategy for instance I. We say that $\mathfrak {R}$ is c-competitive if for all instances $I = (\varvec{p}, \varvec{w})$, $\mathfrak {R}(I)\le c \cdot \text {OPT}(I)$. It is well known that for our problem no non-preemptive strategy can achieve a constant competitive ratio.

Proposition 1

No randomized non-preemptive non-clairvoyant strategy has a constant competitive ratio for .

Proof

By Yao’s principle [50] it suffices to construct a randomized instance for which any deterministic strategy has non-constant competitive ratio. To this end, we consider the instance with n jobs where $p_1=\cdots =p_{n-1}=1$ and $p_n=n^2$ and randomize uniformly over all permutations of the jobs. Clearly, an optimal clairvoyant strategy sequences the jobs in any realization in SPT order and hence, we have $ \text {OPT}=\sum _{j=1}^n (n-j+1)p_j=\frac{1}{2}n(n-1)+n-1+n^2=\mathcal {O}(n^2). $

The schedule of any deterministic strategy can be represented as a permutation of the jobs as idling only increases the objective value. Hence, for any permutation we obtain the expected cost

$$\begin{aligned} \sum _{\sigma }\frac{1}{n!}\sum _{j=1}^{n}(n-j+1)p_{\sigma (j)}= & {} \sum _{k=1}^n\frac{1}{n}\bigg (\sum _{j=1}^{k-1}j +(n-k+1)n^2 + \sum _{j=k+1}^{n}j \bigg )\\ {}\ge & {} \frac{1}{n}\cdot \frac{(n-1)n^3}{2}=\Omega (n^3), \end{aligned}$$

where we used the fact that in a uniformly distributed permutation, the probability that the long job appears in each position $k\in [n]$ is $\frac{1}{n}$. $\square $

Kill-and-restart strategies Due to this negative result, we study non-clairvoyant kill-and-restart strategies for that may abort the processing of a job, but when it is processed again later, it has to be executed from the beginning. In order to define such strategies, we first introduce a state and action space as well as a transition function modeling the kill-and-restart setting. Then, we can describe kill-and-restart strategies as functions mapping states to actions.

Formally, we consider the state space $\mathcal {S}{:}{=}\mathbb {R}\times 2^{[n]} \times \mathbb {R}^n$. A state $(\theta , U, \varvec{\mu }) \in \mathcal {S}$ consists of the current time $\theta $, the set of unfinished jobs U at $\theta $, and a vector $\varvec{\mu }$ of lower bounds on the processing times learned from past probes, such that $p_j\ge \mu _j$ for all jobs j. For every state $s = (\theta , U, \varvec{\mu }) \in \mathcal {S}$, there is a set of possible kill-and-restart actions $\mathcal {A}(s)$, where an action $a = \bigl ((t_i,j_i,\tau _i)\bigr )_{i \in \mathcal {I}} \in \mathcal {A}(s)$ is a family of probes $(t_i,j_i,\tau _i)$ such that the intervals $ (t_i, t_i+\tau _i)$, $i \in \mathcal {I}$, are disjoint and contained in $\mathbb {R}_{> \theta }$ and $j_i \in U$ for all $i \in \mathcal I$. We denote by $\mathcal {A}= \bigcup _{s \in \mathcal {S}} \mathcal {A}(s)$ the set of all actions in all states. Additionally, we define a transition function $T_I :\mathcal {S}\times \mathcal {A}\rightarrow \mathcal {S}$ depending on the instance I. This function transforms any state $s = (\theta , U, \varvec{\mu })$ and action $a = \big ((t_i, j_i, \tau _i)\big )_{i \in \mathcal {I}} \in \mathcal A(s)$ into a new state $s' = (\theta ', U', \varvec{\mu }')$ as follows. First, we identify the probe indexed with $i^* {:}{=}{\text {argmin}} \big \{ t_i + p_{j_i} \mid i \in \mathcal {I} \text { with } \tau _i \ge p_{j_i} \big \}$, which is the first probe in a that leads to the completion of some job. Then, the lower bounds $\varvec{\mu }'$ of the new state $s'$ are defined by $\mu _j' {:}{=}\max \big \{ \mu _j, \max \{ \min \{ \tau _i, p_j \} \mid i \in \mathcal {I}: t_i \le t_{i^*}, j_i = j \} \big \}$, i.e., the lower bounds are updated to the new probing times or the processing times if appropriate. Further, the job completing in probe $i^*$ is removed from the set of unfinished jobs by setting $U'{:}{=}U{\setminus } \{j_{i^*}\}$, and the time is updated to $\theta ' {:}{=}t_{i^*} + p_{j_{i^*}}$. Finally, we define a kill-and-restart strategy as a function $\Pi :\mathcal {S}\rightarrow \mathcal {A}$ with $\Pi (s) \in \mathcal {A}(s)$ for all $s \in \mathcal {S}$. Note that a kill-and-restart strategy is non-clairvoyant by definition as it only has access to the lower bounds on the processing times, while the actual processing time is only revealed to the strategy upon completion of a job.

However, observe that such strategies may not be implementable, e.g., on a Turing machine, as the above definition allows for an infinite number of probes in a bounded time range. On the other hand, a deterministic kill-and-restart strategy without infinitesimal probing cannot be constant-competitive. To see this, consider an arbitrary algorithm ALG, and assume without loss of generality that the first job it probes is the first job presented in the input. Denote by $t>0$ the first probing time, and, for $\varepsilon < 1$, consider the instance $I_\varepsilon =\big ((t, \varepsilon t,\ldots ,\varepsilon t),(1,\ldots ,1)\big )$ with n unit weight jobs. By construction, ALG processes the first job without aborting it, so $ALG\ge nt + n(n-1)/2\cdot t \varepsilon = nt+\mathcal {O}(\varepsilon )$. On the other hand $\text {OPT}$ schedules the job in SPT order, yielding $\text {OPT}=n(n-1)/2\cdot t \varepsilon + (n-1)t\varepsilon +t=t+\mathcal {O}(\varepsilon )$. Hence, $ALG/\text {OPT}$ approaches n as $\varepsilon \rightarrow 0$. This subtlety is in fact inherent to all scheduling problems with unknown processing times or search problems with unknown distances. Of course, this can be avoided if a lower bound on the processing times $p_j$ is given, e.g. $p_j \ge 1$ for all $j \in [n]$. In this case, the strategies analyzed in this paper can be turned into implementable ones. Throughout this paper, however, we don’t want to make this assumption and hence analyze strategies with infinitesimal probing. This way of describing strategies for search problems is commonly used in the literature (see, e.g. [47]) and simplifies the exposition of our analysis since some constants resulting from finite geometric series vanish when infinitesimal probing is introduced so that the geometric series are infinite.

We denote by $Y_j^{\mathfrak {S}}(I, t)$ the total time for which the machine has been busy processing job j until time t in the schedule constructed by the strategy $\mathfrak {S}$ on the instance I.