Skip to main content
Springer logoLink to Springer
. 2025 Jul 21;87(11):1518–1563. doi: 10.1007/s00453-025-01335-7

Smoothed Analysis of the 2-Opt Heuristic for the TSP under Gaussian Noise

Marvin Künnemann 1, Bodo Manthey 2,, Rianne Veenstra 2
PMCID: PMC12450235  PMID: 40984866

Abstract

The 2-opt heuristic is a very simple local search heuristic for the traveling salesperson problem. In practice it usually converges quickly to solutions within a few percentages of optimality. In contrast to this, its running-time is exponential and its approximation performance is poor in the worst case. Englert, Röglin, and Vöcking (Algorithmica, 2014) provided a smoothed analysis in the so-called one-step model in order to explain the performance of 2-opt on d-dimensional Euclidean instances, both in terms of running-time and in terms of approximation ratio. However, translating their results to the classical model of smoothed analysis, where points are perturbed by Gaussian distributions with standard deviation Inline graphic, yields only weak bounds. We prove bounds that are polynomial in n and Inline graphic for the smoothed running-time with Gaussian perturbations. In addition, our analysis for Euclidean distances is much simpler than the existing smoothed analysis. Furthermore, we prove a smoothed approximation ratio of Inline graphic. This bound is almost tight, as we also provide a lower bound of Inline graphic for Inline graphic. Our main technical novelty here is that, different from existing smoothed analyses, we do not separately analyze objective values of the global and local optimum on all inputs (which only allows for a bound of Inline graphic), but simultaneously bound them on the same input.

Keywords: Travelling salesperson problem, Local search, Smoothed analysis, Approximation ratio, 2-opt

2-Opt and Smoothed Analysis

The traveling salesperson problem (TSP) is one of the classical combinatorial optimization problems. Euclidean TSP is the following variant: given points Inline graphic, find the shortest Hamiltonian cycle that visits all points in X (also called a tour). Even this restricted variant is NP-hard for Inline graphic [26]. We consider Euclidean TSP with Manhattan and Euclidean distances as well as squared Euclidean distances to measure the distances between points. For the former two, there exist polynomial-time approximation schemes (PTAS) [2, 25]. The latter, which has applications in power assignment problems for wireless networks [15], admits a PTAS for Inline graphic and is APX-hard for Inline graphic [30].

As it is unlikely that there are efficient algorithms for solving Euclidean TSP optimally, heuristics have been developed in order to find near-optimal solutions quickly. One very simple and popular heuristic is 2-opt: starting from an initial tour, we iteratively replace two edges by two other edges to obtain a shorter tour until we have found a local optimum. Experiments indicate that 2-opt converges to near-optimal solutions quite quickly [16, 17], but its worst-case performance is bad: the worst-case running-time is exponential even for Inline graphic [12] and the approximation ratio is Inline graphic for Euclidean instances [5, 7].

An alternative to worst-case analysis is average-case analysis, where the expected performance with respect to some probability distribution is measured. The average-case running-time for Euclidean and random metric instances and the average-case approximation ratio for non-metric instances of 2-opt have been analyzed [4, 7, 11, 19]. However, while worst-case analysis is often too pessimistic because it is dominated by artificial instances that are rarely encountered in practice, average-case analysis is dominated by random instances, which have often very special properties with high probability that they do not share with typical instances.

In order to overcome the drawbacks of both worst-case and average-case analysis and to explain the performance of the simplex method, Spielman and Teng invented smoothed analysis [28], a hybrid of worst-case and average-case analysis: an adversary specifies an instance, and then this instance is slightly randomly perturbed. The smoothed performance is the expected performance, where the expected value is taken over the random perturbation. The underlying assumption is that real-world instances are often subjected to a small amount of random noise. This noise can stem from measurement or rounding errors, or it might be a realistic assumption that the instances are influenced by unknown circumstances, but we do not have any reason to believe that these are adversarial. Smoothed analysis often allows more realistic conclusions about the performance than worst-case or average-case analysis. Since its invention, it has been applied successfully to explain the performance of a variety of algorithms. We refer to two surveys for an overview of smoothed analysis in general [22, 29] and a more recent survey about smoothed analysis applied to local search algorithms [21].

Related Results

Running-time. Englert, Röglin, and Vöcking [12] provided a smoothed analysis of 2-opt in order to explain its performance. They used the one-step model: an adversary specifies n probability density functions Inline graphic. Then the n points Inline graphic are drawn independently according to the densities Inline graphic, respectively. Here, Inline graphic is the perturbation parameter. If Inline graphic, then the only possibility is the uniform distribution on Inline graphic, and we obtain an average-case analysis. The larger Inline graphic, the more powerful the adversary. Englert et al. [12] proved that the expected number of iterations of 2-opt is Inline graphic and Inline graphic for Manhattan and Euclidean distances, respectively. These bounds can be improved slightly by choosing the initial tour with an insertion heuristic. However, if we transfer these bounds to the classical model of points perturbed by Gaussian distributions of standard deviation Inline graphic, we obtain bounds that are polynomial in n and Inline graphic [12, Section 6], since the maximum density of a d-dimensional Gaussian with standard deviation Inline graphic is Inline graphic. While this is polynomial for any fixed d, it is unsatisfactory that the degree of the polynomial depends on d.

Approximation ratio.

Much less is known about the smoothed approximation performance of algorithms. Karger and Onak have shown that multi-dimensional bin packing can be approximated arbitrarily well for smoothed instances [18] and there are frameworks to approximate Euclidean optimization problems such as TSP for smoothed instances [3, 8]. However, these approaches mostly consider algorithms tailored to solving smoothed instances.

With respect to concrete algorithms other than 2-opt, we are only aware of analyses of the jump and lex-jump heuristics for scheduling [6, 13].

Englert et al. [12] proved a bound of Inline graphic. Translated to Gaussians, this yields a bound of Inline graphic if we truncate the Gaussians such that all points lie in a hypercube of constant side length. This result, however, does not explain the approximation performance 2-opt, as the bound is still quite large, even for larger values of Inline graphic or smaller values of Inline graphic.

Our Contribution

In order to improve our understanding of the practical performance of 2-Opt, we provide an improved smoothed analysis of both its running-time and its approximation ratio. To do this, we use the classical smoothed analysis model: an adversary chooses n points from the d-dimensional unit hypercube Inline graphic, and then these points are independently randomly perturbed by Gaussian random variables of standard deviation Inline graphic.

Running-time The bounds that we prove are polynomial in n and Inline graphic. Different to earlier results, the degree of the polynomial is independent of d. As distance measures, we consider Manhattan (Sect. 3.3), Euclidean (Sect. 3.5), and squared Euclidean distances (Sect. 3.4).

The analysis for Manhattan distances is essentially an adaptation of the existing analysis by Englert et al. [12, Section 4.1]. Note that our bound does not have any factor that is exponential in d.

Our analysis for Euclidean distances is considerably simpler than the one by Englert et al., which is rather technical and takes more than 25 pages [12, Section 4.2 and Appendix C].

The analysis for squared Euclidean distances is, to our knowledge, not preceded by a smoothed analysis in the one-step model. Because of the nice properties of squared Euclidean distances and Gaussian perturbations, this smoothed analysis is relatively compact and elegant.

Table 1 summarizes our bounds for the number of iterations.

Table 1.

Our bounds compared to the bounds obtained by Englert et al. [12] for the one-step model

Manhattan Euclidean Squared Euclidean
Englert et al. [12] Inline graphic Inline graphic
General Inline graphic Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic Inline graphic
Remarks Only for Inline graphic

only for Inline graphic

; a weaker bound holds for Inline graphic (Theorem 3.14)

The bounds can roughly be transferred to Gaussian noise by replacing Inline graphic with Inline graphic. For convenience, we added our bounds for small and large values of Inline graphic: for Inline graphic, we have Inline graphic, for larger Inline graphic, we have Inline graphic. The notation Inline graphic means that terms depending on d are hidden in the O. The remarks are only for our bounds

Recently, building on our analysis, Manthey and van Rhijn [23] have improved the running-time bounds for Euclidean distances. Specifically, they have reduced the upper bound from Inline graphic to Inline graphic, at the cost of a significantly more complex proof. Furthermore, while their analysis remains restricted to Euclidean distances, we cover Manhattan and squared Euclidean distances as well. In particular, our analysis using squared Euclidean distances is very compact, in contrast to most applications of smoothed analysis, which makes this case particularly appealing, not least for teaching courses on this subject.

Approximation ratio As the earlier smoothed analysis by Englert et al. [12], we provide bounds on the quality of the worst local optimum. While this measure is rather unrealistic and pessimistic, it decouples the analysis from the seeding of the heuristic. Taking into account the seeding would probably severely complicate the analysis.

Our bound of Inline graphic improves significantly upon the direct translation of the bound of Englert et al. [12] to Gaussian perturbations (see Sect. 4.2 for how to translate the bound to Gaussian perturbations without truncation). It smoothly interpolates between the average-case constant approximation ratio and the worst-case bound of Inline graphic.

In order to obtain our improved bound for the smoothed approximation ratio, we take into account the origins of the points, i.e., their unperturbed positions. Although this information is not available to the algorithm, it can be exploited in the analysis. The smoothed analyses of approximation ratios so far [3, 6, 8, 12, 13, 18] essentially ignored this information. While this simplifies the analysis, being oblivious to the unperturbed positions seems to be too pessimistic. In fact, we see that the bound of Englert et al. [12] cannot be improved beyond Inline graphic by ignoring the positions of the points (Sect. 4.2). The reason for this limitation is that the lower bound for the global optimum is obtained if all points have the same origin, which corresponds to an average-case rather than a smoothed analysis. On the other hand, the upper bound for the local optimum has to hold for all choices of the unperturbed points, most of which yield higher costs for the global optimum than the average-case analysis. Taking this into account carefully yields our bound of Inline graphic (Sect. 4.3).

To complement our upper bound, we show that the lower bound of Inline graphic by Chandra et al. [7] remains true for Inline graphic (Sect. 4.4). This implies that a smoothed bound of Inline graphic is impossible, and, thus, our bound cannot be improved significantly.

2-Opt and Smoothing Model

2-Opt Heuristic for the TSP

Let Inline graphic be a set of n points. The goal of the TSP is to find a Hamiltonian cycle (also called a tour) T through X that has minimum length according to some distance measure. In this paper, we consider standard Euclidean distances for both approximation ratio and running-time as well as squared Euclidean distances and Manhattan distances for the running-time.

Given a tour T, a 2-change replaces two edges Inline graphic and Inline graphic of T by two new edges Inline graphic and Inline graphic, provided that this yields again a tour (this is the case if Inline graphic appear in this order in the tour) and that this decreases the length of the tour, i.e., Inline graphic, where Inline graphic (Euclidean distances), Inline graphic (Manhattan distances), or Inline graphic (squared Euclidean distances). The 2-opt heuristic iteratively improves an initial tour by applying 2-changes until it reaches a local optimum. A local optimum is called a 2-optimal tour.

Smoothing Model

Throughout the rest of this paper, let Inline graphic be a set of n points from the unit hypercube. In the smoothed analysis, these points are chosen by an adversary, and they serve as unperturbed origins. Let Inline graphic be n independent random variables with mean 0 and standard deviation Inline graphic. By slight abuse of notation, Inline graphic refers here to the multivariate normal distribution with covariance matrix Inline graphic. We obtain the perturbed point set Inline graphic by adding Inline graphic for each Inline graphic. We write Inline graphic to make explicit from which point set Inline graphic the points in X are obtained.

We assume that Inline graphic throughout the paper. This is justified by two reasons. First, small Inline graphic are the interesting case, i.e., when the order of magnitude of the perturbation is relatively small. Second, smoothed performance guarantees are monotonically decreasing in Inline graphic: if we have Inline graphic, then this is equivalent to adversarial instances in Inline graphic that are perturbed with standard deviation 1. This in turn is dominated by adversarial instances in Inline graphic that are perturbed with standard deviation 1, as Inline graphic. Thus, any upper bound for Inline graphic (be it for the number of iterations or the approximation ratio) holds also for larger Inline graphic.

Let us make a final remark about the smoothing model: while the algorithm itself, the 2-opt heuristic in our case, only sees X and does not know anything about the origins Inline graphic, we can of course exploit the positions of the unperturbed points in the analysis.

Smoothed Analysis of the Running-Time

In this section, we make the dependence on all parameters (the number n of points, the dimension d, and the perturbation parameter Inline graphic) explicit. This means that the O or Inline graphic do not hide any factors, not even factors depending on d, which is often considered as a constant and therefore ignored. (This is also in contrast to our analysis of the approximation ratio, where the hidden constant can indeed depend on d.)

Probability Theory for the Running-Time

In order to get an upper bound for the length of the initial tour, we need an upper bound for the diameter of the point set X. Such an upper bound is also necessary for the analysis of 2-changes with Euclidean distances (Sect. 3.5). We choose Inline graphic such that Inline graphic with a probability of at least Inline graphic. For fixed d and Inline graphic, we can choose Inline graphic according to the following lemma. For Inline graphic, we have Inline graphic.

Lemma 3.1

Let Inline graphic be a sufficiently large constant, and let Inline graphic. Then Inline graphic.

Proof

We have Inline graphic only if there is a point Inline graphic and a coordinate of Inline graphic that is perturbed by more than Inline graphic. According to Durrett [10, Theorem 1.2.3], the probability that a 1-dimensional Gaussian of standard deviation Inline graphic is more than Inline graphic away from its mean is bounded from above by Inline graphic. Thus, the probability that Inline graphic is bounded from above by Inline graphic. For sufficiently large c, this is at most 1/n!. Inline graphic

Note that the constant c in Lemma 3.1 does not depend on the dimension d.

The following lemma is well known and follows from the fact that the density of a d-dimensional Gaussian with standard deviation Inline graphic is bounded from above by Inline graphic and the volume of a d-dimensional ball of radius Inline graphic is bounded from above by Inline graphic.

Lemma 3.2

Let Inline graphic be drawn according to a d-dimensional Gaussian distribution of standard deviation Inline graphic, and let Inline graphic be a d-dimensional hyperball of radius Inline graphic centered at Inline graphic. Then Inline graphic.

For Inline graphic with Inline graphic, let Inline graphic denote the straight line through x and y.

Lemma 3.3

Let Inline graphic be arbitrary with Inline graphic. Let Inline graphic be drawn according to a d-dimensional Gaussian distribution with standard deviation Inline graphic. Then the probability that c is Inline graphic-close to L(ab), i.e., Inline graphic, is bounded from above by Inline graphic.

Proof

We divide drawing c into drawing a 1-dimensional Gaussian Inline graphic in the direction of Inline graphic and drawing a Inline graphic-dimensional Gaussian Inline graphic in the hyperplane orthogonal to Inline graphic and containing Inline graphic. Then the distance of c to L(ab) is Inline graphic. For every Inline graphic, the point c is Inline graphic-close to L(ab) only if Inline graphic falls into a Inline graphic-dimensional hyperball of radius Inline graphic around Inline graphic in the Inline graphic-dimensional subspace orthogonal to Inline graphic. Now the lemma follows by applying Lemma 3.2. Inline graphic

We need the following lemma in Sect. 3.5.

Lemma 3.4

Let Inline graphic be a differentiable function. Let B be an upper bound for the absolute value of the derivative of f. Let c be distributed according to a Gaussian distribution with standard deviation Inline graphic. Let I be an interval of size Inline graphic, and let Inline graphic be the image of I. Then Inline graphic.

Proof

Since the derivative of f is bounded by B, the set f(I) is contained in some interval of length Inline graphic. The lemma follows since the density of c is bounded from above by Inline graphic. Inline graphic

The chi distribution [14, Section 8] is the distribution of the Euclidean length of a d-dimensional Gaussian random vector of standard deviation Inline graphic and mean 0. In the following, we denote its density function by Inline graphic. It is given by

graphic file with name d33e1743.gif 1

where Inline graphic denotes the gamma function. We need the following lemma several times.

Lemma 3.5

Assume that Inline graphic is a fixed constant and Inline graphic is arbitrary with Inline graphic. Then we have

graphic file with name d33e1779.gif

Proof

The first equality follows by integration. For the second inequality, we observe Inline graphic is a fixed constant (which also never depends on d when we apply the lemma) and that

graphic file with name d33e1797.gif

for some function Inline graphic with Inline graphic according to Stirling’s formula [1, 6.1.37]. We have Inline graphic as Inline graphic and both are integers. Then

graphic file with name d33e1831.gif

Here, the third equality follows from two facts: first, c is a fixed constant, thus Inline graphic. Second, Inline graphic. Thus, Inline graphic and Inline graphic lie between 0 and a constant. Hence, the exponential term is Inline graphic.

Analyzing A and B remains to be done: We have Inline graphic, thus Inline graphic. If Inline graphic, then B is bounded from below by a constant and so is Inline graphic. If Inline graphic, then Inline graphic. Hence, Inline graphic.

We have Inline graphic. Distinguishing the cases Inline graphic and Inline graphic in the same way as for B yields Inline graphic. Thus, Inline graphic. Inline graphic

The analysis with Euclidean and squared Euclidean distances depends on the distribution of the distance between two points perturbed by Gaussians, where a larger distance between the two points is better for the analysis. The following two lemmas show that, given that larger distance is better, we can replace the distribution of the distance by the corresponding chi distribution. Since we do not know the original positions of the points involved, this allows us to replace unknown distributions by the chi distribution.

Lemma 3.6

Assume that a is drawn according to a d-dimensional Gaussian distribution with standard deviation Inline graphic and mean 0. Assume that b is drawn according to a d-dimensional Gaussian distribution with standard deviation Inline graphic and mean Inline graphic. Then Inline graphic stochastically dominates Inline graphic, i.e., Inline graphic for all Inline graphic.

Proof

For Inline graphic, we have the following:

graphic file with name d33e2037.gif

Now we prove the lemma for larger d. Since Gaussian distributions are rotationally symmetric, we can assume that Inline graphic for some Inline graphic.

We observe that Inline graphic dominates Inline graphic if and only if Inline graphic dominates Inline graphic. Let Inline graphic. It suffices to prove the lemma for this choice of Inline graphic, as Inline graphic follows the same distribution as b. Fixing Inline graphic fixes also Inline graphic. Then Inline graphic dominates Inline graphic if Inline graphic dominates Inline graphic. This is true because the lemma holds for Inline graphic. Inline graphic

Lemma 3.7

Let b be as in Lemma 3.6, and let Inline graphic be a monotonically decreasing function. Let g be the density function of Inline graphic. Then

graphic file with name d33e2180.gif

provided that both integrals exist.

Proof

Let a denote the d-dimensional Gaussian random variable of standard deviation Inline graphic and mean 0. Then Inline graphic has density Inline graphic. By Lemma 3.6, Inline graphic is dominated by Inline graphic. This implies that Inline graphic dominates Inline graphic since h is monotonically decreasing. The lemma follows by observing that the two integrals are the two expected values of Inline graphic and Inline graphic. Inline graphic

For Euclidean and squared Euclidean distances, it turns out to be useful to study Inline graphic for points Inline graphic. By abusing notation, we sometimes write Inline graphic instead of Inline graphic for short. A 2-change that replaces Inline graphic and Inline graphic by Inline graphic and Inline graphic improves the tour length by Inline graphic.

2-Opt State Graph and Linked 2-Changes

The number of iterations that 2-opt needs depends of course heavily on the initial tour and on which 2-change is chosen in each iteration. We do not make any assumptions about the initial tour and about which 2-change is chosen. Following Englert et al. [12], we consider the 2-opt state graph: we have a node for every tour and a directed edge from tour T to tour Inline graphic if Inline graphic can be obtained by one 2-change. The 2-opt state graph is a directed acyclic graph, and the length of the longest path in the 2-opt state graph is an upper bound for the number of successful iterations that 2-opt needs.

In order to improve the bounds, we also consider pairs of linked 2-changes [12]. Two 2-changes form a pair of linked 2-changes if there is one edge added in one 2-change and removed in the other 2-change. Formally, one 2-change replaces Inline graphic and Inline graphic by Inline graphic and Inline graphic and the other 2-change replaces Inline graphic and Inline graphic by Inline graphic and Inline graphic. The edge Inline graphic is the one that appears and disappears again (or the other way round). It can happen that Inline graphic and Inline graphic intersect. Englert et al. [12] called a pair of linked 2-changes a type i pair if Inline graphic. As type 2 pairs, which involve only four nodes, are difficult to analyze because of dependencies, we ignore them. Fortunately, the following lemma states that we will find enough disjoint pairs of linked 2-changes of type 0 and 1 in any sufficiently long sequence of 2-changes.

Lemma 3.8

(Englert et al. [12, Lemma 9 of corrected version]) Every sequence of t consecutive 2-changes contains at least Inline graphic disjoint pairs of linked 2-changes of type 0 or type 1.

Following Englert et al. [12, Figure 8], we subdivide type 1 pairs into type 1a and type 1b depending on how Inline graphic and Inline graphic intersect. One of the 2-changes replaces Inline graphic and Inline graphic by Inline graphic and Inline graphic. Then the other 2-change, i.e., the one that removes the edge Inline graphic shared by the linked pair, determines its type:

  • Type 0: Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic.

  • Type 1a: Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic.

  • Type 1b: Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic.

The main idea in the proofs by Englert et al. [12] and also in our proofs is to bound the minimal improvement of any 2-change or the minimal improvement of any pair of linked 2-changes. We denote the smallest improvement of any 2-change by Inline graphic and the smallest improvement of any pair of linked 2-changes of type 0, 1a, or 1b by Inline graphic. It will be clear from the context which distance measure is used for Inline graphic and Inline graphic.

Suppose that the initial tour has a length of at most L, then 2-opt cannot run for more than Inline graphic iterations and not for more than Inline graphic iterations, provided that Inline graphic because of Lemma 3.8.

The following lemma formalizes this and shows how to bound the expected number of iterations using a tail bound for Inline graphic or Inline graphic.

Lemma 3.9

Suppose that, with a probability of at least Inline graphic, any tour has a length of at most L. Let Inline graphic. Then

  1. If Inline graphic, then the expected length of the longest path in the 2-opt state graph is bounded from above by Inline graphic.

  2. If Inline graphic, then the expected length of the longest path in the 2-opt state graph is bounded from above by Inline graphic.

  3. The same bounds as (1) and (2) hold if we replace Inline graphic by Inline graphic, provided that Inline graphic for Case 1 and Inline graphic for Case 2.

Proof

If the length of the longest tour is longer than L, then we use the trivial upper bound of n!. This contributes only O(1) to the expected value, which, by slight abuse of mathematical correctness, we ignore in the following.

Consider the first statement. Let T be the longest path in the 2-opt state graph. If Inline graphic, then Inline graphic. Plugging this in and observing that n! is an upper bound for T yields

graphic file with name d33e2774.gif

Now consider the second statement, and let T be as above. Let Inline graphic. Then

graphic file with name d33e2789.gif

Finally, we consider the third statement. The statement follows from the observation that the maximal number of disjoint pairs of linked 2-changes and the length of the longest path in the 2-opt state graph are asymptotically equal if they are of length at least Inline graphic (Lemma 3.8) and the probability statements become nontrivial only for Inline graphic in the first and Inline graphic in the second case. Inline graphic

Manhattan Distances

The essence of our analysis for Manhattan distances is a straightforward adaptation of the analysis in the one-step model. The extra factor of Inline graphic comes from the bound of the initial tour, and the extra factor of Inline graphic stems from stating the dependence on d explicitly and getting rid of the exponential dependence on d [12, Proofs of Theorem 7 and Lemma 10].

Lemma 3.10

Inline graphic.

Proof

We consider a pair of linked 2-changes as described in Sect. 3.2. The improvement of the first 2-change is

graphic file with name d33e2863.gif

where Inline graphic is the i-th coordinate of Inline graphic. The improvement of the second 2-change is

graphic file with name d33e2885.gif

Note that we can have a type 1 pair, i.e., two of the points Inline graphic can be identical.

Each ordering of the Inline graphic gives rise to a linear combination for Inline graphic and Inline graphic. We have Inline graphic such orderings. If we examine the case analysis by Englert et al. [12, Lemmas 11, 12, 13] closely, we see that any pair of linear combinations is either impossible (it uses a different ordering of the variables for Inline graphic and Inline graphic or one of Inline graphic and Inline graphic is non-positive, thus the corresponding 2-change is in fact not a 2-change) or we have one variable Inline graphic that has a non-zero coefficient in Inline graphic and a coefficient of 0 in Inline graphic and another variable Inline graphic that has a non-zero coefficient in Inline graphic and a coefficient of 0 in Inline graphic. The absolute values of the non-zero coefficients of Inline graphic and Inline graphic is 2. Now Inline graphic falls into Inline graphic only if Inline graphic falls into an interval of length Inline graphic. This happens with a probability of at most Inline graphic. By independence, the same holds for Inline graphic and Inline graphic.

However, we would incur an extra factor of Inline graphic in this way, and we would like to remove all exponential dependence of d. In order to do this, we assume that we know i and Inline graphic already. This comes at the expense of a factor of Inline graphic for taking a union bound over the choices of i and Inline graphic. We let an adversary fix values for all Inline graphic with Inline graphic. Since we know i and Inline graphic, we are left with at most Inline graphic possible linear combinations.

Finally, the lemma follows by taking a union bound over all Inline graphic possible pairs of linked 2-changes. Inline graphic

Theorem 3.11

The expected length of the longest path in the 2-opt state graph corresponding to d-dimensional instances with Manhattan distances is at most Inline graphic.

Proof

The initial tour has a length of at most Inline graphic with a probability of at least Inline graphic by Lemma 3.1. We apply Lemma 3.9 for linked 2-changes using Lemma 3.10 and Inline graphic. Inline graphic

Squared Euclidean Distances

Preparation

In this section, we have Inline graphic for Inline graphic.

Assume that we have a 2-change that replaces Inline graphic and Inline graphic by Inline graphic and Inline graphic. The improvement caused by this 2-change is Inline graphic. Given the positions of the four nodes except for a single Inline graphic, such a 2-change yields a small improvement only if the corresponding Inline graphic falls into some interval of size Inline graphic. The following lemma gives an upper bound for the probability that this happens.

Lemma 3.12

Let Inline graphic, Inline graphic, and let c be drawn according to a Gaussian distribution with standard deviation Inline graphic. Let Inline graphic be an interval of length Inline graphic. Then

graphic file with name d33e3278.gif
Proof

Since Gaussian distributions are rotationally symmetric, we can assume without loss of generality that Inline graphic and Inline graphic with Inline graphic. Let Inline graphic. Then Inline graphic. Thus, Inline graphic if and only if Inline graphic falls into an interval of length Inline graphic. Since Inline graphic is a 1-dimensional Gaussian random variable with a standard deviation of Inline graphic, the probability for this is bounded from above by Inline graphic since the maximum density of a 1-dimensional Gaussian of standard deviation Inline graphic is bounded from above by Inline graphic. Inline graphic

Single 2-Changes

In this section, we prove a simple bound for the expected number of iterations of 2-opt with squared Euclidean distances. This bound holds for all Inline graphic. In the next section, we improve this bound for the case Inline graphic using pairs of linked 2-changes.

Lemma 3.13

For Inline graphic, we have Inline graphic.

Proof

Consider a 2-change where Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic. Its improvement is given by Inline graphic. We let an adversary fix Inline graphic. Then we draw Inline graphic. This fixes the distance Inline graphic. Now we draw Inline graphic. This fixes Inline graphic. The 2-change yields an improvement of at most Inline graphic only if Inline graphic falls into an interval of size at most Inline graphic. According to Lemma 3.12, the probability that this happens is at most Inline graphic.

Now let g be the probability density of Inline graphic. Then the probability that the 2-change yields an improvement of at most Inline graphic is bounded from above by

graphic file with name d33e3514.gif

The first step is due to Lemma 3.7. The second step is due to Lemma 3.5 using Inline graphic and Inline graphic. The lemma follows by a union bound over the Inline graphic possible 2-changes. Inline graphic

Theorem 3.14

For all Inline graphic, the expected length of the longest path in the 2-opt state graph corresponding to d-dimensional instances with squared Euclidean distances is at most Inline graphic.

Proof

With a probability of at least Inline graphic, the instance is contained in a hypercube of side length Inline graphic. Thus, the longest edge has a length of at most Inline graphic. Therefore, the initial tour has a length of at most Inline graphic. We combine this with Lemmas 3.9 and 3.13 to complete the proof. Inline graphic

Pairs of Linked 2-Changes

We can obtain a better bound than in the previous section by analyzing pairs of linked 2-changes. With the following three lemmas, we analyze the probability that pairs of linked 2-changes of type 0, 1a, or 1b yield an improvement of at most Inline graphic.

Lemma 3.15

For Inline graphic, the probability that there exists a pair of type 0 of linked 2-changes that yields an improvement of at most Inline graphic is bounded from above by Inline graphic.

Proof

Consider a fixed pair of type 0 of linked 2-changes involving the six points Inline graphic as described in Sect. 3.2. We show that the probability that it yields an improvement of at most Inline graphic is at most Inline graphic. A union bound over the Inline graphic possibilities of pairs of type 0 yields the lemma.

The basic idea is that we restrict ourselves to analyzing Inline graphic and Inline graphic only in order to bound the probability that we have a small improvement. In this way, we use the principle of deferred decision to show that we can analyze the improvements of the two 2-changes as if they were independent:

  1. We let an adversary fix Inline graphic arbitrarily.

  2. We draw Inline graphic, which determines the distance Inline graphic.

  3. We draw Inline graphic. This fixes the position of the “bad” interval for Inline graphic. Its size is already fixed since we know the positions of Inline graphic and Inline graphic. The position of Inline graphic is still random.

  4. We draw Inline graphic. The probability that Inline graphic assumes a position such that the first 2-change yields an improvement of at most Inline graphic is thus at most Inline graphic.

  5. We draw Inline graphic. This determines the distance Inline graphic.

  6. We draw Inline graphic. The probability that Inline graphic assumes a position such that the second 2-change yields an improvement of at most Inline graphic is thus at most Inline graphic.

Let g be the probability density function of the distance between Inline graphic and Inline graphic, and let Inline graphic be the probability density function of the distance between Inline graphic and Inline graphic. By independence of the points, the probability that both 2-changes of the pair yield an improvement of at most Inline graphic is bounded from above by

graphic file with name d33e3866.gif

We observe that Inline graphic is monotonically decreasing in Inline graphic. Thus, by Lemma 3.7, we can replace g and Inline graphic by the density Inline graphic of the chi distribution to get the following upper bound for the probability that a pair of type 0 yields an improvement of at most Inline graphic:

graphic file with name d33e3910.gif

Here, we use Lemma 3.5 with Inline graphic, which is allowed since Inline graphic. Inline graphic

Lemma 3.16

For Inline graphic, the probability that there exists a pair of type 1a of linked 2-changes that yields an improvement of at most Inline graphic is bounded from above by Inline graphic.

Proof

We can analyze pairs of type 1a in the same way as type 0 pairs in Lemma 3.15. To do this, we analyze Inline graphic and Inline graphic:

  1. We let an adversary fix the position of Inline graphic.

  2. We draw Inline graphic. This fixes Inline graphic.

  3. We draw Inline graphic. This fixes Inline graphic. In addition, this fixes the positions of the intervals into which Inline graphic and Inline graphic must fall if the first or second 2-change yield an improvement of at most Inline graphic.

  4. We draw Inline graphic.

  5. We draw Inline graphic.

The remainder of the proof is identical to the proof of Lemma 3.15, except that we have to take a union bound only over Inline graphic possible choices. Inline graphic

Lemma 3.17

For Inline graphic, the probability that there exists a pair of type 1b of linked 2-changes that yields an improvement of at most Inline graphic is bounded from above by Inline graphic.

Proof

Again, we proceed similarly to Lemma 3.15. We analyze a fixed pair of type 1b, where Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic in one step and Inline graphic and Inline graphic are replaced by Inline graphic and Inline graphic, and apply a union bound over the Inline graphic possible type 1a pairs. We analyze the probability that Inline graphic or Inline graphic assume a bad value.

We draw the points in the following order:

  1. We fix Inline graphic.

  2. We draw Inline graphic. This fixes the distance Inline graphic, which is crucial for both 2-changes.

  3. We draw Inline graphic.

  4. We draw Inline graphic. The probability that the first 2-change yields an improvement of at most Inline graphic is at most Inline graphic.

  5. We draw Inline graphic. The probability that the second 2-change yields an improvement of at most Inline graphic is at most Inline graphic.

The main difference to Lemma 3.15 is that the sizes of the bad intervals are not independent. However, once the size of the bad intervals is fixed, we can analyze the probabilities that Inline graphic or Inline graphic fall into their bad intervals as independent. Given that Inline graphic is fixed, the probability that the first and the second 2-change yield an improvement of at most Inline graphic is bounded from above by Inline graphic. Since this is decreasing in Inline graphic, we can replace the distribution of Inline graphic by the chi distribution to obtain an upper bound according to Lemma 3.7. Thus, using Lemma 3.5 with Inline graphic and Inline graphic, we obtain the following upper bound for the probability that a pair of type 1b yields an improvement of at most Inline graphic:

graphic file with name d33e4336.gif

With the three lemmas above, we can obtain a bound on the expected number of iterations of 2-opt for TSP with squared Euclidean distances.

Theorem 3.18

For Inline graphic, the expected length of the longest path in the 2-opt state graph corresponding to d-dimensional instances with squared Euclidean distances is at most Inline graphic.

Proof

The probability that any pair of linked 2-changes of type 0, 1a, or 1b yields an improvement of at most Inline graphic is bounded from above by Inline graphic. We apply Lemma 3.9 with Inline graphic and observe that the initial tour has a length of at most Inline graphic with a probability of at least Inline graphic. Inline graphic

Euclidean Distances

Differences of Euclidean Distances

In this section, we have Inline graphic for Inline graphic. Analyzing Inline graphic turns out to be more difficult than analyzing Inline graphic in the previous section. In particular the case when Inline graphic is close to its maximal value of Inline graphic requires special attention. Intuitively, this is for the following reason: if Inline graphic, then z is close L(ab). Assume that Inline graphic for the moment. Then either z is between a and b, which is fine. Or z is not between a and b. Then moving z in the direction of L(ab) does not change Inline graphic at all.

We observe that Inline graphic behaves essentially 2-dimensionally: it depends only on the distance of z from L(ab) (this is x in the following lemma) and on the position of the projection z onto L(ab) (this is y in the following lemma). It also depends on the distance Inline graphic between a and b (this is Inline graphic in the following lemma, and we had this dependency also in the previous section about squared Euclidean distances). The following lemma makes the connection between x and y explicit for a given Inline graphic. Figure 1 depicts the situation described in the lemma.

Fig. 1.

Fig. 1

The situation for Lemma 3.19

Lemma 3.19

Let Inline graphic, Inline graphic, Inline graphic. Let Inline graphic and Inline graphic be two points at a distance of Inline graphic. Let Inline graphic. Then we have

graphic file with name d33e4646.gif 2

for Inline graphic and

graphic file with name d33e4659.gif 3

for Inline graphic. Furthermore, Inline graphic is impossible.

Proof

The last statement follows from the triangle inequality.

We have Inline graphic. Rearranging terms and squaring implies

graphic file with name d33e4691.gif

Squaring again yields

graphic file with name d33e4697.gif

By rearranging terms again, we obtain

graphic file with name d33e4703.gif

Using the assumption Inline graphic or Inline graphic implies the two claims. Inline graphic

As said before, the difficult case in analyzing Inline graphic is when Inline graphic. In terms of the previous lemma, this can only happen if x is small, i.e., if c is close to L(ab), but not between a and b. The following lemmas makes a quantitative statement about this connection.

Lemma 3.20

Let Inline graphic. Assume that Inline graphic and that z has a distance of x from L(ab). Then

graphic file with name d33e4795.gif 4
Proof

Let y be the distance of z from Inline graphic, and let Inline graphic. Then, according to (3), we have

graphic file with name d33e4827.gif

We have Inline graphic. This and the upper bound Inline graphic yield the following weaker bound:

graphic file with name d33e4845.gif 5

We distinguish two cases. The first case is that Inline graphic. In this case, it suffices to show that Inline graphic in order to prove (4). Since Inline graphic, this holds because Inline graphic.

The second case is that Inline graphic. We have

graphic file with name d33e4888.gif

Replacing Inline graphic by Inline graphic in the numerator and Inline graphic by Inline graphic in the denominator of (5), we obtain

graphic file with name d33e4922.gif

Rearranging terms completes the proof. Inline graphic

In order to be able to apply Lemma 3.4, we need the following upper bound on the derivative of y with respect to Inline graphic, given that x is fixed.

Lemma 3.21

For Inline graphic, let Inline graphic with Inline graphic. Assume further that Inline graphic and that Inline graphic. Then the derivative of y with respect to Inline graphic is bounded by

graphic file with name d33e4994.gif
Proof

The derivative of y with respect to Inline graphic is given by

graphic file with name d33e5012.gif

We observe that Inline graphic for all x and allowed choices of Inline graphic and Inline graphic. For the second term, we have

graphic file with name d33e5040.gif

By assumption, we have Inline graphic and Inline graphic. Thus, we have

graphic file with name d33e5058.gif

Using Lemmas 3.21 and 3.4, we can bound the probability that Inline graphic assumes a value in an interval of size Inline graphic.

Lemma 3.22

Let Inline graphic. Let Inline graphic be arbitrary, Inline graphic, and let z be drawn according to a Gaussian distribution with standard deviation Inline graphic. Let Inline graphic. Let I be an interval of length Inline graphic. Then

graphic file with name d33e5137.gif
Proof

We assume throughout this proof that Inline graphic. The case that this is not satisfied is taken care of by the second term in the upper bound for the probability in the statement of the lemma.

Let x denote the distance of z to L(ab), and let y denote the position of the projection of z onto L(ab). First, let us assume that x is fixed. Then, by Lemmas 3.21 and 3.4, the probability that Inline graphic is bounded from above by

graphic file with name d33e5201.gif

Here, the requirements of Lemma 3.21 are satisfied because of Lemma 3.20, or we have Inline graphic.

We observe that this probability is decreasing in x. Thus, in order to get an upper bound for the probability with random x, we can use the Inline graphic-dimensional chi distribution for x according to Lemma 3.7. We obtain

graphic file with name d33e5241.gif

by Lemma 3.5 using Inline graphic and Inline graphic. Since Inline graphic, the lemma follows. Inline graphic

Analysis of Pairs of 2-Changes

We immediately go to pairs of linked 2-changes, as these yield the better bounds.

Lemma 3.23

For Inline graphic, the probability that a pair of linked 2-changes of type 0 yields an improvement of at most Inline graphic or some point lies outside Inline graphic is bounded from above by

graphic file with name d33e5299.gif
Proof

We proceed similarly as in the proof of Lemma 3.15 for type 0 pairs for squared Euclidean distances. We draw the points of a fixed pair of linked 2-changes as in the proof of Lemma 3.15.

In the same way as in the proof of Lemma 3.15, using Lemma 3.22 instead of Lemma 3.12, we obtain that the probability that one fixed of the two 2-changes yields an improvement of at most Inline graphic is bounded from above by

graphic file with name d33e5331.gif

Here, we applied Lemma 3.5 with Inline graphic.

Again in the same way as in the proof of Lemma 3.15, we can analyze both 2-changes of the type 0 pair as if they are independent. Finally, the lemma follows by a union bound over the Inline graphic possibilities for a type 0 pair. Inline graphic

Lemma 3.24

For Inline graphic, the probability that a pair of linked 2-changes of type 1a yields an improvement of at most Inline graphic or some point lies outside Inline graphic is bounded from above by

graphic file with name d33e5385.gif
Proof

The lemma can be proved in the same way as Lemma 3.16 with differences analogous to the proof of Lemma 3.23. Inline graphic

Lemma 3.25

For Inline graphic, the probability that a pair of linked 2-changes of type 1b yields an improvement of at most Inline graphic or some point lies outside Inline graphic is bounded from above by

graphic file with name d33e5427.gif
Proof

Similar to the proof of Lemma 3.17 and using Lemma 3.22, the probability that the two 2-changes of the pair both yield an improvement of at most Inline graphic is bounded from above by

graphic file with name d33e5448.gif

Now the lemma follows by applying Lemma 3.5 with Inline graphic. Inline graphic

Theorem 3.26

For Inline graphic, the expected length of the longest path in the 2-opt state graph corresponding to d-dimensional instances with Euclidean distances is at most Inline graphic.

Proof

We have Inline graphic by Lemmas 3.23, 3.24, and 3.25. If all points are in Inline graphic, then the longest edge has a length of Inline graphic. Thus, the initial tour has a length of at most Inline graphic. Plugging this into Lemma 3.9 yields the result. Inline graphic

Smoothed Analysis of the Approximation Ratio

Technical Preparation

The following standard lemma provides a convenient way to bound the deviation of a perturbed point from its mean in the two-step model.

Lemma 4.1

(Chi-square bound [28, Cor. 2.19]) Let x be a Gaussian random vector in Inline graphic of standard deviation Inline graphic centered at the origin. Then, for Inline graphic, we have Inline graphic

To give large-deviation bounds on sums of independent variables with bounded support, we will make use of a standard Chernoff-Hoeffding bound.

Lemma 4.2

(Chernoff-Hoeffding Bound [9, Exercise 1.1]) Let Inline graphic, where Inline graphic are independently distributed in [0, 1], and Inline graphic. Then, for Inline graphic, we have

graphic file with name d33e5608.gif

Throughout this part of the paper, we assume that the dimension Inline graphic is a fixed constant. Given a sequence of points Inline graphic, we call a collection Inline graphic of edges a tour, if T is connected and every Inline graphic has in- and outdegree exactly one in T. Note that we consider directed tours, which is useful in the analysis in this chapter, but our distances are always symmetric.

Given any collection of edges S, its length is denoted by Inline graphic, where d(uv) denotes the Euclidean distance Inline graphic between points Inline graphic and Inline graphic.

We call a collection Inline graphic a partial 2-optimal tour if T is a subset of a tour and Inline graphic holds for all edges Inline graphic. Our main interests are the traveling salesperson functional Inline graphic as well as the functional Inline graphic that maps the point set X to the length of the longest 2-optimal tour through X.

We note that the results in Sect. 4.2 hold for metrics induced by arbitrary norms in Inline graphic (Lemma 4.4 and 4.5) or typical Inline graphic norms (Lemma 4.6 and 4.7), not only for the Euclidean metric. We conjecture that also the upper bound in Sect. 4.3 holds for more general metrics, while the lower bound in Sect. 4.4 is probably specific for the Euclidean metric. Still, we think that the construction can be adapted to work for most natural metrics.

For obtaining lower bounds on the length of optimal tours, we consider the boundary functional Inline graphic: Inline graphic is the length of the shortest tour through all points in X, where we are allowed to connect points directly or to the boundary, and traversing the boundary of Inline graphic has zero costs. For a proof of the following lemma and for more details about boundary functions of Euclidean optimization problems, we refer to the monograph by Yukich [31].

Lemma 4.3

(Boundary Functional [31, Lemma 3.7]) There is a constant Inline graphic such that for all sets Inline graphic of n points, we have Inline graphic.

Length of 2-Optimal Tours under Perturbations

In this section, we provide an upper bound for the length of any 2-optimal tour and a lower bound for the length of any global optimum. These two results yield an upper bound of Inline graphic for the approximation ratio.

Chandra et al. [7] proved a bound on the worst-case length of 2-optimal tours that, in fact, already holds for the more general notion of partial 2-optimal tours. For an intuition why this is true, let us point out that their proof strategy is to argue that not too many long arcs in a tour may have similar directions due to the 2-optimality of the edges, while short edges do not contribute much to the length. The claim then follows from a packing argument. It is straight-forward to verify that it is never required that the collection of edges is closed or connected.

Lemma 4.4

(Length of partial 2-optimal tours [7, Theorem 5.1], paraphrased) There exists a constant Inline graphic such that for every sequence X of n points in Inline graphic, any partial 2-optimal tour has length less than Inline graphic.

While this bound directly applies to any perturbed instance under the one-step model, Gaussian perturbations fail to satisfy the premise of bounded support in Inline graphic. However, Gaussian tails are sufficiently light to enable us to translate the result to the two-step model by carefully taking care of outliers.

Lemma 4.5

There exists a constant Inline graphic such that for any Inline graphic the following statement holds. For any Inline graphic, the probability that any partial 2-optimal tour on Inline graphic has length greater than Inline graphic, i.e., Inline graphic, is bounded by Inline graphic. Furthermore,

graphic file with name d33e5928.gif

Proof

By translation, assume without loss of generality that the input points are contained in Inline graphic. We define cubes Inline graphic with Inline graphic. The side length of cube Inline graphic is Inline graphic. We consider the partitioning of Inline graphic into the regions Inline graphic and Inline graphic for Inline graphic. For some cube C and any tour T, let Inline graphic denote the edges in T that are completely contained in C. For any tour T, the sequence Inline graphic defined by Inline graphic and Inline graphic, for Inline graphic, partitions the edges of T. Thus, Inline graphic.

For any outcome of the perturbed points, let T be the longest 2-optimal tour. Then, each Inline graphic is a partial 2-optimal tour in Inline graphic. Let Inline graphic be the (random) number of points in Inline graphic, which is an upper bound on the number of points in Inline graphic. At most Inline graphic vertices are incident to the edges Inline graphic, since each such edge is incident to at least one endpoint in Inline graphic and every point has degree 2 in T. Since Inline graphic is a translated unit cube scaled by Inline graphic, Lemma 4.4 yields Inline graphic.

Observe that Inline graphic is not contained in Inline graphic only if its origin has been perturbed by noise of length at least Inline graphic. Thus, let Inline graphic and note that Inline graphic implies that Inline graphic. Hence, for each point Inline graphic, Lemma 4.1 yields

graphic file with name d33e6175.gif

By linearity of expectation, we conclude that Inline graphic for Inline graphic. This yields

graphic file with name d33e6194.gif

where we used Jensen’s inequality for the first inequality.

To derive tail bounds for the length of any 2-optimal tour, let Inline graphic be the upper bound on Inline graphic derived above. By the Chernoff bound (Lemma 4.2), we have

graphic file with name d33e6217.gif

This guarantee is only strong as long as Inline graphic is sufficiently large. Hence, we regard this guarantee only for Inline graphic, where Inline graphic is chosen such that Inline graphic. Assume that Inline graphic for all Inline graphic. Then, analogously to the above calculation, the contribution of Inline graphic is bounded by

graphic file with name d33e6267.gif

Let Inline graphic denote the probability that some Inline graphic fails to satisfy Inline graphic. Then,

graphic file with name d33e6291.gif

Let us continue assuming that all Inline graphic satisfy Inline graphic. Since in particular Inline graphic, at most Inline graphic vertices remain outside Inline graphic. Let Inline graphic. By a union bound,

graphic file with name d33e6334.gif

Assume that the corresponding event holds (i.e., Inline graphic), then the remaining points outside Inline graphic (and hence, outside Inline graphic) are contained in Inline graphic. We conclude that, with probability at least Inline graphic, we have

graphic file with name d33e6371.gif

This finishes the proof, since we have shown that with probability Inline graphic, both the contribution of Inline graphic and Inline graphic is bounded by Inline graphic. Inline graphic

We complement the bound above by a lower bound on tour lengths of perturbed inputs, making use of the following result by Englert et al. [12] for the one-step model.

Lemma 4.6

(Englert et al. [12, Proof of Theorem 1.4]) Let Inline graphic be a Inline graphic-perturbed instance. Then with probability Inline graphic, any tour on Inline graphic has length at least Inline graphic.

It also follows from their results that this bound translates to the two-step model consistently with the intuitive correspondence of Inline graphic between the one-step and the two-step model.

Lemma 4.7

Let Inline graphic be an instance of points in the unit cube perturbed by Gaussians of standard deviation Inline graphic. Then with probability Inline graphic any tour on Inline graphic has length at least Inline graphic.

Proof

We summarize the arguments of Englert et al. [12, Section 6] first, who considered truncated Gaussian perturbations: Here, we condition the Gaussian perturbation Inline graphic for each input point Inline graphic to be contained in Inline graphic for some Inline graphic. Conditioned on this event, the resulting input instance is contained in the cube Inline graphic. By straight-forward calculations, the conditional distribution of each point in C has maximum density bounded by Inline graphic. Moreover, the probability that the condition fails for a single point is bounded by Inline graphic for all i. Thus, by choosing Inline graphic sufficiently large, each point has at least constant probability to satisfy the condition Inline graphic.

Given any instance (with Gaussian perturbations which are not truncated), first reveal the (random) subinstance of those points for which the condition Inline graphic is satisfied and let Inline graphic be the number of such points. By the Chernoff bound (Lemma 4.2), and Inline graphic, we have Inline graphic for some Inline graphic with probability at least Inline graphic. If this event occurs, we obtain a random instance of Inline graphic points and maximum density Inline graphic. Hence an application of Lemma 4.6 yields that, for some constant Inline graphic, the probability that a tour of length less than Inline graphic exists is at most Inline graphic. Inline graphic

Note that Lemmas 4.5 and 4.7 almost immediately yield the following bound on the approximation performance for the two-step model. (The large-deviation bound is immediate. For the expected approximation ratio, we make use of the older and non-tight worst-case bound of Inline graphic, given in Lemma 4.9 below.)

Observation 4.8

Let Inline graphic be an instance of points in the unit cube perturbed by Gaussians of standard deviation Inline graphic. Then the approximation performance of 2-Opt is bounded by Inline graphic in expectation and with probability Inline graphic.

We remark that this bound is best possible for an analysis of perturbed instances that separately bounds the lengths of any 2-optimal tour from above and gives a lower bound on any optimal tour. To see this, we argue that Lemma 4.6, Lemma 4.4 (even under Inline graphic-perturbed input), Lemma 4.7 and Lemma 4.5 cannot be improved in general. This is straight-forward for Lemma 4.6, since n points distributed uniformly at random in a cube of volume Inline graphic always have, by scaling and Lemma 4.4, a tour of length Inline graphic. Hence, the lower bound on optimal tours on perturbed instances is tight. To see that the upper bound on any 2-optimal tour is tight, take n uniformly distributed points that have, by Lemma 4.6, an optimal tour of length Inline graphic with high probability and thus also in expectation.

Naturally, this transfers to the case of Gaussian perturbations, albeit more technical to verify: If we place n identical points in Inline graphic, say at the origin, and perturb them with Gaussians of standard deviation Inline graphic, then we may without loss of generality scale the unit cube to Inline graphic and perturb the points with standard deviation 1 instead. By Lemma 4.5, any 2-optimal tour and, thus, any optimal tour on these points has a length of Inline graphic on the scaled instance, since the origins are still contained in the unit cube. Thus, the optimal tour on the original instance has a length of at most Inline graphic in expectation and with high probability.

We only sketch that 2-optimal tours can have a length of at least Inline graphic: We distribute the n (unperturbed) points into Inline graphic groups of Inline graphic points each, and we partition the cube Inline graphic into Inline graphic subcubes of equal side length. Let Inline graphic be a constant such that with high probability, at least Inline graphic points of a group remain in their subcube after perturbation. We call these points successful. Since successful points are identically distributed, conditioned on falling into a compact set, the shortest tour through these (at least) Inline graphic points has a length of at least Inline graphic for some other constant Inline graphic [31]. (This is just a scaled version of perturbing and truncating a Gaussian of standard deviation 1 to a unit hypercube, which would result in a tour length of Inline graphic for m points.) By closeness of the tour on all points to the boundary functional and geometric superadditivity of the boundary functional (see Yukich [31] for details), it follows that the optimal tour on all successful points has a length of at least Inline graphic.

Upper Bound on the Approximation Performance

In this section, we establish an upper bound on the approximation performance of 2-Opt under Gaussian perturbations. We achieve a bound of Inline graphic. Due to the lower bound presented in Sect. 4.4, improving the smoothed approximation ratio to Inline graphic is impossible. Thus, our bound is almost tight.

As noted in the previous section, to beat Inline graphic it is essential to exploit the structure of the unperturbed input. This will be achieved by classifying edges of a tour into long and short edges and bounding the length of long edges by a (worst-case) global argument and short edges locally against the partial optimal tour on subinstances (by a reduction to an (almost-)average case). The local arguments for short edges will exploit how many unperturbed origins lie in the vicinity of a given region.

The global argument bounding long edges follows from the older Inline graphic bound on the worst-case approximation performance [7] that we rephrase here for our purposes.

Lemma 4.9

(Chandra et al. [7, Proof of Theorem 4.3]) Let T be a 2-optimal tour and Inline graphic denote the length of the optimal traveling salesperson tour Inline graphic. Let Inline graphic contain the set of all edges in T whose length is in Inline graphic. Then Inline graphic. In particular, it follows that Inline graphic.

In the proof of our bound of Inline graphic, the above lemma accounts for all edges of length Inline graphic. A central idea to bound all shorter edges is to apply the one-step model result to small parts of the input space. In particular, we will condition sets of points to be perturbed into cubes of side length Inline graphic. The following technical lemma helps to capture what values of Inline graphic suffice to express the conditional density function of these points depending on the distance of their unperturbed origins to the cube. This allows for appealing to the one-step model result of Lemma 4.6.

Lemma 4.10

Let Inline graphic and Inline graphic. Let Y be the random variable Inline graphic conditioned on Inline graphic and Inline graphic be the corresponding probability density function. Then Inline graphic is bounded from above by Inline graphic.

Proof

Let Inline graphic be the probability density function of X. Let Inline graphic be the point in Q that is closest to c. Then, since Inline graphic is rotationally invariant around c and decreasing in Inline graphic, the density Inline graphic inside Q is maximized at Inline graphic. Likewise, Inline graphic minimizes the density inside Q. Since Q is a Inline graphic-cube in Inline graphic, Inline graphic, where Inline graphic denotes the all-ones vector. Given Inline graphic, we can thus bound the conditional probability density function Inline graphic for Inline graphic by

graphic file with name d33e7155.gif

It remains to bound, for Inline graphic,

graphic file with name d33e7168.gif

Since for all Inline graphic, Inline graphic, we can bound Inline graphic, yielding the claim. Inline graphic

The main result of this section is the following theorem, which will be proved in the remainder of the section.

Theorem 4.11

Let Inline graphic be an instance of points in Inline graphic perturbed by Gaussians of standard deviation Inline graphic. With probability Inline graphic for any constant Inline graphic, we have Inline graphic. Furthermore,

graphic file with name d33e7240.gif

Since the approximation performance of 2-Opt is bounded by Inline graphic in the worst-case, we may assume that Inline graphic for all constant Inline graphic, since otherwise our smoothed result is superseded by Lemma 4.9. Furthermore, we may also assume that Inline graphic, since otherwise Observation 4.8 already yields the result. In what follows, let Inline graphic and T be any optimal and longest 2-optimal, respectively, traveling salesperson tour on Inline graphic. Furthermore, we let Inline graphic denote the length of the shortest traveling salesperson tour.

Outliers and Long Edges

We will first show that the contribution of almost all points outside Inline graphic is bounded by Inline graphic with high probability and in expectation, similar to Lemma 4.5. For this, we define growing cubes Inline graphic, where we set Inline graphic for Inline graphic and Inline graphic. Let Inline graphic be the number of points not contained in Inline graphic. For every point Inline graphic, Lemma 4.1 with Inline graphic bounds Inline graphic (note that we have chosen the Inline graphic such that Inline graphic). Thus, Inline graphic. We define Inline graphic as the set of edges of the longest 2-optimal tour T contained in Inline graphic with at least one endpoint in Inline graphic. We first bound the contribution of the edges in Inline graphic with Inline graphic.

Lemma 4.12

With probability Inline graphic for any constant Inline graphic, we have

graphic file with name d33e7446.gif

In addition, we have Inline graphic.

Proof

The proof is analogous to the proof of Lemma 4.5. Linearity of expectation, Lemma 4.4, and Jensen’s inequality yield

graphic file with name d33e7468.gif

By observing that Inline graphic is bounded by a constant, we conclude that Inline graphic is bounded by Inline graphic.

Let Inline graphic be the upper bound on Inline graphic derived above. By the Chernoff bounds (Lemma 4.2), we have

graphic file with name d33e7510.gif

Choose Inline graphic such that Inline graphic. Thus, Inline graphic. Assume that Inline graphic for all Inline graphic. Then, analogously to the above calculation, the contribution of Inline graphic is bounded by

graphic file with name d33e7553.gif

Note that the probability that some Inline graphic fails to satisfy Inline graphic is bounded by

graphic file with name d33e7572.gif

for any constant Inline graphic. Since Inline graphic, at most Inline graphic vertices remain outside Inline graphic. Let Inline graphic. By a union bound, for any constant Inline graphic,

graphic file with name d33e7615.gif

Assume that we have the – very likely – event that all points are in Inline graphic, then the remaining points outside Inline graphic are contained in Inline graphic. We conclude that

graphic file with name d33e7640.gif

In the remainder of the proof, we bound the total length of edges inside Inline graphic. Define Inline graphic and note that all edges in C have bounded length Inline graphic. We let Inline graphic contain the set of all those edges within C (in the longest 2-optimal tour T) whose lengths are in Inline graphic. Let Inline graphic be such that Inline graphic. Then Inline graphic for all Inline graphic, since no longer edges exist. Let Inline graphic be such that Inline graphic. Then Inline graphic by Lemma 4.9. This argument bounds the contribution of long edges, i.e., edges longer than Inline graphic, in the worst case, after observing the perturbation of the input points. It remains to bound the length of short edges in C, which we do in the next section.

Short Edges

To account for the length of the remaining edges, we take a different route than for the long edges: Call an edge that is shorter than Inline graphic a short edge and partition the bounding box Inline graphic into a grid of Inline graphic-cubes Inline graphic with Inline graphic, which we call cells. All edges in Inline graphic for Inline graphic, i.e., short edges, are completely contained in a single cell or run from some cell Inline graphic to one of its Inline graphic neighboring cells. For a given tour T, let Inline graphic denote the short edges of T for which at least one of the endpoints lies in Inline graphic.

We aim to relate the length of the edges Inline graphic for the longest 2-optimal tour T to the length of the edges Inline graphic of the optimal tour Inline graphic. This local approach is justified by the following property.

Lemma 4.13

For any tour Inline graphic, the contribution Inline graphic of cell Inline graphic is lower bounded by Inline graphic.

Proof

Consider all edges S in Inline graphic that have at least one endpoint in Inline graphic. Replacing those edges Inline graphic with Inline graphic and Inline graphic by the shortest edge connecting u to the boundary of Inline graphic does not increase the total edge length by triangle inequality. If Inline graphic were the unit cube, Inline graphic would thus be lower bounded by the boundary functional Inline graphic. Instead, we scale the instance Inline graphic by Inline graphic to obtain an instance Inline graphic in the unit cube, satisfying Inline graphic and, as argued above, Inline graphic. Thus an application of Lemma 4.3 yields

graphic file with name d33e7986.gif

Intuitively, a cell Inline graphic is of one of two kinds: either few points are expected to be perturbed into it and hence it cannot contribute much to the length of any 2-optimal tour (a sparse cell), or many unperturbed origins are close to the cell (a heavy cell). In the latter case, either the conditional densities of points perturbed into Inline graphic are small, hence any optimal tour inside Inline graphic has a large value by Lemma 4.6, or we find another cell close to Inline graphic that has a very large contribution to the length of any tour.

To formalize this intuition, fix a cell Inline graphic and let Inline graphic be the expected number of points Inline graphic with Inline graphic. Assume for convenience that Inline graphic and Inline graphic are integer. We describe the position of a cube Inline graphic canonically by indices Inline graphic. For two cells Inline graphic and Inline graphic, we define their distance as Inline graphic. For Inline graphic, let Inline graphic denote all cells of distance k to Inline graphic and let Inline graphic denote the cardinality of unperturbed origins located in a cell in Inline graphic. We call a perturbed point Inline graphic with unperturbed origin Inline graphic, for some Inline graphic, a k-successful point. Let Inline graphic denote the set of all k-successful points. Then Inline graphic.

Our first technical lemma shows that any cell Inline graphic, having (in expectation) a large number Inline graphic of points perturbed into it from cells of distance at most K, contributes at least Inline graphic to the length of the optimal tour.

Lemma 4.14

Let Inline graphic and define Inline graphic as the set of k-successful points for Inline graphic. Let Inline graphic. If Inline graphic, then with probability Inline graphic, we have

graphic file with name d33e8243.gif
Proof

Note that by Lemma 4.13, Inline graphic. Fix any realization of Inline graphic, i.e., choice of unperturbed origins inside some cell in Inline graphic whose perturbed points fall into Inline graphic. We can simulate the distribution of Inline graphic (under this realization of Inline graphic) by appealing to the one-step model. Note that each point in Inline graphic is distributed as a Gaussian conditioned on containment in cell Inline graphic. By rotational invariance of the Gaussian distribution, Lemma 4.10 is applicable and bounds the conditional density function of each point in Inline graphic by Inline graphic. By scaling, we obtain an instance in the unit cube with Inline graphic points distributed according to density functions of maximum density Inline graphic. Hence, by Lemma 4.6 we obtain that any tour has length Inline graphic on the scaled instance with probability Inline graphic. Scaling back to Inline graphic, we obtain Inline graphic. Since by Chernoff bounds (Lemma 4.2), Inline graphic with probability Inline graphic, we finally obtain, using Lemma 4.13,

graphic file with name d33e8379.gif

with probability Inline graphic, where we used that Inline graphic. Inline graphic

The following simple technical lemma shows that with constant probability, a point is perturbed into the cell it originates in.

Lemma 4.15

Let Inline graphic and Inline graphic. Then Inline graphic.

Proof

Let Inline graphic be the probability density function of Z. For all Inline graphic, we have Inline graphic and hence Inline graphic. This yields

graphic file with name d33e8459.gif

We are set-up to formally show the classification of heavy cells. Recall that Inline graphic denotes the number of cells Inline graphic.

Lemma 4.16

Let Inline graphic, Inline graphic and Inline graphic for sufficiently small constants Inline graphic. Then we can classify each cell Inline graphic with Inline graphic into one of the following two types.

  1. With probability Inline graphic for any constant Inline graphic, we have
    graphic file with name d33e8542.gif
  2. There is some Inline graphic such that for any Inline graphic, we have
    graphic file with name d33e8564.gif
    with probability Inline graphic for any constant Inline graphic.
Proof

We start with some intuition. By Lemma 4.4, we can bound Inline graphic. If we have Inline graphic, then Lemma 4.14 already proves Inline graphic to have type (T1). Otherwise, by tail bounds for the Gaussian distribution, we argue that some cell Inline graphic in distance at most Inline graphic contains at least Inline graphic unperturbed origins. These are sufficiently many to let Inline graphic contribute Inline graphic, for any Inline graphic, to the optimal tour length.

To make the intuition formal, note that all edges in Inline graphic are contained in a cube of side length Inline graphic around Inline graphic. By Chernoff bounds (Lemma 4.2), at most Inline graphic points are contained in Inline graphic with probability Inline graphic. Hence, Lemma 4.4 bounds

graphic file with name d33e8692.gif 6

with probability Inline graphic.

Case 1: Inline graphic. In this case, we may appeal to Lemma 4.14 (since Inline graphic) and obtain

graphic file with name d33e8724.gif 7

with probability Inline graphic, since Inline graphic and Inline graphic can be chosen sufficiently small. By a union bound, (6) and (7) hold with probability Inline graphic for any constant Inline graphic, proving that Inline graphic has type (T1).

Case 2: Inline graphic. Every point in Inline graphic has an Inline graphic-distance of at least Inline graphic to every point in Inline graphic. Thus, by Lemma 4.1, we have

graphic file with name d33e8813.gif 8

for sufficiently large k. Since Inline graphic, we can choose a sufficiently small constant Inline graphic such that Inline graphic satisfies Inline graphic. From Inline graphic, we conclude

graphic file with name d33e8854.gif

Hence, we have

graphic file with name d33e8860.gif

By (8), it follows that

graphic file with name d33e8869.gif

unperturbed origins are situated in cells in distance Inline graphic from Inline graphic. Note that there are at most Inline graphic such cells and Inline graphic for any Inline graphic. By pigeon hole principle, there is a cell Inline graphic with Inline graphic many unperturbed origins.

Let Inline graphic be the 0-successful points for cell Inline graphic, i.e., the points with origin in Inline graphic that are perturbed into Inline graphic. By Lemma 4.15, each unperturbed origin Inline graphic has constant probability to be perturbed into Inline graphic, i.e., Inline graphic. Hence, Inline graphic. Thus, Lemma 4.14 bounds

graphic file with name d33e8976.gif 9

with probability Inline graphic. Since (6) and (9) hold simultaneously with probability Inline graphic for any constant Inline graphic, this proves that Inline graphic has type (T2).

Inline graphic

Total Length of 2-Optimal Tours

With the analyses of the previous subsections, we can finally bound the total length of 2-optimal tours. To bound the total length of short edges, consider first sparse cells Inline graphic, i.e., cells containing Inline graphic perturbed points in expectation (recall that Inline graphic, where Inline graphic is the number of cells). For each such cell, the Chernoff bound (Lemma 4.2) yields that with probability Inline graphic, at most Inline graphic points are contained in Inline graphic, since each point is perturbed independently. By a union bound, no sparse cell contains more than Inline graphic points with probability at least Inline graphic for any constant Inline graphic. In this event, Lemma 4.4 allows for bounding the contribution of sparse cells by

graphic file with name d33e9094.gif 10

For bounding the length in the remaining cells (the heavy cells), let Inline graphic and Inline graphic. We observe the following: with probability at least Inline graphic, all type-(T1) cells Inline graphic satisfy Inline graphic. Thus,

graphic file with name d33e9132.gif 11

where the last inequality follows from Inline graphic, which holds since every edge in Inline graphic (inside C) is counted at most twice on the left-hand side.

Let Inline graphic be any function that assigns to each cell Inline graphic of type-(T2) a corresponding cell Inline graphic satisfying the condition (T2). We say that Inline graphic charges Inline graphic. We can choose any Inline graphic and have with probability at least Inline graphic that Inline graphic for all Inline graphic. Assume that this event occurs. Since every cell Inline graphic can only be charged by cells in distance Inline graphic, each cell can only be charged Inline graphic times. Hence,

graphic file with name d33e9233.gif

Since Inline graphic, choosing Inline graphic sufficiently large yields

graphic file with name d33e9251.gif 12
Proof of Theorem 4.11

By a union bound, we can bound by Inline graphic, for any constant Inline graphic, the probability that (i) Inline graphic (by Lemma 4.7), (ii) all edges outside C contribute Inline graphic (by Lemma 4.12), (iii) all sparse cells contribute Inline graphic (by (10)), (iv) the type-(T1) cells Inline graphic induce a cost of Inline graphic (by (11)), and (v) the type-(T2) cells induce a cost of Inline graphic (by (12)). Since the remaining edges are long edges and contribute only Inline graphic, we obtain that every 2-optimal tour has a length of at most Inline graphic with probability Inline graphic.

Since a 2-optimal tour always constitutes a Inline graphic-approximation to the optimal tour length by Lemma 4.9, we also obtain that the expected cost of the worst 2-optimal tour is bounded by

graphic file with name d33e9358.gif

Lower Bound on the Approximation Ratio

We complement our upper bound on the approximation performance by the following lower bound: for Inline graphic, the worst-case lower bound is robust against perturbations. For this, we face the technical difficulty that in general, a single outlier might destroy the 2-optimality of a desired long tour, potentially cascading into a series of 2-Opt iterations that result in a substantially different or even optimal tour.

Theorem 4.17

Let Inline graphic. For infinitely many n, there is an instance X of points in Inline graphic perturbed by normally distributed noise of standard deviation Inline graphic such that with probability Inline graphic for any constant Inline graphic, we have Inline graphic. This also yields

graphic file with name d33e9425.gif

We remark that our result transfers naturally to the one-step model with Inline graphic and interestingly, holds with probability 1 over such random perturbations.

Proof of Theorem 4.17. We alter the construction of Chandra et al. [7] to strengthen it against Gaussian perturbations with standard deviation Inline graphic (see Fig. 2). Let Inline graphic be an odd integer and Inline graphic. The original instance of [7] is a subset of the Inline graphic-grid, which we embed into Inline graphic by scaling by 1/P, and consists of three parts Inline graphic, Inline graphic and Inline graphic. The vertices in Inline graphic are partitioned into the layers Inline graphic. Layer i consists of Inline graphic equidistant vertices, each of which has a vertical distance of Inline graphic to the point above it in Layer Inline graphic and a horizontal distance of Inline graphic to the nearest neighbor(s) in the same layer. The set Inline graphic is a copy of Inline graphic shifted to the right by a distance of 2/3. The remaining part Inline graphic consists of a copy of Layer p of Inline graphic shifted to the right by 1/3 to connect Inline graphic and Inline graphic by a path of points. We regard Inline graphic as the set of Layer-i points in Inline graphic.

Fig. 2.

Fig. 2

Parts Inline graphic and Inline graphic of the lower bound instance. Each point is contained in a corresponding small container (depicted as brown circle) with high probability. The black lines indicate the constructed 2-optimal tour, which runs analogously on Inline graphic

As in the original construction, we will construct an instance of Inline graphic points, which implies Inline graphic. Let Inline graphic be the largest odd integer such that Inline graphic. In our construction, we drop all Layers Inline graphic in both Inline graphic and Inline graphic, as well as Layer p in Inline graphic. Instead, we connect Inline graphic and Inline graphic already in Layer t by an altered copy of Layer t of Inline graphic shifted to the right by 1/3. Let C be an arbitrary point of our construction, for convenience we will use the central point of Layer t in Inline graphic. We introduce Inline graphic additional copies of this point C. These surplus points serve as a “padding” of the instance to ensure Inline graphic. Note that the resulting instance has Inline graphic layers Inline graphic. We choose t such that the magnitude of perturbation is negligible compared to the pairwise distances of all non-padding points. Furthermore, the restriction on Inline graphic ensures that incorporating the padding points increases the optimal tour length only by a constant.

Lemma 4.18

With probability Inline graphic for any constant Inline graphic, the optimal tour has length O(1).

Proof

Let n be the number of points in the constructed instance. Note that Inline graphic consists of (i) a subset Inline graphic of the instance of Chandra et al. [7], plus (ii) an additional copy Inline graphic of Layer t and (iii) the padding points Inline graphic in Inline graphic. Denote the number of points in Inline graphic by Inline graphic. We have

graphic file with name d33e9834.gif

by choice of t. Hence Inline graphic. It is easy to see [7] that the original instance of Chandra et al. has a minimum spanning tree of length Inline graphic. (This is achieved by the spanning tree that includes, for each Layer-i vertex with Inline graphic, the vertical edge to the point above it, and each edge between consecutive points on Layer p.) Clearly,

graphic file with name d33e9872.gif

Consider the perturbed instance Inline graphic. Note that for every constant Inline graphic, we have Inline graphic for sufficiently large n. Thus for each Inline graphic, the Gaussian noise Inline graphic satisfies Inline graphic with probability at least Inline graphic by Lemma 4.1. By a union bound, we have Inline graphic with probability at least Inline graphic. In this case, by the triangle inequality, the fact that Inline graphic for all point sets Y and since only a constant number of edges connects the three parts, we obtain

graphic file with name d33e9949.gif

Note that we may translate and scale Inline graphic to be contained in Inline graphic, by which Inline graphic may be regarded as the optimal tour length on an instance of Inline graphic points in Inline graphic perturbed by Gaussians with standard deviation 1. By Lemma 4.5, any 2-optimal tour and hence also the optimal tour on the scaled instance has length Inline graphic with probability Inline graphic. Scaling back to the original instance, we obtain Inline graphic with probability Inline graphic. This yields the result by a union bound. Inline graphic

We find a long 2-optimal tour on all non-padding points analogously to the original construction by taking a shortcut of the original 2-optimal tour, which connects Inline graphic and Inline graphic already in Layer t (see Fig. 2).

Consider the padding points, which are yet to be connected. Let Inline graphic denote the nearest point in Layer t of Inline graphic that is to the left of C. Symmetrically, Inline graphic is the nearest point to the right of C. Let Inline graphic be any 2-optimal path from Inline graphic to Inline graphic that passes through all the padding points (including C). We replace the edges Inline graphic and Inline graphic by the path Inline graphic, completing the construction of our tour T.

Lemma 4.19

Let Inline graphic be arbitrary. With probability Inline graphic, T is 2-optimal and has a length of Inline graphic.

Note that given Lemma 4.19, Theorem 4.17 follows directly using Lemma 4.18. The (rather technical) proof of Lemma 4.19 hence concludes our lower bound.

Probability of 2-optimality.

To account for the perturbation in the analysis, we define a safe region for every point. More formally, let Inline graphic be any unperturbed origin. We define its container Inline graphic as the circle centered at Inline graphic with radius Inline graphic. Very likely, all perturbed points lie in their containers.

Lemma 4.20

For sufficiently large p, the tour T constructed as described in Sect. 4.4 is 2-optimal, provided that all points Inline graphic lie in their corresponding containers Inline graphic.

We first show that this lemma implies Lemma 4.19.

Proof of Lemma 4.19

Let Inline graphic, and let Inline graphic be arbitrary. Since Inline graphic, we have Inline graphic for sufficiently large n. By definition of the containers, Lemma 4.1 yields that for any point Inline graphic and sufficiently large n,

graphic file with name d33e10260.gif

By a union bound, we conclude that with probability Inline graphic, all points are contained in their corresponding containers and hence, by the previous lemma, T is 2-optimal.

Recall that t is the largest odd integer satisfying Inline graphic. Since Inline graphic, this implies Inline graphic. Observe that T visits Inline graphic many layers and crosses a horizontal distance of 2/3 in each of them. Hence, it has a length of at least Inline graphic. Inline graphic

In the remainder of this section, we prove Lemma 4.20, i.e., show that the constructed tour is 2-optimal, provided all points stay inside their respective containers. Clearly, it suffices to show for any pair of edges (uv) and (wz) in the tour, the corresponding 2-change, i.e., replacing these edges by (uw) and (vz) does not reduce the tour length, i.e., Inline graphic. We first state the technical lemmas capturing the ideas behind the construction.

The first lemma treats pairs of horizontal edges and establishes how large their vertical distance must be in order to make swapping these edges increase the length of the tour. It is a generalization of a similar lemma of Chandra et al. [7] to a perturbation setting, in which points are placed arbitrarily into small containers.

Note that in what follows, for a point Inline graphic, we let Inline graphic denote its x-coordinate and Inline graphic its y-coordinate. Furthermore, for any points Inline graphic, we let Inline graphic and Inline graphic denote their horizontal and vertical distance, respectively.

Lemma 4.21

Let Inline graphic and Inline graphic be horizontal line segments in the Euclidean plane with Inline graphic and Inline graphic. Let Inline graphic, Inline graphic, Inline graphic and Inline graphic be circles of radius Inline graphic with centers p, q, r and s, respectively. If Inline graphic and the vertical distance Inline graphic between Inline graphic and Inline graphic is at least

graphic file with name d33e10505.gif

then, for all Inline graphic, we have

graphic file with name d33e10517.gif

Proof

Note that Inline graphic. Furthermore, we have that

graphic file with name d33e10532.gif

and hence

graphic file with name d33e10538.gif

where the right-hand side expression is at least 0, since Inline graphic by assumption. Let Inline graphic and Inline graphic, then it is straight-forward to verify that the expression

graphic file with name d33e10563.gif 13

subject to Inline graphic is minimized when Inline graphic.

Hence, we can bound (13) by

graphic file with name d33e10587.gif

where the third line follows from our assumption on v. Inline graphic

The following very basic lemma shows that a sequence of edges that share roughly the same direction will always be 2-optimal.

Lemma 4.22

Let Inline graphic and Inline graphic be a sequence of points in Inline graphic such that all connecting segments Inline graphic fulfill Inline graphic. Then,

graphic file with name d33e10639.gif

Proof

For any point p, let Inline graphic denote the cone Inline graphic. Let Inline graphic, then by assumption, we have Inline graphic and thus Inline graphic. Let us assume that Inline graphic (the other case is symmetric). Since by assumption, Inline graphic, we have for Inline graphic that Inline graphic and Inline graphic for some Inline graphic and Inline graphic with Inline graphic. If Inline graphic, the claim is immediate from Inline graphic. Otherwise, for Inline graphic, we obtain

graphic file with name d33e10750.gif

By an analogous computation, Inline graphic follows and hence the claim. Inline graphic

We can now prove Lemma 4.20. Assume that all points are contained in their respective containers. We call an edge between Inline graphic and Inline graphic horizontal (or vertical) if the edge between Inline graphic and Inline graphic is horizontal (or vertical) and neither Inline graphic nor Inline graphic belong to the set of padding points. In what follows, we will first consider horizontal-horizontal, horizontal-vertical and vertical-vertical edge pairs and then turn to pairs of edges for which at least one edge is adjacent to some padding point. Recall that Inline graphic is chosen such as to satisfy Inline graphic.

Horizontal-horizontal edge pair Let Inline graphic and Inline graphic be two horizontal edges. Horizontal edges Inline graphic with Inline graphic appear only if Inline graphic. We distinguish the following cases.

  1. Inline graphic: Both edges are in the same layer. Note that no 2-change swaps neighboring edges. Assume without loss of generality that Inline graphic (the other case is symmetric). Since Inline graphic, we have that
    graphic file with name d33e10885.gif
    Similarly, Inline graphic and Inline graphic. This shows that Lemma 4.22 is applicable to Inline graphic, which yields that no 2-change can be profitable.
  2. Inline graphic, and Inline graphic. By construction of T, the edges have opposite direction. Assume that Inline graphic and hence Inline graphic (the other case is symmetric). By construction Inline graphic. We have that Inline graphic. The same reasoning shows that Inline graphic. Similarly, one can show that Inline graphic for all Inline graphic and Inline graphic. Hence the 2-change to Inline graphic and Inline graphic has a crossing, which by triangle inequality cannot be profitable.

  3. Inline graphic, and Inline graphic with Inline graphic and Inline graphic. Either both edges have opposite directions, then the previous argument shows that a 2-change is not profitable. Otherwise, note that the first requirement of Lemma 4.21, Inline graphic, is fulfilled. Also note that Inline graphic, since Inline graphic. We have
    graphic file with name d33e11044.gif
    since for sufficiently large p, we have Inline graphic. Consequently, Lemma 4.21 applies and shows that the 2-change does not yield an improvement.

Horizontal-vertical edge pair. Let Inline graphic be a vertical edge and Inline graphic be a horizontal edge. We assume that the vertical edge is in Inline graphic, since the case Inline graphic is symmetric. Exactly one of the following cases occurs.

  1. Inline graphic and Inline graphic with Inline graphic. The horizontal edge is in the same layer as one of the end points of the vertical edge. Clearly, Inline graphic and Inline graphic. Since a 2-change cannot swap neighboring edges, at least one horizontal segment lies between both edges. By construction of the tour, one of the edges Inline graphic and Inline graphic crosses a vertical distance of at least Inline graphic and the other a horizontal distance of at least Inline graphic. Hence
    graphic file with name d33e11150.gif
    since Inline graphic.
  2. Inline graphic and Inline graphic with Inline graphic. As in the previous case, Inline graphic and Inline graphic. Consider first the case that Inline graphic, then by construction of the tour, one of the edges Inline graphic and Inline graphic crosses a horizontal distance of at least Inline graphic and the other edge crosses a vertical distance of at least Inline graphic, yielding
    graphic file with name d33e11227.gif
    since Inline graphic. Otherwise, if Inline graphic, the edge Inline graphic crosses a vertical distance of at least Inline graphic and hence
    graphic file with name d33e11258.gif
    since Inline graphic. Thus in both cases, a 2-change is not profitable.

Vertical-vertical edge pair. Let Inline graphic and Inline graphic be vertical edges.

  1. Inline graphic and Inline graphic with Inline graphic, i.e., the vertical edges are above each other. By swapping the x- and y-axis in Lemma 4.22, we can show that a 2-change is not profitable, since it is easy to see that Inline graphic for all consecutive pairs (pq) in Inline graphic.

  2. Inline graphic and Inline graphic with Inline graphic. Clearly, Inline graphic and Inline graphic, while Inline graphic and Inline graphic. Hence a 2-change is not profitable, since Inline graphic.

Padding points.

Since we assumed for convenience that the padding points are placed at the central vertex C of Layer t in Inline graphic, only the edges with at least one endpoint in Inline graphic are relevant candidates for the treatment of padding points. This is because all other edges have both endpoints at a distance of 1/6 to the padding points, which can never be accounted for by its edge length, since all edges except in Layer 0 are much shorter than 1/3. Separately, the Layer-0 edges can be handled easily as well: an edge Inline graphic with Inline graphic is a horizontal edge, hence the pair Inline graphic and a Layer-0 edge trigger the corresponding case of horizontal-horizontal edge pairs with even smaller edge length of the edge Inline graphic in Layer t.

It remains to handle the following cases, where we regard C as a padding point, i.e., Inline graphic, not as a Layer-t point.

  1. Inline graphic, and Inline graphic. Clearly, Inline graphic and Inline graphic. Furthermore, at least one of Inline graphic has a horizontal distance of at least Inline graphic to Inline graphic. Hence,
    graphic file with name d33e11501.gif
  2. Inline graphic and Inline graphic. These edge pairs are exactly as regular pairs of Layer-t edges and the corresponding case of horizontal-horizontal edge pairs applies.

  3. Inline graphic. All such edges are 2-optimal by construction, since a 2-optimal path from Inline graphic to Inline graphic passing by all padding points was used.

This concludes the case analysis and thus the proof of Lemma 4.20.

Concluding Remarks

Running-time. Our approach for Euclidean distances does not work for Inline graphic and Inline graphic. However, we can use the bound of Englert et al. [12] for Euclidean distances, which yields a bound polynomial in n and Inline graphic for Inline graphic.

In the same way as Englert et al. [12], we can slightly improve the smoothed number of iterations by using an insertion heuristic to choose the initial tour. We save a factor of Inline graphic for Manhattan and Euclidean distances and a factor of Inline graphic for squared Euclidean distances. The reason is that there always exist tours of length Inline graphic for n points in Inline graphic for Euclidean and Manhattan distances and of length Inline graphic for squared Euclidean distances for Inline graphic [31] (the constants in these upper bounds depend on d). Taking into account also that, because Gaussians have light tails, only few points are far away from the hypercube Inline graphic after perturbation, one might get an even better bound. However, we did not take these improvements into account in our analysis to keep the paper concise.

Of course, even our improved bounds do not fully explain the linear number of iterations observed in experiments. However, we believe that new approaches, beyond analyzing the smallest improvement, are needed in order to further improve the smoothed bounds on the running-time.

Approximation ratio.

We have proved an upper bound of Inline graphic for the smoothed approximation ratio of 2-Opt. Furthermore, we have proved that the lower bound of Chandra et al. [7] remains robust even for Inline graphic. We leave as an open problem to generalize our upper bounds to the one-step model to improve the current bound of Inline graphic [12], but we conjecture that this might be difficult, because of the lack of the nice structure that Gaussian distributions provide.

Given the recent improvement from Inline graphic to Inline graphic by Brodowsky et al. [5], we raise the question of tightening our upper bound to Inline graphic.

While our bound significantly improves the previously known bound for the smoothed approximation ratio of 2-Opt, we readily admit that it still does not explain the performance observed in practice. A possible explanation is that when the initial tour is not picked by an adversary or the nearest neighbor heuristic, but using a construction heuristic such as the spanning tree heuristic or an insertion heuristic, an approximation factor of 2 is guaranteed even before 2-Opt has begun to improve the tour [27]. We chose to compare the worst local optimum to the global optimum in order, as this is arguably the simplest of all technically difficult possibilities.

However, a smoothed analysis of the approximation ratio of 2-Opt initialized with a good heuristic might be difficult: even in the average case, it is only known that the length of an optimal TSP tour is concentrated around Inline graphic for some constant Inline graphic. But the precise value of Inline graphic is unknown [31]. Since experiments suggest that 2-Opt even with good initialization does not achieve an approximation ratio of Inline graphic [16, 17], one has to deal with the precise constants, which seems challenging.

Finally, we conjecture that many examples for showing lower bounds for the approximation ratio of concrete algorithms for Euclidean optimization such as the TSP remain stable under perturbation for Inline graphic. The question remains whether such small values of Inline graphic, although they often suffice to prove polynomial smoothed running time, are essential to explain practical approximation ratios or if already slower decreasing Inline graphic provide a sufficient explanation.

Footnotes

This paper is based on results presented at ISAAC 2013 [25] and ICALP 2015 [20].

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Abramowitz, M., Stegun, I. A.: editors. Pocketbook of mathematical functions. Harri Deutsch, (1984)
  • 2.Arora, S.: Polynomial time approximation schemes for Euclidean traveling salesman and other geometric problems. J. ACM 45(5), 753–782 (1998) [Google Scholar]
  • 3.Bläser, M., Manthey, B., Raghavendra Rao, B.V.: Smoothed analysis of partitioning algorithms for Euclidean functionals. Algorithmica 66(2), 397–418 (2013) [Google Scholar]
  • 4.Bringmann, K., Engels, C., Manthey, B., Raghavendra Rao, B. V.: Random shortest paths: Non-euclidean instances for metric optimization problems. In Krishnendu Chatterjee and Jiří Sgall, editors, Proc. of the 38th Int. Symp. on mathematical foundations of computer science (MFCS), volume 8087 of lecture notes in computer science, pages 219–230. Springer, (2013)
  • 5.Brodowsky, U.A., Hougardy, S., Zhong, X.: The approximation ratio of the k-opt heuristic for the Euclidean traveling salesman problem. SIAM J. Comput. 52(4), 841–864 (2023) [Google Scholar]
  • 6.Brunsch, T., Röglin, H., Rutten, C., Vredeveld, T.: Smoothed performance guarantees for local search. Math. Program. 146(1–2), 185–218 (2014) [Google Scholar]
  • 7.Chandra, B., Karloff, H., Tovey, C.: New results on the old [CDATA[k]]Inline graphic-opt algorithm for the traveling salesman problem. SIAM J. Comput. 28(6), 1998–2029 (1999) [Google Scholar]
  • 8.Curticapean, R., Künnemann, M.: A quantization framework for smoothed analysis of Euclidean optimization problems. Algorithmica 73(3), 1–28 (2015) [Google Scholar]
  • 9.Dubhashi, D.P., Panconesi, A.: Concentration of Measure for the Analysis of Randomized Algorithms. Cambridge University Press, Cambridge (2009) [Google Scholar]
  • 10.Durrett, R.: Probability: Theory and Examples. Cambridge University Press, Cambridge (2013) [Google Scholar]
  • 11.Engels, C., Manthey, B.: Average-case approximation ratio of the 2-opt algorithm for the TSP. Oper. Res. Lett. 37(2), 83–84 (2009) [Google Scholar]
  • 12.Englert, M., Röglin, H., Vöcking, B.: Worst case and probabilistic analysis of the 2-Opt algorithm for the TSP. Algorithmica 68(1), 190–264 (2014) [Google Scholar]
  • 13.Etscheid, M.: Performance guarantees for scheduling algorithms under perturbed machine speeds. Discret. Appl. Math. 195, 84–100 (2015) [Google Scholar]
  • 14.Evans, M., Hastings, N., Peacock, B.: Statistical Distributions, 3rd edn. Wiley, Hoboken (2000) [Google Scholar]
  • 15.Funke, S., Laue, S., Lotker, Z., Naujoks, R.: Power assignment problems in wireless communication: Covering points by disks, reaching few receivers quickly, and energy-efficient travelling salesman tours. Ad Hoc Netw. 9(6), 1028–1035 (2011) [Google Scholar]
  • 16.Johnson, D.S., McGeoch, L.A.: The traveling salesman problem: a case study. In: Aarts, E., Lenstra, J.K. (eds.) Local Search in Combinatorial Optimization. Wiley, Hoboken (1997) [Google Scholar]
  • 17.Johnson, D.S., McGeoch, L.A.: Experimental analysis of heuristics for the STSP. In: Gutin, G., Punnen, A.P. (eds.) The Traveling Salesman Problem and its Variations. Kluwer Academic Publishers, Dordrecht (2002) [Google Scholar]
  • 18.Karger, D., Onak, K.: Polynomial approximation schemes for smoothed and random instances of multidimensional packing problems. In Proc. of the 18th Ann. ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 1207–1216. SIAM, (2007)
  • 19.Kern, W.: A probabilistic analysis of the switching algorithm for the TSP. Math. Program. 44(2), 213–219 (1989) [Google Scholar]
  • 20.Künnemann, M., Manthey, B.: Towards understanding the smoothed approximation ratio of the 2-opt heuristic. In Magnús M. Halldórsson, Kazuo Iwama, Naoki Kobayashi, and Bettina Speckmann, editors, In: Proc. of the 42nd Int. Coll. on Automata, Languages and Programming (ICALP), volume 9134 of Lecture Notes in Computer Science, pages 859–871. Springer, (2015)
  • 21.Manthey, B.: Smoothed analysis of local search. In: Roughgarden, T. (ed.) Beyond the Worst-Case Analysis of Algorithms, pp. 285–308. Cambridge University Press, Cambridge (2020) [Google Scholar]
  • 22.Manthey, B., Röglin, H.: Smoothed analysis: analysis of algorithms beyond worst case. IT Inf Technol 53(6), 280–286 (2011) [Google Scholar]
  • 23.Manthey, B.,Rhijn, J.V.: Improved smoothed analysis of 2-opt for the Euclidean TSP. In Satoru Iwata and Naonori Kakimura, editors, In: Proc. 34th Int. Symposium on Algorithms and Computation (ISAAC), volume 283 of LIPIcs, pages 52:1–52:16. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, (2023)
  • 24.Manthey, B.,Veenstra, R.: Smoothed analysis of the 2-Opt heuristic for the TSP: Polynomial bounds for Gaussian noise. In Leizhen Cai, Siu-Wing Cheng, and Tak-Wah Lam, editors, In: Proc. of the 24th Ann. Int. Symp. on Algorithms and Computation (ISAAC), volume 8283 of Lecture Notes in Computer Science, pages 579–589. Springer, (2013)
  • 25.Mitchell, J.S.B.: Guillotine subdivisions approximate polygonal subdivisions: a simple polynomial-time approximation scheme for Geometric TSP, Inline graphic[CDATA[k]]-MST, and related problems. SIAM J. Comput. 28(4), 1298–1309 (1999) [Google Scholar]
  • 26.Papadimitriou, C.H.: The Euclidean traveling salesman problem is NP-complete. Theoret. Comput. Sci. 4(3), 237–244 (1977) [Google Scholar]
  • 27.Rosenkrantz, D.J., Stearns, R.E., Lewis, P.M., II.: An analysis of several heuristics for the traveling salesman problem. SIAM J. Comput. 6(3), 563–581 (1977) [Google Scholar]
  • 28.Spielman, D.A., Teng, S.-H.: Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time. J. ACM 51(3), 385–463 (2004) [Google Scholar]
  • 29.Spielman, D.A., Teng, S.-H.: Smoothed analysis: an attempt to explain the behavior of algorithms in practice. Commun. ACM 52(10), 76–84 (2009) [Google Scholar]
  • 30.Nijnatten, F.v., Sitters, R., Woeginger, G. J., Wolff, A., de Berg, M.: The traveling salesman problem under squared Euclidean distances. In Jean-Yves Marion and Thomas Schwentick, editors, In: Proc. of the 27th Int. Symp. on Theoretical Aspects of Computer Science (STACS), volume 5 of LIPIcs, pages 239–250. Schloss Dagstuhl– Leibniz-Zentrum für Informatik, (2010)
  • 31.Yukich, J.E.: Probability theory of classical Euclidean optimization problems. Springer, Berlin (1998) [Google Scholar]

Articles from Algorithmica are provided here courtesy of Springer

RESOURCES