Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1998 Jan 20;95(2):450–454. doi: 10.1073/pnas.95.2.450

An extension of the steepest descent method for Riemann-Hilbert problems: The small dispersion limit of the Korteweg-de Vries (KdV) equation

P Deift *,, S Venakides , X Zhou
PMCID: PMC18440  PMID: 11038618

Abstract

This paper extends the steepest descent method for Riemann-Hilbert problems introduced by Deift and Zhou in a critical new way. We present, in particular, an algorithm, to obtain the support of the Riemann-Hilbert problem for leading asymptotics. Applying this extended method to small dispersion KdV (Korteweg-de Vries) equation, we (i) recover the variational formulation of P. D. Lax and C. D. Levermore [(1979) Proc. Natl. Acad. Sci. USA 76, 3602–3606] for the weak limit of the solution, (ii) derive, without using an ansatz, the hyperelliptic asymptotic solution of S. Venakides that describes the oscillations; and (iii) are now able to compute the phase shifts, integrating the modulation equations exactly. The procedure of this paper is a version of fully nonlinear geometrical optics for integrable systems. With some additional analysis the theory can provide rigorous error estimates between the solution and its computed asymptotic expression.

Introduction

The Korteweg-de Vries (KdV) equation

graphic file with name M1.gif 1

was solved by Gardner, Greene, Kruskal, and Miura (1) who discovered its close connection with the Schrödinger equation,

graphic file with name M2.gif

and introduced the inverse scattering transformation as a fundamental tool of applied mathematics and physics.

The initial value problem for KdV with small dispersion ɛ↘0, was first analyzed by Lax and Levermore (2, 3) for initial data consisting of a single potential well u0(x) < 0, and then by Venakides (4) for initial data consisting of a single “hump” u0(x) > 0. The dispersion coefficient tending to zero (ɛ↘0), corresponds to considering the Schrödinger operator in its semiclassical limit, a regime that allows explicit scattering calculations by the WKB method. In both studies, the inverse scattering transformation procedure led, at each (x,t), to a variational problem over spectral density functions of the eigenvalue parameter λ. The variational problem was shown to possess a unique solution and the weak limit ū(x,t) of u(x,t,ɛ) as ɛ↘0 emerged as a simple linear functional of the maximizing function ψ.

The nonexistence of a strong limit beyond a break time tcrit was shown by the fact that, over wide space-time regions, the weak limit of u2 did not equal ū2, a fact that demonstrated the emergence of small-scale oscillations in these regions. The analysis of the variational problem by Lax and Levermore (2, 3) revealed the deep significance of the nature of the support of the maximizing function ψ. Under the assumption that the support of ψ is a finite set of intervals, Lax and Levermore derived a formula for the weak limit in terms of the endpoints of these intervals. The number 2N + 1 of the end points (N = 0, 1, 2, … ), and the end points themselves, were dependent on (x,t). At t = 0, it was seen that N = 0 and the unique end point for any given value of x was easily derived from the initial data u0(x). In space-time regions of constant N, Lax and Levermore showed that the end points were the solution of a nonlinear hyperbolic system in Riemann invariant form. The system turned out to be identical to the modulation equations satisfied by the evolving spectral end points that describe modulated multiphase KdV waves. The latter had been derived earlier by Whitham (5) for single-phase traveling waves and concurrently with the Lax–Levermore calculation by Flaschka, Forest, and McLaughlin (6) (N > 1) for multiphase waves. Thus, Lax–Levermore theory produced overwhelming evidence that the small-scale oscillations in the initial value problem for Eq. 1 consisted of multiphase KdV waves, and that the Lax–Levermore end points should be identified with the spectral endpoints of quasi-periodic N-phase KdV waves. We will refer to the end points as branch points, because in both theories they are the branch points of an algebraic curve. The precise identification between the branch points of the two theories was not made, and a direct derivation of the small-scale oscillations was still lacking.

The open question could be phrased as follows: Derive the leading order asymptotic behavior for the solution u(x,t,ɛ) after break-time, that is after the time at which oscillations emerge. The task would require resolving the small-scale wave structure, i.e., identifying the local wave as a KdV single-phase or multiphase wave and calculating the small-scale wave parameters, i.e., the mean value (weak limit), the wave numbers, the frequencies, and the phase shifts. The local mean value was calculated in Lax–Levermore theory. The wave numbers and frequencies also would follow if the above identification of two types of branch points were made. Even then however, the phase-shifts (i.e., the order one term in the phase function of the oscillatory structure) lay well beyond any existing theory.

A further step toward the above identification was made by Venakides (4) in two ways, first by using Abel sums (7, 8), and later by a sharpening of the derivation of the variational problem in ref. 9. This sharpening led to the introduction of an additional “quantum condition” or quantum constraint on the variational problem. As a consequence, the unique maximizer of the variational problem was replaced by a sequence of nearby maximizers that carried the information not only of the solution average, but also of the structure of the oscillations (9). As a result, a version of Eq. 12 below was derived in ref. 9 that describes the leading asymptotic behavior of the solution u(x,t,ɛ) as ɛ↘0. The derivation (9) relied on an averaging Ansatz and, as stated in that study, there was some uncertainty in the resolution of the phase shifts. During this time the theory for the periodic small dispersion KdV equation (10) also was developed.

Our approach in this paper follows the Shabat (11) (see also ref. 12) reformulation of the inverse scattering transformation as a Riemann–Hilbert problem (RHP). It is motivated by the steepest descent method for oscillatory RHPs introduced by Deift and Zhou (13) and developed further, in particular, in refs. 14 and 15. For simplicity, we consider only the case u0(x) > 0. The case u0(x) < 0, as well as the case when u0 changes sign, also can be handled with slight modifications of the same method (see Remark below). A more detailed version of the results of this paper is given in ref. 16. The extension of the nonlinear steepest descent method of Deift and Zhou consists of introducing a rigorous fully nonlinear version of geometrical optics in the following way.

As in refs. 15 and 16, we introduce in the Shabat vector RHP, which describes inverse scattering, the phase function g (see section 1) as the nonlinear analogue of the phase function in the geometrical optics representation of a linear wave. The form of the function g cannot be easily guessed in this case; we introduce a systematic procedure for determining g, which reduces its calculation to a scalar RHP. The solution of the scalar RHP corresponds to the solution of the eikonal equation and the leading transport equation in linear wave theory (amplitude is coupled to the other wave parameters) with g yielding the local number of phases, wave numbers, frequencies, and the weak limit. Pursuing the analogy to the linear theory, we call the scalar RHP, the eikonal RHP. Once g is determined, we solve the remaining vector RHP explicitly in terms of theta functions.

We present three main results for small dispersion KdV.

First, the leading asymptotic behavior of u(x,t,ɛ), including the phase shifts, is obtained in terms of the modulating branch points described above. The identification between the branch points of refs. 2 and 3 on the one hand, and refs. 5 and 6 on the other, is made precise (there is no use of an Ansatz). Our Eq. 12 indicates that the corresponding formula of ref. 9 has the correct phase shifts to order ɛ. Our method actually allows one to write down higher order correction terms in the asymptotic expansion of u(x,t,ɛ), but this work has not yet been carried out.

Our second result relates to the determination of the branch points. In the Lax–Levermore theory, the calculation of the branch points is reduced to solving the initial value problem for the modulation system. This goal motivated a large body of research both in the former Soviet Union and in the West in the eighties and nineties. Leaving aside the discussion of the work of many authors (see ref. 16) we mention the fact that the KdV modulation equations are themselves integrable, as was first shown by Tsarev (17), and that the local existence of the solution of the modulation system beyond break-time was established constructively for N = 1 by Tian (18). In the present work, we write down explicit first integrals for the solution of the modulation equations, for arbitrary N. If N is known, and (x,t) is fixed, our Eqs. 7 and 8 provide a complete algebraic system of 2N + 1 equations for the 2N + 1 unknown branch points. To find the branch points at some given (x,t), we have to solve this nonlinear algebraic system starting with the value N = 0, and proceeding to N = 1, … until the corresponding function g (see section 1) satisfies conditions (Eqs. 4 ad) below. The function g corresponds (see below) to the Lax–Levermore–Venakides maximizer. For illustrative purposes (see ref. 16), we have carried out this construction globally in (x,t) when the graph of u0 has the shape of a tent, u0(x)= 1 −  x  for  x  < 1, and u0(x) = 0 otherwise. We remark that we derive Eqs. 7 and 8 for the branch points directly from the RHP and without resorting to the modulation equations. We can derive the modulation equations from the compatibility condition gxt = gtx exactly as in Lax–Levermore theory, but they do not play such a prominent role in our calculation.

Our third result consists of establishing a connection between our steepest descent approach and the variational approach of Lax and Levermore.

We assume a positive, and hence soliton-free, u0(x) that is analytic and consists of a “hump” of unit height, restrictions that are convenient but not essential to our method. This is the problem considered in ref. 4. After changing from the Inline graphic-plane to the λ-plane, taking the right half Inline graphic-plane onto C/(−∞,0], the Shabat RHP consists of finding a row-vector-valued function m(λ) = m(λ;x,t,ɛ) = (m1,m2), analytic for complex λ off the real axis, with boundary values m± (λ;x,t,ɛ) ≡ m(λ ± i0;x,t,ɛ) ∈ (1,1) + L2(|dλ1/2|) that satisfy the jump condition

graphic file with name M5.gif

on the real axis. The jump matrix v(0)(λ;ɛ) is given by

graphic file with name M6.gif

1 reflects the Inline graphic to −Inline graphic symmetry of the RHP in the Inline graphic-plane), while

graphic file with name M10.gif

Here r(λ;ɛ), λ > 0, is the reflection coefficient for the Schrödinger equation (Eq. 2) at t = 0 and α = 4tλ3/2 + xλ1/2. The space and time variables are simply parameters in the solution process. The solution to Eq. 1 is given by

graphic file with name M11.gif

where

graphic file with name M12.gif

We identify r with its WKB approximation (cf e.g., ref. 19)

graphic file with name M13.gif
graphic file with name M14.gif
graphic file with name M15.gif

the quantities x(λ) < x+(λ) being defined by u0(x± (λ)) = λ. Here χ[0,1] is characteristic function of the interval [0,1]. The jump matrix reduces to the identity for λ > 1 and our problem reduces to a RHP on the interval (−∞,1]. Further reductions of the RHP m+(n) = m(n)v(n), where m+(n) and m(n) are the left and right boundary values of m(n) on the oriented contour Σ(n) with jump matrix v(n), will not affect the jump matrix outside 0 < λ < 1.

1. The Eikonal Problem and the Differentiated Eikonal RHP

We introduce the complex phase function,

graphic file with name M16.gif

to be determined below, by making the change of dependent variable

graphic file with name M17.gif

The function g is required to be analytic in λ off the set (−∞,1] and satisfy g+(λ) + g(λ) = 0 for λ ∈ (−∞,0), to maintain the jump σ1 for m(1) across (-∞,0). The formula for u becomes

graphic file with name M18.gif 2

where mi(1)(λ) = 1 + m11(1)1/2 + O(1/λ); in the region 0 < λ < 1, the RHP transforms to

graphic file with name M19.gif 3

where h = g+ + g − 2ρ + 2α.

Our strategy is to determine g in such a way that for each value of 0 < λ < 1, as ɛ↘0, the jump matrix v(1) reduces to one of the following three forms with exponentially small errors,

graphic file with name M20.gif
graphic file with name M21.gif

where h′ < 0 in the first form, h′ = 0 in the second form, and h′ > 0 in the third form. As can be seen in the more detailed presentation (16), according to the steepest descent method, the first and third forms can be replaced by the identity matrix through the use of an intermediate RHP (m(2),v(2), Σ(2)). This reduces the RHP for m(1) to one with a piecewise constant jump matrix (h′ = 0) of the second form, which is exactly solvable. To implement this strategy we require that, at each 0 < λ < 1, exactly one of the conditions (Eqs. 4 ac) be satisfied, and that condition (Eq. 4d) below be satisfied as required earlier.

graphic file with name M22.gif 4a
graphic file with name M23.gif
graphic file with name M24.gif
graphic file with name M25.gif 4b
graphic file with name M26.gif 4c
graphic file with name M27.gif 4d

In the remaining RHP (m(3), v(3), Σ(3)) the contour Σ(3) consists of the union of (-∞,0] with the set of λs where Eq. 4b holds. We pose the ansatz that the latter consists of disjoint intervals Ij = (αjj),j = 0, … , N < ∞, ordered from left to right and lying in the open interval (0,1), with α0 = 0. For λ in Ij, the jump matrix v(3)(λ) = −iσ1e−Ωj/ɛσ3 where Ωj = −h(λ). The equalities in Eqs. 4 ad define a scalar RHP for the function g. This is the eikonal problem.

Differentiating with respect to λ, we obtain a RHP for g′ namely

graphic file with name M28.gif 4e
graphic file with name M29.gif 4f
graphic file with name M30.gif 4g

This is the differentiated eikonal problem. To have a well-posed RHP for g′, we require additionally that

graphic file with name M31.gif 5

2. Calculation of the Branch Points αj and βj

The solution to the differentiated eikonal RHP is given in terms of the initial data of Eq. 1 by

graphic file with name M32.gif
graphic file with name M33.gif 6

where E = 1 in case A and E = βN in case B,

graphic file with name M34.gif

with Inline graphic determined to be positive for λ > βN, and Inline graphic+ = Inline graphic.

We obtain 2N + 1 equations to determine the 2N + 1 unknown branch points β0jj,j = 1,… N as follows. The condition g(λ) = O(λ½) for large λ implies g′/ Inline graphic = ON−1), and leads to the N + 1 moment equations, for k = 0, … , N,

graphic file with name M39.gif 7

An additional set of N equations is obtained by integrating g′ around each Ij and by using Eqs. 4a and 4c. In case A we obtain for j = 1, … N,

graphic file with name M40.gif 8

while in case B, τ(βN) must be replaced by zero. To calculate the branch points at some (x,t), we solve the above equations for N = 0. If g, as determined from g′ by integration, satisfies the conditions (Eqs. 4 ac), it is the desired solution of the scalar RHP. If not, we increase the value of N by one and repeat the calculation. We expect that the procedure always terminates with a finite N for generic initial data. For a more systematic determination of N = N(x,t), we construct for each t the separatrix F(t)=[(x,λ): λ = β0(x,t), α1(x,t),… ,βN(x,t), −∞ < x < ∞,] (see Fig. 1a and b).

graphic file with name pq0183156001.jpg

If a vertical line is drawn at some x, its intersection with the points in the plane on the one side of the curve (the “lower” one) gives the contour Σ(3). For example, if x > x1 in Fig. 1b, then Σ(3) consists of a single interval (−∞,β0(x)): on the other hand, for x2 < x < x1, Σ(3) consists of two intervals, (−∞,β (x))∪(α,(x),β(x)), etc.

For times t less than the critical value tcrit at which Eq. 1 with ɛ = 0 breaks down, we take N = N(x,t) = 0. Solving Eq. 7 for β0, we find that F(t)=[(x,λ):λ = u(x,t)], where u(x,t) is the solution of Eq. 1 with ɛ = 0, and we verify that the corresponding function g satisfies Eqs. 4 ac. For x > xmax(t), we are in case B and for x < xmax(t), we are in case A. For fixed t > tcrit, we construct the separatrix of Fig. 1b starting with x ≫ 1 and N(x,t) = 0. We then move x to the left, calculating the parameters αjj, and readjusting the value of N as soon as one of the inequalities (Eqs. 4 ac) is violated. In Fig. 1b, interval I0 splits into two intervals at x1 and x2, whereas interval I1 collapses to a point and disappears at x3 and x4. As noted earlier, we have carried out (see ref. 16) this procedure explicitly in the case of the nontrivial example u(x,0)= 1 −  |x| , when  |x|  < 1 and u(x,0) = 0 otherwise.

3. Calculation of the Ωj

We obtain a formula for g by solving the eikonal RHP,

graphic file with name M41.gif

The original RHP for m has a Inline graphic symmetry, and we require that g be an odd function of Inline graphic to retain the symmetry for m(1); this leads to the condition Ω0 = 0. We determine the remaining Ωjs from the N moment conditions that g/ Inline graphic = ON) must satisfy at λ = ∞. We find for j > 0, Ωj = xΩj1 + tΩj2 + Ωj3, where

graphic file with name M45.gif
graphic file with name M46.gif 9a

In these formulae, ωi = pi(λ)/Inline graphic dλ, i = 1, … , N, and pi(λ) are polynomials of degree ≤N − 1 uniquely determined from the N relations ∫aj ωi = δij (contour aj circles around interval Ij clockwise). Solving the RHP for ∂xg we obtain

graphic file with name M48.gif

where the meromorphic differential (q/Inline graphic)dλ = (λN + qN−1λN−1 + … +q0)/Inline graphicdλ, has N coefficients q0, … qN−1 uniquely determined by the N relations ∫a>j(q/Inline graphic)dλ = 0,j = 1, … , N.

From the RHP for ∂xg and ∂tg, we obtain the relations

graphic file with name M52.gif 9b

that are not at all obvious because Ω(1), Ω(2), and Ω(3) all depend on x and t through ωj.

4. Calculation of u(x,t,ɛ)

We verify (also see refs. 2 and 15) that the solution to the RHP for m(3) is

graphic file with name M53.gif 10
graphic file with name M54.gif

where the Riemann theta function θ has period matrix (∫bjΩi) (contour bj proceeds from β0 to αj on the upper half of the lower sheet of the Riemann surface and returns to β0 on the lower half of the upper sheet), w(λ) = ∫λ ω, ω = (ω1, … , ωN)t, and Ω = (Ω1, … , ΩN)t. We obtain finally, as ɛ → 0,

graphic file with name M55.gif 11

In a space-time window of scale ɛ about (x,t), we have

graphic file with name M56.gif 12
graphic file with name M57.gif

By using Eqs. 9 a and b, we see that Eq. 12 has precisely the form of a modulated KdV wave (see e.g., ref. 6).

5. A Variational Principle for g′(λ)

We make the change of variable λ = 1 − η2, taking the upper complex half-plane onto C/(−∞, 1]. The function g transforms to G(η) = g(λ). We extend the definition of G onto the lower complex η half-plane, by the relation G(−η) = − G(η), and we observe that G is analytic off the real η axis. We transform the RHP for g into a problem for G, observing that G+G = g+ + g and G+ + G = −(g+g)sgnη. On the real axis, we define the function ψ (η) to equal zero outside the interval (−1,1) while on (−1,1)

graphic file with name M58.gif 13

and one verifies that

graphic file with name M59.gif 14

After a nontrivial calculation, conditions (Eqs. 4 ac) transform to the variational conditions of ref. 4 (a misprint in equation 1.33, line i of this reference in which a factor η is missing from the second term in the expression for a(η) is corrected below). This allows us to use the variational formulation of Lax–Levermore–Venakides for the derivation of ψ, and hence of g. Indeed, the function ψ(η), restricted to the interval [0,1] is the unique maximizer of the functional Q(ψ) = (2a,ψ) − (2γ,ψ+) + (Lψ,ψ) where

graphic file with name M60.gif
graphic file with name M61.gif
graphic file with name M62.gif

ψ+(η) is the positive part of the function ψ. The benefit of the variational formulation is that, because of the convexity of the maximization problem, the uniqueness of ψ, and hence of G and g, are guaranteed.

Remark.

The RH steepest descent method also can be used to analyze potential wells u0(x) < 0: here the critical behavior of the RHP takes place on the interval [minxu0(x),0], rather than the interval [0,1 = maxxu0(x)] as in the case of a “bump.” If u0 consists of a “bump” separated from a well, then the critical region for the RHP is the interval [minxu0(x),maxxu0(x)].

Acknowledgments

The work of P.D. was supported in part by National Science Foundation Grants DMS-9203771 and DMS-9500867. The work of S.V. was supported by Army Research Office Grants DAAH04-93-G-0011 and DAAH 04-96-1-0157 and National Science Foundation Grants DMS-9103386 and DMS-9500623. The work of X.Z. was supported by National Science Foundation Grant DMS-9401403.

ABBREVIATIONS

KdV

Korteweg-de Vries

RHP

Riemann–Hilbert problem

References

  • 1.Gardner C S, Greene J M, Kruskal M D, Miura R M. Phys Rev Lett. 1976;19:1095–1097. [Google Scholar]
  • 2.Lax P D, Levermore C D. Proc Natl Acad Sci USA. 1979;76:3602–3606. doi: 10.1073/pnas.76.8.3602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lax P D, Levermore C D. Commun Pure Appl Math. 1983;36:253–290. , 571–593, 809–829. [Google Scholar]
  • 4.Venakides S. Commun Pure Appl Math. 1985;38:125–155. [Google Scholar]
  • 5.Whitham G B. Linear and Nonlinear Waves. New York: Wiley-Interscience; 1974. [Google Scholar]
  • 6.Flaschka H, Forest M G, McLaughlin D W. Commun Pure Appl Math. 1980;33:739–784. [Google Scholar]
  • 7.Venakides S. Commun Pure Appl Math. 1985;38:883–909. [Google Scholar]
  • 8.Venakides S. AMS Trans. 1987;301:189–226. [Google Scholar]
  • 9.Venakides S. Commun Pure Appl Math. 1990;43:335–361. [Google Scholar]
  • 10.Venakides S. AMS Trans. 1987;301:189–225. [Google Scholar]
  • 11.Shabat A B. Problems in Mechanics and Mathematical Physics. Moscow: Nauka; 1976. [Google Scholar]
  • 12.Beals, R., Deift, P. & Tomei, C. (1988) AMS Mathematical Surveys Monographs, No. 28 (Amer. Math. Soc., Providence).
  • 13.Deift P, Zhou X. Ann Math. 1993;137:295–368. [Google Scholar]
  • 14.Deift P, Zhou X. Commun Pure Appl Math. 1995;48:277–337. [Google Scholar]
  • 15.Deift P, Venakides S, Zhou X. Commun Pure Appl Math. 1994;47:199–206. [Google Scholar]
  • 16.Deift P, Zhou X, Venakides S. Int Math Res Not. 1997;6:285–299. [Google Scholar]
  • 17.Tsarev S P. Soviet Math Dokl. 1985;31:488–491. [Google Scholar]
  • 18.Tian F-R. Commun Pure Appl Math. 1993;46:1093–1129. [Google Scholar]
  • 19.Bender C M, Orszag S A. Advanced Mathematical Methods for Scientists and Engineers. New York: McGraw-Hill; 1978. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES