Scaling relations for auxin waves

Bente Hilde Bakker; Timothy E Faver; Hermen Jan Hupkes; Roeland M H Merks; Jelle van der Voort

doi:10.1007/s00285-022-01793-5

. 2022 Sep 26;85(4):41. doi: 10.1007/s00285-022-01793-5

Scaling relations for auxin waves

Bente Hilde Bakker ¹, Timothy E Faver ², Hermen Jan Hupkes ^1,^✉, Roeland M H Merks ³, Jelle van der Voort ¹

PMCID: PMC9512763 PMID: 36163567

Abstract

We analyze an ‘up-the-gradient’ model for the formation of transport channels of the phytohormone auxin, through auxin-mediated polarization of the PIN1 auxin transporter. We show that this model admits a family of travelling wave solutions that is parameterized by the height of the auxin-pulse. We uncover scaling relations for the speed and width of these waves and verify these rigorous results with numerical computations. In addition, we provide explicit expressions for the leading-order wave profiles, which allows the influence of the biological parameters in the problem to be readily identified. Our proofs are based on a generalization of the scaling principle developed by Friesecke and Pego to construct pulse solutions to the classic Fermi–Pasta–Ulam–Tsingou model, which describes a one-dimensional chain of coupled nonlinear springs.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00285-022-01793-5.

Keywords: Travelling waves, Polar auxin transport, Up-the-gradient models, Scaling limits, Cross-diffusion, Lattice differential equations

Introduction

Polar auxin transport

The phytohormone auxin is a central player in practically all aspects of the development and growth of plants, for example in phyllotaxis, root development and the initiation of lateral roots, the formation of vascular tissues in stems, the patterning of leaf veins, and flower development (Paque and Weijers 2016). The pattern formation principles underlying these developmental mechanisms have been uncovered to a large part through an intensive cross-talk between experimental approaches and mathematical modeling (Shi and Vernoux 2018; Autran et al. 2021; Cieslak et al. 2021). Auxin is transported between cells and between cells and the cell walls both through diffusion and through transport proteins that are localized at the plasma membrane (PM). Some of these transport proteins, mostly notably several members of the PIN-FORMED family including PIN1 (Adamowski and Friml 2015) are distributed in a polarised manner inside the cells. Such polarised localisation of PINs is coordinated in plant tissue, leading to a directed transport of auxin through plant tissues in a mechanism called polar auxin transport (PAT) (Adamowski and Friml 2015). For example, in fully developed seed plants, auxin is synthesized in leaves, then is transported through the central tissues of the stem and the root towards the root tips, where it redirected along the superficial tissues of the root back to towards the stem and recycled towards the internal tissues of the root (Adamowski and Friml 2015).

Despite new details being uncovered incessantly (see e.g. Verna et al. 2019; Hajný et al. 2020), it is still incompletely understood what mechanisms drive the polarization of PINs inside cells and the coordinated polarization among adjacent cells. In a series of classical experiments, Sachs applied artificial auxin to bean plants, and observed that these become the source of new vascular tissue that then joins the existing vasculature; see e.g. Sachs (1975) and the review (Hajný et al. 2022). These initial observations, together with the discovery of PIN1 and subsequently discovered members of the PIN-FORMED protein family suggested that auxin drives the polarization of its own transporters, and hence the direction of its own transport (reviewed in Merks et al. 2007; Hajný et al. 2022). Initial models aimed to explain the formation of transport channels as observed in Sachs’ experiments. These models therefore assumed that the rate of auxin flux from cell to cell further polarised auxin transport. This positive feedback led to the self-organised formation of auxin transport channels in a process called auxin canalisation. When it was realised that auxin accumulations mark the formation of new leaves at the shoot apex, an alternative model was proposed, in which cells polarised towards the locally increased concentrations of auxin, thus forming self-organised accumulation of auxin (Reinhardt et al. 2003). Mathematical models of the self-organisation of polar auxin transport therefore follow these two broad categories. ‘With-the-gradient’ models formalise the canalisation hypothesis and assume that the rate of cell polarisation depends on the auxin flux towards the relevant neighbour (Mitchison 1980, 1981; Rolland-Lagan and Prusinkiewicz 2005; Rolland-Lagan 2008). ‘Up-the-gradient’ models assume that PIN polarizes in the direction of neighbouring cells at a rate that positively depends on the auxin concentration in that neighbour (Jönsson et al. 2006; Smith et al. 2006). Attempts to reconcile these two seemingly contradicting ideas have followed two broad approaches. The first approach proposed that with-the-gradient and up-the-gradient models act at different positions of the plant or at different stages during development. For example Bayer et al. (2009) proposed that the up-the-gradient model act at superficial tissue layers of the shoot apical meristem where it forms auxin accumulation points leading to the initial of new leaves. The deeper tissue layers could follow the with-the-gradient model channeling auxin away from the auxin accumulation point towards the vascular tissues (Bayer et al. 2009). A similar approach was recently taken to explain the leaf venation patterning in combination with auxin convergence at the edge of the leaf primordium (Holloway and Wenzel 2021). The second approach looked for variants of the with-the-gradient or up-the-gradient models that could explain both auxin canalisation and auxin accumulation depending on the parameter settings. In this line of reasoning Walker et al. have proposed a with-the-gradient hypothesis for phyllotaxis (Walke et al. 2013), whereas one of us has proposed an up-the-gradient hypothesis for canalisation (Merks et al. 2007).

More recent analyses of the role of auxin and PINs in the formation of leaf veins (Verna et al. 2019) put the key role of a feedback between auxin signaling and the polar localisation of PINs into question, and therefore the validity of canalisation hypothesis or its alternatives including the traveling-wave hypothesis (Merks et al. 2007) for formation of vascular tissues. In particular, quadruple mutants strongly reducing functionality of all plasma-membrane-localised PINs, i.e., of all PINs that are responsible for PAT in the leaf veins, show relatively mild venation pattern phenotypes. Further knock-out of PIN6 and PIN8, expressed in the leaf veins but not localised in the PM, thus excluding a role of these PINs in PAT, led to further defects in leaf venation patterning (Verna et al. 2019) identical to those due to a chemical block of auxin transport. Nevertheless, in these mutants the polar ordening of the cells in the vasculature stays intact and supernumerary veins are induced by exogenous application of auxin, showing that auxin can induce veins in absence of polar transport. Using further mutations of auxin sensing proteins, it was found that this PAT-independent vein formation requires auxin sensing and the activity of GNOM, a protein regulating the constitutive recycling of PM-localised proteins, including PINs. How the available mathematical models of auxin-regulated patterning in plants will need to be updated or rejected is a topic of ongoing investigation, but what seems clear at this moment is that such models must involve auxin sensing and coordination of cell polarisation possibly through polar transport of other small chemicals besides auxine [e.g., acidification of the cell walls (Fendrych et al. 2016)], facilitated diffusion (Mitchison 1980) or coordination of polarity through other means such as mechanical signaling as studied in mathematical models of phyllotaxis (Julien et al. 2019) and leaf venation patterning (Kneuper et al. 2020).

In this paper we formally analyse an existing up-the-gradient model for establishment of polar auxin transport during leaf venation patterning (Jönsson et al. 2006; Heisler and Jonsson 2006; Merks et al. 2007). Although this model is a strong oversimplification of the experimental state-of-the-art, which in part invalidates it, it includes (1) auxin sensing, (2) polar transport, and (3) constitutive recycling, and thus likely contains key elements of updated, future models while still retaining the simplicity required for mathematical analysis. Thus, despite clear discrepancies of recent experimental insights with both the up-the-gradient and with-the-gradient models, the insights obtained in a formal analysis as well as the mathematical approaches developed in this work will likely apply to future, updated models of auxin-regulated patterning in plants.

Mathematical motivation

In order to distinguish between the available phenomenological models of auxin-driven pattern formation and the general developmental principles that they represent, mathematical insight into the models’ structure and the models’ solutions will be crucial. This will help pinpoint key differences between the model structures and may uncover potential structural instabilities in the models upon which evolution may have acted, so as to produce new developmental patterning modules (Benítez et al. 2018). From the mathematical side, almost all previous studies have focused on the types of patterns that can be generated by different models once the transitory dynamics have died out. An important example is the study by Van Berkel and coworkers (van Berkel et al. 2013), where a number of models for polar auxin transport are recast into a common mathematical framework that allows them to be compared. A steady state analysis for a general class of active transport models can be found in Draelants et al. (2015), using advanced tools such as snaking from the field of bifurcation theory. Both periodic and stationary patterns are examined in Allen and Ptashnyk (2020), where the authors consider an extended with-the-gradient model. Haskovec and his coworkers derive local and global existence results together with an appropriate continuum limit for their graph-based diffusion model in Haskovec et al. (2019).

Important qualitative examples of the up-the-gradient model are the formation of regularly spaced auxin maximums that lead to the growth of new leaves, as well as the formation of auxin channels that have been hypothesized to precede the formation of veins. Our goal here is to move beyond the well-studied equilibrium settings above and focus instead on understanding the dynamical behavior that leads to these patterns. In particular, we provide a rigorous framework to study a class of wave solutions that underpin the dynamical behaviour associated to up-the-gradient models. Ultimately, we hope that this analytic approach will provide an additional lens through which models of PAT can be examined and compared.

The model

Inspired by Jönsson et al. (2006), Heisler and Jonsson (2006) and Merks et al. (2007), the system we will study is given by

\begin{matrix} \{\begin{matrix} {\dot{A}}_{j} = T_{act} (R_{j - 1} \frac{A_{j - 1}}{k_{a} + A_{j - 1}} - R_{j} \frac{A_{j}}{k_{a} + A_{j}}) + T_{diff} (A_{j + 1} - 2 A_{j} + A_{j - 1}), \\ {\dot{P}}_{j} = - k_{1} \frac{A_{j + 1}}{k_{r} + A_{j + 1}} (\frac{P_{j}}{k_{m} + P_{j}}) + α A_{j}, \\ {\dot{R}}_{j} = k_{1} \frac{A_{j + 1}}{k_{r} + A_{j + 1}} (\frac{P_{j}}{k_{m} + P_{j}}), \end{matrix}) \end{matrix}

1.1

posed on the one-dimensional lattice $j \in Z$ ; see Fig. 1. The variable $A_{j} (t)$ denotes the auxin concentration in cell $j \in Z$ , while $P_{j} (t)$ and $R_{j} (t)$ represent the unpolarized respectively right-polarized PIN1 in this cell. PIN1 is the PIN-variant that is believed to play a central role during auxin-based pattern formation in the shoot apical meristem and during leaf venation patterning (Reinhardt et al. 2003; Jönsson et al. 2006; Smith et al. 2006; Scarpella et al. 2006; Verna et al. 2019), and we therefore consider PIN1 here. However, note that the general structure of this model would apply to other polarised transporter proteins with similar behavior.

The parameters appearing in the problem are all strictly positive and labelled in the same manner as in Merks et al. (2007).1 In particular, $T_{act}$ and $T_{diff}$ denote the strengths of the active PIN1-induced rightward auxin transport and its diffusive counterpart, respectively. Unpolarized PIN1 is formed in the presence of auxin at a rate $α$ , while $k_{1}$ denotes the polarization rate. Finally, $k_{a}$ , $k_{r}$ , and $k_{m}$ are the Michaelis constants associated to the active transport of auxin and the polarization of PIN1, which depends on the auxin-concentration in the right-hand neighbouring cell. In particular, this model is of ‘up-the-gradient’ type.

The main difference compared to Merks et al. (2007) is that we are neglecting the presence of left-polarized PIN1 and have set the decay and depolarization rates of PIN1 to zero. Although this step of course imposes a pre-existing polarity on the system, we need to do this for technical reasons that we explain in the sequel. For now we simply point out that we wish to focus our attention on the dynamics of rightward auxin propagation, which takes place on timescales that are much faster than these decay and depolarization processes, and that the results will give novel insight into the full problem.

We will look for solutions of the special type

\begin{matrix} (A_{j}, P_{j}, R_{j}) (t) = (ϕ_{A}, ϕ_{P}, ϕ_{R}) (j - c t), \end{matrix}

1.2

with $c > 0$ , in which we impose the limits

\begin{matrix} lim_{ξ \to - \infty} ϕ_{A} (ξ) = 0, lim_{ξ \to \infty} (ϕ_{A}, ϕ_{P}, ϕ_{R}) (ξ) = 0 ; \end{matrix}

1.3

see Fig. 2. From a modelling perspective, such solutions represent a pulse of auxin that moves to the right through a one-dimensional row of cells. Ahead of the wave the cells are clear of both polarized and unpolarized PIN, but behind the wavefront a residual amount of PIN is left in the cells, representing the coordinated polarisation of the tissue.

In reality these residues start to depolarize and decay, which can be included by adding linear decay terms to (1.1). This leads to the expanded system

\begin{matrix} \{\begin{matrix} {\dot{A}}_{j} = T_{act} (R_{j - 1} \frac{A_{j - 1}}{k_{a} + A_{j - 1}} - R_{j} \frac{A_{j}}{k_{a} + A_{j}}) + T_{diff} (A_{j + 1} - 2 A_{j} + A_{j - 1}), \\ {\dot{P}}_{j} = - k_{1} \frac{A_{j + 1}}{k_{r} + A_{j + 1}} (\frac{P_{j}}{k_{m} + P_{j}}) + α A_{j} + k_{2} R_{j} - δ P_{j}, \\ {\dot{R}}_{j} = k_{1} \frac{A_{j + 1}}{k_{r} + A_{j + 1}} (\frac{P_{j}}{k_{m} + P_{j}}) - k_{2} R_{j}, \end{matrix}) \end{matrix}

1.4

in which the positive parameters $δ$ and $k_{2}$ represent the decay and depolarization rate of PIN1, respectively. Mathematically, these terms can be included into our framework provided that the parameters $δ$ and $k_{2}$ are small compared to the amplitude of the pulses, but we do not pursue this level of generality in the current paper for presentational clarity. Note in any case that in Merks et al. (2007) these parameters were chosen to be orders of magnitude smaller than $α$ and $k_{1}$ .

Travelling waves have played a fundamental role in the analysis of many spatially discrete systems (Kevrekidis 2011; Mallet-Paret 1999; Chen et al. 2008; Hupkes and Sandstede 2010; Keener 1987). They can be seen as a lossless mechanism to transport matter or energy over arbitrary distances. As such, they are interesting in their own right, but they can also be viewed as building blocks to describe more complicated behaviour of nonlinear systems (Aronson and Weinberger 1975, 1978). In the present case for example, one can construct wavetrain solutions to (1.4) by adding a persistent auxin source; see Fig. 3 and Supplementary Video S1. Initially, these solutions can be seen in an approximate sense as a concatenation of the individual auxin pulses that we consider here (Moser 2021). As a consequence of the amplitude variations, small speed differences occur between these pulses which leads to highly interesting collision processes. Due to this type of versatility, travelling waves play an important role in many applications and have been extensively studied in a variety of settings (Sandstede 2002; Kevrekidis 2011; Hochstrasser et al. 1989; Jones et al. 1991).

Fig. 3 — Six snapshots of a wavetrain simulation for the expanded system (1.4). Higher pulses travel faster than lower pulses, in correspondence with the scaling relations (1.7). These speed differences lead to merge events where even higher pulses are formed, which detach from the bulk. We used the procedure described in Sect. 1.4, taking $A_{1} (0) = A_{⋄} = 0.0$ but adding 0.025 to ${\dot{A}}_{1} (t)$ to simulate a constant auxin influx at the left boundary. We picked $δ = 0.1$ and $k_{2} = 0.2$ , leaving the remaining parameters from Fig. 2 unchanged. The full simulation can be found in supplementary video S1

Main results

Our goal will be to obtain quantitative scaling information concerning the speed and shape of these waves. In particular, we will show rigorously that (1.1) admits a family of travelling wave solutions that are parameterized by the amplitude of the auxin-pulse. In addition, we show that the speed and width of these waves scale with this amplitude via a fractional power law. We state our results in full technical detail in Theorem 6.3 below.

More precisely, we provide an explicit triplet of functions $(ϕ_{A}^{*}, ϕ_{P}^{*}, ϕ_{R}^{*})$ that satisfy the limits (1.3) and construct solutions to (1.1) of the form

\begin{matrix} \begin{matrix} (A_{j}, P_{j}, R_{j}) (t) & = & (ϵ ϕ_{A}^{*}, ϵ^{1 / 5} ϕ_{P}^{*}, ϵ^{2 / 5} ϕ_{R}^{*}) (ϵ^{2 / 5} (j - c_{*} ϵ^{2 / 5} t)) \\ + (O (ϵ^{17 / 15}), O (ϵ^{1 / 3}), O (ϵ^{3 / 5})), \end{matrix} \end{matrix}

1.5

for a constant $c_{*}$ , which we state exactly in (1.8). Here the limiting profile $ϕ_{A}^{*}$ is scaled in such a way that $‖ ϕ_{A}^{*} ‖_{L^{\infty}} = 1$ . Upon introducing the heights2

\begin{matrix} (h_{A}, h_{P}, h_{R}) = {(‖ A ‖}_{\infty} {, ‖ P ‖}_{\infty}, {‖ R ‖}_{\infty}) \end{matrix}

1.6

associated to the three components of our waves, this choice ensures that the auxin-height $h_{A}$ is equal to the parameter $ϵ > 0$ at leading order. In particular, comparing this to (1.2) we uncover the leading order scaling relations

\begin{matrix} c \sim c_{*} h_{A}^{2 / 5}, w \sim w_{*} h_{A}^{- 2 / 5}, h_{P} \sim h_{P}^{*} h_{A}^{1 / 5}, h_{R} \sim h_{R}^{*} h_{A}^{2 / 5} \end{matrix}

1.7

for the speed c, width3w and heights of the wave. Here the constant $w_{*}$ denotes the width of the limiting profile $ϕ_{A}^{*}$ , while the other constants are given explicitly by

\begin{matrix} \begin{matrix} c_{*} & = & {(\frac{9 α k_{1} T_{act} T_{diff}^{2}}{8 k_{a} k_{m} k_{r}})}^{1 / 5}, \\ h_{P}^{*} & = & \sqrt{6} {(\frac{9 α^{6} k_{a}^{4} k_{m}^{4} k_{r}^{4} T_{diff}^{2}}{8 k_{1}^{4} T_{act}^{4}})}^{1 / 10}, \\ h_{R}^{*} & = & 3 {(\frac{9 α k_{a}^{4} k_{1} T_{diff}^{2}}{8 k_{r} k_{m} T_{act}^{4}})}^{1 / 5} . \end{matrix} \end{matrix}

1.8

In particular, for a fixed height of the auxin-pulse our results state that the speed and residual PIN1 will increase as the PIN1-production parameter $α > 0$ is increased.

Although our proof requires the parameter $ϵ > 0$ and hence the amplitude of the auxin-pulses to be small, this branch of solutions continues to exist well beyond this asymptotic regime. Indeed, we numerically confirmed the existence (and stability) of these waves by a direct simulation of (1.1) on a row of cells $j \in {1, \dots 500}$ , initialized with $A_{j} (0) = P_{j} (0) = R_{j} (0) = 0$ for $2 \leq j \leq 500$ , together with $P_{1} (0) = R_{1} (0) = 0$ and $A_{1} (0) = A_{⋄}$ for some $A_{⋄} > 0$ that we varied between simulations. In order to close the system, we used the Neumann-type condition $A_{0} (t) = A_{1} (t)$ on the left-boundary, together with $R_{0} (t) = 0$ and a sink condition $A_{501} (t) = 0$ on the right. An example of such a simulation can be found in Fig. 2 (right). By varying the initial auxin concentration $A_{⋄}$ , we were able to generate waves with a range of amplitudes. We subsequently numerically computed the speed and width of these waves, which allowed us to confirm the leading order behaviour (1.7); see Fig. 4. In addition, we verified the convergence to the limiting profiles $(ϕ_{A}^{*}, ϕ_{P}^{*}, ϕ_{R}^{*})$ by comparing the appropriately rescaled numerical waveprofiles; see Fig. 5.

Fig. 4 — Scaling behaviour of the wavespeed c (left) and the auxin width w (right) against the height $h_{A}$ of the auxin pulse. The dashed lines represent the explicit predictions (1.7). The circles arise from numerical simulations, following the procedure described in Sect. 1.4 with several different values for $A_{⋄}$ . The other parameters were chosen as in Fig. 2

Fig. 5 — Convergence of the (scaled) profiles $ϕ_{A}$ (left), $ϕ_{P}$ (center) and $ϕ_{R}$ (right) to their limits $(ϕ_{A}^{*}, ϕ_{P}^{*}, ϕ_{R}^{*})$ . To perform the scalings, we wrote $h_{A} = {‖ ϕ_{A} ‖}_{L^{\infty}}$ , compressed space by a factor of $h_{A}^{2 / 5}$ and divided the three profiles by the respective factors $(h_{A}, h_{A}^{1 / 5}, h_{A}^{2 / 5})$ , in line with the relations (1.7)

Cross-diffusion

From a mathematical perspective, the problem (1.1) is interesting due to its interpretation as a so-called cross-diffusion problem, where the transport coefficient of one component is influenced by one of the other components. Work in this area was stimulated by developments in the modeling of bacterial cell membranes (Shih et al. 2019) and biofilms (Emerenini et al. 2015), where self-organization of biological molecules plays an important role. In the continuum regime, such problems are tough to analyze on account of potential degeneracies in the coefficients. The well-posedness of the underlying problem was analyzed in Sonner et al. (2011), while a numerical method for such problems was developed in Ghasemi et al. (2018).

The key phenomenological assumption behind such models is that particles behave differently when they are isolated compared to when they are part of a cluster. A simplified agent-based approach to capture this mechanism can be found in Johnston et al. (2017), which reduces naturally to a scalar PDE with nonlinear diffusion in the continuum limit. After adding a small regularization term, it is possible to use geometric singular perturbation theory to show that this PDE admits travelling wave solutions (Li et al. 2021). In this setting, the steepness of the wavefronts provides the necessary scale-separation required for rigorous results.

Our approach in this paper proceeds along entirely different lines, using the amplitude of the auxin pulse as a small continuation parameter to construct a family of travelling wave solutions to (1.1). The key insight is that one can extract an effective limiting system by scaling the width and speed of the wave in an appropriate fashion and sending the amplitude to zero. By means of a fixed-point analysis one can show in a rigorous fashion that solutions to this limiting system can be continued to form a family of solutions to the full system.

Relation to FPUT pulses

Our technique is a generalization of the approach developed by Friesecke and Pego (1999) to construct small-amplitude travelling pulse solutions to the Fermi–Pasta–Ulam–Tsingou (FPUT) problem (Fermi et al. 1955; Dauxois 2008)

\begin{matrix} {\ddot{x}}_{j} = F (x_{j + 1} - x_{j}) - F (x_{j} - x_{j - 1}), j \in Z . \end{matrix}

1.9

This models an infinite, one-dimensional chain of particles that can only move horizontally and are connected to their nearest neighbours by springs. These springs transmit a force

\begin{matrix} F (r) = r + r^{2} \end{matrix}

1.10

that hence depends nonlinearly on the relative distance r between neighbouring particles; see Friesecke and Pego (1999), Herrmann and Matthies (2015) and Pankov (2005) for the impact of other choices. The FPUT system is well-established as a fundamental model to study the propagation of disturbances through spatially discrete systems, such as granular media, artificial metamaterials, DNA strands, and electrical transmission lines (Brillouin 1953; Kevrekidis 2011).

Looking for a travelling wave in the relative displacement coordinates, one introduces an Ansatz of the form

\begin{matrix} x_{j + 1} (t) - x_{j} (t) = ϕ (j - σ t), \end{matrix}

1.11

which leads to the scalar functional differential equation of mixed type (MFDE)

\begin{matrix} σ^{2} ϕ^{''} (ξ) = F (ϕ (ξ + 1)) - 2 F (ϕ (ξ)) + F (ϕ (ξ - 1)) . \end{matrix}

1.12

Following the classic papers by Friesecke in combination with Wattis (Friesecke and Wattis 1994) and Pego (Friesecke and Pego 1999, 2002, 2004a, b), we introduce the scaling

\begin{matrix} ϕ (ξ) = ϵ^{2} φ_{ϵ} (ϵ ξ) \end{matrix}

1.13

and write $σ = σ_{ϵ}$ , which transforms (1.12) into the MFDE

\begin{matrix} σ_{ϵ}^{2} ϵ^{2} φ_{ϵ}^{''} = (S^{ϵ} + S^{- ϵ} - 2) [φ_{ϵ} + ϵ^{2} φ_{ϵ}^{2}] . \end{matrix}

1.14

Here the shift operator $S^{d}$ acts as

\begin{matrix} (S^{d} f) (ξ) = f (ξ + d) \end{matrix}

1.15

for any $d \in R$ . Since the symbol $S^{ϵ} + S^{- ϵ} - 2$ represents a discrete Laplacian, we can interpret (1.14) as a wave equation with a nonlinear diffusion term. To some extent, this clarifies the link with our original problem (1.1) and the discussion above.

Applying the Fourier transform to (1.14) with k as the frequency variable, we arrive at

\begin{matrix} - σ_{ϵ}^{2} ϵ^{2} k^{2} {\hat{φ}}_{ϵ} (k) = & 2 (cos (ϵ k) - 1) [\hat{φ_{ϵ}} + ϵ^{2} \hat{φ_{ϵ}^{2}}] (k) \\ = & - 4 {sin}^{2} (ϵ k / 2) [\hat{φ_{ϵ}} + ϵ^{2} \hat{φ_{ϵ}^{2}}] (k) . \end{matrix}

1.16

Upon introducing the symbol

\begin{matrix} {\tilde{M}}_{FPUT}^{(ϵ)} (k) = \frac{4 ϵ^{2} {sin}^{2} (ϵ k / 2)}{σ_{ϵ}^{2} ϵ^{2} k^{2} - 4 {sin}^{2} (ϵ k / 2)}, \end{matrix}

1.17

this can be recast into the compact form

\begin{matrix} \hat{φ_{ϵ}} (k) = {\tilde{M}}_{FPUT}^{(ϵ)} (k) \hat{φ_{ϵ}^{2}} (k) . \end{matrix}

1.18

Upon choosing the speed

\begin{matrix} σ_{ϵ} = 1 + \frac{ϵ^{2}}{3}, \end{matrix}

1.19

we can exploit the expansion ${sin}^{2} (z / 2) = \frac{1}{4} z^{2} - \frac{1}{48} z^{4} + O (z^{6})$ to obtain the pointwise limit

\begin{matrix} {\tilde{M}}_{FPUT}^{(ϵ)} (k) \to \frac{12}{8 + k^{2}}, ϵ \to 0 . \end{matrix}

1.20

Using the fact that $(8 + k^{2})$ is the Fourier symbol for $8 - \partial_{ξ}^{2}$ , this suggests that the relevant system for $φ_{ϵ}$ in the formal $ϵ \to 0$ limit is given by

\begin{matrix} 8 φ_{*} - φ_{*}^{''} = 12 φ_{*}^{2}, \end{matrix}

1.21

which has the nontrivial even solution

\begin{matrix} φ_{*} (ξ) = {sech}^{2} (\sqrt{2} ξ) . \end{matrix}

1.22

By casting the problem in an appropriate functional analytic framework, one can show that this explicit solution $φ_{*}$ can be continued to yield solutions $φ_{ϵ}$ to (1.14) for small $ϵ > 0$ . In this fashion, one establishes the existence of a family of pulse solutions (Friesecke and Pego 1999)

\begin{matrix} x_{j + 1} (t) - x_{j} (t) = ϵ^{2} {sech}^{2} (\sqrt{2} ϵ (j - σ_{ϵ} t)) + O (ϵ^{4}) . \end{matrix}

1.23

Roughly speaking, the main mathematical contribution in this paper is that we show how this analysis can be generalized to the setting of (1.1). The first main obstacle is that this is a multi-component system, which requires us to explicitly reduce the order before a tractable limit can be obtained. The second main obstacle is that the analysis of our Fourier symbol is considerably more delicate, since in our setting the wavespeed c converges to zero instead of one as $ϵ \to 0$ . Indeed, the denominator of ${\tilde{M}}_{FPUT}^{(ϵ)}$ above depends only on the product $ϵ k$ , while in our case there is a separate dependence on $ϵ^{2} k$ . This introduces a quasi-periodicity into the problem that requires our convergence analysis to carefully distinguish between ‘small’ values of k and several separate regions of ‘large’ k.

The third main difference is that we cannot use formal spectral arguments to analyze the limiting linear operator, which in our case is related to the Bernoulli equation. Instead, we apply a direct solution technique using variation-of-constants formulas. On the one hand this is much more explicit, but on the other hand the resulting estimates are rather delicate on account of the custom function spaces involved.

Discussion

Due to the important organizing role that wave solutions often play in complex systems, scaling information such as (1.7) can be used as the starting point to uncover more general dynamical information concerning models such as (1.1) and related models of polar auxin trasnport. As such, we hope that the ideas we present here will provide a robust analytical tool to analyze different types of models as well. The resulting insights and predictions could help to prioritize competing models on the basis of dynamical experimental observations. Indeed, scaling laws appear to play a role in many aspects of biological systems, such as the structural properties of vascular systems (Razavi et al. 2018), the mass dependence of metabolic rates (West and Brown 2004) and the functional constraints imposed by size (Schmidt-Nielsen and Knut 1984).

Although we have included only right-polarizing PIN in our system, we believe that our techniques can be adapted to cover the full case where also left-polarizing PIN is included. However, the computations rapidly become unwieldy and the limiting system is expected to differ qualitatively. For this reason, we have not chosen to pursue this level of generality in the present paper, as it would only obscure the main ideas behind our framework. One of the main generalizations that we intend to pursue in the future is to study the model in two spatial dimensions. This is motivated by recent numerical observations concerning the formation of auxin channels and their associated PIN polarization under the influence of travelling patterns that are localized in both spatial dimensions (Althuis 2021; Merks et al. 2007).

Notation

We summarize a few aspects of our (mostly standard) notation.

If $f = f (X)$ is a differentiable function on $R$ , then we sometimes write $f^{'} = \partial_{X} [f]$ .
If $X$ and $Y$ are normed spaces, then we denote the space of bounded linear operators from $X$ to $Y$ by $B (X, Y)$ . We put $B (X) : = B (X, X)$ .
We sometimes abbreviate $R_{+} : = (0, \infty)$ and $R_{-} : = (- \infty, 0)$ .

The travelling wave problem

Rewriting the original problem (1.1)

We will reduce the problem (1.1) to a system of equations involving only $A_{j}$ and $P_{j}$ , and it will be this resulting system on which we make the long wave-scaled travelling wave Ansatz.

Changes of notation

We begin by rewriting (1.1) in a slightly more compressed manner that also exposes more transparently the leading order terms in the nonlinearities. Let $δ^{\pm}$ be the left and right difference operators that act on sequences $(x_{j})$ in $R$ via

\begin{matrix} δ^{+} x_{j} : = x_{j + 1} - x_{j} and δ^{-} x_{j} : = x_{j} - x_{j - 1} . \end{matrix}

Next, for k, $x \in R$ with $k + x \neq 0$ we have

\begin{matrix} \frac{x}{k + x} = \frac{x}{k} - \frac{x^{2}}{k (k + x)} . \end{matrix}

We put

\begin{matrix} Q_{1} (x, y) : = \frac{x^{2} y}{k_{a} + x} \end{matrix}

2.1

and compress

\begin{matrix} τ_{1} : = \frac{T_{act}}{k_{a}} and τ_{2} : = T_{diff} \end{matrix}

2.2

to see that our equation for $A_{j}$ now reads

\begin{matrix} {\dot{A}}_{j} = τ_{2} δ^{+} δ^{-} A_{j} - τ_{1} δ^{-} (R_{j} A_{j}) + τ_{1} δ^{-} Q_{1} (A_{j}, R_{j}) . \end{matrix}

Next, we abbreviate

\begin{matrix} κ : = \frac{k_{1}}{k_{r} k_{m}} \end{matrix}

2.3

and put

\begin{matrix} Q_{2} (x, y) : = κ (\frac{k_{r} y + k_{m} x + x y}{(k_{r} + x) (k_{m} + y)}) x y \end{matrix}

2.4

to see that, the equation for $P_{j}$ is

\begin{matrix} {\dot{P}}_{j} = - κ A_{j + 1} P_{j} + α A_{j} + Q_{2} (A_{j + 1}, P_{j}) . \end{matrix}

The equation for $R_{j}$ is updated similarly, and so we have rewritten (1.1) as

\begin{matrix} \{\begin{matrix} {\dot{A}}_{j} = τ_{2} δ^{+} δ^{-} A_{j} - τ_{1} δ^{-} (R_{j} A_{j}) + τ_{1} δ^{-} Q_{1} (A_{j}, R_{j}), \\ {\dot{P}}_{j} = - κ A_{j + 1} P_{j} + α A_{j} + Q_{2} (A_{j + 1}, P_{j}), \\ {\dot{R}}_{j} = κ A_{j + 1} P_{j} - Q_{2} (A_{j + 1}, P_{j}) . \end{matrix}) \end{matrix}

2.5

We observe that the equation for $R_{j}$ depends only on $A_{j + 1}$ and $P_{j}$ and therefore can be solved by direct integration. Before we do that, however, we rewrite the new equation for $P_{j}$ using Duhamel’s formula.

Rewriting the $P_{j}$ equation

We can view the equation for $P_{j}$ in (2.5) as a first-order linear differential equation forced by $α A_{j} + Q_{2} (A_{j + 1}, P_{j})$ , and so we can solve it via the integrating factor method. For f, $g \in L^{1}$ and $h \in L^{\infty}$ we introduce the operators

\begin{matrix} E (f) (s, t) : = exp (- κ \int_{s}^{t} f (ξ) d ξ), s, t \in R, \end{matrix}

2.6

\begin{matrix} P_{1} (f, g) (t) : = α \int_{- \infty}^{t} E (f) (s, t) g (s) d s, \end{matrix}

2.7

and

\begin{matrix} P_{2} (f, h) (t) : = \int_{- \infty}^{t} E (f) (s, t) Q_{2} (f (s), h (s)) d s . \end{matrix}

2.8

Recall from (1.3) that we want $P_{j}$ to vanish at $- \infty$ . The unique solution for $P_{j}$ in (2.5) that does vanish at $- \infty$ must satisfy

\begin{matrix} P_{j} (t) = P_{1} (A_{j + 1}, A_{j}) (t) + P_{2} (A_{j + 1}, P_{j}) (t) . \end{matrix}

Solving the $R_{j}$ equation

Since, per (1.3), we want $R_{j}$ to vanish at $- \infty$ , and since we are assuming that each $A_{j}$ vanishes sufficiently fast at both $\pm \infty$ and $P_{j}$ vanishes at $- \infty$ and remains bounded at $+ \infty$ , we may solve for $R_{j}$ by integrating the third equation in (2.5) from $- \infty$ to t. For f, $g \in L^{1}$ and $h \in L^{\infty}$ , we define more integral operators:

\begin{matrix} R_{1} (f, g) (t) : = & κ τ_{1} \int_{- \infty}^{t} f (s) P_{1} (f, g) (s) d s, t \in R, \end{matrix}

2.9

\begin{matrix} R_{2} (f, g, h) (t) : = & \int_{- \infty}^{t} (κ f (s) P_{2} (f, g) (s) - Q_{2} (f (s), P_{1} (f, g) (s) \\ + P_{2} (f, h) (s))) d s, \end{matrix}

2.10

and

\begin{matrix} R (f, g, h) (t) : = R_{1} (f, g) (t) + R_{2} (f, g, h) (t) . \end{matrix}

2.11

We have defined $P_{1}$ and $P_{2}$ just above, respectively, in (2.7) and (2.8) and $Q_{2}$ earlier in (2.4). Then the solution to the third equation in (2.5) that vanishes at $- \infty$ is

\begin{matrix} R_{j} (t) = R (A_{j + 1}, A_{j}, P_{j}) (t) = R_{1} (A_{j + 1}, A_{j}) (t) + R_{2} (A_{j + 1}, A_{j}, P_{j}) (t) . \end{matrix}

2.12

The final system for $A_{j}$ and $P_{j}$

We rewrite (part of) the $A_{j}$ equation once more to incorporate the new expression for $R_{j}$ . For f, $g \in L^{1}$ and $h \in L^{\infty}$ and $t \in R$ put

\begin{matrix} N (f, g, h) (t) : = τ_{1} Q_{1} (g (t), R (f, g, h) (t)) - τ_{1} R_{2} (f, g, h) (t) g (t), \end{matrix}

2.13

where we defined $Q_{1}$ in (2.1). Then $A_{j}$ must satisfy

\begin{matrix} {\dot{A}}_{j} = τ_{2} δ^{+} δ^{-} A_{j} - δ^{-} (R_{1} (A_{j + 1}, A_{j}) A_{j}) + δ^{-} N (A_{j + 1}, A_{j}, P_{j}), \end{matrix}

and so our system for $A_{j}$ and $P_{j}$ is now

\begin{matrix} \{\begin{matrix} {\dot{A}}_{j} = τ_{2} δ^{+} δ^{-} A_{j} - δ^{-} (R_{1} (A_{j + 1}, A_{j}) A_{j}) + δ^{-} N (A_{j + 1}, A_{j}, P_{j}), \\ P_{j} = P_{1} (A_{j + 1}, A_{j}) + P_{2} (A_{j + 1}, P_{j}) . \end{matrix}) \end{matrix}

2.14

That is, using the formula (2.12) for $R_{j}$ in terms of $A_{j}$ and $P_{j}$ , we can solve (2.5) if we can solve (2.14).

We will make two changes of variables on (2.14). First, in Sect. 2.2, we make a travelling wave Ansatz for $A_{j}$ and $P_{j}$ . We reformulate (2.14) for the travelling wave profiles as the system (2.28) below. Then, in Sect. 3.1, we introduce our long wave scaling on these travelling wave profiles. After numerous adjustments, we arrive at the final system (3.14) for the scaled travelling wave profiles, which we solve in Sect. 6. The reader uninterested in these intermediate stages may wish to proceed directly to Proposition 3.5, which discusses the equivalence of the problem (2.14) for $A_{j}$ and $P_{j}$ and the ultimate long wave system (3.42). Of course, our notation must keep up with these changes of variables, and we summarize in Table 1 the evolution of a typical operator’s typesetting across these different problems.

Table 1.

Summary of notational evolution

Symbol	Use
$R$	The original problem (2.14)
${\tilde{R}}^{c}$	The travelling wave problem (2.28)
${\overset{˘}{R}}^{ϵ}$	The preliminary long wave problem (3.14)
$R^{ν}$	The final long wave problem (3.42)

Open in a new tab

Remark 2.1

The linearization of (2.14) at 0 yields

\begin{matrix} {\dot{A}}_{j} = τ_{2} δ^{+} δ^{-} A_{j}, P_{j} = R_{j} = 0 . \end{matrix}

If we follow the discussion after Friesecke and Pego (1999, Thm. 1.1), as well as Faver and Wright (2018, Rem. 2.2), and look for plane wave solutions $A_{j} (t) = e^{i k j - i ω t}$ with $ω$ , $k \in R$ , we find the dispersion relation

\begin{matrix} - i ω = 2 τ_{2} (cos (k) - 1) . \end{matrix}

2.15

The only real solutions are $ω = 0$ and $k \in 2 π Z$ . Previously, in Friesecke and Pego (1999) and Faver and Wright (2018) a nontrivial dispersion relation $ω = ω (k)$ was found by making the same kind of plane wave Ansatz, and the result ‘phase speed’ $k \mapsto ω (k) / k$ had a nonzero maximum $c_{s}$ , which was called the ‘speed of sound.’ These articles then proceeded to look for travelling waves with speed slightly above their respective values of $c_{s}$ ; these were ‘supersonic’ waves. For us, $ω (k)$ is identically zero, which suggests that the speed of sound for our auxin problem is 0. Our long wave scaling in Sect. 3.1 analytically justifies this intuition.

The travelling wave Ansatz

We now look for solutions $A_{j}$ and $P_{j}$ to (2.14) of the form

\begin{matrix} A_{j} = ϕ_{1} (j - c t) and P_{j} = ϕ_{2} (j - c t) . \end{matrix}

2.16

The profiles $ϕ_{1}$ and $ϕ_{2}$ are real-valued functions of a single real variable and $c \in R$ . The following manipulations will be justified if we assume $ϕ_{1} \in H_{q}^{1}$ and $ϕ_{2} \in W^{1, \infty}$ ; we discuss the exponentially localized Sobolev space $H_{q}^{1}$ in Appendix A.3. Working on an exponentially localized space, as opposed to an algebraically weighted space, both allows us to capture precisely certain very fast decay properties and permits us to use some technical results on approximating Fourier multipliers. Furthermore, since we want $P_{j}$ to vanish at $- \infty$ and be asymptotically constant at $+ \infty$ , per the limits (1.3) and the numerical predictions of Fig. 2, we expect that $ϕ_{2}$ should vanish at $+ \infty$ and be asymptotically constant at $- \infty$ .

We will convert the problem (2.14) for $A_{j}$ and $P_{j}$ into a nonlocal system for $ϕ_{1}$ and $ϕ_{2}$ , with c as a parameter. Doing so amounts to little more than changing variables many times in the integral operators defined in Sects. 2.1.2 and 2.1.3 and gives us a host of new integral operators that will constitute the problem for $ϕ_{1}$ and $ϕ_{2}$ .

In what follows we assume $f \in L^{1}$ and $g \in L^{\infty}$ , so that the operators below are defined in the special cases of $f = ϕ_{1} \in H_{q}^{1}$ and $g = ϕ_{2} \in W^{1, \infty}$ . First, for x, $v \in R$ , put

\begin{matrix} {\tilde{E}}^{c} (f) (v, x) : = exp (\frac{κ}{c}, \int_{v}^{x}, f, (u + 1), d, u) \end{matrix}

2.17

and

\begin{matrix} {\tilde{P}}_{1}^{c} (f) (x) : = \frac{α}{c} \int_{x}^{\infty} {\tilde{E}}^{c} (f) (v, x) f (v) d v . \end{matrix}

2.18

Then we use the Ansatz (2.16) and the definition of $P_{1}$ in (2.7) to find

\begin{matrix} P_{1} (A_{j + 1}, A_{j}) (t) = & α \int_{- \infty}^{t} exp (- κ \int_{s}^{t} ϕ_{1} (j - c ξ + 1) d ξ) ϕ_{1} (j - c s) d s \\ = & {\tilde{P}}_{1}^{c} (ϕ_{1}) (j - c t) . \end{matrix}

Here we have substituted $u = j - c ξ$ in the exponential’s integral and then $v = j - c s$ throughout.

Similar substitutions, which we do not discuss, yield the following identities. Put

\begin{matrix} {\tilde{P}}_{2}^{c} (f, g) (x) : = \frac{1}{c} \int_{x}^{\infty} {\tilde{E}}^{c} (f) (v, x) Q_{2} (f (v + 1), g (v)) d v, \end{matrix}

2.19

so that with $P_{2}$ defined in (2.8) we have

\begin{matrix} P_{2} (A_{j + 1}, P_{j}) (t) = {\tilde{P}}_{2}^{c} (ϕ_{1}, ϕ_{2}) (j - c t) . \end{matrix}

Thus $ϕ_{2}$ must satisfy

\begin{matrix} ϕ_{2} = {\tilde{P}}_{1}^{c} (ϕ_{1}) + {\tilde{P}}_{2}^{c} (ϕ_{1}, ϕ_{2}), \end{matrix}

2.20

which indicates that, as expected, $ϕ_{2}$ should vanish at $+ \infty$ and be asymptotically constant at $- \infty$ .

Now we reformulate the equation for $A_{j}$ , equivalently, for $ϕ_{1}$ . Put

\begin{matrix} {\tilde{R}}_{1}^{c} (f) (x) : = \frac{κ τ_{1}}{c} \int_{x}^{\infty} f (u + 1) {\tilde{P}}_{1}^{c} (f) (u) d u, \end{matrix}

2.21

so that with $R_{1}$ defined in (2.9) we have

\begin{matrix} R_{1} (A_{j + 1}, A_{j}) (t) = {\tilde{R}}_{1}^{c} (ϕ_{1}) (j - c t) . \end{matrix}

Put

\begin{matrix} {\tilde{R}}_{2}^{c} (f, g) (x) : = & \frac{1}{c} \int_{x}^{\infty} (κ f (u + 1) {\tilde{P}}_{2}^{c} (f, g) (u) \\ - Q_{2} (f (u + 1), g (u))) d u \end{matrix}

2.22

and

\begin{matrix} {\tilde{R}}^{c} (f, g) : = {\tilde{R}}_{1}^{c} (f) + {\tilde{R}}_{2}^{c} (f, g), \end{matrix}

2.23

so that with $R_{2}$ defined in (2.10) and $R$ in (2.11) we have

\begin{matrix} R_{2} (A_{j + 1}, A_{j}, P_{j}) (t) = {\tilde{R}}_{2}^{c} (ϕ_{1}, ϕ_{2}) (j - c t) and \\ R (A_{j + 1}, A_{j}, P_{j}) (t) = {\tilde{R}}^{c} (ϕ_{1}, ϕ_{2}) (j - c t) . \end{matrix}

Last, put

\begin{matrix} {\tilde{N}}^{c} (f, g) (x) : = τ_{1} {\tilde{R}}_{2}^{c} (f, g) (x) f (x) - τ_{1} Q_{1} (f (x), {\tilde{R}}^{c} (f, g) (x)), \end{matrix}

2.24

so that with $N$ defined in (2.13) we have

\begin{matrix} N (A_{j + 1}, A_{j}, P_{j}) (t) = {\tilde{N}}^{c} (ϕ_{1}, ϕ_{2}) (j - c t) . \end{matrix}

For a function $f : R \to R$ and $d \in R$ , define, as in (1.15), the shift operator $S^{d}$ by

\begin{matrix} (S^{d} f) (x) : = f (x + d) . \end{matrix}

2.25

This final piece of notation, along with the Eq. (2.20), allows us to convert the problem (2.14) for $A_{j}$ and $P_{j}$ into the following nonlocal system for $ϕ_{1}$ and $ϕ_{2}$ :

\begin{matrix} \{\begin{matrix} - c ϕ_{1}^{'} = τ_{2} (S^{1} - 2 + S^{- 1}) ϕ_{1} + (S^{- 1} - 1) ({\tilde{R}}_{1}^{c} (ϕ_{1}) ϕ_{1} + {\tilde{N}}^{c} (ϕ_{1}, ϕ_{2})), \\ ϕ_{2} = {\tilde{P}}_{1}^{c} (ϕ_{1}) + {\tilde{P}}_{2}^{c} (ϕ_{1}, ϕ_{2}) . \end{matrix}) \end{matrix}

2.26

The Fourier multiplier structure

We summarize our conventions and definitions for Fourier transforms and Fourier multipliers in Appendix A. If we take the Fourier transform of the equation for $ϕ_{1}$ in (2.26), we find

\begin{matrix} (i c k + 2 τ_{2} (cos (k) - 1)) {\hat{ϕ}}_{1} (k) = (1 - e^{- i k}) F [{\tilde{R}}_{1}^{c} (ϕ_{1}) ϕ_{1} + {\tilde{N}}^{c} (ϕ_{1}, ϕ_{2})] (k) . \end{matrix}

For $k \in R$ , we have $i c k + 2 τ_{2} (cos (k) - 1) = 0$ if and only if $k = 0$ . Consequently, the function

\begin{matrix} {\tilde{M}}_{c} (k) : = \frac{1 - e^{- i k}}{i c k + 2 τ_{2} (cos (k) - 1)} \end{matrix}

2.27

has a removable singularity at 0 and is in fact analytic on $R$ . We therefore define $M_{c}$ to be the Fourier multiplier with symbol ${\tilde{M}}_{c}$ , i.e., $M_{c}$ satisfies

\begin{matrix} \hat{M_{c} f} (k) = {\tilde{M}}_{c} (k) \hat{f} (k) . \end{matrix}

We discuss some further properties of Fourier multipliers in Appendix A.2. Now the problem (2.26) is equivalent to

\begin{matrix} \{\begin{matrix} ϕ_{1} = M_{c} ({\tilde{R}}_{1}^{c} (ϕ_{1}) ϕ_{1} + {\tilde{N}}^{c} (ϕ_{1}, ϕ_{2})) \\ ϕ_{2} = {\tilde{P}}_{1}^{c} (ϕ_{1}) + {\tilde{P}}_{2}^{c} (ϕ_{1}, ϕ_{2}) . \end{matrix}) \end{matrix}

2.28

The long wave problem

The long wave scaling

We now make the long wave Ansatz

\begin{matrix} ϕ_{1} (x) = ϵ ψ_{1} (ϵ^{μ} x), ϕ_{2} (x) = ϵ^{β} ψ_{2} (ϵ^{μ} x), and c = ϵ^{γ} c_{0} . \end{matrix}

3.1

We assume, as with $ϕ_{1}$ and $ϕ_{2}$ , that the scaled profiles satisfy $ψ_{1} \in H_{q}^{1}$ and $ψ_{2} \in W^{1, \infty}$ . We think of $ϵ > 0$ as small and keep the exponents $β$ , $γ$ , $μ > 0$ arbitrary for now; eventually we will pick

\begin{matrix} γ = μ = \frac{2}{5} and β = \frac{1}{5} . \end{matrix}

The reasoning behind this choice is by no means obvious at this point and will not be for some time; leaving $μ$ , $β$ , and $γ$ arbitrary will allow this choice to appear more naturally (at the cost of temporarily more cumbersome notation).

As we intuited in Remark 2.1, our wave speed is now close to 0, which is the auxin problem’s natural ‘speed of sound’. The parameter $c_{0} \neq 0$ affords us some additional flexibility in choosing the wave speed. A properly chosen value of $c_{0}$ will cause the maximum of the leading-order term of $ϕ_{1}$ to be $ϵ$ , which will fulfill our promise in Sect. 1.4 that the auxin-height is, to leading order, $ϵ$ . Friesecke and Pego introduce a similar auxiliary parameter into their $ϵ$ -dependent wave speed, see Friesecke and Pego (1999, Eq. (2.5), (2.13)). This parameter allows them to prove that the dependence of their travelling wave profile on wave speed is sufficiently regular in different function spaces, a result needed for their subsequent stability arguments in Friesecke and Pego (2002, 2004a, 2004b). We did not provide this extra parameter in our version (1.19) of the Friesecke–Pego wave speed, but rather we selected it so that the amplitude of the leading order ${sech}^{2}$ -profile term in (1.22) is 1. Similarly, we will not pursue their depth of wave-speed analysis on our profiles’ dependence on $c_{0}$ .

We convert (2.28) to another nonlocal system for $ψ_{1}$ and $ψ_{2}$ , which now depends heavily on the parameter $ϵ$ . As before, this process mostly amounts to changing variables in many integrals. For example, we use the definition of ${\tilde{P}}_{1}^{c}$ in (2.18) and the Ansatz (3.1) to find

\begin{matrix} {\tilde{P}}_{1}^{c} (ϕ_{1}) (x) = \frac{α}{ϵ^{γ} c_{0}} \int_{x}^{\infty} {\tilde{E}}^{c} (ϕ_{1}) (v, x) ϵ ψ_{1} (ϵ^{μ} v) d v, \end{matrix}

3.2

where, using the definition of ${\tilde{E}}^{c}$ in (2.17), we have

\begin{matrix} {\tilde{E}}^{c} (ϕ_{1}) (v, x) = & exp (\frac{κ}{ϵ^{γ} c_{0}}, \int_{v}^{x}, ϵ, ψ_{1}, (ϵ^{μ} u + ϵ^{μ}), d, u) \\ = & exp (\frac{κ}{c_{0}}, ϵ^{1 - (γ + μ)}, \int_{ϵ^{μ} v}^{ϵ^{μ} x}, ψ_{1}, (U + ϵ^{μ}), d, U) . \end{matrix}

Here we have substituted $U = ϵ^{μ} u$ .

Now for $f \in L^{1}$ we put

\begin{matrix} E (f) (V, X) : = exp (\frac{κ}{c_{0}}, \int_{V}^{X}, f, (U), d, U), V, X \in R, \end{matrix}

3.3

so that (3.2) becomes

\begin{matrix} {\tilde{P}}_{1}^{c} (ϕ_{1}) (x) = \frac{α}{c_{0}} ϵ^{1 - γ} \int_{x}^{\infty} E (ϵ^{1 - (γ + μ)} S^{ϵ^{μ}} ψ_{1}) (ϵ^{μ} v, ϵ^{μ} x) ψ_{1} (ϵ^{μ} v) d v . \end{matrix}

Here $S^{ϵ^{μ}}$ is the shift operator defined in (2.25) with $d = ϵ^{μ}$ . We substitute again with $V = ϵ^{μ} v$ and define

\begin{matrix} {\overset{˘}{P}}_{1}^{ϵ} (f) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} E (ϵ^{1 - (γ + μ)} S^{ϵ^{μ}} f) (V, X) f (V) d V \end{matrix}

3.4

to conclude that

\begin{matrix} {\tilde{P}}_{1}^{c} (ϕ_{1}) (x) = ϵ^{1 - (γ + μ)} {\overset{˘}{P}}_{1}^{ϵ} (ψ_{1}) (ϵ^{μ} x) . \end{matrix}

Similar careful substitutions will allow us to reformulate the integral operators from Sect. 2.2 in terms of the long wave Ansatz. First, however, we define

\begin{matrix} {\overset{˘}{Q}}_{1}^{ϵ} (X, Y) : = \frac{X^{2} Y}{k_{a} + ϵ X} and {\overset{˘}{Q}}_{2}^{ϵ} (X, Y) : = κ \frac{k_{r} Y + k_{m} ϵ^{1 - β} X + ϵ XY}{(k_{r} + ϵ X) (k_{m} + ϵ^{β} Y)} X Y . \end{matrix}

3.5

When $ϵ \neq 0$ , this definition permits the very convenient factorizations

\begin{matrix} Q_{1} (ϵ X, ϵ^{1 - (γ + μ)} Y) = ϵ^{3 - (γ + μ)} {\overset{˘}{Q}}_{1}^{ϵ} (X, Y) and Q_{2} (ϵ X, ϵ^{β} Y) = ϵ^{1 + 2 β} {\overset{˘}{Q}}_{2}^{ϵ} (X, Y), \end{matrix}

where $Q_{1}$ was defined in (2.1) and $Q_{2}$ in (2.4).

Now we work on the travelling wave integral operators. Below we will assume $f \in L^{1}$ and $g \in L^{\infty}$ . Put

\begin{matrix} {\overset{˘}{P}}_{2}^{ϵ} (f, g) (X) : = \frac{1}{c_{0}} \int_{X}^{\infty} E (ϵ^{1 - (γ + μ)} S^{ϵ^{μ}} f) (V, X) {\overset{˘}{Q}}_{2}^{ϵ} (f (V + ϵ^{μ}), g (V)) d V, \end{matrix}

3.6

so that with ${\tilde{P}}_{2}^{c}$ defined in (2.19) we have

\begin{matrix} {\tilde{P}}_{2}^{c} (ϕ_{1}, ϕ_{2}) (x) = ϵ^{1 - (γ + μ) + 2 β} {\overset{˘}{P}}_{2}^{ϵ} (ψ_{1}, ψ_{2}) (ϵ^{μ} x) . \end{matrix}

This converts the second equation in (2.28) for $ϕ_{2}$ to

\begin{matrix} ϵ^{β} ψ_{2} (ϵ^{μ} x) = ϵ^{1 - (γ + μ)} {\overset{˘}{P}}_{1}^{ϵ} (ψ_{1}) (ϵ^{μ} x) + ϵ^{1 - (γ + μ) + 2 β} {\overset{˘}{P}}_{2}^{ϵ} (ψ_{1}, ψ_{2}) (ϵ^{μ} x) . \end{matrix}

Passing to $X = ϵ^{μ} x$ , we find that $ψ_{2}$ must satisfy

\begin{matrix} ψ_{2} (X) = ϵ^{1 - (γ + μ) - β} {\overset{˘}{P}}_{1}^{ϵ} (ψ_{1}) (X) + ϵ^{1 - (γ + μ) + β} {\overset{˘}{P}}_{2}^{ϵ} (ψ_{1}, ψ_{2}) (X) . \end{matrix}

3.7

Now put

\begin{matrix} {\overset{˘}{R}}_{1}^{ϵ} (f) (X) : = \frac{κ τ_{1}}{c_{0}} \int_{X}^{\infty} {\overset{˘}{P}}_{1}^{ϵ} (f) (V) f (V + ϵ^{μ}) d V, \end{matrix}

3.8

so that with ${\tilde{R}}_{1}^{c}$ defined in (2.21) we have

\begin{matrix} {\tilde{R}}_{1}^{c} (ϕ_{1}) (x) = ϵ^{2 (1 - (γ + μ))} {\overset{˘}{R}}_{1}^{ϵ} (ψ_{1}) (ϵ^{μ} x) . \end{matrix}

Put

\begin{matrix} {\overset{˘}{R}}_{2}^{ϵ} (f, g) (X) : = & \frac{1}{c_{0}} \int_{X}^{\infty} (ϵ^{1 - (γ + μ)} κ f (V + ϵ^{μ}) {\overset{˘}{P}}_{2}^{ϵ} (f, g) (V) \\ - {\overset{˘}{Q}}_{2}^{ϵ} (f (V + ϵ^{μ}), g (V))) d V \end{matrix}

3.9

and

\begin{matrix} {\overset{˘}{R}}^{ϵ} (f, g) (X) : = ϵ^{1 - (γ + μ)} {\overset{˘}{R}}_{1}^{ϵ} (f) (X) + ϵ^{2 β} R_{2}^{ϵ} (f, g) (X), \end{matrix}

3.10

so that with ${\tilde{R}}_{2}^{c}$ defined in (2.22) and ${\tilde{R}}^{c}$ defined in (2.23) we have

\begin{matrix} {\tilde{R}}_{2}^{c} (ϕ_{1}, ϕ_{2}) (x) = ϵ^{1 - (γ + μ) + 2 β} {\overset{˘}{R}}_{2}^{ϵ} (ψ_{1}, ψ_{2}) (ϵ^{μ} x) and \\ {\tilde{R}}^{c} (ϕ_{1}, ϕ_{2}) (x) = ϵ^{1 - (γ + μ)} {\overset{˘}{R}}^{ϵ} (ψ_{1}, ψ_{2}) (ϵ^{μ} x) . \end{matrix}

Finally, put

\begin{matrix} {\overset{˘}{N}}^{ϵ} (f, g) (X) : = τ_{1} {\overset{˘}{R}}_{2}^{ϵ} (f, g) (X) f (X) - ϵ^{1 - 2 β} τ_{1} {\overset{˘}{Q}}_{1}^{ϵ} (f (X), {\overset{˘}{R}}^{ϵ} (f, g) (X)), \end{matrix}

3.11

so that with ${\tilde{N}}^{c}$ defined in (2.24) we have

\begin{matrix} {\tilde{N}}^{c} (ϕ_{1}, ϕ_{2}) (x) = ϵ^{2 - (γ + μ) + 2 β} {\overset{˘}{N}}^{ϵ} (ψ_{1}, ψ_{2}) (ϵ^{μ} x) . \end{matrix}

The definition of scaled Fourier multipliers from (A.3) tells us that, for $ϵ > 0$ , $M_{ϵ^{γ} c_{0}}^{(ϵ^{μ})}$ is the Fourier multiplier satisfying

\begin{matrix} \hat{M_{ϵ^{γ} c_{0}}^{(ϵ^{μ})} f} (k) = {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) \hat{f} (k), \end{matrix}

where ${\tilde{M}}_{ϵ^{γ} c_{0}}$ is defined by taking $c = ϵ^{γ} c_{0}$ in (2.27). This converts the first equation in (2.28) for $ϕ_{1}$ to

\begin{matrix} ϵ ψ_{1} (ϵ^{μ} x) = M_{ϵ^{γ} c_{0}}^{(ϵ^{μ})} [ϵ^{2 (1 - (γ + μ))} {\overset{˘}{R}}_{1}^{ϵ} (ψ_{1}) ϵ ψ_{1} + ϵ^{2 - (γ + μ) + 2 β} {\overset{˘}{N}}^{ϵ} (ψ_{1}, ψ_{2})] (ϵ^{μ} x) . \end{matrix}

We factor this to reveal

\begin{matrix} ψ_{1} (X) = ϵ^{2 (1 - (γ + μ))} M_{ϵ^{γ} c_{0}}^{(ϵ^{μ})} [{\overset{˘}{R}}_{1}^{ϵ} (ψ_{1}) ψ_{1} + ϵ^{- 1 + γ + μ + 2 β} N^{ϵ} (ψ_{1}, ψ_{2})] (X) . \end{matrix}

3.12

We abbreviate

\begin{matrix} {\overset{˘}{M}}_{ϵ} : = ϵ^{2 (1 - (γ + μ))} M_{ϵ^{γ} c_{0}}^{(ϵ^{μ})} \end{matrix}

3.13

to conclude from (3.12) and the prior Eq. (3.7) for $ψ_{2}$ that the long wave profiles must satisfy

\begin{matrix} \{\begin{matrix} ψ_{1} = {\overset{˘}{M}}_{ϵ} [{\overset{˘}{R}}_{1}^{ϵ} (ψ_{1}) ψ_{1} + ϵ^{- 1 + γ + μ + 2 β} {\overset{˘}{N}}^{ϵ} (ψ_{1}, ψ_{2})] \\ ψ_{2} = ϵ^{1 - (γ + μ + β)} {\overset{˘}{P}}_{1}^{ϵ} (ψ_{1}) + ϵ^{1 - (γ + μ) + β} {\overset{˘}{P}}_{2}^{ϵ} (ψ_{1}, ψ_{2}) . \end{matrix}) \end{matrix}

3.14

We have been tacitly assuming that all of the exponents on powers of $ϵ$ above are nonnegative so that the various $ϵ$ -dependent operators and prefactors are actually defined at $ϵ = 0$ . In particular, this demands

\begin{matrix} 1 - 2 β \geq 0, - 1 + γ + μ + 2 β \geq 0, and 1 - (γ + μ + β) \geq 0 . \end{matrix}

3.15

The formal long wave limit and exponent selection

Our intention is now to take the limit $ϵ \to 0$ in the Eq. (3.14) for $ψ_{1}$ and $ψ_{2}$ . Doing so in a way that the limit is both meaningful (i.e., defined and nontrivial) and reflective of what the numerics predict at $ϵ = 0$ will teach us what the exponents $μ$ , $γ$ , and $β$ should be, beyond the requirements of (3.15).

The formal limit on ${\overset{˘}{M}}_{ϵ}$ and the selection of the exponents $γ$ and $μ$

We want to assign a ‘natural’ definition to ${\overset{˘}{M}}_{0}$ , where ${\overset{˘}{M}}_{ϵ}$ was defined, for $ϵ > 0$ , in (3.13). However, we relied above on having $ϵ > 0$ to invoke the scaled Fourier multiplier identity (A.3) that gave us ${\overset{˘}{M}}_{ϵ}$ , and naively setting $ϵ = 0$ in that identity is meaningless. Additionally, we should be careful that the prefactor $ϵ^{2 (1 - (γ + μ))}$ in (3.13) does not lead us to define ${\overset{˘}{M}}_{0} = 0$ ; otherwise, we would have $ψ_{1} = 0$ when $ϵ = 0$ , and that is not what the numerics in Fig. 2 predict.

A natural starting point, then, is to study ${\overset{˘}{M}}_{ϵ}$ in the limit $ϵ \to 0^{+}$ , and this amounts to considering the limit of its symbol, whose definition we extract from the definition of ${\overset{˘}{M}}_{ϵ}$ in (3.13) and the definition of the scaled Fourier multiplier in (A.3). Thus, for each $k \in R$ , we want the limit

\begin{matrix} lim_{ϵ \to 0^{+}} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) \end{matrix}

3.16

to exist without being identically zero. The function ${\tilde{M}}_{ϵ^{γ} c_{0}}$ was defined in (2.27).

To calculate this limit, we first state the Taylor expansions

\begin{matrix} 1 - e^{- i z} = i z + i z^{2} N_{1} (z) and cos (z) - 1 = - \frac{z^{2}}{2} + \frac{i z^{4} N_{2} (z)}{2 τ_{2}} \end{matrix}

3.17

for $z \in C$ . The functions $N_{1}$ and $N_{2}$ are analytic and uniformly bounded on strips in the sense that

\begin{matrix} C_{q} : = sup_{x \in R} | N_{1} (x \pm iq) | + | N_{2} (x \pm iq) | < \infty \end{matrix}

3.18

for any $q > 0$ . The choice of constants on $N_{1}$ and $N_{2}$ will permit some useful cancellations later. Then

\begin{matrix} {\tilde{M}}_{c} (k) = \frac{i k + i k^{2} N_{1} (k)}{i c k - τ_{2} k^{2} + i k^{4} N_{2} (k)} = \frac{1 + k N_{1} (k)}{c + i τ_{2} k + k^{3} N_{2} (k)}, \end{matrix}

and so

\begin{matrix} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) = ϵ^{2 (1 - (γ + μ))} \frac{1 + ϵ^{μ} k N_{1} (ϵ^{μ} k)}{ϵ^{γ} c_{0} + i τ_{2} ϵ^{μ} k + ϵ^{3 μ} k^{3} N_{2} (ϵ^{μ} k)} . \end{matrix}

3.19

At this point it does not make sense to set $ϵ = 0$ , as then the denominator would be identically zero. So, we would like to factor some power of $ϵ$ out of the denominator. Since the first term in the denominator has a factor of $ϵ^{γ}$ and the second a factor of $ϵ^{μ}$ , we assume $γ = μ$ and remove the power of $ϵ$ from both the first and the second terms. We discuss the choice of $γ = μ$ further in Remark 3.2.

Then

\begin{matrix} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) = & ϵ^{2 (1 - 2 γ)} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{γ} k) \\ = & ϵ^{2 (1 - 2 γ) - γ} \frac{1 + ϵ^{γ} k N_{1} (ϵ^{γ} k)}{c_{0} + i τ_{2} k + ϵ^{2 γ} k^{3} N_{2} (ϵ^{γ} k)} . \end{matrix}

3.20

Pointwise in k we have

\begin{matrix} lim_{ϵ \to 0^{+}} \frac{1 + ϵ^{γ} k N_{1} (ϵ^{γ} k)}{c_{0} + i τ_{2} k + ϵ^{2 γ} k^{3} N_{2} (ϵ^{γ} k)} = \frac{1}{c_{0} + i τ_{2} k}, \end{matrix}

and so we want

\begin{matrix} 2 (1 - 2 γ) - γ = 0 \end{matrix}

so that the prefactor of $ϵ^{2 (1 - 2 γ) - γ}$ in (3.20) does not induce a trivial or undefined limit. Thus we take

\begin{matrix} γ = μ = \frac{2}{5} . \end{matrix}

Certainly doing so does not contradict any of the inequalities in (3.15), provided that $β$ is chosen appropriately. Moreover, the power of 2/5 agrees with the height-speed-width relations suggested in Fig. 4. And so

\begin{matrix} lim_{ϵ \to 0^{+}} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) = lim_{ϵ \to 0^{+}} ϵ^{4 / 5} {\tilde{M}}_{ϵ^{2 / 5} c_{0}} (ϵ^{2 / 5} k) = \frac{1}{c_{0} + τ_{2} i k} . \end{matrix}

Put

\begin{matrix} {\tilde{M}}^{(0)} (z) : = \frac{1}{c_{0} + i τ_{2} z}, \end{matrix}

3.21

so ${\tilde{M}}^{(0)}$ is analytic on any strip $\{z \in C | | Im (z) | < q\}$ for $q \in (0, τ_{2} / c_{0})$ . Let $M^{(0)}$ be the Fourier multiplier with symbol ${\tilde{M}}^{(0)}$ .

Lemma A.2 then gives the following properties of $M^{(0)}$ ; the identities (3.22) are direct calculations with the Fourier transform.

Lemma 3.1

Fix $q \in (0, τ_{2} / c_{0})$ . Then $M^{(0)} \in B (H_{q}^{r}, H_{q}^{r + 1})$ for all r. More generally, if $f \in H^{1}$ and $g \in L^{2}$ , then

\begin{matrix} M^{(0)} (c_{0} + τ_{2} \partial_{X}) f = f and (c_{0} + τ_{2} \partial_{X}) M^{(0)} g = g . \end{matrix}

3.22

Because of the identities (3.22), we write $M^{(0)} = {(c_{0} + τ_{2} \partial_{X})}^{- 1}$ . The formal analysis above then leads us to expect

\begin{matrix} lim_{ϵ \to 0^{+}} {\overset{˘}{M}}_{ϵ} = M^{(0)} = {(c_{0} + τ_{2} \partial_{X})}^{- 1} . \end{matrix}

3.23

However, we have not yet proved this rigorously by any means.

Remark 3.2

Here is why we take $γ = μ$ when factoring the power of $ϵ$ out of the denominator in (3.19). First, taking $γ > μ$ produces

\begin{matrix} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) = ϵ^{2 (1 - (γ + μ)) - μ} \frac{1 + ϵ^{μ} k N_{1} (ϵ^{μ} k)}{ϵ^{γ - μ} c_{0} + i τ_{2} k + ϵ^{2 μ} k^{3} N_{2} (ϵ^{μ} k)} \end{matrix}

instead of (3.20). If $2 (1 - (γ + μ)) - μ > 0$ , then the right side above is identically zero at $ϵ = 0$ , and so we demand $2 (1 - (γ + μ)) - μ = 0$ ; there are many pairs of $γ$ and $μ$ that work here. But then

\begin{matrix} lim_{ϵ \to 0^{+}} \frac{1 + ϵ^{μ} k N_{1} (ϵ^{μ} k)}{ϵ^{γ - μ} c_{0} + i τ_{2} k + ϵ^{2 μ} k^{3} N_{2} (ϵ^{μ} k)} = \frac{1}{i τ_{2} k} . \end{matrix}

This suggests that instead of (3.23), we have

\begin{matrix} lim_{ϵ \to 0^{+}} {\overset{˘}{M}}_{ϵ} = {(τ_{2} \partial_{X})}^{- 1} . \end{matrix}

However, this is meaningless: differentiation is not invertible from $H_{q}^{r}$ to $H_{q}^{r + 1}$ .

Taking $γ < μ$ also does not work. In that case, instead of (3.20) we would have found

\begin{matrix} ϵ^{2 (1 - (γ + μ))} {\tilde{M}}_{ϵ^{γ} c_{0}} (ϵ^{μ} k) = ϵ^{2 (1 - (γ + μ)) - γ} \frac{1 + ϵ^{μ} k N_{1} (ϵ^{μ} k)}{c_{0} + i τ_{2} ϵ^{μ - γ} k + ϵ^{3 μ - γ} k^{3} N_{2} (ϵ^{μ} k)} . \end{matrix}

Since $γ < μ$ we find

\begin{matrix} lim_{ϵ \to 0^{+}} \frac{1 + ϵ^{μ} k N_{1} (ϵ^{μ} k)}{c_{0} + i τ_{2} ϵ^{μ - γ} k + ϵ^{3 μ - γ} k^{3} N_{2} (ϵ^{μ} k)} = \frac{1}{c_{0}} . \end{matrix}

We would then want $2 - 3 γ - 2 μ = 0$ to prevent a nontrivial limit.

Choosing $γ$ and $μ$ appropriately, we conclude that at $ϵ = 0$ the equation for $ψ_{1}$ from (3.14) formally reduces to

\begin{matrix} ψ_{1} = \frac{1}{c_{0}} {\overset{˘}{R}}_{1}^{0} (ψ_{1}) ψ_{1} . \end{matrix}

Numerically we expect $ψ_{1} (X) > 0$ for all X when $ϵ = 0$ , and so, using the definition of ${\overset{˘}{R}}_{1}^{0}$ from (3.8), we have

\begin{matrix} c_{0} = {\overset{˘}{R}}_{1}^{0} (ψ_{1}) (X) = \frac{α κ τ_{1}}{c_{0}^{2}} \int_{X}^{\infty} (\int_{V}^{\infty}, ψ_{1}, (W), d, W) ψ_{1} (V) d V . \end{matrix}

Differentiating, we find

\begin{matrix} (\int_{X}^{\infty}, ψ_{1}, (W), d, W) ψ_{1} (X) = 0 . \end{matrix}

But since $ψ_{1} (W) > 0$ for all W, we cancel the integral factor to find $ψ_{1} (X) = 0$ , a contradiction to our numerical predictions.

The formal leading order equation for $ψ_{1}$

At $ϵ = 0$ the equation for $ψ_{1}$ in (3.14) becomes (again, formally)

\begin{matrix} ψ_{1} = M^{(0)} ({\overset{˘}{R}}^{0} (ψ_{1}) ψ_{1}) = {(c_{0} + τ_{2} \partial_{X})}^{- 1} ({\overset{˘}{R}}^{0} (ψ_{1}) ψ_{1}) . \end{matrix}

This is equivalent to

\begin{matrix} c_{0} ψ_{1} + τ_{2} ψ_{1}^{'} = {\overset{˘}{R}}^{0} (ψ_{1}) ψ_{1} . \end{matrix}

3.24

We will rewrite this equation so that each term is a perfect derivative.

The definition of ${\overset{˘}{R}}_{1}^{ϵ}$ in (3.8), valid for all $ϵ$ , gives

\begin{matrix} {\overset{˘}{R}}^{0} (ψ_{1}) (X) = \frac{α κ τ_{1}}{c_{0}^{2}} \int_{X}^{\infty} (\int_{V}^{\infty}, ψ_{1}, (W), d, W) ψ_{1} (V) d V . \end{matrix}

3.25

Write

\begin{matrix} Ψ_{1} (X) : = \int_{X}^{\infty} ψ_{1} (W) d W, \end{matrix}

so that $Ψ_{1}^{'} = - ψ_{1}$ . The double integral from (3.25) is

\begin{matrix} \int_{X}^{\infty} (\int_{V}^{\infty}, ψ_{1}, (W), d, W) ψ_{1} (V) d V = & - \int_{X}^{\infty} Ψ_{1} (V) Ψ_{1}^{'} (V) d V \\ = & - \int_{X}^{\infty} \partial_{V} [\frac{Ψ_{1}^{2} (V)}{2}] d V \\ = & \frac{Ψ_{1} {(X)}^{2}}{2} . \end{matrix}

Here we are using the requirement that $ψ_{1} \in H_{q}^{1}$ , which implies $Ψ_{1} (X) \to 0$ as $X \to \infty$ . Thus

\begin{matrix} {\overset{˘}{R}}^{0} (ψ_{1}) ψ_{1} = - (\frac{α κ τ_{1}}{2 c_{0}^{2}}) Ψ_{1}^{2} Ψ_{1}^{'} = - (\frac{α κ τ_{1}}{6 c_{0}^{2}}) \partial_{X} [Ψ_{1}^{3}] . \end{matrix}

Then (3.24) is equivalent to

\begin{matrix} τ_{2} Ψ_{1}^{''} + c_{0} Ψ_{1}^{'} - (\frac{α κ τ_{1}}{6 c_{0}^{2}}) \partial_{X} [Ψ_{1}^{3}] = 0 . \end{matrix}

We integrate both sides from 0 to $\infty$ and use the aforementioned fact that $Ψ_{1}$ and its derivatives are required to vanish at $\infty$ to find

\begin{matrix} τ_{2} Ψ_{1}^{'} + c_{0} Ψ_{1} - (\frac{α κ τ_{1}}{6 c_{0}^{2}}) Ψ_{1}^{3} = 0 . \end{matrix}

3.26

This is a Bernoulli equation, and it has the solution

\begin{matrix} Ψ_{1} (X) = Σ (X) : = {(\frac{6 c_{0}^{3}}{α κ τ_{1} + 6 c_{0}^{2} exp (2 c_{0} X / τ_{2} + θ)})}^{1 / 2} . \end{matrix}

3.27

Here $θ \in R$ is an arbitrary phase shift. It follows that putting

\begin{matrix} ψ_{1} (X) = σ (X) : = - Σ^{'} (X) = \frac{{(6 c_{0}^{3})}^{3 / 2} exp (2 c_{0} X / τ_{2} + θ)}{τ_{2} [α κ τ_{1} + 6 c_{0}^{2} exp (2 c_{0} X / τ_{2} + θ)]^{3 / 2}} \end{matrix}

3.28

solves (3.24).

Remark 3.3

We view the free parameter $θ$ in (3.28) as an artifact of the translation invariance of our original problem (1.1). Friesecke and Pego (1999) do not incorporate a phase shift like $θ$ into their leading order ${sech}^{2}$ -type KdV solution, since their broader existence result relies on working in spaces of even functions, and phase shifts destroy evenness. We will not need such symmetry in our subsequent arguments (nor could we achieve it, since no translation of $σ$ is even or odd), and so we will leave $θ$ as an arbitrary free parameter and not specify its value. Instead, we will restrain the extra degree of freedom of translation invariance by imposing a certain integral condition, which we make precise in (5.2).

The formal leading order equation for $ψ_{2}$ and the selection of the exponent $β$

From our choice of $γ = μ = 2 / 5$ and the inequalities in (3.15), we need, at the very least,

\begin{matrix} \frac{1}{10} \leq β \leq \frac{1}{5} . \end{matrix}

If the strict inequality $β < 1 / 5$ holds, then at $ϵ = 0$ the equation for $ψ_{2}$ in (3.14) reduces to the trivial result $ψ_{2} = 0$ . This is not at all what we expect numerically from Fig. 2; rather, we anticipate that $ψ_{2}$ will asymptote to some nonzero constant at $\infty$ .

However, if we instead take $β$ so that

\begin{matrix} 0 = 1 - (γ + μ + β) = \frac{1}{5} - β, \end{matrix}

which is to say,

\begin{matrix} β = \frac{1}{5}, \end{matrix}

then the equation for $ψ_{2}$ in (3.14) at $ϵ = 0$ becomes

\begin{matrix} ψ_{2} = {\overset{˘}{P}}_{1}^{0} (ψ_{1}) . \end{matrix}

Putting

\begin{matrix} ψ_{2} (X) = ζ (X) : = {\overset{˘}{P}}_{1}^{0} (σ) (X) = \frac{α}{c_{0}} \int_{X}^{\infty} σ (V) d V \end{matrix}

3.29

therefore solves the leading order equation for $ψ_{2}$ . We really have

\begin{matrix} ζ (X) = \frac{α}{c_{0}} Σ (X) = \frac{α}{c_{0}} {(\frac{6 c_{0}^{3}}{α κ τ_{1} + 6 c_{0}^{2} e^{2 c_{0} X / τ_{2} + θ}})}^{1 / 2}, \end{matrix}

where $Σ$ was defined in (3.27).

The final long wave system

With the choices of exponents $γ = μ = 2 / 5$ and $β = 1 / 5$ , it becomes convenient to introduce the new small parameter

\begin{matrix} ν : = ϵ^{2 / 5} \end{matrix}

3.30

into the problem (3.14) and then recast that problem more cleanly in terms of $ν$ . First, the long wave Ansatz (3.1) becomes

\begin{matrix} ϕ_{1} (x) = ν^{5 / 2} ψ_{1} (ν x), ϕ_{2} (x) = ν^{1 / 2} ψ_{2} (ν x), and c = ν c_{0} . \end{matrix}

3.31

Proceeding very much as in Sect. 3.1, we then define

\begin{matrix} Q_{1}^{ν} (X, Y) : = & \frac{X^{2} Y}{k_{a} (k_{a} + ν^{5 / 2} X)} and \\ Q_{2}^{ν} (X, Y) : = & κ \frac{k_{r} Y + k_{m} ν^{2} X + ν^{5 / 2} XY}{(k_{r} + ν^{5 / 2} X) (k_{m} + ν^{1 / 2} Y)} X Y \end{matrix}

3.32

for X, $Y \in R$ , while for $f \in L^{1}$ and $g \in L^{\infty}$ , we put

\begin{matrix} P_{1}^{ν} (f) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} E (ν^{1 / 2} S^{ν} f) (V, X) f (V) d V, \end{matrix}

3.33

where $E$ was defined in (3.3), and

\begin{matrix} P_{2}^{ν} (f, g) (X) : = & \frac{1}{c_{0}} \int_{X}^{\infty} E (ν^{1 / 2} S^{ν} f) (V, X) Q_{2}^{ν} (f (V + ν), g (V)) d V, \end{matrix}

3.34

\begin{matrix} R_{1}^{ν} (f) (X) : = & \frac{κ τ_{1}}{c_{0}} \int_{X}^{\infty} P_{1}^{ν} (f) (V) f (V + ν) d V, \end{matrix}

3.35

\begin{matrix} R_{2}^{ν} (f, g) (X) : = & \frac{1}{c_{0}} \int_{X}^{\infty} (ν^{1 / 2} κ f (V + ν) P_{2}^{ν} (f, g) (V) \\ - Q_{2}^{ν} (f (V + ν), g (V))) d V, \end{matrix}

3.36

\begin{matrix} R^{ν} (f, g) (X) : = & R_{1}^{ν} (f) (X) + ν^{1 / 2} R_{2}^{ν} (f, g) (X), \end{matrix}

3.37

and

\begin{matrix} N^{ν} (f, g) (X) : = τ_{1} R_{2}^{ν} (f, g) (X) f (X) - ν^{3 / 2} τ_{1} Q_{1}^{ν} (f (X), R^{ν} (f, g) (X)) . \end{matrix}

3.38

Remark 3.4

The operators $P_{1}^{ν}$ and $R_{1}^{ν}$ map $L^{1}$ into $L^{\infty}$ , while $P_{2}^{ν}$ and $R_{2}^{ν}$ map $L^{1} \times L^{\infty}$ into $L^{\infty}$ , and $N^{ν}$ maps $L^{1} \times L^{\infty}$ into $L^{1}$ . More precisely, we could replace $L^{1}$ with $H_{q}^{1}$ and $L^{\infty}$ with $W^{1, \infty}$ and the preceding statement would still be true; see the estimates in Appendix B.1.

The operator $R_{1}^{0}$ has the especially simple form

\begin{matrix} R_{1}^{0} (f) (X) = (\frac{α κ τ_{1}}{6 c_{0}^{2}}) \int_{X}^{\infty} (\int_{V}^{\infty}, f, (W), d, W) f (V) d V \end{matrix}

3.39

and therefore is differentiable from $L^{1}$ to $L^{\infty}$ .

Last, for $ν > 0$ , let $M^{(ν)}$ be the Fourier multiplier with symbol

\begin{matrix} {\tilde{M}}^{(ν)} (z) : = ν \frac{1 - e^{- i ν z}}{i c_{0} ν^{2} z + 2 τ_{2} (cos (ν z) - 1)} . \end{matrix}

3.40

When $ν = 0$ we have already defined $M^{(0)}$ as the Fourier multiplier whose symbol ${\tilde{M}}^{(0)}$ is given in (3.21). We will show momentarily in Sect. 4 how $M^{(0)}$ is a good approximation of $M^{(ν)}$ in the operator norm topology, which we formally anticipated in the limit (3.23).

We now summarize the work of this section and the preceding one. In Sect. 2 we reformulated our original problem (1.1) into the more concise structure (2.14) and then made a travelling wave ansatz on this latter system. That led to the travelling wave problem (2.28). In this section, we introduced the long wave Ansatz (3.1) into this travelling wave problem and studied several formal expansions and limits to deduce the ‘correct’ choice of exponents and scalings. With the operators defined above, we can summarize our long wave problem in the following form.

Proposition 3.5

Suppose

\begin{matrix} \{\begin{matrix} A_{j} (t) = ν^{5 / 2} ψ_{1} (ν (j - ν c_{0} t)), \\ P_{j} (t) = ν^{1 / 2} ψ_{2} (ν (j - ν c_{0} t)) \end{matrix}) \end{matrix}

3.41

for some $ψ_{1} \in H_{q}^{1}$ and $ψ_{2} \in L^{\infty}$ , where $c_{0}$ , $ν > 0$ and $q \in (0, c_{0} / τ_{2})$ . Then $A_{j}$ and $P_{j}$ satisfy (2.14) if and only if $ψ_{1}$ and $ψ_{2}$ satisfy

\begin{matrix} \{\begin{matrix} ψ_{1} = M^{(ν)} (R_{1}^{ν} (ψ_{1}) ψ_{1} + ν^{1 / 2} N^{ν} (ψ_{1}, ψ_{2})), \\ ψ_{2} = P_{1}^{ν} (ψ_{1}) + ν P_{2}^{ν} (ψ_{1}, ψ_{2}) . \end{matrix}) \end{matrix}

3.42

Moreover, taking

\begin{matrix} ψ_{1} = σ and ψ_{2} = ζ = P_{1}^{0} (σ), \end{matrix}

where $σ$ is defined in (3.28) and $ζ$ is given explicitly in (3.29), solves (3.42) when $ν = 0$ .

We will analyze the system (3.42) with a quantitative contraction mapping argument that tracks its dependence on $ν$ . Specifically, we will look for solutions $ψ_{1}$ and $ψ_{2}$ to (3.42) that are close to $σ$ and $ζ$ , respectively, when $ν$ is close to 0. We provide the details of this argument in Sect. 6.

Before doing so, we need to understand the behavior of two key operators, on whose good behavior the successful contraction argument will hinge. Our first task, as we mentioned above, is to study how $M^{(0)}$ approximates $M^{(ν)}$ , and we do this in Sect. 4. Next, since we are looking for solutions $(ψ_{1}, ψ_{2})$ to (3.42) that are close to $(σ, ζ)$ , it is natural to study the linearization of (3.42) at $(σ, ζ)$ for $ν = 0$ . It turns out that we will only need the linearization of the first equation, which is the operator

\begin{matrix} T ψ : = ψ - M^{(0)} [R_{1}^{0} (σ) ψ + (D R_{1}^{0} (σ) ψ) σ] . \end{matrix}

3.43

We will show in Sect. 5 that $T$ is surjective from $H_{q}^{1}$ to $H_{q}^{1}$ with a one-dimensional kernel. Restricting $T$ off this kernel will yield an extremely useful bijectivity result.

Analysis of the Fourier multiplier $M^{(ν)}$

We show that the Fourier multiplier $M^{(ν)}$ , whose symbol was defined in (3.40), converges to the multiplier $M^{(0)}$ , whose symbol was defined in (3.21). More precisely, we prove the following estimate.

Proposition 4.1

Fix $q \in (0, c_{0} / τ_{2})$ . There exist $ν_{M}$ , $C_{M} > 0$ such that if $0 < ν < ν_{M}$ , then

\begin{matrix} ‖ M^{(ν)} - M^{(0)} ‖_{B (H_{q}^{1})} \leq C_{M} ν^{1 / 3} . \end{matrix}

The proof of this estimate depends on the following lemma, whose proof we give in the remainder of this section.

Lemma 4.2

Let $q \in (0, c_{0} / τ_{2})$ . There exist $C_{M}$ , $ν_{M} > 0$ such that if $0 < ν < ν_{M}$ , then the map ${\tilde{M}}^{(ν)}$ defined in (3.40) is analytic on the strip ${\bar{U}}_{q} : = \{z \in C | | Im (z) | \leq q\}$ and satisfies

\begin{matrix} sup_{k \in R} | {\tilde{M}}^{(ν)} (k \pm i q) - {\tilde{M}}^{(0)} (k \pm iq) | < C_{M} ν^{1 / 3}, \end{matrix}

4.1

where ${\tilde{M}}^{(0)}$ was defined in (3.21).

This lemma allows us to invoke Beale’s result in Lemma A.2 (with $r = 1$ and $s = 0$ ) to prove Proposition 4.1. Beale’s result depends very much on our working in exponentially weighted Sobolev spaces, and this is another of our reasons for preferring these spaces to algebraically weighted ones.

We will estimate the difference $| {\tilde{M}}^{(ν)} (z) - {\tilde{M}}^{(0)} (z) |$ over two regimes, one in which $z = k \pm iq$ is ‘close’ to 0, and the other in which z is ‘far from’ 0. Part of these estimates will involve bounding the denominator of ${\tilde{M}}^{(ν)}$ away from zero; this will ensure the analyticity of ${\tilde{M}}^{(ν)}$ , since it is the quotient of two analytic functions.

To quantify these regimes, we introduce two positive constants p and m; we say that z is ‘close’ to 0 if $| z | \leq ν^{- p}$ and ‘far from’ 0 if $| z | > ν^{- p}$ . The constant m will later control how close the real part of $ν z$ is to an integer multiple of $2 π$ , a bound that will be very useful in certain estimates to come. All constants C in the work below are allowed to depend on m, p, and q, but they are always independent of $ν$ and z.

Our estimates will depend on the parameters p and m; once we have all the estimates together, we will choose useful values for p and m. We feel that this approach allows the otherwise nonobvious final values for p and m to emerge very naturally. This strategy of splitting the estimates over regions close to and far from 0 is modeled on the proofs of Faver and Wright (2018, Lem. A.13) and Stefanov and Wright (2020, Lem. 3) and the strategy in Johnson and Wright (2020, App. A.3). Friesecke and Pego Friesecke and Pego (1999, Sec. 3) give a rather different proof of symbol convergence that relies on more knowledge of the poles of ${\tilde{M}}^{(ν)}$ than we care to discover.

Estimates for z ‘close to’ 0

In this regime we fix $| z | \leq ν^{- p}$ . We recall the Taylor expansions

\begin{matrix} 1 - e^{- i z} = i z + i z^{2} N_{1} (z) and cos (z) - 1 = - \frac{z^{2}}{2} + \frac{i z^{4} N_{2} (z)}{2 τ_{2}} \end{matrix}

from (3.17), as well as the estimate

\begin{matrix} C_{q} : = sup_{x \in R} | N_{1} (x \pm iq) | + | N_{2} (x \pm iq) | < \infty . \end{matrix}

Now we can write

\begin{matrix} {\tilde{M}}^{(ν)} (z) = \frac{1 + ν z N_{1} (ν z)}{c_{0} + τ_{2} i z + ν^{2} z^{3} N_{2} (ν z)} . \end{matrix}

With this expression we find the following equality

\begin{matrix} {\tilde{M}}^{(ν)} (z) - {\tilde{M}}^{0} (z) = I_{ν} (z) + I I_{ν} (z), \end{matrix}

where

\begin{matrix} I_{ν} (z) : = \frac{c_{0} ν z N_{1} (ν z) - ν^{2} z^{3} N_{2} (ν z)}{(c_{0} + τ_{2} i z + ν^{2} z^{3} N_{2} (ν z)) (c_{0} + i τ_{2} z)} \end{matrix}

4.2

and

\begin{matrix} I I_{ν} (z) : = \frac{i τ_{2} ν z^{2} N_{1} (ν z)}{(c_{0} + τ_{2} i z + ν^{2} z^{3} N_{2} (ν z)) (c_{0} + i τ_{2} z)} . \end{matrix}

4.3

We work on the denominators. We use the reverse triangle inequality to find

\begin{matrix} | (c_{0} + τ_{2} i z - 2 τ_{2} i ν^{2} z^{3} N_{2} (ν z)) (c_{0} + τ_{2} i z) | \geq | c_{0} - τ_{2} q - 2 τ_{2} ν^{2} {| z |}^{3} | N_{2} (ν z) | | \\ | c_{0} - τ_{2} q | . \end{matrix}

As $q \in (0, c_{0} / τ_{2})$ , we have $| c_{0} - τ_{2} q | > 0$ . Also, since $| z | \leq ν^{- p}$ , we have $ν^{2} {| z |}^{3} \leq ν^{2 - 3 p}$ . If we take

\begin{matrix} 0 < p < \frac{2}{3} \end{matrix}

4.4

and assume $ν \in (0, ν_{1})$ , where

\begin{matrix} ν_{1} : = min \{1, {(\frac{| c_{0} - τ_{2} q |}{4 C_{q} τ_{2}})}^{1 / (2 - 3 p)}\}, \end{matrix}

4.5

then

\begin{matrix} | c_{0} - τ_{2} q - ν^{2} {| z |}^{3} | N_{2} (ν z) | | \geq \frac{| c_{0} - τ_{2} q |}{2} . \end{matrix}

4.6

In particular,

\begin{matrix} | c_{0} - τ_{2} q - 2 τ_{2} ν^{2} {| z |}^{3} | N_{2} (ν z) | | | c_{0} - τ_{2} q | \geq \frac{| c_{0} - τ_{2} {q |}^{2}}{2}, \end{matrix}

4.7

and this inequality guarantees that ${\tilde{M}}^{(ν)}$ is defined (and analytic) for $| z | \leq ν^{- p}$ and $| Im (z) | < q$ . Then we use (4.7) to estimate $I_{ν} (z)$ from (4.2) as

\begin{matrix} | I_{ν} (z) | \leq C ν^{1 - p} + C ν^{2 - 3 p} . \end{matrix}

Next, we use (4.6) to estimate $I I_{ν} (z)$ from (4.3) as

\begin{matrix} | I I_{ν} (z) | \leq C ν^{1 - p} \frac{| z |}{| c_{0} + i τ_{2} z |} . \end{matrix}

Setting $z = x \pm i q$ we note

\begin{matrix} \frac{{| z |}^{2}}{| c_{0} + i τ_{2} {z |}^{2}} = \frac{x^{2} + q^{2}}{{(c_{0} \pm τ_{2} q)}^{2} + τ_{2}^{2} x^{2}} \leq \frac{x^{2} + q^{2}}{{(c_{0} - τ_{2} q)}^{2} + τ_{2}^{2} x^{2}} . \end{matrix}

We know

\begin{matrix} D : = sup_{x \in R} \frac{x^{2} + q^{2}}{{(c_{0} - τ_{2} q)}^{2} + c_{0}^{2} x^{2}} < \infty, \end{matrix}

and thus

\begin{matrix} | I I_{ν} (z) | \leq C ν^{1 - p} . \end{matrix}

We conclude

\begin{matrix} | {\tilde{M}}^{(ν)} (z) - {\tilde{M}}^{(0)} (z) | \leq | I_{ν} (z) | + | I I_{ν} (z) | \leq C (ν^{1 - p} + ν^{2 - 3 p}) . \end{matrix}

4.8

As we required $p \in (0, 2 / 3)$ , the final estimate contains only positive powers of $ν$ . Since we will always consider $0 < ν < ν_{1}$ in the future, the definition of $ν_{1}$ in (4.5) ensures $0 < ν < 1$ in the following regimes.

Estimates for z ‘far from’ 0

In this regime we assume $| z | > ν^{p}$ . Take

\begin{matrix} ν_{2} < min \{ν_{1}, {(\frac{τ_{2}}{2 c_{0}})}^{1 / p}\}, \end{matrix}

4.9

with $ν_{1}$ defined in (4.5), so that if $0 < ν < ν_{2}$ , then $| z | > c_{0} / τ_{2}$ . With the reverse triangle inequality we find

\begin{matrix} | {\tilde{M}}^{(0)} (z) | \leq \frac{1}{| | c_{0} | - | τ_{2} z | |} < \frac{1}{τ_{2} ν^{- p} - c_{0}} < \frac{2}{τ_{2}} ν^{p} . \end{matrix}

4.10

Consequently, it suffices in this regime to show that ${\tilde{M}}^{(ν)}$ is bounded by a multiple of some power of $ν$ . It will be convenient now to rewrite ${\tilde{M}}^{(ν)}$ as

\begin{matrix} {\tilde{M}}^{(ν)} (z) = \frac{ν {\tilde{M}}_{1}^{(ν)} (z)}{{\tilde{M}}_{2}^{(ν)} (z)}, \end{matrix}

where

\begin{matrix} {\tilde{M}}_{1}^{(ν)} (z) : = 1 - e^{- i ν z} and {\tilde{M}}_{2}^{(ν)} (z) : = i c_{0} ν^{2} z + 2 τ_{2} (cos (ν z) - 1) . \end{matrix}

4.11

The analyticity of ${\tilde{M}}^{(ν)}$ for $| z | > ν^{p}$ will follow if we bound ${\tilde{M}}_{2}^{(ν)}$ away from zero here.

The presence of the factor $cos (ν z) - 1$ in the denominator of ${\tilde{M}}^{(ν)}$ suggests that the behavior of this function may be different when $Re (ν z)$ is ‘close’ to an integer multiple of $2 π$ and when it is not. For this reason, we expand $z = x \pm i q$ and let $n \in Z$ be the unique integer such that $| ν x - 2 π n | \leq π$ . We consider three cases on the behavior of $ν x$ and n.

Estimates for $Re (ν z)$ ‘close to’ a nonzero integer multiple of $2 π$

In this regime we assume $| ν x - 2 π n | \leq ν^{m}$ with $n \neq 0$ .

We first rewrite the numerator as

\begin{matrix} {\tilde{M}}_{1}^{(ν)} (z) = 1 - e^{- i (ν x - 2 π n)} + e^{- i ν x} (1 - e^{\pm ν q}) . \end{matrix}

Since the map $y \mapsto e^{- i y}$ is uniformly Lipschitz on $R$ we have

\begin{matrix} | 1 - e^{- i (ν x - 2 π n)} | \leq | ν x - 2 π n | \leq ν^{m} . \end{matrix}

Since the map $y \mapsto e^{- y}$ is locally Lipschitz on $R$ we have, if we take $0 < ν < ν_{3}$ with

\begin{matrix} ν_{3} : = min \{ν_{2}, \frac{1}{q}\} \end{matrix}

4.12

and $ν_{2}$ defined in (4.9), the estimate

\begin{matrix} | 1 - e^{\pm ν q} | \leq ν q . \end{matrix}

Then

\begin{matrix} | {\tilde{M}}_{1}^{(ν)} (z) | \leq ν^{m} + ν q \leq C (ν^{m} + ν) . \end{matrix}

4.13

We remark that we did not need $n \neq 0$ here, although we will momentarily.

We now turn to the denominator, $M_{2}^{(ν)} (z)$ . Using the identity

\begin{matrix} cos (a + b i) = cos (a) cosh (b) - i sin (a) sinh (b) \end{matrix}

4.14

for $a, b \in R$ we find

\begin{matrix} Im ({\tilde{M}}_{1}^{(ν)} (z)) = c_{0} ν^{2} x - 2 τ_{2} sin (ν x) sinh (ν q) . \end{matrix}

We estimate

\begin{matrix} | Im ({\tilde{M}}_{1}^{(ν)} (z)) | \geq C (ν | n | - ν | ν x - 2 n π | - | sin (ν x - 2 n π) | | sinh (ν q) |) \end{matrix}

We control the three terms on the right as follows. First, $| n | \geq 1$ . Next, we are in the regime $| ν x - 2 n π | \leq ν^{m}$ . Finally, we have

\begin{matrix} | sin (ν x - 2 n π) | \leq | ν x - 2 n π | \leq ν^{m} and | sinh (ν q) | \leq 2 | ν q |, \end{matrix}

since $| ν q | \leq 1$ . We thus find

\begin{matrix} | Im ({\tilde{M}}_{2}^{(ν)} (z)) | \geq C (ν - ν^{m + 1}) . \end{matrix}

Now that we have the numerator and the denominator bounded, we can conclude

\begin{matrix} | {\tilde{M}}^{(ν)} (z) | \leq C ν \frac{ν^{m} + ν}{ν - ν^{m + 1}} = C \frac{ν^{m} + ν}{1 - ν^{m}} \leq C ν^{m} . \end{matrix}

4.15

Here we need to assume

\begin{matrix} 0 < m < 1 . \end{matrix}

4.16

Estimates for $Re (ν z)$ ‘close to’ 0

In this regime we assume $| ν x | \leq ν^{m}$ ; in particular, we are taking $n = 0$ . We will need the following bound on the cosine, which is a consequence of an elementary argument with Taylor’s theorem.

Lemma 4.3

Let $Q \geq 0$ . There exist $C_{1, Q}$ , $C_{2, Q} > 0$ such that if $Z \in C$ with $| Z | \leq C_{1, Q}$ and $| Im (Z) | \leq Q$ , then

\begin{matrix} | cos (Z) - 1 | \geq C_{2, Q} {| Z |}^{2} . \end{matrix}

In particular, if $Q = 0$ , then $C_{1, 0} > π$ .

We use the reverse triangle inequality on ${\tilde{M}}_{2}^{(ν)}$ from (4.11) to find

\begin{matrix} | {\tilde{M}}_{2}^{(ν)} (z) | \geq 2 τ_{2} | cos (ν z) - 1 | - c_{0} ν^{2} q - c_{0} ν^{2} | x | . \end{matrix}

4.17

Take $0 < ν < ν_{M}$ , where

\begin{matrix} ν_{M} < min \{ν_{3}, {(\frac{1}{q})}^{1 / (1 - m)}, {(\frac{C_{1, q}}{2})}^{1 / m}\}, \end{matrix}

4.18

with $ν_{3}$ defined in (4.12), to find

\begin{matrix} | ν z | \leq ν^{m} + ν z \leq 2 ν^{m} < C_{1, q} . \end{matrix}

Lemma 4.3 then guarantees

\begin{matrix} | cos (ν z) - 1 | \geq {C | ν z |}^{2} \geq C ν^{2 - 2 p} . \end{matrix}

Finally, since $| x | \leq ν^{m - 1}$ in this regime we use the bound (4.17) to conclude

\begin{matrix} | {\tilde{M}}_{2}^{(ν)} (z) | \geq C (ν^{2 - 2 p} - ν^{2} - ν^{m + 1}) . \end{matrix}

We remark that the derivation of the estimate (4.13) only assumed $| ν x - 2 π n | \leq ν^{m}$ and did not rely on having $n \neq 0$ . So it is still valid here, and we conclude

\begin{matrix} | {\tilde{M}}^{(ν)} (z) | \leq & C ν \frac{ν^{m} + ν}{ν^{2 - 2 p} - ν^{2} - ν^{m + 1}} = C \frac{ν^{m + 2 p - 1} + ν^{2 p}}{1 - ν^{2 p} - ν^{m + 2 p - 1}} \\ \leq & C (ν^{m + 2 p - 1} + ν^{2 p}) . \end{matrix}

4.19

Here we are assuming

\begin{matrix} 0 \leq 1 - 2 p < min {1, m} . \end{matrix}

4.20

Estimates for $Re (ν z)$ ‘far from’ a nonzero integer multiple of $2 π$

In this regime we assume $| ν x - 2 π n | > ν^{m}$ . We do not perform separate work on $n = 0$ and $n \neq 0$ .

Via (4.14) we find

\begin{matrix} Re (M_{2}^{(ν)} (z)) = - c_{0} ν^{2} q + 2 τ_{2} (cos (ν x - 2 n π) - 1) + 2 τ_{2} cos (ν x) (cosh (ν q) - 1) . \end{matrix}

We estimate

\begin{matrix} | Re (M_{2}^{(ν)} (z)) | \geq C (| cos (ν x - 2 n π) - 1 | - | cos (ν x) | | cosh (ν q) - 1 | - ν^{2}) . \end{matrix}

Now we use Lemma 4.3 with $Q = 0$ to bound

\begin{matrix} | cos (ν x - 2 n π) - 1 | \geq {C | ν z - 2 n π |}^{2} \geq C ν^{2 m} . \end{matrix}

Also, a routine Lipschitz estimate on the hyperbolic cosine gives

\begin{matrix} | cos (ν x) | | cosh (ν q) - 1 | \leq C ν^{2} \end{matrix}

since $| ν q | \leq 1$ . We thus find

\begin{matrix} | Re (M_{2}^{(ν)} (z)) | \geq C (ν^{2 m} - ν^{2}) . \end{matrix}

As we are assuming $0 < m < 1$ from (4.16), this is a positive lower bound.

Finally, we bound the numerator ${\tilde{M}}_{1}^{ν} (z)$ crudely as $| {\tilde{M}}_{1}^{(ν)} (z) | \leq C$ for all $z \in C$ with $| Im (z) | = q$ . This follows from the boundedness of $Z \mapsto e^{iZ}$ on strips. We conclude

\begin{matrix} | {\tilde{M}}^{(ν)} (z) | < C \frac{ν}{ν^{2 m} - ν^{2}} \leq C ν^{1 - 2 m} . \end{matrix}

4.21

This is a positive bound if we now require

\begin{matrix} 0 < m < \frac{1}{2} . \end{matrix}

4.22

Overall estimates

Suppose $0 < ν < ν_{M}$ , where $ν_{M}$ was specified in (4.18). We conclude from (4.8) that

\begin{matrix} sup_{\begin{matrix} | Im (z) | = q \\ | z | \leq ν^{- p} \end{matrix}} | {\tilde{M}}^{(ν)} (z) - {\tilde{M}}^{(0)} (z) | \leq C (ν^{1 - p} + ν^{2 - 3 p}) \end{matrix}

4.23

and, by combining (4.10), (4.15), (4.19), and (4.21), that

\begin{matrix} sup_{\begin{matrix} | Im (z) | = q \\ | z | > ν^{- p} \end{matrix}} | {\tilde{M}}^{(ν)} (z) - {\tilde{M}}^{(0)} (z) | \leq C ν^{p} + C max {ν^{m}, ν^{m + 2 p - 1} + ν^{2 p}, ν^{1 - 2 m}} . \end{matrix}

4.24

Additionally, we need, per (4.4), and (4.20), and (4.22), the exponents p and m to satisfy

\begin{matrix} 0 < p < \frac{2}{3}, 0 < m < \frac{1}{2}, and 0 < 1 - 2 p < min {1, m} . \end{matrix}

4.25

There are many possible choices of p and m that will satisfy (4.25). Purely for convenience, we elect to take $m = 1 / 3$ and then $p = 1 / 2$ . We combine (4.23) and (4.24) to conclude the estimate (4.1).

Analysis of the linearization $T$

The operator

\begin{matrix} T ψ = ψ - M^{(0)} [R_{1}^{0} (σ) ψ + (D R_{1}^{0} (σ) ψ) σ] \end{matrix}

5.1

is the linearization of the first equation in our long wave-scaled travelling wave problem (3.42) at $(ψ_{1}, ψ_{2}) = (σ, ζ)$ and $ν = 0$ . We recall that the symbol of the Fourier multiplier $M^{(0)}$ was defined in (3.21) and the operator $R_{1}^{0}$ was defined in (3.39).

We will work with $T$ defined on the following subspace of $H_{q}^{1}$ :

\begin{matrix} H_{q, 0}^{1} : = \{f \in H_{q}^{1} | \int_{0}^{\infty} f (W) d W = 0\} . \end{matrix}

5.2

We norm $H_{q, 0}^{1}$ with the $H_{q}^{1}$ -norm. In Lemma 5.2 below we show that the kernel of $T$ in $H_{q}^{1}$ is spanned by $σ^{'}$ . Restricting $T$ to $H_{q, 0}^{1}$ removes this kernel and guarantees injectivity. We will then prove that $T$ is surjective onto $H_{q}^{1}$ and so conclude the following result.

Proposition 5.1

For $q \in (0, c_{0} / τ_{2})$ , the operator $T : H_{q, 0}^{1} \to H_{q}^{1}$ is invertible with bounded inverse.

The linearization at the limiting localized solution appears as a key operator in numerous FPUT problems, including (Friesecke and Pego 1999; Faver and Wright 2018; Hoffman and Wright 2017), and the invertibility of this operator is a property essential to the development of the right fixed point formula for the given problem. Our treatment of the invertibility of $T$ is rather different from the analogous inversions in those papers, as the problem $T f = g$ is really a linearized Bernoulli equation in disguise, rather than the linearized KdV travelling wave profile equation. In particular, solving $T f = g$ amounts to studying a first-order linear problem, which we can solve explicitly with an integrating factor. In doing so, we avoid the more abstract spectral theory that controls the second-order KdV linearizations (see, e.g., Friesecke and Pego (1999, Lem. 4.2)).

It will be convenient to abbreviate

\begin{matrix} q_{*} : = \frac{c_{0}}{τ_{2}}, \end{matrix}

5.3

and in the following we always assume $0 < q < q_{*}$ .

The proof of Proposition 5.1

We will reformulate the equation $T f = g$ in a very convenient manner, which we summarize in Lemma 5.2 below. This lemma will be the key to deducing the injectivity and surjectivity of $T$ .

First, for $h \in H_{q}^{1}$ , we introduce the operator

\begin{matrix} (A f) (X) : = \int_{X}^{\infty} h (W) d W, \end{matrix}

5.4

so that

\begin{matrix} h = - \partial_{X} [A h] . \end{matrix}

5.5

Then $h \in H_{q, 0}^{1}$ if and only if $h \in H_{q}^{1}$ and $(A f) (0) = 0$ . We will show that the equation $T f = g$ for f, $g \in H_{q}^{1}$ is equivalent to a statement about $A f$ and $A g$ , which we give precisely in Lemma 5.2.

The following steps are quite similar to the derivation of the Bernoulli solution $σ$ in Sect. 3.2.2. From the definition of $T$ in (5.1), we have $T f = g$ if and only if

\begin{matrix} (c_{0} + τ_{2} \partial_{X}) f - R_{1}^{0} (σ) f + (D R_{1}^{0} (σ) f) σ = g . \end{matrix}

5.6

From the definition of $R_{1}^{0}$ in (3.39), we find

\begin{matrix} (D R^{0} (σ) f) (X) = & \int_{X}^{\infty} (\int_{W}^{\infty}, σ, (V), d, V) f (W) d W \\ + \int_{X}^{\infty} (\int_{W}^{\infty}, f, (V), d, V) σ (W) d W . \end{matrix}

5.7

Since g, $σ \in H_{q}^{1}$ , and since we seek $f \in H_{q}^{1}$ , we abbreviate

\begin{matrix} F : = A f, G : = A g, and Σ : = A σ . \end{matrix}

5.8

Then (5.6) is equivalent to

\begin{matrix} τ_{2} F^{''} (X) - c_{0} F^{'} (X) - (\frac{α κ τ_{1}}{c_{0}^{2}}) F^{'} (X) \int_{X}^{\infty} Σ (W) Σ^{'} (W) d W \\ - (\frac{α κ τ_{1}}{c_{0}^{2}}) Σ^{'} (X) \int_{X}^{\infty} (Σ (W) F^{'} (W) + F (W) Σ^{'} (W)) d W \\ = - c_{0} G^{'} (X) - τ_{2} G^{''} (X) . \end{matrix}

5.9

Although it may not be apparent at first glance, every term in this equation is a perfect derivative. First, since $Σ$ and F must vanish at $+ \infty$ , we have

\begin{matrix} \int_{X}^{\infty} Σ (W) Σ^{'} (W) d W = - \frac{Σ {(X)}^{2}}{2} \end{matrix}

and

\begin{matrix} \int_{X}^{\infty} (Σ (W) F^{'} (W) + F (W) Σ^{'} (W)) d W = - Σ (X) F (X) . \end{matrix}

Hence (5.9) really is

\begin{matrix} τ_{2} F^{''} - c_{0} F^{'} + (\frac{α κ τ_{1}}{c_{0}^{2}}) (\frac{F^{'} Σ^{2}}{2} + Σ^{'} Σ F) = - c_{0} G^{'} - τ_{2} G^{''}, \end{matrix}

5.10

where

\begin{matrix} \frac{F^{'} Σ^{2}}{2} + Σ^{'} Σ F = \frac{1}{2} \partial_{X} [Σ^{2} F] . \end{matrix}

So, we deduce that F and G must satisfy

\begin{matrix} - τ_{2} F^{''} - c_{0} F^{'} + (\frac{α κ τ_{1}}{c_{0}^{2}}) \partial_{X} [\frac{Σ^{2} F}{2}] = - c_{0} G^{'} - τ_{2} G^{''} . \end{matrix}

5.11

Since both F and G must vanish at $+ \infty$ , we may integrate (5.11) to find

\begin{matrix} \underset{L F}{\underset{⏟}{τ_{2} F^{'} + c_{0} F - (\frac{α κ τ_{1}}{c_{0}^{2}}) \frac{Σ^{2} F}{2}}} = c_{0} G + τ_{2} G^{'} . \end{matrix}

5.12

The operator $L$ defined above is the linearization of the Bernoulli equation (3.26) at its solution $Σ$ , and so

\begin{matrix} 0 = L Σ^{'} = L (- σ) . \end{matrix}

5.13

The operator $L$ , or, more precisely, $τ_{2}^{- 1} L$ , is also a first-order linear differential operator, and so we can solve (5.12) with an integrating factor. Namely, let $P$ satisfy

\begin{matrix} P^{'} = \frac{c_{0}}{τ_{2}} - (\frac{α κ τ_{1}}{2 c_{0}^{2} τ_{2}}) Σ^{2} . \end{matrix}

5.14

Then F solves (5.12) if and only if

\begin{matrix} F (X) = F (0) e^{P (0) - P (X)} + σ (X) \int_{0}^{X} e^{P (W)} (q_{*} G (W) + G^{'} (W)) d W . \end{matrix}

5.15

In particular, any solution H to $L H = 0$ must be a scalar multiple of $e^{- P (\cdot)}$ , and so, by (5.13), $σ$ is also a scalar multiple of $e^{- P (\cdot)}$ . Consequently, we can rewrite (5.15) as

\begin{matrix} F (X) = \frac{F (0)}{σ (0)} σ (X) + σ (X) \int_{0}^{X} \frac{q_{*} G (W) + G^{'} (W)}{σ (W)} d W . \end{matrix}

5.16

Conversely, if F satisfies (5.16), then we may undo all of the work above to see that $f : = - F^{'}$ solves $T f = g$ . Using the identities (5.8), we can recast this result in terms of the original functions f and g.

Lemma 5.2

For $g \in H_{q}^{1}$ , define

\begin{matrix} (H g) (X) : = q_{*} (A g) (X) - g (X) \end{matrix}

5.17

and

\begin{matrix} (K g) (X) : = σ (X) \int_{0}^{X} \frac{(H g) (W)}{σ (W)} d W . \end{matrix}

5.18

Then $f \in H_{q}^{1}$ satisfies $T f = g$ if and only if

\begin{matrix} (A f) (X) = \frac{(A f) (0)}{σ (0)} σ (X) + (K g) (X) . \end{matrix}

5.19

In particular, a function $f \in H_{q, 0}^{1}$ satisfies $T f = g$ if and only if

\begin{matrix} (A f) (X) = (K g) (X) . \end{matrix}

5.20

The identity (5.20) allows to prove the bijectivity of $T : H_{q, 0}^{1} \to H_{q}^{1}$ . The proof of injectivity is very easy. If $T f = 0$ for some $f \in H_{q, 0}^{1}$ , then (5.20) implies $A f = 0$ , and so the identity (5.5) gives $f = 0$ . Observe that if we were working on all of $H_{q}^{1}$ , then (5.19) tells us that $T$ would have a one-dimensional kernel in $H_{q}^{1}$ spanned by $σ^{'}$ . But since $(A σ^{'}) (0) = - σ (0) \neq 0$ , we have $σ^{'} \notin H_{q, 0}^{1}$ .

Toward surjectivity, suppose $T f = g$ for some $f \in H_{q, 0}^{1}$ and $g \in H_{q}^{1}$ . Then (5.5) and (5.20) imply

\begin{matrix} f = - \partial_{X} [K g] = : S g . \end{matrix}

5.21

That is, we expect $T^{- 1} = S$ . Now we make this rigorous.

Lemma 5.3

The operator $S$ , defined in (5.21), is a bounded linear operator from $H_{q}^{1}$ to $H_{q, 0}^{1}$ that satisfies $T S g = g$ for all $g \in H_{q}^{1}$ .

Proof

Let $g \in H_{q}^{1}$ . In part (i) of Lemma 5.4 below we show that $σ^{'} = - ρ σ$ for a certain function $ρ \in L^{\infty}$ . Then the definition of $K$ in (5.18) gives

\begin{matrix} S g = ρ (K g) - H g, \end{matrix}

5.22

and the definition of $H$ in (5.17) shows

\begin{matrix} \partial_{X} [S g] = ρ^{'} (K g) - ρ (S g) + q_{*} g + g^{'} . \end{matrix}

5.23

We claim there is a constant C, independent of g, such that

\begin{matrix} ‖ ρ^{'} {(K g) ‖}_{L_{q}^{2}} \leq C {‖ g ‖}_{H_{q}^{1}} \end{matrix}

5.24

and

\begin{matrix} {‖ S g ‖}_{L_{q}^{2}} \leq C {‖ g ‖}_{H_{q}^{1}} \end{matrix}

5.25

Since $ρ \in L^{\infty}$ , the identities (5.22) and (5.23) show that $S$ is a bounded operator on $H_{q}^{1}$ . We prove the estimate (5.24) in Sect. 5.2.1 below and the estimate (5.25) in Sect. 5.2.2.

To show both that $S g \in H_{q, 0}^{1}$ and that taking $f = S g$ satisfies (5.20), we first use the definition of $A$ in (5.4) to compute

\begin{matrix} (A S g) (X) = (K g) (X) - lim_{B \to \infty} (K g) (B) . \end{matrix}

5.26

We claim that

\begin{matrix} lim_{B \to \infty} (K g) (B) = 0 . \end{matrix}

5.27

Indeed, since $S g \in H_{q}^{1}$ , we know that $S g$ vanishes at infinity; so does $H g$ by the definition of $H$ in (5.17). However, $ρ (X) \to q_{*} \neq 0$ as $X \to \infty$ by part (iii) of Lemma 5.4 below. The first equality in (5.22) then forces the limit (5.27) to be true.

Thus

\begin{matrix} (A S g) (X) = (K g) (X) . \end{matrix}

5.28

In particular,

\begin{matrix} (A S g) (0) = (K g) (0) = 0 \end{matrix}

by the definition of $K$ in (5.18). Consequently, $S g \in H_{q, 0}^{1}$ , and so (5.28) shows that $f = S g$ satisfies (5.20). This implies $T (S g) = g$ . $□$

Auxiliary results for the proof of Lemma 5.3

We first study some properties of $σ$ and its derivative.

Lemma 5.4

(i)
There exists $ρ \in L^{\infty}$ such that $σ^{'} = - ρ σ$ .
(ii)
There exist $ς_{1}^{+}$ , $ς_{2}^{+}$ , $ϱ^{+} \in L^{\infty} (R_{+})$ , $ς_{1}^{-}$ , $ς_{2}^{-}$ , $ϱ^{-} \in L^{\infty} (R_{-})$ , and $C_{1}$ , $C_{2} \in R$ such that
$\begin{matrix} \frac{1}{σ (X)} = \{\begin{matrix} C_{1} e^{q_{*} X} + e^{- q_{*} X} ς_{1}^{+} (X), X > 0 \\ C_{2} e^{- 2 q_{*} X} + ς_{1}^{-} (X), X < 0, \end{matrix}) \end{matrix}$ 5.29

$\begin{matrix} σ (X) = \{\begin{matrix} C_{1}^{- 1} e^{- q_{*} X} + e^{- 3 q_{*} X} ς_{2}^{+} (X), X > 0 \\ C_{2}^{- 1} e^{2 q_{*} X} + e^{4 q_{*} X} ς_{2}^{-} (X), X < 0, \end{matrix}) \end{matrix}$ 5.30
and
$\begin{matrix} ρ (X) = \{\begin{matrix} q_{*} + e^{- q_{*} X} ϱ^{+} (X), X > 0 \\ - 2 q_{*} + e^{q_{*} X} ϱ^{-} (X), X < 0 . \end{matrix}) \end{matrix}$ 5.31
(iii)
There is $C_{3} > 0$ such that
$\begin{matrix} | ρ^{'} (X) | \leq C_{3} e^{- q_{*} | X |} \end{matrix}$ 5.32
for all $X \in R$ .

Proof

(i)
Recall that $σ = - Σ^{'}$ , where $Σ$ satisfies the Bernoulli equation (3.26). That is,
$\begin{matrix} σ = - Σ^{'} = \frac{c_{0}}{τ_{2}} Σ - (\frac{α κ τ_{1}}{6 c_{0}^{2} τ_{2}}) Σ^{3} . \end{matrix}$
Then
$\begin{matrix} σ^{'} = \frac{c_{0}}{τ_{2}} Σ^{'} - (\frac{α κ τ_{1}}{2 c_{0}^{2} τ_{2}}) Σ^{2} Σ^{'} = [(\frac{α κ τ_{1}}{2 c_{0}^{2} τ_{2}}) Σ^{2} - \frac{c_{0}}{τ_{2}}] σ . \end{matrix}$
Put
$\begin{matrix} ρ = \frac{c_{0}}{τ_{2}} - (\frac{α κ τ_{1}}{2 c_{0}^{2} τ_{2}}) Σ^{2} . \end{matrix}$ 5.33
By the definition of $Σ$ in (3.27), we have $ρ \in L^{\infty}$ . Note, incidentally, that $ρ$ must be a scalar multiple of $P$ from (5.14).
(ii)
The expansions (5.30) and (5.29) follow directly from the formula for $σ$ in (3.28). The expansion (5.31) follows from the definition of $ρ$ in (5.33) and the definition of $Σ$ in (3.27), which gives
$\begin{matrix} lim_{X \to \infty} Σ {(X)}^{2} = 0 and lim_{X \to - \infty} Σ {(X)}^{2} = \frac{3 c_{0}}{τ_{2}} = 3 q_{*} . \end{matrix}$
(iii)
This is a direct consequence of (5.33).

$□$

We will also need estimates on $A$ and $H$ .

Lemma 5.5

There is $C > 0$ such that

\begin{matrix} {‖ A g ‖}_{L^{\infty}} + {‖ H g ‖}_{L^{\infty}} \leq C {‖ g ‖}_{H_{q}^{1}} \end{matrix}

5.34

for all $g \in H_{q}^{1}$ .

Proof

We use the definition of $A$ in (5.4) to bound

\begin{matrix} | (A g) (X) | \leq {‖ f ‖}_{L^{1}} \leq C {‖ f ‖}_{H_{q}^{1}} \end{matrix}

by the embedding of $H_{q}^{1}$ into $L^{1}$ , which we discuss in Appendix A.3. The estimate for $H$ then follows from the triangle inequality. $□$

In the following we again recall that $0 < q < q_{*}$ .

The proof of the estimate (5.24)

We first use the definition of $K$ in (5.18) and the estimates (5.32) on $ρ^{'}$ and (5.34) on $H g$ to bound

\begin{matrix} | ρ^{'} {(X) (K g) (X) | \leq C ‖ g ‖}_{H_{q}^{1}} e^{- q_{*} | X |} σ (X) \int_{0}^{X} \frac{dW}{σ (W)} . \end{matrix}

If $X > 0$ , we use the estimates (5.30) on $σ (X)$ and (5.29) on $1 / σ (W)$ to bound

\begin{matrix} σ (X) \int_{0}^{X} \frac{dW}{σ (W)} \leq C e^{- q_{*} X} \int_{0}^{X} e^{q_{*} W} d W \leq C . \end{matrix}

If $X < 0$ , we use the negative versions of these estimates to bound

\begin{matrix} σ (X) |\int_{0}^{X}, \frac{dW}{σ (W)}| \leq C e^{2 q_{*} X} \int_{X}^{0} e^{- 2 q_{*} W} \leq C . \end{matrix}

We conclude

\begin{matrix} | ρ^{'} {(X) (K g) (X) | \leq C ‖ g ‖}_{H_{q}^{1}} e^{- q_{*} | X |} \end{matrix}

for all X. Since $0 < q < q_{*}$ , this gives $‖ ρ^{'} {(K g) ‖}_{L_{q}^{2}} \leq C {‖ g ‖}_{H_{q}^{1}}$ .

The proof of the estimate (5.25)

It suffices to find $C > 0$ such that for all $g \in H_{q}^{1}$ , we have

\begin{matrix} {‖ S g ‖}_{L_{q}^{2} (R_{+})} + {‖ S g ‖}_{L_{q}^{2} (R_{-})} \leq C {‖ g ‖}_{H_{q}^{1}} . \end{matrix}

We will rewrite $(S g) (X)$ in different ways for $X > 0$ and $X < 0$ to exploit the different decay rates of $σ$ at $+ \infty$ and $- \infty$ .

First suppose $X > 0$ . The formula (5.22) for $S g$ , the formula (5.18) for $K$ and the expansions in Lemma 5.4 allow us to write

\begin{matrix} (S g) (X) = (C_{1}^{- 1} q_{*} e^{- q_{*} X} + e^{- 2 q_{*} X} ς_{3}^{+} (X)) \\ \int_{0}^{X} (C_{1} e^{q_{*} W} + e^{- q_{*} W} ς_{1}^{+} (W)) (H g) (W) d W - (H g) (X), \end{matrix}

where $ς_{3}^{+} \in L^{\infty} (R_{+})$ . We expand this to give

\begin{matrix} (S g) (X) = \sum_{j = 1}^{4} (S_{j}^{+} g) (X), \end{matrix}

where

\begin{matrix} (S_{1}^{+} g) (X) & : = q_{*} e^{- q_{*} X} \int_{0}^{X} e^{q_{*} W} (H g) (W) d W - (H g) (X) \\ (S_{2}^{+} g) (X) & : = C_{1}^{- 1} q_{*} e^{- q_{*} X} \int_{0}^{X} e^{- q_{*} W} ς_{1}^{+} (W) (H g) (W) d W \\ (S_{3}^{+} g) (X) & : = C_{1} e^{- 2 q_{*} X} ς_{3}^{+} (X) \int_{0}^{X} e^{q_{*} W} (H g) (W) d W \\ (S_{4}^{+} g) (X) & : = e^{- 2 q_{*} X} ς_{3}^{+} (X) \int_{0}^{X} e^{- q_{*} W} ς_{1}^{+} (W) (H g) (W) d W . \end{matrix}

The estimate (5.34) from Lemma 5.5 allows us to bound the last three terms with

\begin{matrix} | (S_{2}^{+} g) (X) | + | (S_{4}^{+} g) (X) | \leq C e^{- q_{*} X} {‖ H g ‖}_{L^{\infty}} \int_{0}^{X} e^{- q_{*} W} \leq C e^{- q_{*} X} {‖ g ‖}_{H_{q}^{1}} \end{matrix}

5.35

and

\begin{matrix} | (S_{3}^{+} g) (X) | \leq C e^{- 2 q_{*} X} {‖ H g ‖}_{L^{\infty}} \int_{0}^{X} e^{q_{*} W} \leq C e^{- q_{*} X} {‖ g ‖}_{H_{q}^{1}} . \end{matrix}

5.36

To control $S_{1}^{+} g$ , we first integrate by parts:

\begin{matrix} \int_{0}^{X} e^{q_{*} W} (H g) (W) d W = \frac{e^{q_{*} X} (H g) (X) - (H g) (0)}{q_{*}} - \frac{1}{q_{*}} \int_{0}^{X} e^{q_{*} W} {(H g)}^{'} (W) d W . \end{matrix}

The definition of $H$ in (5.17) gives

\begin{matrix} \int_{0}^{X} e^{q_{*} W} {(H g)}^{'} (W) d W \\ = - \int_{0}^{X} e^{q_{*} W} (q_{*} g (W) + g^{'} (W)) d W = - \int_{0}^{X} \partial_{W} [e^{q_{*} W} g (W)] d W \\ = g (0) - e^{q_{*} X} g (X) . \end{matrix}

5.37

It follows that

\begin{matrix} (S_{1}^{+} g) (X) = - e^{- q_{*} X} (H g) (0) - e^{- q_{*} X} g (0) + g (X) . \end{matrix}

5.38

We apply Lemma 5.5 to the factor $(H g) (0)$ and use the Sobolev embedding to estimate g(0), so that

\begin{matrix} | (S_{1}^{+} g) (X) | \leq C e^{- q_{*} X} {‖ g ‖}_{H_{q}^{1}} + | g (X) | . \end{matrix}

Since $0 < q < q_{*}$ , the estimates (5.35) and (5.36) and the identity (5.38) give

\begin{matrix} e^{qX} | (S g) (X) | \leq C e^{(q - q_{*}) X} {‖ g ‖}_{H_{q}^{1}} + e^{qX} | g (X) | \end{matrix}

for $X > 0$ , from which the bound

\begin{matrix} {‖ S g ‖}_{L_{q}^{2} (R_{+})} \leq C {‖ g ‖}_{H_{q}^{1}} \end{matrix}

follows.

Now suppose $X < 0$ . Using the expansions in Lemma 5.4 valid for $X < 0$ , we rewrite

\begin{matrix} (S g) (X) = (2 q_{*} C_{2}^{- 1} e^{2 q_{*} X} + e^{3 q_{*} X} ς_{3}^{-} (X)) \int_{X}^{0} (C_{2} e^{- 2 q_{*} W} + ς_{1}^{-} (W)) (H g) (W) \\ - (H g) (X), \end{matrix}

where $ς_{3}^{-} \in L^{\infty} (R_{-})$ . We expand this as

\begin{matrix} (S g) (X) = \sum_{j = 1}^{4} (S_{j}^{-} g) (X), \end{matrix}

where

\begin{matrix} (S_{1}^{-} g) (X) & : = 2 q_{*} e^{2 q_{*} X} \int_{X}^{0} e^{- 2 q_{*} W} (H g) (W) d W - (H g) (X) \\ (S_{2}^{-} g) (X) & : = 2 q_{*} C_{2}^{- 1} e^{2 q_{*} X} \int_{X}^{0} ς_{1}^{-} (W) (H g) (W) d W \\ (S_{3}^{-} g) (X) & : = C_{2} e^{3 q_{*} X} ς_{3}^{-} (X) \int_{X}^{0} e^{- 2 q_{*} W} (H g) (W) d W \\ (S_{4}^{-} g) (X) & : = e^{3 q_{*} X} ς_{3}^{-} (X) \int_{X}^{0} ς_{1}^{-} (W) (H g) (W) d W . \end{matrix}

We crudely estimate the last three terms as

\begin{matrix} | (S_{2}^{-} g) (X) | + | (S_{3}^{-} g) (X) | + | (S_{4}^{-} g) (X) | & \leq C e^{2 q_{*} X} {| X | ‖ H g ‖}_{L^{\infty}} \\ \leq C e^{q_{*} X} {‖ g ‖}_{H_{q}^{1}} . \end{matrix}

5.39

For the first term, we integrate by parts to find

\begin{matrix} \int_{X}^{0} e^{- 2 q_{*} W} (H g) (W) d W = & \frac{e^{- 2 q_{*} X} (H g) (X) - (H g) (0)}{2 r} \\ - \frac{1}{2 q_{*}} \int_{X}^{0} e^{- 2 q_{*} W} (q_{*} g (W) + g^{'} (W)) d W . \end{matrix}

The difference compared to (5.37) in our treatment of $S_{1}^{+} g$ is that we no longer have a perfect derivative as the integrand on the right; this is an consequence of the different asymptotic behavior of $ρ$ and $σ$ at $- \infty$ compared to $+ \infty$ , as specified in Lemma 5.4. Thus

\begin{matrix} (S_{1}^{-} g) (X) = - (H g) (0) e^{2 q_{*} X} + I [g] (X), \end{matrix}

5.40

where

\begin{matrix} I [g] (X) : = - e^{2 q_{*} X} \int_{X}^{0} e^{- 2 q_{*} W} (r g (W) + g^{'} (W)) d W . \end{matrix}

To control this integral term, we will use the following lemma, whose proof we defer to Sect. 5.2.3.

Lemma 5.6

There exists $C > 0$ such that

\begin{matrix} \int_{- \infty}^{0} e^{2 X} {|\int_{X}^{0}, e^{- W}, h, (W), d, W|}^{2} d X \leq C {‖ h ‖}_{L^{2}} \end{matrix}

5.41

for all $h \in L^{2}$ .

Since $q_{*} g + g^{'} \in L_{q}^{2}$ , we can write

\begin{matrix} q_{*} g (X) + g^{'} (X) = e^{- q | X |} h (X) \end{matrix}

for some $h \in L^{2}$ . Then

\begin{matrix} \int_{- \infty}^{0} e^{- 2 q X} {| I [g] (X) |}^{2} d X = & \int_{- \infty}^{0} e^{2 (2 q_{*} - q) X} {|\int_{X}^{0}, e^{- (2 q_{*} - q) W}, h, (W), d, W|}^{2} d X \\ = & \frac{1}{{(2 q_{*} - q)}^{2}} \int_{- \infty}^{0} e^{2 U} {|\int_{U}^{0}, e^{- V}, h, (\frac{V}{2 q_{*} - q}), d, V|}^{2} d U . \end{matrix}

Applying Lemma 5.6, we obtain

\begin{matrix} {‖ I [g] ‖}_{L_{q}^{2} (R_{-})} \leq C {∥h, (\frac{\cdot}{2 q_{*} - q})∥}_{L^{2}} \leq C ‖ e^{- q | \cdot |} (e^{q | \cdot |} h) ‖_{L^{2}} \leq C {‖ g ‖}_{H_{q}^{1}} . \end{matrix}

All together, we use the estimates (5.39) and the identity (5.40) to bound

\begin{matrix} | (S g) (X) | \leq C e^{q_{*} X} {‖ g ‖}_{H_{q}^{1}} + | I [g] (X) |, \end{matrix}

from which we obtain

\begin{matrix} {‖ S g ‖}_{L_{q}^{2} (R_{-})} \leq C {‖ g ‖}_{H_{q}^{1}} . \end{matrix}

The proof of Lemma 5.6

Put

\begin{matrix} W : = \{(X, W, Y) \in R^{3} | - \infty < X \leq 0, X \leq W \leq 0, X \leq Y \leq 0\}, \end{matrix}

so that, after using the triangle inequality, the integral in (5.41) is bounded by

\begin{matrix} J : = & \int_{- \infty}^{0} e^{2 X} {(\int_{X}^{0}, e^{- W}, | h (W) |, d, W)}^{2} d X \\ = & ∭_{W} e^{2 X} e^{- W} e^{- Y} | h (W) h (Y) | d Y d W d X . \end{matrix}

Next, put

\begin{matrix} W_{1} : = \{(X, W, Y) \in R^{3} | - \infty < X \leq W, W \leq Y \leq 0, - \infty < W \leq 0\} \end{matrix}

and

\begin{matrix} W_{2} : = \{(X, W, Y) \in R^{3} | - \infty < X \leq Y, Y \leq W \leq 0, - \infty < Y \leq 0\}, \end{matrix}

so $W = W_{1} \cup W_{2}$ and $W_{1} \cap W_{2}$ has measure zero. Then

\begin{matrix} J = J_{1} + J_{2}, \end{matrix}

where

\begin{matrix} J_{1} : = ∭_{W_{1}} e^{2 X} e^{- W} e^{- Y} | h (W) h (Y) | d Y d W d X \end{matrix}

and

\begin{matrix} J_{2} : = ∭_{W_{2}} e^{2 X} e^{- W} e^{- Y} | h (W) h (Y) | d Y d W d X . \end{matrix}

Since the integrands are symmetric in W and Y, it suffices to show

\begin{matrix} J_{1} \leq C \int_{- \infty}^{0} {| h (X) |}^{2} d X . \end{matrix}

Change variables to obtain

\begin{matrix} J_{1} = & \int_{- \infty}^{0} \int_{W}^{0} (\int_{- \infty}^{W}, e^{2 X}, d, X) e^{- W} e^{- Y} | h (W) h (Y) | d Y d W \\ = & \frac{1}{2} \int_{- \infty}^{0} \int_{W}^{0} e^{W} e^{- Y} | h (W) h (Y) | d Y d W . \end{matrix}

Now we estimate

\begin{matrix} 4 | J_{1} | \leq J_{12} + J_{13}, \end{matrix}

5.42

where

\begin{matrix} J_{12} & : = \int_{- \infty}^{0} \int_{W}^{0} e^{W} e^{- Y} {| h (W) |}^{2} d Y d W and \\ J_{13} & : = \int_{- \infty}^{0} \int_{W}^{0} e^{W} e^{- Y} {| h (Y) |}^{2} d Y d W . \end{matrix}

We first evaluate

\begin{matrix} J_{12} = \int_{- \infty}^{0} (\int_{W}^{0}, e^{- Y}, d, Y) e^{W} {| h (W) |}^{2} d W = \int_{- \infty}^{0} (1 - e^{W}) {| h (W) |}^{2} d W . \end{matrix}

Since $W \leq 0$ we have $| 1 - e^{W} | \leq 2$ , and so

\begin{matrix} J_{12} \leq 2 \int_{- \infty}^{0} {| h (W) |}^{2} d W \leq C {‖ h ‖}_{L^{2}}^{2} . \end{matrix}

5.43

Next, we change variables in $J_{13}$ to find

\begin{matrix} J_{13} = \int_{- \infty}^{0} (\int_{- \infty}^{Y}, e^{W}, d, W) e^{- Y} {| h (Y) |}^{2} d Y = \int_{- \infty}^{0} {| h (Y) |}^{2} d Y \leq {‖ h ‖}_{L^{2}}^{2} . \end{matrix}

5.44

Combining the decomposition (5.42) and the estimates (5.43) and (5.44) gives

\begin{matrix} | J_{1} {| \leq C ‖ h ‖}_{L^{2}}^{2}, \end{matrix}

as desired.

Analysis of the long wave problem

The perturbation Ansatz for the long wave problem (3.42)

Throughout this section we keep $q \in (0, c_{0} / τ_{2})$ fixed. We make the perturbation Ansatz

\begin{matrix} ψ_{1} = σ + η_{1} and ψ_{2} = ζ + η_{2} \end{matrix}

6.1

for the long wave problem (3.42). Here $η_{1} \in H_{q, 0}^{1}$ , which was defined in (5.2), and $η_{2} \in W^{1, \infty}$ . We abbreviate

\begin{matrix} η = (η_{1}, η_{2}) \in X : = H_{q, 0}^{1} \times W^{1, \infty}, \end{matrix}

where $X$ has the norm

\begin{matrix} {‖ η ‖}_{X} : = ‖ η_{1} ‖_{H_{q}^{1}} + {‖ η_{2} ‖}_{W^{1, \infty}} . \end{matrix}

The Ansatz (6.1) solves the system (3.42) if and only if $η_{1}$ and $η_{2}$ solve

\begin{matrix} \{\begin{matrix} T η_{1} = \sum_{k = 1}^{5} V_{1 k}^{ν} (η), \\ η_{2} = \sum_{k = 1}^{3} V_{2 k}^{ν} (η), \end{matrix}) \end{matrix}

6.2

where $T$ was defined in (3.43) and the $V$ -operators are given by

\begin{matrix} \begin{matrix} V_{11}^{ν} (η) & : = (M^{(ν)} - M^{(0)}) [R_{1}^{ν} (σ + η_{1}) (σ + η_{1})], \\ V_{12}^{ν} (η) & : = M^{(0)} [(R_{1}^{ν} (σ + η_{1}) - R_{1}^{0} (σ + η_{1})) (σ + η_{1})], \\ V_{13}^{ν} (η) & : = M^{(0)} [(R_{1}^{0} (σ + η_{1}) - R_{1}^{0} (σ) - D R_{1}^{0} (σ) η_{1}) σ], \\ V_{14}^{ν} (η) & : = M^{(0)} [(R_{1}^{0} (σ + η_{1}) - R_{1}^{0} (σ)) η_{1}], \\ V_{15}^{ν} (η) & : = ν^{1 / 2} M^{(0)} N^{ν} (σ + η_{1}, ζ + η_{2}) \end{matrix} \end{matrix}

6.3

and

\begin{matrix} \begin{matrix} V_{21}^{ν} (η) & : = P_{1}^{ν} (σ + η_{1}) - P_{1}^{0} (σ + η_{1}), \\ V_{22}^{ν} (η) & : = P_{1}^{0} (σ + η_{1}) - ζ, \\ V_{23}^{ν} (η) & : = ν P_{2}^{ν} (σ + η_{1}, ζ + η_{2}) . \end{matrix} \end{matrix}

6.4

We recall that the symbol of $M^{(ν)}$ was defined in (3.40) and the symbol of $M^{(0)}$ in (3.21). The operator $R_{1}^{ν}$ was defined in (3.35), the operator $N^{ν}$ in (3.38), the operator $P_{1}^{ν}$ in (3.33) and the operator $P_{2}^{ν}$ in (3.34).

Due to Proposition 5.1, the first equation in (6.2) is equivalent to

\begin{matrix} η_{1} = T^{- 1} \sum_{k = 1}^{5} V_{1 k}^{ν} (η) = : N_{1}^{ν} (η) . \end{matrix}

6.5

Subsequently, $η_{1}$ and $η_{2}$ solve (6.2) if and only if

\begin{matrix} η_{2} = V_{21}^{ν} (η) + V_{22}^{ν} (N_{1}^{ν} (η)) + V_{23}^{ν} (η) = : N_{2}^{ν} (η) . \end{matrix}

6.6

We have replaced $η_{1}$ with its fixed point expression (6.5) in $V_{22}^{ν}$ for the sake of better estimates later; see Appendix B.2.7 for a more precise discussion. Finally, set

\begin{matrix} N^{ν} (η) : = (N_{1}^{ν} (η), N_{2}^{ν} (η)), \end{matrix}

6.7

so $N^{ν}$ maps $X$ to $X$ . More precisely, this follows from the mapping estimates in Appendix B.3. We conclude that the problem (6.2) is equivalent to the fixed point problem

\begin{matrix} η = N^{ν} (η), \end{matrix}

6.8

which we now solve.

The solution of the fixed point problem (6.8)

For $r > 0$ , we define the ball

\begin{matrix} B (r) : = \{η \in {X | ‖ η ‖}_{X} \leq r\} . \end{matrix}

We prove the following estimates in Appendix B; their verifications are routine, but detailed, so we do not present them here.

Proposition 6.1

There exist $C_{⋆}$ , $ν_{⋆} > 0$ such that if $0 < ν < ν_{⋆}$ then the following hold.

(i)
If $η \in B (C_{⋆} ν^{1 / 3})$ , then $N^{ν} (η) \in B (C_{⋆} ν^{1 / 3})$ .
(ii)
If $η$ , $\overset{`}{η} \in B (C_{⋆} ν^{1 / 3})$ , then
$\begin{matrix} ‖ N^{ν} (η) - N^{ν} (\overset{`}{η}) ‖_{X} \leq \frac{1}{2} {‖ η - \overset{`}{η} ‖}_{X} . \end{matrix}$

Proposition 6.1 then guarantees that $N^{ν}$ is a contraction on $B (C_{⋆} ν^{1 / 3})$ for each $0 < ν < ν_{⋆}$ , and so Banach’s fixed point theorem gives the following solution to (6.2).

Proposition 6.2

Let $C_{⋆}$ , $ν_{⋆} > 0$ be as in Proposition 6.1. For each $0 < ν < ν_{⋆}$ , there exists a unique $η^{ν} \in B (C_{⋆} ν^{1 / 3})$ such that $η^{ν} = N^{ν} (η^{ν})$ .

The existence of the perturbation terms $η^{ν}$ enables us to conclude our main results, which are paraphrased nontechnically in (1.5).

Theorem 6.3

Let $α$ , $κ$ , $τ_{1}$ , $τ_{2} > 0$ , $q \in (0, c_{0} / τ_{2})$ , and $θ \in R$ . Define the leading-order profile terms

\begin{matrix} ϕ_{A}^{*} (X) : = (\frac{6 \sqrt{6} c_{0}^{9 / 2}}{τ_{2}}) (\frac{exp (2 c_{0} X / τ_{2} + θ)}{[α κ τ_{1} + 6 c_{0}^{2} exp (2 c_{0} X / τ_{2} + θ)]^{3 / 2}}), \\ ϕ_{P}^{*} (X) : = ({(6 c_{0})}^{1 / 2} α) (\frac{1}{[α κ τ_{1} + 6 c_{0}^{2} exp (2 c_{0} X / τ_{2} + θ) / τ_{2})]^{1 / 2}}), \end{matrix}

and

\begin{matrix} ϕ_{R}^{*} (X) : = (3 α κ c_{0}) (\frac{1}{α κ τ_{1} + 6 c_{0}^{2} exp (2 c_{0} X / τ_{2} + θ) / τ_{2})}) . \end{matrix}

There exists $ϵ_{⋆} > 0$ such that for each $0 < ϵ < ϵ_{⋆}$ , there are $ϕ_{A}^{ϵ} \in H_{q}^{1} \cap C^{\infty}$ and $ϕ_{P}^{ϵ}$ , $ϕ_{R}^{ϵ} \in W^{1, \infty} \cap C^{\infty}$ with the following properties.

(i)
Let
$\begin{matrix} A_{j} (t) = ϵ ϕ_{A}^{*} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)) + ϵ^{17 / 15} ϕ_{A}^{ϵ} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)), \\ P_{j} (t) = ϵ^{1 / 5} ϕ_{P}^{*} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)) + ϵ^{1 / 3} ϕ_{P}^{ϵ} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)), \end{matrix}$
and
$\begin{matrix} R_{j} (t) = ϵ^{2 / 5} ϕ_{R}^{*} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)) + ϵ^{3 / 5} ϕ_{R}^{ϵ} (ϵ^{2 / 5} (j - ϵ^{2 / 5} c_{0} t)) . \end{matrix}$
Then the triple $(A_{j}, P_{j}, R_{j})$ solves (1.1).
(ii)
The remainder terms $ϕ_{A}^{ϵ}$ , $ϕ_{P}^{ϵ}$ , and $ϕ_{R}^{ϵ}$ satisfy
$\begin{matrix} sup_{0 < ϵ < ϵ_{⋆}} ‖ ϕ_{A}^{ϵ} ‖_{H_{q}^{1}} + ‖ ϕ_{P}^{ϵ} ‖_{W^{1, \infty}} + {‖ ϕ_{R}^{ϵ} ‖}_{W^{1, \infty}} < \infty . \end{matrix}$
(iii)
The functions $ϕ_{P}^{ϵ}$ and $ϕ_{R}^{ϵ}$ vanish exponentially fast at $+ \infty$ and are asymptotically constant at $- \infty$ in the following sense: there exist $ℓ_{P}^{ϵ}$ , $ℓ_{R}^{ϵ} \in R$ such that
$\begin{matrix} sup_{0 < ϵ < ϵ_{⋆}} (|, ℓ_{P}^{ϵ}, | +, sup_{X \geq 0}, e^{qX}, |, ϕ_{P}^{ϵ}, (X) | +, sup_{X \leq 0}, e^{- q X}, | ϕ_{P}^{ϵ} (X) - ℓ_{P}^{ϵ} |) < \infty \end{matrix}$
and
$\begin{matrix} sup_{0 < ϵ < ϵ_{⋆}} (|, ℓ_{R}^{ϵ}, | +, sup_{X \geq 0}, e^{qX}, |, ϕ_{R}^{ϵ}, (X) | +, sup_{X \leq 0}, e^{- q X}, | ϕ_{R}^{ϵ} (X) - ℓ_{R}^{ϵ} |) < \infty . \end{matrix}$

Proof

Write the solution of (6.8), which exists due to Proposition 6.2, as $η^{ν} = (η_{1}^{ν}, η_{2}^{ν})$ . Define

\begin{matrix} ψ_{1}^{ν} : = σ + η_{1}^{ν} and ψ_{2}^{ν} : = ζ + η_{2}^{ν} . \end{matrix}

6.9

By the discussion at the start of Sect. 6.1, the pair $(ψ_{1}^{ν}, ψ_{2}^{ν})$ then solves the system (3.42).

Now take

\begin{matrix} A_{j} (t) = ν^{5 / 2} ψ_{1}^{ν} (ν (j - ν c_{0} t)) and P_{j} (t) = ν^{1 / 2} ψ_{2}^{ν} (ν (j - ν c_{0} t)) . \end{matrix}

This is the scaled travelling wave ansatz from (3.41), and so Proposition 3.5 guarantees that $A_{j}$ and $P_{j}$ thus defined solve the simplified system (2.14). Let $R_{j}$ be given by (2.12). Then the discussion in Sect. 2.1 shows that $A_{j}$ , $P_{j}$ and $R_{j}$ together solve our original problem (1.1).

Next, we use the identity $ϵ = ν^{5 / 2}$ from (3.30) to reintroduce the original long wave parameter $ϵ$ into the solutions. The expansions in (i) and the estimates in (ii) above then follow from (6.9) and the estimates in Proposition 6.2. The exact formulas for the leading order terms follow from the definitions of $σ$ in (3.28) and $ζ$ in (3.29). The asymptotics of part (iii) follow from the definition of $ζ$ , the fixed point property $η_{2}^{ν} = N_{2}^{ν} (η^{ν})$ and the definition of $N_{2}^{ν}$ in (6.6), and the definition of $R_{j}$ from (2.12).

Finally, to obtain the smoothness of the solutions, we first note that $σ$ , $ζ \in C^{\infty}$ . Next, crude estimates on the symbol of the Fourier multiplier $M^{(ν)}$ , which we omit, allow us to invoke Lemma A.2 to conclude $M^{(ν)} \in B (H_{q}^{r}, H_{q}^{r + 1})$ for each $ν \geq 0$ and $r \geq 0$ ; we make no claim about uniform estimates in $ν$ here. This smoothing property of $M^{(ν)}$ , as well as the smoothing properties of the integral operators that compose $N_{2}^{ν}$ , show that if $η \in C^{r} \times C^{2}$ , then $N^{ν} (η) \in C^{r + 1} \times C^{r + 1}$ . By bootstrapping, we obtain $η^{ν} \in C^{\infty} \times C^{\infty}$ . $□$

In order to achieve the normalization $‖ ϕ_{A}^{*} ‖_{L^{\infty}} = 1$ , as discussed in Sect. 1.4, we need to use the explicit choice

\begin{matrix} c_{0} = {(\frac{9 α κ τ_{1} τ_{2}^{2}}{8})}^{1 / 5} = : c_{*}, \end{matrix}

6.10

as employed in (1.5). From (6.10), we obtain

\begin{matrix} ‖ ϕ_{P}^{*} ‖_{L^{\infty}} = {(\frac{6 α}{κ τ_{1}})}^{1 / 2} {(\frac{9 α κ τ_{1} τ_{2}^{2}}{8})}^{1 / 10} and {‖ ϕ_{R}^{*} ‖}_{L^{\infty}} = \frac{3}{τ_{1}} {(\frac{9 α κ τ_{1} τ_{2}^{2}}{8})}^{1 / 5} . \end{matrix}

6.11

Substituting the abbreviations (2.2) and (2.3) into the quantities in (6.10) and (6.11) then leads to the identities (1.8). This completes the rigorous derivation of the main results that we discussed more informally in Sect. 1.4.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (mp4 4441 KB)^{(4.3MB, mp4)}

Acknowledgements

We acknowledge support from the Netherlands Organization for Scientific Research (NWO) (Grants 639.032.612, HJH, TEF; 865.17.004, RM)

Appendix A. Fourier analysis

A.1. The Fourier transform

We use the following conventions for Fourier transforms. If $f \in L^{1}$ , then its Fourier transform is

\begin{matrix} F [f] (k) = \hat{f} (k) : = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} f (x) e^{- i k x} d x, \end{matrix}

and its inverse Fourier transform is

A.2. Fourier multipliers on Sobolev spaces

For integers $r \geq 0$ , we denote by $H^{r} = H^{r} (R)$ the usual Sobolev space of all r-times weakly differentiable functions whose weak derivatives are square-integrable.

Our fundamental operator on Sobolev spaces is the Fourier multiplier. The following result above is standard; see, e.g., Faver (2018, Lem. D.2.1).

Lemma A.1

Let $\tilde{M} : R \to C$ be measurable and suppose

\begin{matrix} N_{\tilde{M}} (r, s) : = sup_{k \in R} \frac{| \tilde{M} (k) |}{{(1 + k^{2})}^{(r - s) / 2}} < \infty . \end{matrix}

Then the Fourier multiplier $M$ with symbol $\tilde{M}$ defined by

\begin{matrix} M f : = F^{- 1} [\tilde{M} \hat{f}], \end{matrix}

A.1

i.e., by $\hat{M f} (k) = \tilde{M} (k) \hat{f} (k)$ , is a bounded operator from $H^{r}$ to $H^{s}$ , and

\begin{matrix} {‖ M ‖}_{B (H^{r}, H^{s})} = N_{\tilde{M}} (r, s) . \end{matrix}

A.2

We also need a convenient expression for ‘scaled’ Fourier multipliers. If f is a function on $R$ and $ν \in R \ {0}$ , let $f (ν \cdot)$ be the ‘scaled’ map $x \mapsto f (ν x)$ . Now let $M$ be the Fourier multiplier with symbol $\tilde{M}$ and define ${\tilde{M}}^{(ν)} (k) : = \tilde{M} (ν k)$ . Let $M^{(ν)}$ be the Fourier multiplier with symbol ${\tilde{M}}^{(ν)}$ . Then standard scaling properties of the Fourier transform imply that

\begin{matrix} M [f (ν \cdot)] (x) = (M^{(ν)} f) (ν x) . \end{matrix}

A.3

A.3. Fourier multipliers on weighted Sobolev spaces

We frequently work with weighted Sobolev spaces. For $q \in R$ , let

\begin{matrix} H_{q}^{r} : = \{f \in H_{q}^{r} | e^{q | \cdot |} f \in H^{r}\} . \end{matrix}

We norm this space by

\begin{matrix} {‖ f ‖}_{H_{q}^{r}} : = {‖ e^{q | \cdot |} f ‖}_{H^{r}}, \end{matrix}

and, see Faver (2018, App. C), this norm is equivalent to

\begin{matrix} f \mapsto \sum_{j = 0}^{r} {‖ e^{q | \cdot |} \partial_{x}^{j} [f] ‖}_{L^{2}} . \end{matrix}

We put $L_{q}^{2} : = H_{q}^{0}$ . The Cauchy–Schwarz inequality guarantees that $L_{q}^{2}$ embeds into $L^{1}$ :

\begin{matrix} {‖ f ‖}_{L^{1}} = ‖ e^{- q | \cdot |} (e^{q | \cdot |} f) ‖_{L^{1}} \leq ‖ e^{- | q \cdot |} ‖_{L^{2}} ‖ e^{q | \cdot |} {f ‖}_{L^{2}} \leq C_{q} {‖ f ‖}_{L_{q}^{2}} . \end{matrix}

Finally, if $I \subseteq R$ is an interval, we sometimes denote by $L_{q}^{2} (I)$ the set of all measurable functions $f : I \to C$ such that

\begin{matrix} \int_{I} e^{2 q | X |} {| f (X) |}^{2} d X < \infty . \end{matrix}

Since $H_{q}^{r} \subseteq H^{r}$ , any Fourier multiplier defined on $H^{r}$ is defined on $H_{q}^{r}$ . A variation on a result of Beale (1980, Lem. 5.1) gives sufficient conditions for a Fourier multiplier on $H_{q}^{r}$ to map into another weighted space $H_{q}^{s}$ .

Lemma A.2

(Beale) Fix $q > 0$ and let

\begin{matrix} {\bar{U}}_{q} : = \{z \in C | | Im (z) | \leq q\} . \end{matrix}

Let $\tilde{M}$ be analytic on ${\bar{U}}_{q}$ . Suppose there exist $s \geq 0$ and C, $r_{0} > 0$ such that if $z \in {\bar{U}}_{q}$ and $r_{0} < | z |$ , then

\begin{matrix} | \tilde{M} (z) | \leq \frac{C}{{| Re (z) |}^{(s - r)}} . \end{matrix}

Then the Fourier multiplier $M$ with symbol $\tilde{M}$ , defined by (A.1), is a bounded operator from $H_{q}^{r}$ to $H_{q}^{s}$ and

\begin{matrix} {‖ M ‖}_{B (H_{q}^{r}, H_{q}^{s})} \leq sup_{k \in R} | {(1 + k^{2})}^{(s - r) / 2} \tilde{M} (k \pm i q) | . \end{matrix}

Appendix B. The Proof of Proposition 6.1

Our proof depends on the following lemma, which we prove in the subsequent parts of this appendix.

Lemma B.1

Let $ν_{M} > 0$ be as in Proposition 4.1. There exist $C_{N}$ , $ρ_{N} > 0$ such that if $0 < ν < ν_{M}$ and $‖ η_{1} ‖_{H_{q}^{1}}$ , $‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}$ , $‖ η_{2} ‖_{H_{q}^{1}}$ , $‖ {\overset{`}{η}}_{2} ‖_{H_{q}^{1}} \leq ρ_{N}$ , then the following hold.

(i)
$‖ N^{ν} {(η) ‖}_{X} \leq C_{N} (ν^{1 / 3} + {‖ η ‖}_{X}^{2})$ .
(ii)
$‖ N^{ν} (η) - N^{ν} (\overset{`}{η}) ‖_{X} \leq C_{N} (ν^{1 / 3} + {‖ η ‖}_{X} + ‖ \overset{`}{η} ‖_{X}) {‖ η - \overset{`}{η} ‖}_{X}$ .

Assuming this lemma to be true, we define

\begin{matrix} C_{⋆} : = C_{N} and ν_{⋆} : = \frac{1}{2} min \{ν_{M}, 1, \frac{1}{{(1 + C_{⋆}^{2})}^{6}}, \frac{1}{64 C_{⋆}^{6} {(1 + 2 C_{⋆})}^{6}}, ρ_{N}\} . \end{matrix}

Take $0 < ν < ν_{⋆}$ and $η$ , $\overset{`}{η} \in B (C_{⋆} ν)$ . Then by part (i) of Lemma B.1 we have

\begin{matrix} ‖ N^{ν} {(η) ‖}_{X} \leq C_{N} (ν^{1 / 3} + {‖ η ‖}_{X}^{2}) \leq C_{⋆} [C_{⋆} (1 + C_{⋆}^{2}) ν^{1 / 6}] ν^{1 / 3} \leq C_{⋆} ν^{1 / 3} . \end{matrix}

This proves part (i) of Proposition 6.1. Next, part (ii) of that lemma gives

\begin{matrix} ‖ N^{ν} (η) - N^{ν} (\overset{`}{η}) ‖_{X} \leq C_{N} (ν^{1 / 3} + 2 C_{N} ν^{1 / 6}) {‖ η - \overset{`}{η} ‖}_{X} \\ \leq C_{⋆} (1 + 2 C_{⋆}) ν^{1 / 6} ‖ η - \overset{`}{η} ‖_{X} \leq \frac{1}{2} {‖ η - \overset{`}{η} ‖}_{X} . \end{matrix}

This proves part (ii) of Proposition 6.1.

In the remainder of this appendix, we first give some essential auxiliary estimates in Appendix B.1. Then we prove the Lipschitz estimates of part (ii) of Proposition 6.1 in Appendix B.2. Finally, using in part these Lipschitz estimates, we prove in Appendix B.3 the mapping estimates from part (i) of the proposition.

B.1. Auxiliary estimates

Throughout this appendix we will frequently obtain estimates in terms of the $L^{1}$ - or $L^{\infty}$ -norms of a function $f \in H_{q}^{1}$ . Afterwards we can use the embedding of $L_{q}^{2}$ into $L^{1}$ and the corresponding inequalities

\begin{matrix} {‖ f ‖}_{L^{1}} \leq {C ‖ f ‖}_{L_{q}^{2}} \leq C {‖ f ‖}_{H_{q}^{1}} \end{matrix}

for $f \in H_{q}^{1}$ , as well as the Sobolev embedding, to turn these $L^{1}$ - and $L^{\infty}$ -estimates into $H_{q}^{1}$ estimates. For brevity, we will omit those details.

We will also use the operator $A$ defined in (5.4); we recall

\begin{matrix} (A f) (X) = \int_{X}^{\infty} f (W) d W \end{matrix}

for $f \in H_{q}^{1}$ , so that

\begin{matrix} f = - \partial_{X} {[A f] and ‖ A f ‖}_{L^{\infty}} \leq C {‖ f ‖}_{L^{1}} . \end{matrix}

Last, we will use the shift operator $S^{ν}$ , which satisfies

\begin{matrix} (S^{ν} f) (X) = f (X + ν) \end{matrix}

for any function f defined on $R$ and any $ν \in R$ .

The operator $A$ and the definitions of our fundamental operators $P_{1}^{ν}$ in (3.33) and $R_{1}^{ν}$ in (3.35) allow us to rewrite

\begin{matrix} P_{1}^{ν} (f) (X) = \frac{α}{c_{0}} A [E (ν^{1 / 2} S^{ν} f) (\cdot, X) f] (X) \end{matrix}

and

\begin{matrix} R_{1}^{ν} (f) (X) = \frac{α}{c_{0}^{2} τ_{1}} \int_{X}^{\infty} A [E (ν^{1 / 2} S^{ν} f) (\cdot, V) f] (V) (S^{ν} f) (V) d V . \end{matrix}

Now we begin our estimates on $E$ , $P_{1}^{ν}$ , and $R_{1}^{ν}$ in earnest.

Lemma B.2

There exists $C > 0$ such that if ${‖ f ‖}_{L^{1}}$ , $‖ \overset{`}{f} ‖_{L^{1}} \leq 1$ , then the following hold.

(i)
$| E (f) (V, X) - E (\overset{`}{f}) (V, X) | \leq C ‖ f - \overset{`}{f} ‖_{L^{1}}$ for all V, $X \in R$ .
(ii)
$| E (f) (V, X) | \leq C$ for all V, $X \in R$ .

Proof

(i)
Since
$\begin{matrix} |\frac{κ}{c_{0}}, \int_{V}^{X}, f, (U), d, U| \leq \frac{κ}{c_{0}} {‖ f ‖}_{L^{1}} \end{matrix}$
for all V, $X \in R$ , a local Lipschitz estimate on the exponential yields $C > 0$ such that if ${‖ f ‖}_{L^{1}}$ , $‖ \overset{`}{f} ‖_{L^{1}} \leq 1$ , then
$\begin{matrix} | E (f) (V, X) - E (\overset{`}{f}) (V, X) | \leq C |\int_{V}^{X} f (U) d U - \int_{V}^{X} \overset{`}{f} (U) d U| \leq C {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}$
(ii)
Since $E (0) = 0$ , this follows from part (i) by taking $\overset{`}{f} = 0$ .

$□$

The following lemma guarantees that $P_{1}^{ν}$ maps $H_{q}^{1}$ to $W^{1, \infty}$ , among other results.

Lemma B.3

There exists $C > 0$ such that if $0 \leq ν < 1$ and f, $\overset{`}{f} \in H_{q}^{1}$ with ${‖ f ‖}_{H_{q}^{1}}$ , $‖ \overset{`}{f} ‖_{H_{q}^{1}} \leq 1$ , then the following hold.

(i)
$‖ P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} \leq C (ν^{1 / 2} {‖ f ‖}_{H_{q}^{1}} + 1) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$ .
(ii)
$‖ P_{1}^{ν} {(f) ‖}_{L^{\infty}} \leq C {‖ f ‖}_{H_{q}^{1}}$ .
(iii)
$‖ P_{1}^{ν} (f) - P_{1}^{0} {(f) ‖}_{L^{\infty}} \leq C ν^{1 / 2} {‖ f ‖}_{H_{q}^{1}}^{2}$ .
(iv)
$‖ \partial_{X} [P_{1}^{ν} (f) - P_{1}^{0} (f)] ‖_{L^{\infty}} \leq C ν^{1 / 2} {‖ f ‖}_{H_{q}^{1}}^{2}$ .
(v)
$‖ (P_{1}^{ν} (f) - P_{1}^{0} (f)) - (P_{1}^{ν} (\overset{`}{f}) - P_{1}^{0} (\overset{`}{f}) {) ‖}_{L^{\infty}} \leq C ν^{1 / 2} {(‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}}) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$ .
(vi)
$‖ \partial_{X} [P_{1}^{ν} (f) - P_{1}^{0} (f)]) - (\partial_{X} [P_{1}^{ν} (\overset{`}{f}) - P_{1}^{0} (\overset{`}{f})] ‖_{L^{\infty}} \leq C ν^{1 / 2} {(‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}}) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$ .

Proof

As we mentioned earlier, in most cases we will conclude bounds in terms of $L^{\infty}$ - or $L^{1}$ -norms, which then immediately yield the $H_{q}^{1}$ -bounds stated above.

(i)
We have
$\begin{matrix} P_{1}^{ν} (f) (X) - P_{1}^{ν} (\overset{`}{f}) (X) = I_{1}^{ν} (f, \overset{`}{f}) (X) + I_{2}^{ν} (f, \overset{`}{f}) (X), \end{matrix}$
where
$\begin{matrix} I_{1}^{ν} (f, \overset{`}{f}) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} (E (ν^{1 / 2} S^{ν} f) (V, X) - E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X)) f (V) d V \end{matrix}$
and
$\begin{matrix} I_{2}^{ν} (f, \overset{`}{f}) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X) (f (V) - \overset{`}{f} (V)) d V . \end{matrix}$

We use part (i) of Lemma B.2 to bound
$\begin{matrix} | E (ν^{1 / 2} S^{ν} f) (V, X) - E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X) | \leq C ν^{1 / 2} {‖ S^{ν} f - S^{ν} \overset{`}{f} ‖}_{L^{1}} \\ = C ν^{1 / 2} {‖ f - \overset{`}{f} ‖}_{L^{1}} \end{matrix}$ B.1
for all V, $X \in R$ . Thus
$\begin{matrix} | I_{1}^{ν} (f, \overset{`}{f}) (X) | \leq C ν^{1 / 2} ‖ f - \overset{`}{f} ‖_{L^{1}} \int_{X}^{\infty} | f (V) | d V \leq C ν^{1 / 2} ‖ f - \overset{`}{f} ‖_{L^{1}} {‖ f ‖}_{L^{1}} \end{matrix}$
for all $X \in R$ .

Next, we use part (ii) of Lemma B.2 to bound
$\begin{matrix} | I_{2}^{ν} (f, \overset{`}{f}) (X) | \leq C \int_{X}^{\infty} | f (V) - \overset{`}{f} (V) | d V \leq C ‖ f - \overset{`}{f} ‖_{L^{1}} . \end{matrix}$
(ii)
Since $P^{ν} (0) = 0$ , this follows from part (i) by taking $\overset{`}{f} = 0$ .
(iii)
We have
$\begin{matrix} P_{1}^{ν} (f) (X) - P_{1}^{0} (f) (X) = \frac{α}{c_{0}} \int_{X}^{\infty} (E (ν^{1 / 2} S^{ν} f) (V, X) - 1) f (V) d V . \end{matrix}$ B.2
Since
$\begin{matrix} E (ν^{1 / 2} S^{ν} f) (V, X) - 1 = E (ν^{1 / 2} S^{ν} f) (V, X) - E (0) (V, X), \end{matrix}$
we may use part (i) of Lemma B.2 to bound
$\begin{matrix} | E (ν^{1 / 2} S^{ν} f) (V, X) - 1 | \leq C ν^{1 / 2} ‖ S^{ν} {f ‖}_{L^{1}} = C ν^{1 / 2} {‖ f ‖}_{L^{1}} . \end{matrix}$ B.3
Thus
$\begin{matrix} | P_{1}^{ν} (f) (X) - P_{1}^{0} (f) (X) | \leq C ν^{1 / 2} {‖ f ‖}_{L^{1}} \int_{X}^{\infty} | f (V) | d V \leq C ν^{1 / 2} {‖ f ‖}_{L^{1}}^{2} . \end{matrix}$
(iv)
We first differentiate under the integral and use the condition $E (g) (X, X) = 1$ , apparent from the definition of $E$ in (3.3) and valid for all integrable g and $X \in R$ , to calculate
$\begin{matrix} \partial_{X} [P_{1}^{ν} (f)] (X) = & - \frac{α}{c_{0}} f (X) + {(\frac{α}{c_{0}})}^{2} ν^{1 / 2} \\ \int_{X}^{\infty} E (ν^{1 / 2} S^{ν} f) (V, X) f (V + ν) f (V) d V . \end{matrix}$
Then
$\begin{matrix} \partial_{X} [P_{1}^{ν} (f) - P_{1}^{0} (f)] (X) = & {(\frac{α}{c_{0}})}^{2} ν^{1 / 2} \\ \int_{X}^{\infty} E (ν^{1 / 2} S^{ν} f) (V, X) f (V + ν) f (V) d V . \end{matrix}$ B.4
Part (ii) of Lemma B.2 then guarantees
$\begin{matrix} ‖ \partial_{X} [P_{1}^{ν} (f) - P_{1}^{0} (f)] ‖_{L^{\infty}} \leq C ν^{1 / 2} {‖ f ‖}_{L^{\infty}} {‖ f ‖}_{L^{1}} . \end{matrix}$
(v)
We use (B.2) to write
$\begin{matrix} (P_{1}^{ν} (f) - P_{1}^{0} (f)) - (P_{1}^{ν} (\overset{`}{f}) - P_{1}^{0} (\overset{`}{f})) = I_{3}^{ν} (f, \overset{`}{f}) + I_{4}^{ν} (f, \overset{`}{f}), \end{matrix}$
where
$\begin{matrix} I_{3}^{ν} (f, \overset{`}{f}) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} (E (ν^{1 / 2} S^{ν} f) (V, X) - E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X)) f (V) d V \end{matrix}$
and
$\begin{matrix} I_{4}^{ν} (f, \overset{`}{f}) (X) : = \frac{α}{c_{0}} \int_{X}^{\infty} (E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X) - 1) (f (V) - \overset{`}{f} (V)) d V . \end{matrix}$
We use (B.1) to estimate
$\begin{matrix} | I_{3}^{ν} (f, \overset{`}{f}) (X) | \leq C ν^{1 / 2} ‖ f - \overset{`}{f} ‖_{L^{1}} \int_{X}^{\infty} | f (V) | d V \leq C ν^{1 / 2} {‖ f ‖}_{L^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}$
We use (B.3) to estimate
$\begin{matrix} | I_{3}^{ν} (f, \overset{`}{f}) (X) | & \leq C ν^{1 / 2} ‖ \overset{`}{f} ‖_{L^{1}} \int_{X}^{\infty} | f (V) - \overset{`}{f} (V) | d V \\ \leq C ν^{1 / 2} ‖ \overset{`}{f} ‖_{L^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}$
(vi)
Using (B.4), we have
$\begin{matrix} \partial_{X} [P_{1}^{ν} (f) - P_{1}^{0} (f)] (X) - \partial_{X} [P_{1}^{ν} (\overset{`}{f}) - P_{1}^{0} (\overset{`}{f})] (X) \\ = {(\frac{α}{c_{0}})}^{2} ν^{1 / 2} \int_{X}^{\infty} [E (ν^{1 / 2} S^{ν} f) (V, X) f (V + ν) f (V) \\ - E (ν^{1 / 2} S^{ν} \overset{`}{f}) (V, X) \overset{`}{f} (V + ν) \overset{`}{f} (V)] d V . \end{matrix}$
The estimate follows in a manner analogous to the proof of part (v) above, so we omit the details.

$□$

The next lemma guarantees that $R_{1}^{ν}$ maps $H_{q}^{1}$ to $W^{1, \infty}$ .

Lemma B.4

There exists $C > 0$ such that if $0 \leq ν < 1$ and ${‖ f ‖}_{H_{q}^{1}}$ , $‖ \overset{`}{f} ‖_{H_{q}^{1}} \leq 1$ , then the following hold.

(i)
$‖ R^{ν} {(f) ‖}_{L^{\infty}} \leq C {‖ f ‖}_{H_{q}^{1}}^{2}$ .
(ii)
$‖ \partial_{X} [R^{ν} (f)] ‖_{L^{\infty}} \leq C {‖ f ‖}_{H_{q}^{1}}^{2}$ .
(iii)
$‖ R^{ν} (f) - R^{ν} (\overset{`}{f}) ‖_{L^{\infty}} \leq C (ν^{1 / 2} + {‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}}) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$ .
(iv)
$‖ \partial_{X} [R^{ν} (f) - R^{ν} (\overset{`}{f})] ‖_{L^{\infty}} \leq C (ν^{1 / 2} + {‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}}) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$
(v)
$‖ R^{ν} (f) - R_{1}^{0} {(f) ‖}_{L^{\infty}} \leq C ν^{1 / 2} {‖ f ‖}_{H_{q}^{1}}^{2}$ .
(vi)
$‖ (R^{ν} (f) - R_{1}^{0} (f)) - (R^{ν} (\overset{`}{f}) - R_{1}^{0} (\overset{`}{f}) {) ‖}_{L^{\infty}} \leq C (ν^{1 / 2} + {‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}}) {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}}$ .
(vii)
$‖ R^{ν} (f) - R_{1}^{0} {(f) ‖}_{L^{\infty}} \leq C ν^{1 / 2} {‖ f ‖}_{H_{q}^{1}}^{2}$ .

Proof

Throughout we will use the inequality

\begin{matrix} ‖ R^{ν} {(f) ‖}_{L^{\infty}} \leq {‖ P_{1}^{ν} (f) (S^{ν} f) ‖}_{L^{1}} . \end{matrix}

As before, we stop when we have bounds in terms of $L^{1}$ - or $L^{\infty}$ -norms.

(i)
We use part (ii) of Lemma B.3 to bound
$\begin{matrix} ‖ R^{ν} {(f) ‖}_{L^{\infty}} = C ‖ P_{1}^{ν} (f) (S^{ν} f) ‖_{L^{1}} \leq C ‖ P_{1}^{ν} (f) ‖_{L^{\infty}} ‖ S^{ν} {f ‖}_{L^{1}} \leq C {‖ f ‖}_{L^{1}}^{2} . \end{matrix}$
(ii)
We have
$\begin{matrix} \partial_{X} [R^{ν} (f)] = - \frac{α}{c_{0}^{2} τ_{1}} P_{1}^{ν} (f) (S^{ν} f), \end{matrix}$
thus
$\begin{matrix} ‖ \partial_{X} [R^{ν} (f)] ‖_{L^{\infty}} \leq C ‖ P_{1}^{ν} (f) (S^{ν} f) ‖_{L^{\infty}} \leq C ‖ P_{1}^{ν} (f) ‖_{L^{\infty}} {‖ f ‖}_{H_{q}^{1}} \leq C {‖ f ‖}_{H_{q}^{1}}^{2} \end{matrix}$
by the Sobolev embedding and part (ii) of Lemma B.3.

(iii)

We use parts (i) and (ii) of Lemma B.3 to bound

\begin{matrix} ‖ R^{ν} (f) - R^{ν} (\overset{`}{f}) ‖_{L^{\infty}} \leq C ‖ (P_{1}^{ν} (f) - P_{1}^{ν} (f) {) f ‖}_{L^{1}} + C {‖ P_{1}^{ν} (\overset{`}{f}) (S^{ν} (f - \overset{`}{f})) ‖}_{L^{1}} \\ \leq C ‖ P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} {‖ f ‖}_{L^{1}} + C ‖ P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} {‖ S^{ν} (f - \overset{`}{f}) ‖}_{L^{1}} \\ \leq C (ν^{1 / 2} {‖ f ‖}_{L^{1}} + {1) ‖ f ‖}_{L^{1}} ‖ f - \overset{`}{f} ‖_{L^{1}} + C ‖ \overset{`}{f} ‖_{L^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}

(iv)
We have
$\begin{matrix} \partial_{X} [R^{ν} (f) - R^{ν} (\overset{`}{f})] = \frac{α}{c_{0}^{2} τ_{1}} P_{1}^{ν} (\overset{`}{f}) (S^{ν} \overset{`}{f}) - \frac{α}{c_{0}^{2} τ_{1}} P_{1}^{ν} (f) (S^{ν} f), \end{matrix}$
thus
$\begin{matrix} ‖ \partial_{X} [R^{ν} (f) - R^{ν} (\overset{`}{f})] ‖_{L^{\infty}} \leq C {‖ (P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f})) \overset{`}{f} ‖}_{L^{\infty}} \\ + C ‖ P_{1}^{ν} (\overset{`}{f}) (S^{ν} (f - \overset{`}{f}) {) ‖}_{L^{\infty}} . \end{matrix}$
We use part (i) of Lemma B.3 and the Sobolev embedding to estimate
$\begin{matrix} ‖ (P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f})) \overset{`}{f} ‖_{L^{\infty}} \leq ‖ P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} {‖ f ‖}_{H_{q}^{1}} \\ \leq C (ν^{1 / 2} {‖ f ‖}_{L^{1}} + {1) ‖ f ‖}_{H_{q}^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} \end{matrix}$
and part (ii) of Lemma B.3 and the Sobolev embedding to estimate
$\begin{matrix} ‖ P_{1}^{ν} (\overset{`}{f}) (S^{ν} (f - \overset{`}{f}) {) ‖}_{L^{\infty}} \leq C ‖ P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} ‖ f - \overset{`}{f} ‖_{L^{\infty}} \leq {C ‖ f ‖}_{L^{1}} {‖ f - \overset{`}{f} ‖}_{H_{q}^{1}} . \end{matrix}$
(v)
We first estimate
$\begin{matrix} ‖ R^{ν} (f) - R_{1}^{0} {(f) ‖}_{L^{\infty}} \leq C {‖ P_{1}^{ν} (f) (S^{ν} f) - P_{1}^{0} (f) f ‖}_{L^{1}} \\ \leq C ‖ (P_{1}^{ν} (f) - P_{1}^{0} (f)) (S^{ν} f) ‖_{L^{1}} \\ + C ‖ P_{1}^{0} (f) (S^{ν} f - f) ‖_{L^{1}} . \end{matrix}$
Then part (iii) of Lemma B.3 gives
$\begin{matrix} ‖ (P_{1}^{ν} (f) - P_{1}^{0} (f)) (S^{ν} f) ‖_{L^{1}} \leq ‖ P_{1}^{ν} (f) - P_{1}^{0} (f) ‖_{L^{\infty}} ‖ S^{ν} {f ‖}_{L^{1}} \leq C ν^{1 / 2} {‖ f ‖}_{L^{1}}^{3} . \end{matrix}$
Next, part (ii) of Lemma B.3 implies
$\begin{matrix} ‖ P_{1}^{0} (f) (S^{ν} f - f) ‖_{L^{1}} \leq ‖ P_{1}^{0} (f) ‖_{L^{\infty}} ‖ S^{ν} {f - f ‖}_{L^{1}} \leq {C ‖ f ‖}_{L^{1}} {‖ (S^{ν} - 1) f ‖}_{L^{1}} . \end{matrix}$

Since $f \in H_{q}^{1}$ , we have
$\begin{matrix} ‖ (S^{ν} - 1) {f ‖}_{L^{1}} \leq C_{q} {‖ (S^{ν} - 1) f ‖}_{L_{q}^{2}} . \end{matrix}$
It follows from Faver and Wright (2018, Lem. A.11) that
$\begin{matrix} ‖ (S^{ν} - 1) {f ‖}_{L_{q}^{2}} \leq C ν {‖ f ‖}_{H_{q}^{1}} . \end{matrix}$

(vi)

We estimate

\begin{matrix} ‖ (R^{ν} (f) - R_{1}^{0} (f)) - (R^{ν} (\overset{`}{f}) - R_{1}^{0} (\overset{`}{f})) ‖_{L^{\infty}} \leq C ‖ (P_{1}^{ν} (f) (S^{ν} f) \\ - P_{1}^{0} (f) f) - (P^{ν} (\overset{`}{f}) (S^{ν} \overset{`}{f}) - P_{1}^{0} (\overset{`}{f}) \overset{`}{f} {) ‖}_{L^{1}} \\ \leq C ‖ (P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f})) (S^{ν} f) ‖_{L^{1}} + C {‖ P_{1}^{ν} (\overset{`}{f}) (S^{ν} (f - \overset{`}{f})) ‖}_{L^{1}} \\ + C ‖ (P_{1}^{0} (f) - P_{1}^{0} (\overset{`}{f})) \overset{`}{f} ‖_{L^{1}} + C {‖ P_{1}^{0} (f) (f - \overset{`}{f}) ‖}_{L^{1}} . \end{matrix}

We use part (ii) of Lemma B.3 to bound

\begin{matrix} ‖ P_{1}^{ν} (\overset{`}{f}) (S^{ν} (f - \overset{`}{f}) {) ‖}_{L^{1}} + ‖ P_{1}^{0} (f) (f - \overset{`}{f}) ‖_{L^{1}} \leq ‖ P_{1}^{ν} (\overset{`}{f} ‖_{L^{\infty}} {‖ S^{ν} (f - \overset{`}{f}) ‖}_{L^{1}} \\ + ‖ P_{1}^{0} {(f) ‖}_{L^{\infty}} {‖ f - \overset{`}{f} ‖}_{L^{1}} \\ \leq {C ‖ f ‖}_{L^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}

We use part (i) of Lemma B.3 to bound

\begin{matrix} ‖ (P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f})) (S^{ν} f) ‖_{L^{1}} + {‖ (P_{1}^{0} (f) - P_{1}^{0} (\overset{`}{f})) \overset{`}{f} ‖}_{L^{1}} \\ \leq ‖ P_{1}^{ν} (f) - P_{1}^{ν} (\overset{`}{f}) ‖_{L^{\infty}} ‖ S^{ν} {f ‖}_{L^{1}} + ‖ P_{1}^{0} (f) - P_{1}^{0} (\overset{`}{f}) ‖_{L^{\infty}} {‖ \overset{`}{f} ‖}_{L^{1}} \\ \leq C (ν^{1 / 2} {‖ f ‖}_{L^{1}} + {1) ‖ f ‖}_{L^{1}} ‖ f - \overset{`}{f} ‖_{L^{1}} + C ‖ \overset{`}{f} ‖_{L^{1}} {‖ f - \overset{`}{f} ‖}_{L^{1}} . \end{matrix}

(vii)
We use part (vi) with $\overset{`}{f} = 0$ .

$□$

Finally, we present estimates on the operators $N^{ν}$ defined in (3.38) and $P_{2}^{ν}$ from (3.34).

Lemma B.5

There exist C, $ρ_{0} > 0$ such that if $0 \leq ν < 1$ , then the following hold.

(i)
If f, $\overset{`}{f} \in H_{q}^{1}$ and g, $\overset{`}{g} \in W^{1, \infty}$ with ${‖ f ‖}_{H_{q}^{1}} + {‖ g ‖}_{W^{1, \infty}} \leq ρ_{0}$ and $‖ \overset{`}{f} ‖_{H_{q}^{1}} + {‖ \overset{`}{g} ‖}_{W^{1, \infty}} \leq ρ_{0}$ , then
$\begin{matrix} ‖ N^{ν} (f, g) - N^{ν} (\overset{`}{f}, \overset{`}{g}) ‖_{L_{q}^{2}} + {‖ P_{2}^{ν} (f, g) - P_{2}^{ν} (\overset{`}{f}, \overset{`}{g}) ‖}_{H_{q}^{1}} \\ \leq C (ν^{1 / 2} + {‖ f ‖}_{H_{q}^{1}} + ‖ \overset{`}{f} ‖_{H_{q}^{1}} + {‖ g ‖}_{W^{1, \infty}} + {‖ \overset{`}{g} ‖}_{W^{1, \infty}}) \\ (‖ f - \overset{`}{f} ‖_{H_{q}^{1}} + {‖ g - \overset{`}{g} ‖}_{W^{1, \infty}}) . \end{matrix}$
(ii)
If $f \in H_{q}^{1}$ and $g \in W^{1, \infty}$ with ${‖ f ‖}_{H_{q}^{1}} + {‖ g ‖}_{W^{1, \infty}} \leq ρ_{0}$ , then
$\begin{matrix} ‖ N^{ν} {(f, g) ‖}_{L_{q}^{2}} + {‖ P_{2}^{ν} (f, g) ‖}_{W^{1, \infty}} \leq C . \end{matrix}$

Proof

Part (ii) follows from part (i) since $N^{ν} (0, 0) = P_{2}^{ν} (0, 0) = 0$ . The proof of the Lipschitz estimates in part (i) follows exactly the strategies deployed above, and we would learn almost nothing new from seeing its argument, so we omit that. The one difference here is that $N^{ν}$ and $P_{2}^{ν}$ incorporate the maps $Q_{1}^{ν}$ and $Q_{2}^{ν}$ , which were defined in (3.32) and which are really rational functions from $R^{2}$ to $R$ . A glance at the formulas for $Q_{1}^{ν}$ and $Q_{2}^{ν}$ provides $ρ_{Q} > 0$ such that if $0 < ν < 1$ , then $Q_{1}^{ν}$ and $Q_{2}^{ν}$ are defined and smooth on the ball $\{(X, Y) \in R^{2} | | X | + | Y | \leq ρ_{Q}\}$ . By taking ${‖ f ‖}_{H_{q}^{1}} + {‖ g ‖}_{W^{1, \infty}} \leq ρ_{0}$ for some small $ρ_{0} > 0$ , we can guarantee that the compositions involving $Q_{1}^{ν}$ and $Q_{2}^{ν}$ with f, g, and other operators acting on f and g are all defined and satisfy tame Lipschitz estimates. $□$

B.2. Lipschitz estimates

We first prove the Lipschitz estimates undergirding part (ii) of Lemma B.1, which we then use to prove the mapping estimates in part (i). From (6.7), we have $N^{ν} = (N_{1}^{ν}, N_{2}^{ν})$ , where $N_{1}^{ν}$ was defined in (6.5) and $N_{2}^{ν}$ in (6.6). Using these definitions and the boundedness of the operator $T^{- 1}$ from Proposition 5.1, we can prove part (ii) of Lemma B.1 if we show

\begin{matrix} \sum_{k = 1}^{5} (‖ V_{1 k}^{ν} (η) - V_{1 k}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}}) + ({‖ V_{21}^{ν} (η) - V_{21}^{ν} (\overset{`}{η}) ‖}_{W^{1, \infty}}) \\ + ‖ V_{23}^{ν} (η) - V_{23}^{ν} (\overset{`}{η}) ‖_{W^{1, \infty}}) \leq C R_{⋆}^{ν} (η, \overset{`}{η}), \end{matrix}

where

\begin{matrix} R_{⋆}^{ν} (η, \overset{`}{η}) : = & (ν^{1 / 3} + ‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}} + ‖ η_{2} ‖_{W^{1, \infty}} + {‖ {\overset{`}{η}}_{2} ‖}_{W^{1, \infty}}) \\ (‖ η_{1} - \overset{`}{η_{1}} ‖_{H_{q}^{1}} + {‖ η_{2} - {\overset{`}{η}}_{2} ‖}_{W^{1, \infty}}) . \end{matrix}

The terms $V_{1 k}^{ν}$ were defined in (6.3) and $V_{2 k}^{ν}$ in (6.4).

B.2.1. Lipschitz estimates on $V_{11}^{ν}$

We use the estimate on $M^{(ν)} - M^{(0)}$ from Proposition 4.1 to obtain

\begin{matrix} ‖ V_{11}^{ν} (η) - V_{11}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq & C ν^{1 / 3} {‖ (R^{ν} (σ + η_{1}) - R^{ν} (σ + {\overset{`}{η}}_{1})) (σ + {\overset{`}{η}}_{1}) ‖}_{H_{q}^{1}} \\ + C ν^{1 / 3} {‖ R^{ν} (σ + η_{1}) (η_{1} - {\overset{`}{η}}_{1}) ‖}_{H_{q}^{1}} . \end{matrix}

We first estimate

\begin{matrix} ‖ (R^{ν} (σ + η_{1}) - R^{ν} (σ + {\overset{`}{η}}_{1})) (σ + {\overset{`}{η}}_{1}) ‖_{H_{q}^{1}} \leq ‖ \partial_{X} [R^{ν} (σ + η_{1}) - R^{ν} (σ + {\overset{`}{η}}_{1})] \\ (σ + {\overset{`}{η}}_{1}) ‖_{L_{q}^{2}} + {‖ (R^{ν} (σ + η_{1}) - R^{ν} (σ + {\overset{`}{η}}_{1})) \partial_{X} [σ + {\overset{`}{η}}_{1}] ‖}_{L_{q}^{2}}, \end{matrix}

where

\begin{matrix} ‖ \partial_{X} [R^{ν} (σ + η_{1}) - R^{ν} (σ + {\overset{`}{η}}_{1})] (σ + {\overset{`}{η}}_{1}) ‖_{L_{q}^{2}} \\ \leq ‖ \partial_{X} [R^{ν} (σ + η_{1}) - R_{1}^{ν} (σ + {\overset{`}{η}}_{1}) {] ‖}_{L^{\infty}} {‖ σ + {\overset{`}{η}}_{1} ‖}_{L_{q}^{2}} \\ \leq C (ν^{1 / 2} + ‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}) {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

by part (iv) of Lemma B.4 and

\begin{matrix} ‖ (R_{1}^{ν} (σ + η_{1}) - R_{1}^{ν} (σ + {\overset{`}{η}}_{1})) \partial_{X} [σ + {\overset{`}{η}}_{1}] ‖_{L_{q}^{2}}, \\ \leq ‖ R_{1}^{ν} (σ + η_{1}) - R_{1}^{ν} (σ + {\overset{`}{η}}_{1}) ‖_{L^{\infty}} {‖ \partial_{X} [σ + {\overset{`}{η}}_{1}] ‖}_{L_{q}^{2}} \\ \leq C (ν^{1 / 2} + ‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}) {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

by part (iii) of Lemma B.4.

Next we estimate

\begin{matrix} ‖ R_{1}^{ν} (σ + η_{1}) (η_{1} - {\overset{`}{η}}_{1}) ‖_{H_{q}^{1}} \leq {‖ \partial_{X} [R_{1}^{ν} (σ + η_{1})] (η_{1} - {\overset{`}{η}}_{1}) ‖}_{L_{q}^{2}} \\ + ‖ R_{1}^{ν} (σ + η_{1}) \partial_{X} [η_{1} - {\overset{`}{η}}_{1}] ‖_{L_{q}^{2}}, \end{matrix}

where

\begin{matrix} ‖ \partial_{X} [R_{1}^{ν} (σ + η_{1})] (η_{1} - {\overset{`}{η}}_{1}) ‖_{L_{q}^{2}} \leq ‖ \partial_{X} [R_{1}^{ν} (σ + η_{1}) {] ‖}_{L^{\infty}} {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{L_{q}^{2}} \\ \leq C ‖ σ + η_{1} ‖_{H_{q}^{1}}^{2} ‖ η_{1} - {\overset{`}{η}}_{1} ‖_{H_{q}^{1}} \leq C {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

by part (ii) of Lemma B.4 and

\begin{matrix} ‖ R_{1}^{ν} (σ + η_{1}) \partial_{X} [η_{1} - {\overset{`}{η}}_{1}] ‖_{L_{q}^{2}} \leq ‖ R_{1}^{ν} (σ + η_{1}) ‖_{L^{\infty}} {‖ \partial_{X} [η_{1} - {\overset{`}{η}}_{1}] ‖}_{L_{q}^{2}} \\ \leq C ‖ σ + η_{1} ‖_{H_{q}^{1}}^{2} ‖ η_{1} - {\overset{`}{η}}_{1} ‖_{H_{q}^{1}} \leq C {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}}^{2} . \end{matrix}

by part (i) of Lemma B.4.

B.2.2. Lipschitz estimates on $V_{12}^{ν}$

We use the smoothing property of $M^{(0)}$ from Lemma 3.1 to bound

\begin{matrix} ‖ V_{12}^{ν} (η) - V_{12}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq C ‖ [(R_{1}^{ν} (η_{1}) \\ - R_{1}^{0} (η_{1})) - (R_{1}^{ν} ({\overset{`}{η}}_{1}) - R_{1}^{0} ({\overset{`}{η}}_{1}))] (σ + η_{1}) ‖_{L_{q}^{2}} \\ + C ‖ (R_{1}^{ν} ({\overset{`}{η}}_{1}) - R_{1}^{0} ({\overset{`}{η}}_{1})) (η_{1} - {\overset{`}{η}}_{1}) ‖_{L_{q}^{2}} . \end{matrix}

Call the two $L_{q}^{2}$ -norm terms above I and $I I$ . We estimate

\begin{matrix} I \leq & ‖ (R_{1}^{ν} (η_{1}) - R_{1}^{0} (η_{1})) - (R_{1}^{ν} ({\overset{`}{η}}_{1}) - R_{1}^{0} ({\overset{`}{η}}_{1}) {) ‖}_{L^{\infty}} {‖ σ + η_{1} ‖}_{L_{q}^{2}} \\ \leq & C (ν^{1 / 2} + ‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}) {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} . \end{matrix}

by part (vi) of Lemma B.4 and

\begin{matrix} I I \leq ‖ R_{1}^{ν} ({\overset{`}{η}}_{1}) - R_{1}^{0} ({\overset{`}{η}}_{1}) ‖_{L^{\infty}} ‖ η_{1} - {\overset{`}{η}}_{1} ‖_{L_{q}^{2}} \leq C ν^{1 / 2} ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}} {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

by part (vii) of Lemma B.4.

B.2.3. Lipschitz estimates on $V_{13}^{ν}$

We use again the smoothing property of $M^{(0)}$ to bound

\begin{matrix} ‖ V_{13}^{ν} (η) - V_{13}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq C {‖ (R_{1}^{0} (σ + η_{1}) - R_{1}^{0} (σ) - D R_{1}^{0} (σ) η_{1}) σ ‖}_{L_{q}^{2}} \\ \leq C ‖ R_{1}^{0} (σ + η_{1}) - R_{1}^{0} (σ) - D R_{1}^{0} (σ) η_{1} ‖_{L^{\infty}} {‖ σ ‖}_{L_{q}^{2}} . \end{matrix}

Next we will use the following ‘difference of squares’ estimate, which is proved using the fundamental theorem of calculus. We thank J. Douglas Wright for pointing out this lemma to us.

Lemma B.6

Let $X$ and $Y$ be Banach spaces with $Z \subseteq X$ open and convex and with $0 \in Z$ . Let $f \in C^{1} (Z, Y)$ with $D f (0) = 0$ , and suppose

\begin{matrix} {Lip}_{Z} (D f) : = sup_{\begin{matrix} x, \overset{`}{x} \in Z \\ x \neq \overset{`}{x} \end{matrix}} \frac{‖ D f (x) - D f (\overset{`}{x}) ‖_{B (X, Y)}}{‖ x - \overset{`}{x} ‖_{X}} < \infty . \end{matrix}

Then

\begin{matrix} ‖ f (x) - f (\overset{`}{x}) ‖_{Y} \leq \frac{1}{2} {Lip}_{Z} (D f) {(‖ x ‖}_{X} + ‖ \overset{`}{x} ‖_{X}) {‖ x - \overset{`}{x} ‖}_{X} . \end{matrix}

We apply this lemma to $f (η_{1}) : = R_{1}^{0} (σ) η_{1}) - R_{1}^{0} (σ) - D R_{1}^{0} (σ) η_{1}$ , which is infinitely differentiable as a map from $H_{q}^{1}$ to $W^{1, \infty}$ by Remark 3.4, to conclude

\begin{matrix} ‖ R_{1}^{0} (σ + η_{1}) - R_{1}^{0} (σ) - D R_{1}^{0} (σ) η_{1} ‖_{L^{\infty}} \leq C (‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}) {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} . \end{matrix}

B.2.4. Lipschitz estimates on $V_{14}^{ν}$

We smooth with $M^{(0)}$ once more, and then we use the fundamental theorem of calculus and the smoothness of $R_{1}^{0}$ to rewrite

\begin{matrix} ‖ V_{14}^{ν} (η) - V_{14}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq {C ‖ I ‖}_{L_{q}^{2}} + C {‖ I I ‖}_{L_{q}^{2}}, \end{matrix}

where

\begin{matrix} I : = (\int_{0}^{1} (D R_{1}^{0} (σ + s η_{1}) - D R_{1}^{0} (σ + s {\overset{`}{η}}_{1})) d s) η_{1}^{2} \end{matrix}

and

\begin{matrix} I I : = (\int_{0}^{1}, D, R_{1}^{0}, (σ + s {\overset{`}{η}}_{1}), d, s) (η_{1} + {\overset{`}{η}}_{1}) (η_{1} - {\overset{`}{η}}_{1}) . \end{matrix}

Then

\begin{matrix} {‖ I ‖}_{L_{q}^{2}} \leq {∥\int_{0}^{1} (D R_{1}^{0} (σ + s η_{1}) - D R_{1}^{0} (σ + s {\overset{`}{η}}_{1})) d s∥}_{L^{\infty}} {‖ η_{1} ‖}_{L_{q}^{2}} \end{matrix}

and

\begin{matrix} {‖ I I ‖}_{L_{q}^{2}} \leq {∥\int_{0}^{1}, D, R_{1}^{0}, (σ + s {\overset{`}{η}}_{1}), d, s∥}_{L^{\infty}} ‖ η_{1} + {\overset{`}{η}}_{1} ‖_{L^{\infty}} {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{L_{q}^{2}} . \end{matrix}

We conclude

\begin{matrix} {‖ I ‖}_{L_{q}^{2}} \leq C ‖ η_{1} ‖_{H_{q}^{1}} {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

via a Lipschitz estimate on $D R_{1}^{0}$ and

\begin{matrix} {‖ I I ‖}_{L_{q}^{2}} \leq C (‖ η_{1} ‖_{H_{q}^{1}} + ‖ {\overset{`}{η}}_{1} ‖_{H_{q}^{1}}) {‖ η_{1} - {\overset{`}{η}}_{1} ‖}_{H_{q}^{1}} \end{matrix}

via the boundedness of $D R_{1}^{0}$ .

B.2.5. Lipschitz estimates on $V_{15}^{ν}$

We smooth with $M^{(0)}$ to estimate

\begin{matrix} ‖ V_{15}^{ν} (η) - V_{15}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq C ν^{1 / 2} {‖ N^{ν} (σ + η_{1}, ζ + η_{2}) - N^{ν} (σ + {\overset{`}{η}}_{1}, ζ + {\overset{`}{η}}_{2}) ‖}_{L_{q}^{2}} . \end{matrix}

The desired estimate then follows from part (i) of Lemma B.5.

B.2.6. Lipschitz estimates on $V_{21}^{ν}$

This is a direct application of parts (v) and .(vi) of Lemma B.3.

B.2.7. Lipschitz estimates on $V_{22}^{ν} \circ N_{1}^{ν}$

We have

\begin{matrix} V_{22}^{ν} (N_{1}^{ν} (η)) (X) = P_{1}^{0} (σ + N_{1}^{ν} (η)) (X) - P_{1}^{0} (σ) (X) = \frac{α}{c_{0}} \int_{X}^{\infty} N_{1}^{ν} (η) (V) d V . \end{matrix}

B.5

The desired Lipschitz estimate on $V_{22}^{ν}$ then follows at once from the Lipschitz estimate

\begin{matrix} ‖ N_{1}^{ν} (η) - N_{1}^{ν} (\overset{`}{η}) ‖_{H_{q}^{1}} \leq C R_{⋆}^{ν} (η, \overset{`}{η}), \end{matrix}

which we proved in Appendix B.2.1 through B.2.5. Without having substituted $N_{1}^{ν} (η)$ for $η_{1}$ in the process of defining $N_{2}^{ν}$ in (6.6), we would have only a useless $O (1)$ estimate here.

B.2.8. Lipschitz estimates on $V_{23}^{ν}$

This is a direct application of part (i) of Lemma B.5.

B.3. Mapping estimates

We prove the mapping estimates that deliver part (i) of Lemma B.1 and rely mostly on the preceding Lipschitz estimates. Due to the boundedness of $T^{- 1}$ , it suffices to show

\begin{matrix} \sum_{k = 1}^{5} ‖ V_{1 k}^{ν} {(η) ‖}_{H_{q}^{1}} + ‖ V_{21}^{ν} {(η) ‖}_{W^{1, \infty}} + {‖ V_{23}^{ν} (η) ‖}_{W^{1, \infty}} \\ \leq C (ν^{1 / 3} + ‖ η_{1} ‖_{H_{q}^{1}}^{2} + {‖ η_{2} ‖}_{W^{1, \infty}}^{2}) . \end{matrix}

B.3.1. Mapping estimates on $V_{11}^{ν}$

We estimate

\begin{matrix} ‖ V_{11}^{ν} {(η) ‖}_{H_{q}^{1}} \leq ‖ V_{11}^{ν} (η) - V_{11}^{ν} {(0) ‖}_{H_{q}^{1}} + {‖ V_{11}^{ν} (0) ‖}_{H_{q}^{1}}, \end{matrix}

where

\begin{matrix} ‖ V_{11}^{ν} (η) - V_{11}^{ν} {(0) ‖}_{H_{q}^{1}} \leq C ν^{1 / 3} {‖ η_{1} ‖}_{H_{q}^{1}}^{2} \end{matrix}

by the Lipschitz estimates in Appendix B.2.1 and

\begin{matrix} ‖ V_{11}^{ν} {(0) ‖}_{H_{q}^{1}} = {‖ (M^{(ν)} - M^{(0)}) [R_{1}^{ν} (σ) σ] ‖}_{H_{q}^{1}} \leq C ν^{1 / 3} \end{matrix}

by Proposition 4.1.

B.3.2. Mapping estimates on $V_{12}^{ν}$

We estimate

\begin{matrix} ‖ V_{12}^{ν} {(η) ‖}_{L_{q}^{2}} \leq ‖ V_{12}^{ν} (η) - V_{12}^{ν} {(0) ‖}_{L_{q}^{2}} + {‖ V_{12}^{ν} (0) ‖}_{L_{q}^{2}}, \end{matrix}

where

\begin{matrix} ‖ V_{12}^{ν} (η) - V_{12}^{ν} {(0) ‖}_{L_{q}^{2}} \leq C \end{matrix}

by the Lipschitz estimates in Appendix B.2.1 and

\begin{matrix} ‖ V_{12}^{ν} {(0) ‖}_{L_{q}^{2}} = ‖ (R_{1}^{ν} (σ) - R_{1}^{0} {(σ)) σ ‖}_{L_{q}^{2}} \leq ‖ R_{1}^{ν} (σ) - R_{1}^{0} {(σ) ‖}_{L^{\infty}} {‖ σ ‖}_{L_{q}^{2}} \leq C ν^{1 / 2} \end{matrix}

by part (vii) of Lemma B.4.

B.3.3. Mapping estimates on $V_{13}^{ν}$

Because $V_{13}^{ν} (0) = 0$ , these follow from the Lipschitz estimates for $V_{13}^{ν}$ that we developed above in Appendix B.2.3.

B.3.4. Mapping estimates on $V_{14}^{ν}$

Because $V_{14}^{ν} (0) = 0$ , these follow from the Lipschitz estimates for $V_{14}^{ν}$ that we developed above in Appendix B.2.3.

B.3.5. Mapping estimates on $V_{15}^{ν}$

The estimates are analogous to those in Appendix B.3.1, except now we use Lemma B.5 instead of the Lipschitz estimates in Appendix B.2.1.

B.3.6. Mapping estimates on $V_{21}^{ν}$

These estimates follow directly from parts (iii) and (iv) of Lemma B.3.

B.3.7. Mapping estimates on $V_{22}^{ν} \circ N_{1}^{ν}$

We obtain these estimates by first rewriting $V_{22}^{ν} \circ N_{1}^{ν}$ via the identity (B.5) and then using the mapping estimates on $N_{1}^{ν}$ developed in Appendices B.3.1 through B.3.5.

B.3.8. Mapping estimates on $V_{23}^{ν}$

This estimate follows from part (ii) of Lemma B.5.

Data availability statement

The datasets generated during the current study are available from the corresponding author on reasonable request.

Declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Footnotes

For presentation purposes, the parameters L and r appearing in Merks et al. (2007) have been set to unity.

Here we use the abbreviation ${‖ A ‖}_{\infty} = {sup}_{j, t} | A_{j} (t) |$ and its analogues for P and R.

We define the width of the auxin pulse as the distance between the two points where the pulse attains 5% of its maximum value.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Bente Hilde Bakker, Email: b.h.bakker@math.leidenuniv.nl.

Timothy E. Faver, Email: tfaver1@kennesaw.edu

Hermen Jan Hupkes, Email: hhupkes@math.leidenuniv.nl.

Roeland M. H. Merks, Email: merksrmh@math.leidenuniv.nl

Jelle van der Voort, Email: jelvoort@live.nl.

References

Adamowski M, Friml J. PIN-dependent auxin transport: action, regulation, and evolution. Plant Cell. 2015;27:20–32. doi: 10.1105/tpc.114.134874. [DOI] [PMC free article] [PubMed] [Google Scholar]
Allen HR, Ptashnyk M. Mathematical modelling of auxin transport in plant tissues: flux meets signalling and growth. Bull Math Biol. 2020;82:1–35. doi: 10.1007/s11538-019-00685-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Althuis R (2021) Auxin waves in a two-dimensional grid. BSc thesis, Leiden University. https://pub.math.leidenuniv.nl/hupkeshj/scriptie_rosalie.pdf
Aronson DG, Weinberger HF (1975) Nonlinear diffusion in population genetics, combustion, and nerve pulse propagation, in Partial differential equations and related topics (Program, Tulane Univ., New Orleans, La., 1974). Lecture notes in mathematics, vol 446. Springer, Berlin, pp 5–49
Aronson DG, Weinberger HF. Multidimensional nonlinear diffusion arising in population genetics. Adv Math. 1978;30:33–76. [Google Scholar]
Autran D, Bassel GW, Chae E, Ezer D, Ferjani A, Fleck C, Hamant O, Hartmann FP, Jiao Y, Johnston IG, Kwiatkowska D, Lim BL, Mahönen AP, Morris RJ, Mulder BM, Nakayama N, Sozzani R, Strader LC, Tusscher Kt, Ueda M, Wolf S (2021) What is quantitative plant biology? Quant Plant Biol 2 [DOI] [PMC free article] [PubMed]
Bayer EM, Smith RS, Mandel T, Nakayama N, Sauer M, Prusinkiewicz P, Kuhlemeier C. Integration of transport-based models for phyllotaxis and midvein formation. Genes Dev. 2009;23:373–384. doi: 10.1101/gad.497009. [DOI] [PMC free article] [PubMed] [Google Scholar]
Beale JT. Water waves generated by a pressure disturbance on a steady stream. Duke Math J. 1980;47:297–323. [Google Scholar]
Benítez M, Hernández-Hernández V, Newman SA, Niklas KJ. Dynamical patterning modules, biogeneric materials, and the evolution of multicellular plants. Front Plant Sci. 2018;9:871. doi: 10.3389/fpls.2018.00871. [DOI] [PMC free article] [PubMed] [Google Scholar]
Brillouin L. Wave propagation in periodic structures. New York: Dover Phoenix Editions; 1953. [Google Scholar]
Chen X, Guo J-S, Wu C-C. Traveling waves in discrete periodic media for bistable dynamics. Arch Ration Mech Anal. 2008;189:189–236. [Google Scholar]
Cieslak M, Owens A, Prusinkiewicz P. Computational models of auxin-driven patterning in shoots. Cold Spring Harb Perspect Biol. 2021;14:a040097. doi: 10.1101/cshperspect.a040097. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dauxois T. Fermi, Pasta, Ulam, and a mysterious lady. Phys Today. 2008;61:55–57. [Google Scholar]
Draelants D, Avitabile D, Vanroose W. Localized auxin peaks in concentration-based transport models of the shoot apical meristem. J R Soc Interface. 2015;12:20141407. doi: 10.1098/rsif.2014.1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
Emerenini BO, Hense BA, Kuttler C, Eberl HJ. A mathematical model of quorum sensing induced biofilm detachment. PLoS ONE. 2015;10:e0132385–e0132385. doi: 10.1371/journal.pone.0132385. [DOI] [PMC free article] [PubMed] [Google Scholar]
Faver TE (2018) Nanopteron-stegoton traveling waves in mass and spring dimer Fermi–Pasta–Ulam–Tsingou lattices. Ph.D. thesis, Drexel University, Philadelphia, PA, May
Faver TE, Wright JD. Exact diatomic Fermi–Pasta–Ulam–Tsingou solitary waves with optical band ripples at infinity. SIAM J Math Anal. 2018;50:182–250. [Google Scholar]
Fendrych M, Leung J, Friml J. TIR1/AFB-Aux/IAA auxin perception mediates rapid cell wall acidification and growth of Arabidopsis hypocotyls. Elife. 2016;5:e19048. doi: 10.7554/eLife.19048. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fermi E, Pasta J, Ulam S. Studies of nonlinear problems. Lect Appl Math. 1955;12:143–56. [Google Scholar]
Friesecke G, Pego RL. Solitary waves on FPU lattices. I. Qualitative properties, renormalization and continuum limit. Nonlinearity. 1999;12:1601–1627. [Google Scholar]
Friesecke G, Pego RL. Solitary waves on FPU lattices. II. Linear implies nonlinear stability. Nonlinearity. 2002;15:1343–1359. [Google Scholar]
Friesecke G, Pego RL. Solitary waves on Fermi–Pasta–Ulam lattices. III. Howland-type Floquet theory. Nonlinearity. 2004;17:207–227. [Google Scholar]
Friesecke G, Pego RL. Solitary waves on Fermi–Pasta–Ulam lattices. IV. Proof of stability at low energy. Nonlinearity. 2004;17:229–251. [Google Scholar]
Friesecke G, Wattis JAD. Existence theorem for solitary waves on lattices. Commun Math Phys. 1994;161:391–418. [Google Scholar]
Ghasemi M, Sonner S, Eberl HJ. Time adaptive numerical solution of a highly non-linear degenerate cross-diffusion system arising in multi-species biofilm modelling. Eur J Appl Math. 2018;29:1035–1061. [Google Scholar]
Hajný J, Prát T, Rydza N, Rodriguez L, Tan S, Verstraeten I, Domjan D, Mazur E, Smakowska-Luzan E, Smet W, Mor E, Nolf J, Yang B, Grunewald W, Molnár G, Belkhadir Y, Rybel BD, Friml J. Receptor kinase module targets PIN-dependent auxin transport during canalization. Science. 2020;370:550–557. doi: 10.1126/science.aba3178. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hajný J, Tan S, Friml J. Auxin canalization: from speculative models toward molecular players. Curr Opin Plant Biol. 2022;65:102174. doi: 10.1016/j.pbi.2022.102174. [DOI] [PubMed] [Google Scholar]
Haskovec J, Jönsson H, Kreusser LM, Markowich P. Auxin transport model for leaf venation. Proc R Soc A. 2019;475:20190015. doi: 10.1098/rspa.2019.0015. [DOI] [PMC free article] [PubMed] [Google Scholar]
Heisler MG, Jonsson H. Modeling auxin transport and plant development. J Plant Growth Regul. 2006;25:302–312. [Google Scholar]
Herrmann M, Matthies K. Asymptotic formulas for solitary waves in the high-energy limit of FPU-type chains. Nonlinearity. 2015;28:2767–2789. [Google Scholar]
Hochstrasser D, Mertens F, Büttner H. Energy transport by lattice solitons in $α$ -helical proteins. Phys Rev A. 1989;40:2602. doi: 10.1103/physreva.40.2602. [DOI] [PubMed] [Google Scholar]
Hoffman A, Wright JD. Nanopteron solutions of diatomic Fermi–Pasta–Ulam–Tsingou lattices with small mass-ratio. Physica D. 2017;358:33–59. [Google Scholar]
Holloway DM, Wenzel CL. Polar auxin transport dynamics of primary and secondary vein patterning in dicot leaves. in silico Plants. 2021;3:diab030. [Google Scholar]
Hupkes HJ, Sandstede B. Travelling pulse solutions for the discrete FitzHugh–Nagumo system. SIAM J Appl Dyn Syst. 2010;9:827–882. [Google Scholar]
Johnson MA, Wright JD. Generalized solitary waves in the gravity-capillary Whitham equation. Stud Appl Math. 2020;144:102–130. [Google Scholar]
Johnston ST, Baker RE, McElwain DS, Simpson MJ. Co-operation, competition and crowding: a discrete framework linking Allee kinetics, nonlinear diffusion, shocks and sharp-fronted travelling waves. Sci Rep. 2017;7:1–19. doi: 10.1038/srep42134. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jones C, Kopell N, Langer R. Construction of the FitzHugh–Nagumo pulse using differential forms. In: Aris R, Aronson DG, Swinney HL, editors. Patterns and dynamics in reactive media. New York: Springer; 1991. pp. 101–115. [Google Scholar]
Jönsson H, Heisler M, Shapiro B, Meyerowitz E, Mjolsness E. An auxin-driven polarized transport model for phyllotaxis. Proc Natl Acad Sci USA. 2006;103:1633–1638. doi: 10.1073/pnas.0509839103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Julien JD, Pumir A, Boudaoud A. Strain- or stress-sensing in mechanochemical patterning by the phytohormone auxin. Bull Math Biol. 2019;81:3342–3361. doi: 10.1007/s11538-019-00600-5. [DOI] [PubMed] [Google Scholar]
Keener JP. Propagation and its failure in coupled systems of discrete excitable cells. SIAM J Appl Math. 1987;47:556–572. [Google Scholar]
Kevrekidis PG. Non-linear waves in lattices: past, present, future. IMA J Appl Math. 2011;76:389–423. [Google Scholar]
Kneuper I, Teale W, Dawson JE, Tsugeki R, Katifori E, Palme K, Ditengou FA. Auxin biosynthesis and cellular efflux act together to regulate leaf vein patterning. J Exp Bot. 2020;72:1151–1165. doi: 10.1093/jxb/eraa501. [DOI] [PubMed] [Google Scholar]
Li Y, van Heijster P, Simpson MJ, Wechselberger M. Shock-fronted travelling waves in a reaction–diffusion model with nonlinear forward–backward–forward diffusion. Physica D. 2021;423:132916. [Google Scholar]
Mallet-Paret J. The global structure of traveling waves in spatially discrete dynamical systems. J Dyn Differ Equ. 1999;11:49–128. [Google Scholar]
Merks RMH, Van de Peer Y, Inzé D, Beemster GTS. Canalization without flux sensors: a traveling-wave hypothesis. Trends Plant Sci. 2007;12:384–390. doi: 10.1016/j.tplants.2007.08.004. [DOI] [PubMed] [Google Scholar]
Mitchison GJ. A model for vein formation in higher plants. Proc R Soc Lond Ser B Biol Sci. 1980;207:79–109. [Google Scholar]
Mitchison G. The polar transport of auxin and vein patterns in plants. Philos Trans R Soc Lond B Biol Sci. 1981;295:461–471. [Google Scholar]
Moser P (2021) The propagation of auxin waves and wave trains. B.Sc. thesis, Leiden University. https://hdl.handle.net/1887/3197145
Pankov A. Travelling waves and periodic oscillations in Fermi–Pasta–Ulam lattices. Singapore: Imperial College Press; 2005. [Google Scholar]
Paque S, Weijers D. Q &a: Auxin: the plant molecule that influences almost anything. BMC Biol. 2016;14:67. doi: 10.1186/s12915-016-0291-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Razavi MS, Shirani E, Kassab GS. Scaling laws of flow rate, vessel blood volume, lengths, and transit times with number of capillaries. Front Physiol. 2018;9:581. doi: 10.3389/fphys.2018.00581. [DOI] [PMC free article] [PubMed] [Google Scholar]
Reinhardt D, Pesce E-R, Stieger P, Mandel T, Baltensperger K, Bennett M, Traas J, Friml J, Kuhlemeier C. Regulation of phyllotaxis by polar auxin transport. Nature. 2003;426:255–260. doi: 10.1038/nature02081. [DOI] [PubMed] [Google Scholar]
Rolland-Lagan A-G. Vein patterning in growing leaves: axes and polarities. Curr Opin Genet Dev. 2008;18:348–353. doi: 10.1016/j.gde.2008.05.002. [DOI] [PubMed] [Google Scholar]
Rolland-Lagan A-G, Prusinkiewicz P. Reviewing models of auxin canalization in the context of leaf vein pattern formation in Arabidopsis. Plant J Cell Mol Biol. 2005;44:854–865. doi: 10.1111/j.1365-313X.2005.02581.x. [DOI] [PubMed] [Google Scholar]
Sachs T. The induction of transport channels by auxin. Planta. 1975;127:201–206. doi: 10.1007/BF00380716. [DOI] [PubMed] [Google Scholar]
Sandstede B. Stability of travelling waves. In: Fiedler B, editor. Handbook of dynamical systems. Amsterdam: Elsevier; 2002. pp. 983–1055. [Google Scholar]
Scarpella E, Marcos D, Friml J, Berleth T. Control of leaf vascular patterning by polar auxin transport. Genes Dev. 2006;20:1015–1027. doi: 10.1101/gad.1402406. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schmidt-Nielsen K, Knut S-N. Scaling: why is animal size so important? Cambridge: Cambridge University Press; 1984. [Google Scholar]
Shi B, Vernoux T. Patterning at the shoot apical meristem and phyllotaxis. Curr Top Dev Biol. 2018;131:81–107. doi: 10.1016/bs.ctdb.2018.10.003. [DOI] [PubMed] [Google Scholar]
Shih Y-L, Huang L-T, Tu Y-M, Lee B-F, Bau Y-C, Hong CY, lin Lee H, Shih Y-P, Hsu M-F, Lu Z-X, Chen J-S, Chao L. Active transport of membrane components by self-organization of the Min proteins. Biophys J . 2019;116:1469–1482. doi: 10.1016/j.bpj.2019.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith RS, Guyomarc’h S, Mandel T, Reinhardt D, Kuhlemeier C, Prusinkiewicz P. A plausible model of phyllotaxis. Proc Natl Acad Sci USA. 2006;103:1301–1306. doi: 10.1073/pnas.0510457103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sonner S, Efendiev MA, Eberl HJ. On the well-posedness of a mathematical model of quorum-sensing in patchy biofilm communities. Math Methods Appl Sci. 2011;34:1667–1684. [Google Scholar]
Stefanov A, Wright JD. Small amplitude traveling waves in the full-dispersion Whitham equation. J Dyn Differ Equ. 2020;32:85–99. [Google Scholar]
van Berkel K, de Boer RJ, Scheres B, ten Tusscher K. Polar auxin transport: models and mechanisms. Development. 2013;140:2253–2268. doi: 10.1242/dev.079111. [DOI] [PubMed] [Google Scholar]
Verna C, Ravichandran SJ, Sawchuk MG, Linh NM, Scarpella E. Coordination of tissue cell polarity by auxin transport and signaling. Elife. 2019;8:e51061. doi: 10.7554/eLife.51061. [DOI] [PMC free article] [PubMed] [Google Scholar]
Walke ML, Farcot E, Traas J, Godin C. The flux-based pin allocation mechanism can generate either canalyzed or diffuse distribution patterns depending on geometry and boundary conditions. PLoS ONE. 2013;8:e54802. doi: 10.1371/journal.pone.0054802. [DOI] [PMC free article] [PubMed] [Google Scholar]
West GB, Brown JH. Life’s universal scaling laws. Phys Today. 2004;57:36–43. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file 1 (mp4 4441 KB)^{(4.3MB, mp4)}

Data Availability Statement

The datasets generated during the current study are available from the corresponding author on reasonable request.

[CR1] Adamowski M, Friml J. PIN-dependent auxin transport: action, regulation, and evolution. Plant Cell. 2015;27:20–32. doi: 10.1105/tpc.114.134874. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] Allen HR, Ptashnyk M. Mathematical modelling of auxin transport in plant tissues: flux meets signalling and growth. Bull Math Biol. 2020;82:1–35. doi: 10.1007/s11538-019-00685-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] Althuis R (2021) Auxin waves in a two-dimensional grid. BSc thesis, Leiden University. https://pub.math.leidenuniv.nl/hupkeshj/scriptie_rosalie.pdf

[CR4] Aronson DG, Weinberger HF (1975) Nonlinear diffusion in population genetics, combustion, and nerve pulse propagation, in Partial differential equations and related topics (Program, Tulane Univ., New Orleans, La., 1974). Lecture notes in mathematics, vol 446. Springer, Berlin, pp 5–49

[CR5] Aronson DG, Weinberger HF. Multidimensional nonlinear diffusion arising in population genetics. Adv Math. 1978;30:33–76. [Google Scholar]

[CR6] Autran D, Bassel GW, Chae E, Ezer D, Ferjani A, Fleck C, Hamant O, Hartmann FP, Jiao Y, Johnston IG, Kwiatkowska D, Lim BL, Mahönen AP, Morris RJ, Mulder BM, Nakayama N, Sozzani R, Strader LC, Tusscher Kt, Ueda M, Wolf S (2021) What is quantitative plant biology? Quant Plant Biol 2 [DOI] [PMC free article] [PubMed]

[CR7] Bayer EM, Smith RS, Mandel T, Nakayama N, Sauer M, Prusinkiewicz P, Kuhlemeier C. Integration of transport-based models for phyllotaxis and midvein formation. Genes Dev. 2009;23:373–384. doi: 10.1101/gad.497009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] Beale JT. Water waves generated by a pressure disturbance on a steady stream. Duke Math J. 1980;47:297–323. [Google Scholar]

[CR9] Benítez M, Hernández-Hernández V, Newman SA, Niklas KJ. Dynamical patterning modules, biogeneric materials, and the evolution of multicellular plants. Front Plant Sci. 2018;9:871. doi: 10.3389/fpls.2018.00871. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] Brillouin L. Wave propagation in periodic structures. New York: Dover Phoenix Editions; 1953. [Google Scholar]

[CR11] Chen X, Guo J-S, Wu C-C. Traveling waves in discrete periodic media for bistable dynamics. Arch Ration Mech Anal. 2008;189:189–236. [Google Scholar]

[CR12] Cieslak M, Owens A, Prusinkiewicz P. Computational models of auxin-driven patterning in shoots. Cold Spring Harb Perspect Biol. 2021;14:a040097. doi: 10.1101/cshperspect.a040097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] Dauxois T. Fermi, Pasta, Ulam, and a mysterious lady. Phys Today. 2008;61:55–57. [Google Scholar]

[CR14] Draelants D, Avitabile D, Vanroose W. Localized auxin peaks in concentration-based transport models of the shoot apical meristem. J R Soc Interface. 2015;12:20141407. doi: 10.1098/rsif.2014.1407. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] Emerenini BO, Hense BA, Kuttler C, Eberl HJ. A mathematical model of quorum sensing induced biofilm detachment. PLoS ONE. 2015;10:e0132385–e0132385. doi: 10.1371/journal.pone.0132385. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] Faver TE (2018) Nanopteron-stegoton traveling waves in mass and spring dimer Fermi–Pasta–Ulam–Tsingou lattices. Ph.D. thesis, Drexel University, Philadelphia, PA, May

[CR17] Faver TE, Wright JD. Exact diatomic Fermi–Pasta–Ulam–Tsingou solitary waves with optical band ripples at infinity. SIAM J Math Anal. 2018;50:182–250. [Google Scholar]

[CR18] Fendrych M, Leung J, Friml J. TIR1/AFB-Aux/IAA auxin perception mediates rapid cell wall acidification and growth of Arabidopsis hypocotyls. Elife. 2016;5:e19048. doi: 10.7554/eLife.19048. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] Fermi E, Pasta J, Ulam S. Studies of nonlinear problems. Lect Appl Math. 1955;12:143–56. [Google Scholar]

[CR20] Friesecke G, Pego RL. Solitary waves on FPU lattices. I. Qualitative properties, renormalization and continuum limit. Nonlinearity. 1999;12:1601–1627. [Google Scholar]

[CR21] Friesecke G, Pego RL. Solitary waves on FPU lattices. II. Linear implies nonlinear stability. Nonlinearity. 2002;15:1343–1359. [Google Scholar]

[CR22] Friesecke G, Pego RL. Solitary waves on Fermi–Pasta–Ulam lattices. III. Howland-type Floquet theory. Nonlinearity. 2004;17:207–227. [Google Scholar]

[CR23] Friesecke G, Pego RL. Solitary waves on Fermi–Pasta–Ulam lattices. IV. Proof of stability at low energy. Nonlinearity. 2004;17:229–251. [Google Scholar]

[CR24] Friesecke G, Wattis JAD. Existence theorem for solitary waves on lattices. Commun Math Phys. 1994;161:391–418. [Google Scholar]

[CR25] Ghasemi M, Sonner S, Eberl HJ. Time adaptive numerical solution of a highly non-linear degenerate cross-diffusion system arising in multi-species biofilm modelling. Eur J Appl Math. 2018;29:1035–1061. [Google Scholar]

[CR26] Hajný J, Prát T, Rydza N, Rodriguez L, Tan S, Verstraeten I, Domjan D, Mazur E, Smakowska-Luzan E, Smet W, Mor E, Nolf J, Yang B, Grunewald W, Molnár G, Belkhadir Y, Rybel BD, Friml J. Receptor kinase module targets PIN-dependent auxin transport during canalization. Science. 2020;370:550–557. doi: 10.1126/science.aba3178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] Hajný J, Tan S, Friml J. Auxin canalization: from speculative models toward molecular players. Curr Opin Plant Biol. 2022;65:102174. doi: 10.1016/j.pbi.2022.102174. [DOI] [PubMed] [Google Scholar]

[CR28] Haskovec J, Jönsson H, Kreusser LM, Markowich P. Auxin transport model for leaf venation. Proc R Soc A. 2019;475:20190015. doi: 10.1098/rspa.2019.0015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] Heisler MG, Jonsson H. Modeling auxin transport and plant development. J Plant Growth Regul. 2006;25:302–312. [Google Scholar]

[CR30] Herrmann M, Matthies K. Asymptotic formulas for solitary waves in the high-energy limit of FPU-type chains. Nonlinearity. 2015;28:2767–2789. [Google Scholar]

[CR31] Hochstrasser D, Mertens F, Büttner H. Energy transport by lattice solitons in $α$ -helical proteins. Phys Rev A. 1989;40:2602. doi: 10.1103/physreva.40.2602. [DOI] [PubMed] [Google Scholar]

[CR32] Hoffman A, Wright JD. Nanopteron solutions of diatomic Fermi–Pasta–Ulam–Tsingou lattices with small mass-ratio. Physica D. 2017;358:33–59. [Google Scholar]

[CR33] Holloway DM, Wenzel CL. Polar auxin transport dynamics of primary and secondary vein patterning in dicot leaves. in silico Plants. 2021;3:diab030. [Google Scholar]

[CR34] Hupkes HJ, Sandstede B. Travelling pulse solutions for the discrete FitzHugh–Nagumo system. SIAM J Appl Dyn Syst. 2010;9:827–882. [Google Scholar]

[CR35] Johnson MA, Wright JD. Generalized solitary waves in the gravity-capillary Whitham equation. Stud Appl Math. 2020;144:102–130. [Google Scholar]

[CR36] Johnston ST, Baker RE, McElwain DS, Simpson MJ. Co-operation, competition and crowding: a discrete framework linking Allee kinetics, nonlinear diffusion, shocks and sharp-fronted travelling waves. Sci Rep. 2017;7:1–19. doi: 10.1038/srep42134. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] Jones C, Kopell N, Langer R. Construction of the FitzHugh–Nagumo pulse using differential forms. In: Aris R, Aronson DG, Swinney HL, editors. Patterns and dynamics in reactive media. New York: Springer; 1991. pp. 101–115. [Google Scholar]

[CR38] Jönsson H, Heisler M, Shapiro B, Meyerowitz E, Mjolsness E. An auxin-driven polarized transport model for phyllotaxis. Proc Natl Acad Sci USA. 2006;103:1633–1638. doi: 10.1073/pnas.0509839103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] Julien JD, Pumir A, Boudaoud A. Strain- or stress-sensing in mechanochemical patterning by the phytohormone auxin. Bull Math Biol. 2019;81:3342–3361. doi: 10.1007/s11538-019-00600-5. [DOI] [PubMed] [Google Scholar]

[CR40] Keener JP. Propagation and its failure in coupled systems of discrete excitable cells. SIAM J Appl Math. 1987;47:556–572. [Google Scholar]

[CR41] Kevrekidis PG. Non-linear waves in lattices: past, present, future. IMA J Appl Math. 2011;76:389–423. [Google Scholar]

[CR42] Kneuper I, Teale W, Dawson JE, Tsugeki R, Katifori E, Palme K, Ditengou FA. Auxin biosynthesis and cellular efflux act together to regulate leaf vein patterning. J Exp Bot. 2020;72:1151–1165. doi: 10.1093/jxb/eraa501. [DOI] [PubMed] [Google Scholar]

[CR43] Li Y, van Heijster P, Simpson MJ, Wechselberger M. Shock-fronted travelling waves in a reaction–diffusion model with nonlinear forward–backward–forward diffusion. Physica D. 2021;423:132916. [Google Scholar]

[CR44] Mallet-Paret J. The global structure of traveling waves in spatially discrete dynamical systems. J Dyn Differ Equ. 1999;11:49–128. [Google Scholar]

[CR45] Merks RMH, Van de Peer Y, Inzé D, Beemster GTS. Canalization without flux sensors: a traveling-wave hypothesis. Trends Plant Sci. 2007;12:384–390. doi: 10.1016/j.tplants.2007.08.004. [DOI] [PubMed] [Google Scholar]

[CR46] Mitchison GJ. A model for vein formation in higher plants. Proc R Soc Lond Ser B Biol Sci. 1980;207:79–109. [Google Scholar]

[CR47] Mitchison G. The polar transport of auxin and vein patterns in plants. Philos Trans R Soc Lond B Biol Sci. 1981;295:461–471. [Google Scholar]

[CR48] Moser P (2021) The propagation of auxin waves and wave trains. B.Sc. thesis, Leiden University. https://hdl.handle.net/1887/3197145

[CR49] Pankov A. Travelling waves and periodic oscillations in Fermi–Pasta–Ulam lattices. Singapore: Imperial College Press; 2005. [Google Scholar]

[CR50] Paque S, Weijers D. Q &a: Auxin: the plant molecule that influences almost anything. BMC Biol. 2016;14:67. doi: 10.1186/s12915-016-0291-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] Razavi MS, Shirani E, Kassab GS. Scaling laws of flow rate, vessel blood volume, lengths, and transit times with number of capillaries. Front Physiol. 2018;9:581. doi: 10.3389/fphys.2018.00581. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR52] Reinhardt D, Pesce E-R, Stieger P, Mandel T, Baltensperger K, Bennett M, Traas J, Friml J, Kuhlemeier C. Regulation of phyllotaxis by polar auxin transport. Nature. 2003;426:255–260. doi: 10.1038/nature02081. [DOI] [PubMed] [Google Scholar]

[CR53] Rolland-Lagan A-G. Vein patterning in growing leaves: axes and polarities. Curr Opin Genet Dev. 2008;18:348–353. doi: 10.1016/j.gde.2008.05.002. [DOI] [PubMed] [Google Scholar]

[CR54] Rolland-Lagan A-G, Prusinkiewicz P. Reviewing models of auxin canalization in the context of leaf vein pattern formation in Arabidopsis. Plant J Cell Mol Biol. 2005;44:854–865. doi: 10.1111/j.1365-313X.2005.02581.x. [DOI] [PubMed] [Google Scholar]

[CR55] Sachs T. The induction of transport channels by auxin. Planta. 1975;127:201–206. doi: 10.1007/BF00380716. [DOI] [PubMed] [Google Scholar]

[CR56] Sandstede B. Stability of travelling waves. In: Fiedler B, editor. Handbook of dynamical systems. Amsterdam: Elsevier; 2002. pp. 983–1055. [Google Scholar]

[CR57] Scarpella E, Marcos D, Friml J, Berleth T. Control of leaf vascular patterning by polar auxin transport. Genes Dev. 2006;20:1015–1027. doi: 10.1101/gad.1402406. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR58] Schmidt-Nielsen K, Knut S-N. Scaling: why is animal size so important? Cambridge: Cambridge University Press; 1984. [Google Scholar]

[CR59] Shi B, Vernoux T. Patterning at the shoot apical meristem and phyllotaxis. Curr Top Dev Biol. 2018;131:81–107. doi: 10.1016/bs.ctdb.2018.10.003. [DOI] [PubMed] [Google Scholar]

[CR60] Shih Y-L, Huang L-T, Tu Y-M, Lee B-F, Bau Y-C, Hong CY, lin Lee H, Shih Y-P, Hsu M-F, Lu Z-X, Chen J-S, Chao L. Active transport of membrane components by self-organization of the Min proteins. Biophys J . 2019;116:1469–1482. doi: 10.1016/j.bpj.2019.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR61] Smith RS, Guyomarc’h S, Mandel T, Reinhardt D, Kuhlemeier C, Prusinkiewicz P. A plausible model of phyllotaxis. Proc Natl Acad Sci USA. 2006;103:1301–1306. doi: 10.1073/pnas.0510457103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR62] Sonner S, Efendiev MA, Eberl HJ. On the well-posedness of a mathematical model of quorum-sensing in patchy biofilm communities. Math Methods Appl Sci. 2011;34:1667–1684. [Google Scholar]

[CR63] Stefanov A, Wright JD. Small amplitude traveling waves in the full-dispersion Whitham equation. J Dyn Differ Equ. 2020;32:85–99. [Google Scholar]

[CR64] van Berkel K, de Boer RJ, Scheres B, ten Tusscher K. Polar auxin transport: models and mechanisms. Development. 2013;140:2253–2268. doi: 10.1242/dev.079111. [DOI] [PubMed] [Google Scholar]

[CR65] Verna C, Ravichandran SJ, Sawchuk MG, Linh NM, Scarpella E. Coordination of tissue cell polarity by auxin transport and signaling. Elife. 2019;8:e51061. doi: 10.7554/eLife.51061. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] Walke ML, Farcot E, Traas J, Godin C. The flux-based pin allocation mechanism can generate either canalyzed or diffuse distribution patterns depending on geometry and boundary conditions. PLoS ONE. 2013;8:e54802. doi: 10.1371/journal.pone.0054802. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR67] West GB, Brown JH. Life’s universal scaling laws. Phys Today. 2004;57:36–43. [Google Scholar]

PERMALINK

Scaling relations for auxin waves

Bente Hilde Bakker

Timothy E Faver

Hermen Jan Hupkes

Roeland M H Merks

Jelle van der Voort

Abstract

Supplementary Information

Introduction

Polar auxin transport

Mathematical motivation

The model

Fig. 1.

Fig. 2.

Fig. 3.

Main results

Fig. 4.

Fig. 5.

Cross-diffusion

Relation to FPUT pulses

Discussion

Notation

The travelling wave problem

Rewriting the original problem (1.1)

Changes of notation

Rewriting the Pj equation

Solving the Rj equation

The final system for Aj and Pj

Table 1.

Remark 2.1

The travelling wave Ansatz

The Fourier multiplier structure

The long wave problem

The long wave scaling

The formal long wave limit and exponent selection

The formal limit on M˘ϵ and the selection of the exponents γ and μ

Lemma 3.1

Remark 3.2

The formal leading order equation for ψ1

Remark 3.3

The formal leading order equation for ψ2 and the selection of the exponent β

The final long wave system

Remark 3.4

Proposition 3.5

Analysis of the Fourier multiplier M(ν)

Proposition 4.1

Lemma 4.2

Estimates for z ‘close to’ 0

Estimates for z ‘far from’ 0

Estimates for Re(νz) ‘close to’ a nonzero integer multiple of 2π

Estimates for Re(νz) ‘close to’ 0

Lemma 4.3

Estimates for Re(νz) ‘far from’ a nonzero integer multiple of 2π

Overall estimates

Analysis of the linearization T

Proposition 5.1

The proof of Proposition 5.1

Lemma 5.2

Lemma 5.3

Proof

Auxiliary results for the proof of Lemma 5.3

Lemma 5.4

Proof

Lemma 5.5

Proof

The proof of the estimate (5.24)

The proof of the estimate (5.25)

Lemma 5.6

The proof of Lemma 5.6

Analysis of the long wave problem

The perturbation Ansatz for the long wave problem (3.42)

The solution of the fixed point problem (6.8)

Proposition 6.1

Proposition 6.2

Theorem 6.3

Proof

Supplementary Information

Acknowledgements

Appendix A. Fourier analysis

Rewriting the $P_{j}$ equation

Solving the $R_{j}$ equation

The final system for $A_{j}$ and $P_{j}$

The formal limit on ${\overset{˘}{M}}_{ϵ}$ and the selection of the exponents $γ$ and $μ$

The formal leading order equation for $ψ_{1}$

The formal leading order equation for $ψ_{2}$ and the selection of the exponent $β$

Analysis of the Fourier multiplier $M^{(ν)}$

Estimates for $Re (ν z)$ ‘close to’ a nonzero integer multiple of $2 π$

Estimates for $Re (ν z)$ ‘close to’ 0

Estimates for $Re (ν z)$ ‘far from’ a nonzero integer multiple of $2 π$

Analysis of the linearization $T$

B.2.1. Lipschitz estimates on $V_{11}^{ν}$

B.2.2. Lipschitz estimates on $V_{12}^{ν}$

B.2.3. Lipschitz estimates on $V_{13}^{ν}$

B.2.4. Lipschitz estimates on $V_{14}^{ν}$

B.2.5. Lipschitz estimates on $V_{15}^{ν}$

B.2.6. Lipschitz estimates on $V_{21}^{ν}$

B.2.7. Lipschitz estimates on $V_{22}^{ν} \circ N_{1}^{ν}$

B.2.8. Lipschitz estimates on $V_{23}^{ν}$

B.3.1. Mapping estimates on $V_{11}^{ν}$

B.3.2. Mapping estimates on $V_{12}^{ν}$

B.3.3. Mapping estimates on $V_{13}^{ν}$

B.3.4. Mapping estimates on $V_{14}^{ν}$

B.3.5. Mapping estimates on $V_{15}^{ν}$

B.3.6. Mapping estimates on $V_{21}^{ν}$

B.3.7. Mapping estimates on $V_{22}^{ν} \circ N_{1}^{ν}$

B.3.8. Mapping estimates on $V_{23}^{ν}$