Skip to main content
Springer logoLink to Springer
. 2021 Feb 14;382(1):485–545. doi: 10.1007/s00220-021-03977-4

Phase Transitions for Nonlinear Nonlocal Aggregation-Diffusion Equations

José A Carrillo 1,, Rishabh S Gvalani 2
PMCID: PMC7921099  PMID: 33746234

Abstract

We are interested in studying the stationary solutions and phase transitions of aggregation equations with degenerate diffusion of porous medium-type, with exponent 1<m<. We first prove the existence of possibly infinitely many bifurcations from the spatially homogeneous steady state. We then focus our attention on the associated free energy, proving existence of minimisers and even uniqueness for sufficiently weak interactions. In the absence of uniqueness, we show that the system exhibits phase transitions: we classify values of m and interaction potentials W for which these phase transitions are continuous or discontinuous. Finally, we comment on the limit m and the influence that the presence of a phase transition has on this limit.

Introduction

In this work, we deal with the properties of the set of stationary states and long-time asymptotics for a general class of nonlinear aggregation-diffusion equations of the form

tρ=β-1Δρm+·(ρWρ)(x,t)Ω×(0,T]ρ(·,0)=ρ0L2(Ω)Lm(Ω)P(Ω)xΩ, 1.1

where 1<m< is the nonlinear diffusivity exponent of porous medium type [V07], β>0 measures the relative strength between repulsion (by nonlinear diffusion) and attraction-repulsion (by the nonlocal aggregation terms), and WC2(Ω) is the attractive-repulsive interaction potential. Here Ω denotes the d-dimensional torus Td having side length L>0, with P(Ω) being the set of Borel probability measures on Ω, and Lm(Ω) the set of m-power integrable functions on Ω. Notice that for m=1 we recover the linear diffusion case which is related to certain nonlocal Fokker–Planck equations, also referred to as McKean–Vlasov equations in the probability community. These equations also share the feature of being gradient flows of free energy functionals of the form

Fβm(ρ):=β-1m-1Ωρm(x)dx-β-1m-1+12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy,m>1β-1Ω(ρlogρ)(x)dx+12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy,m=1 1.2

for ρLm(Ω)P(Ω), as discussed extensively in the literature [JKO98, Ott01, Vil03, CMV03, AGS08]. We refer to [CCY19] for a recent survey of this active field of research. Note that although we have included the free energy for m=1 in (1.2), we will mostly be dealing with case m>1 in this article. We will only discuss the case m=1 as a limiting case of the energies Fβm as m1. The case m=1 is treated in more detail in [CGPS20].

Aggregation-diffusion equations such as (1.1) naturally appear in mathematical biology [BCM07, VS15, CMS+19, BDZ17, BCD+18] and mathematical physical contexts [Oel90, Phi07, FP08, BV13] as the typical mean-field limits of interacting particle systems of the form

dXti=-1NijNWN(Xti-Xtj)dt+2β2-1dBti,

where WN=1β1φN+W and φN(x)=Nξφ(Nξ/dx),forallxRd. Here, φ is a the typical localized repulsive potential, for instance a Gaussian, and 0<ξ<1. Notice that due to the choice of ξ, the shape of the potential gets squeezed to a Dirac Delta at 0 slower than the typical relative particle distance N-1/d. Also, β2-10 is the strength of the independent Brownian motions driving each particle. We refer to [Oel90, Phi07, BV13] for the case of quadratic diffusion m=2 with β1=β, ν=0, and to [FP08] for related particle approximations for different exponents m. The McKean–Vlasov equation m=1 is obtained for the particular case β1=+ and β2=β, being the inverse temperature of the system for the linear case, and its derivation is classical for regular interaction potentials W, see for instance [Szn91].

Analysing the set of stationary states of the aggregation-diffusion equation (1.1) and their properties depending on β, the relative strength of repulsion by local nonlinear diffusion and attraction-repulsion by nonlocal interactions, is a very challenging problem. As with the linear case, the flat state

ρ:=L-d=|Ω|-1, 1.3

is always a stationary solution of the system. The problem lies in constructing nontrivial stationary solutions and minimisers. In the linear diffusion case m=1, we refer to [CP10, CGPS20] where quite a complete picture of the appearance of bifurcations and of continuous and discontinuous phase transitions is present, under suitable assumptions on the interaction potential W. Bifurcations of stationary solutions depending on a parameter are usually referred in the physics literature as phase transitions [Daw83]. In this work we make a distinction between the two: referring to the existence of nontrivial stationary solutions as bifurcations and the existence of nontrivial minimisers of Fβm as phase transtions. Particular instances of phase transitions related to aggregation-diffusion equations with linear diffusion have been recently studied for the case of the Vicsek–Fokker–Planck equation on the sphere [DFL15, FL12] and the approximated homogeneous Cucker–Smale approximations in the whole space [Tug14, BCnCD16, ASBCD19]. We also refer to [Sch85] where the problem was studied on a bounded domain for the Newtonian interaction, and to [Tam84] where the problem was studied on the whole space with a confining potential.

However, there are no general results in the literature for the nonlinear diffusion case (1.1), m>1, except for the particular case of m=2, d=1, with W given by the fundamental solution of the Laplacian with no flux boundary conditions (the Newtonian interaction) recently studied in [CCW+20]. Despite the simplicity of the setting in [CCW+20], this example revealed how complicated phase transitions for nonlinear diffusion cases could be. The authors showed that infinitely many discontinuous phase transitions occur for that particular problem. Let us mention that the closer result in the periodic setting is [CKY13], where the authors showed that no phase transitions occur for small values of β, when the flat state is asymptotically stable, for m(1,2].

Our main goal is thus to develop a theory for the stationary solutions and phase transitions of (1.1) for general interactions WC2(Ω) and nonlinear diffusion in the periodic setting, something that has not been previously studied in the literature. This paper can be thought of as an extension of the results in [CGPS20] to the setting of nonlinear diffusion. Considering this, we need to define appropriately the notion of phase transition for the case m(1,), as done in [CP10] for the linear case m=1.

Note that, unlike in the linear setting, the L1(Ω) topology is not the natural topology to define phase transitions. It seems that for m>1 the correct topology to work in is L(Ω) (cf. Definition 5.10 and Remark 5.17 below). For our results we will often require compactness of minimisers in this topology. One possible way of obtaining this compactness is via control of the Hölder norms of the stationary solutions of (1.1). In Sect. 3 we briefly comment on the existence of solutions to (1.1) before proceeding to the proof of Hölder regularity. Since this is a key element of the subsequent results and the proof of Hölder regularity for such equations is not in the literature we include the proof in full detail in Sect. 3. It relies on the so-called method of intrinsic scaling introduced by DiBenedetto for the porous medium equation (cf. [DiB79]), which is a version of the De Giorgi–Nash–Moser iteration adapted to the setting of degenerate parabolic equations. We make modifications to the method to deal with the presence of the nonlocal drift term ·(ρWρ). We remark here that the proof of this result is completely independent of the rest of the paper. In a first reading, readers more interested in the properties of stationary solutions and phase transitions might choose to skip the proof and continue to Sect. 4. As a consequence of the proof of Hölder regularity, we also obtain uniform-in-time equicontinuity of the solutions away from the initial datum in Corollary 3.4.

After the proof of the Hölder regularity we proceed to Sect. 4, where we discuss the local bifurcations of stationary solutions from the flat state ρ. In Theorem 4.4, we provide conditions on the interaction potential W and on the parameter β=β, such that (ρ,β) is a bifurcation point using the Crandall–Rabinowitz theorem (cf. Theorem B.1). In fact for certain choices of W one can show that there exist infinitely many such bifurcation points. We then move on to Sect. 5, where we prove the existence and regularity of minimisers Fβm. We also show that, for β small enough, the flat state is the unique minimiser of the energy for m(1,], thus extending the result of [CKY13]. In Theorem 5.8, we use the uniform equicontinuity in time obtained in Corollary 3.4 to prove that solutions of (1.1) converge to ρ in L(Ω) whenever it is the unique stationary solution. We show that, as in the linear case, the notion of H-stability (cf. Definition 2.1), provides a sharp criterion for the existence or non-existence of phase transitions. We then proceed, in Lemmas 5.15 and 5.16 Proposition 5.18, to provide sufficient conditions for the existence of continuous or discontinuous phase transitions, where the proofs rely critically on the Hölder regularity obtained in Sect. 3. We also provide general conditions on W for the existence of discontinuous phase transitions. We conclude the section by showing that m[2,3] all non-H-stable potentials W are associated with discontinuous phase transitions of Fβm, while for m=4 we can construct a large class of W that lead to continuous phase transitions of Fβm. We summarise our results below:

  1. The proof of Hölder regularity of the weak solutions of (1.1) can be found in Theorem 3.3 and the preceding lemmas of Sect. 3.

  2. The result on the existence of local bifurcations of the stationary solutions is contained in Theorem 4.4.

  3. The results on phase transitions are spread out throughout Sect. 5. The result on the long-time behaviour of solutions before or in the absence of a phase transition can be found in Theorem 5.8. The main result on the existence of discontinuous transition points is Theorem 5.19 while the explicit conditions for a continuous transition point can be found in Theorem 5.24.

  4. In Sect. 6, we treat the mesa limit m. The Γ-convergence of the sequence of energies Fβm to some limiting free energy F as m can be found in Theorem 6.1. We then provide a characterisation of the minimisers of the limiting variational problem in terms of the size of the domain and the potential W in Theorem 6.2.

In Sect. 7, we display the results of some numerical experiments which we hope will shed further light on the theoretical results, while also providing us with some conjectures about the behaviour of the system in settings not covered by the theory.

Preliminaries and Notation

As mentioned earlier, we denote by P(Ω) the space of all Borel probability measures on Ω with ρ the generic element which we will often associate with its density ρ(x)L1(Ω), if it exists. We use the standard notation of Lp(Ω) and Hs(Ω) for the Lebesgue and periodic L2-Sobolev spaces, respectively. We denote by the Ck(Ω),C(Ω) the space of k-times (kN) continuously differentiable and smooth functions, respectively.

Given any function in fL2(Ω) we define its Fourier transform as

f^(k)=f,ekL2(Ω),kZd

where

ek(x)=Nki=1deki(xi),whereeki(xi)=cos2πkiLxiki>0,1ki=0,sin2πkiLxiki<0,

and Nk is defined as

Nk:=1Ld/2i=1d2-δki,012=:Θ(k)Ld/2. 2.1

Using this we have the following representation of the convolution of two functions W,fL2(Ω) where W is even along every coordinate

(Wf)(y)=kNdW^(k)1NkσSymk(Λ)f^(σ(k))eσ(k)(y).

where Symk(Λ)=Sym(Λ)/Hk. Sym(λ) represents the symmetric group of the product of two-point spaces, Λ={1,-1}d, which acts on Zd by pointwise multiplication, i.e. (σ(k))i=σiki,kZd,σSym(Λ). Hk is a normal subgroup of Sym(Λ) defined as follows

Hk:={σSym(Λ):σ(k)=k}.

We need to quotient out Hk as there might be some repetition of terms in the sum σSym(Λ) if kNd is such that ki=0 for some i{1,,d}. Another expression that we will use extensively in the sequel is the Fourier expansion of the following bilinear form

Ω×ΩW(x-y)f(x)f(y)dxdy=kNdW^(k)1NkσSymk(Λ)|f^(σ(k))|2. 2.2

The following notion will play an important role in the subsequent analysis.

Definition 2.1

A potential WL2(Ω) is said to be H-stable denoted by WHs if

W^(k)0,kZd,k0.

If this does not hold, we denote this by WHsc. The above condition is equivalent to the following inequality holding true for all ηL2(Ω) :

Ω×ΩW(x-y)η(x)η(y)dxdy0.

Furthermore, if η,W0, we have that

Ω×ΩW(x-y)η(x)η(y)dxdy>0.

Existence and Regularity of Solutions

We are interested in solutions of the following nonlinear-nonlocal PDE

tρ=β-1Δρm+·(ρWρ)(x,t)Ω×(0,T]ρ(·,0)=ρ0L2(Ω)Lm(Ω)P(Ω)xΩ, 3.1

where 1<m<, β>0, and WC2(Ω) is even along every co-ordinate and has mean zero. It is not immediately clear what the correct notion of solution for the above PDE is, as it need not possess classical solutions. We introduce the appropriate notion of solution in the following definition.

Definition 3.1

A weak solution of (3.1) is a bounded, measurable function

ρC([0,T];L2(Ω))

with

ρmL2([0,T];H1(Ω)),

such that

Ωρ(x,t)ϕ(x,t)dx0T+0TΩ(-ρ(x,t)ϕt(x,t)+β-1mρm-1(x,t)ρ(x,t)·ϕ(x,t)+ρ(Wρ(x,t))·ϕ(x,t))dxdt=0, 3.2

for all ϕH1([0,T];L2(Ω))L2([0,T];H01(Ω)) and ρ(·,0)=ρ0.

Theorem 3.2

Given ρ0L2(Ω)Lm(Ω)P(Ω), there exists a unique weak solution of (3.1). Furthermore ρ(·,t)P(Ω) for all t0.

The proof of this result is classical and we will not include it. It relies on regularisation techniques which remove the degeneracy in the problem. The meat of the matter is proving estimates uniform in the regularisation parameter. We refer to [BCL09, BS10] for proofs of this result with WC2(Ω).

We turn our attention to the regularity of solutions of (3.1). The proof is based on the method of intrinsic scaling introduced by DiBenedetto for the porous medium equation [DiB79, Urb08]. It is also similar in spirit to the proof in [KZ18] where regularity was proved for a degenerate diffusion equation posed on Rd with a potentially singular drift term. We also direct the readers to [HZ19] where Hölder regularity was proven for drift-diffusion equations with sharp conditions on the drift term using a different strategy of proof. Since we will mainly be concerned with stationary solutions we assume for the time being that there exists some universal constant M>0 such that ρL(ΩT)M, where ΩT is the parabolic domain ΩT:=Ω×[0,T] and Ω:=Ω×[0,). We first state the result regarding Hölder regularity.

Theorem 3.3

Let ρ be a weak solution of (3.1). with initial datum ρ0L(Ω)P(Ω), such that ρL(ΩT)M<. Then ρ is Hölder continuous with exponent a(0,1) dependent on the data, m, d, W, and β. Moreover, the Hölder exponent a depends continuously on β for β>0.

We also have the following consequence of the above result:

Corollary 3.4

Let ρ be a weak solution of (3.1) with initial datum ρ0L(Ω)P(Ω), such that ρL(Ω)M<. Then, for some C>0, it holds that

|ρ(y,t1)-ρ(x,t2)|Ch(dTd(x,y)+|t1-t2|1/2)a,

for all x,yTd and 0<C<t1<t2<. Note that the constants Ch and a are independent of xy and t1,t2.

We remind the reader that the above results are used to obtain the desired regularity and compactness of minimisers in Lemma 5.4 and the equicontinuity in time of solutions for the long-time behaviour result in Theorem 5.8, although they are of independent interest by themselves. The proof of Theorem 3.3 and Corollary 3.4 can be found in Sect. 8.

Characterisation of Stationary Solutions and Bifurcations

Now that we have characterised the notion of solution for (3.1) we study the associated stationary problem which is given by

β-1Δρm+·(ρWρ)=0,xΩ 4.1

with the notion of solution identical to the one defined in Theorem 3.1. One can immediately see that ρ (cf. (1.3)) is a solution to (4.1) for all β>0. As mentioned earlier, (3.1) and (4.1) are intimately associated to the free energy functional Fβm:P(Ω)(-,+] which is defined as

Fβm(ρ):=β-1m-1Ωρm(x)dx-β-1m-1+12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy,m>1β-1Ω(ρlogρ)(x)dx+12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy,m=1,

whenever the above quantities are finite and as + otherwise. We will often use the shorthand notation Sβm(ρ):=β-1m-1Ωρm(x)dx-β-1m-1 and Sβ:=β-1Ω(ρlogρ)(x)dx for the entropies and E(ρ):=12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy for the interaction energy. We will also drop the superscript m and just use Fβ(ρ) whenever m=1.

Another object that will play an important role in the analysis below is the following self-consistency equation

β-1mm-1ρm-1+Wρ=C,

for some constant C>0. We discuss how the above equation, solutions of (4.1), and Fβm(ρ) are related to each other for the case m>1 in the following proposition (the case m=1 is discussed in [CGPS20] and the proofs are essentially identical).

Proposition 4.1

Let ρP(Ω)Lm(Ω) and fix β>0,m>1. Then the following statements are equivalent

  1. ρ is a weak solution of  (4.1)

  2. ρ is a critical point of Fβm, i.e. the metric slope |Fβm|(ρ) is 0.

  3. For every connected component A of its support ρ satisfies the self-consistency equation, i.e.
    β-1mm-1ρm-1+Wρ=C(A,ρ) 4.2
    with C(A,ρ) given by
    C(A,ρ)=β-1m|A|(m-1)ρLm-1(A)m-1+1|A|AWρ(x)dx.

Remark 4.2

We have used the notation

ρLm-1(A)=(Aρm-1(x)dx)1m-1,

for 1<m<, even though this is not a norm for 1<m<2.

Remark 4.3

Note that if a stationary solution ρ is fully supported then the constant C(A,ρ)=C(Ω,ρ) reduces to

C(Ω,ρ)=β-1m|Ω|(m-1)ρLm-1(A)m-1,

where we have used the fact that W has mean zero. We can now formally pass to the limit m1 to obtain

β-1logρ+Wρ=β-1|Ω|Ωlogρdx.

The solutions of the above equation are studied in detail in [CGPS20].

Now that we have various equivalent characterisations of stationary solutions of (3.1), we proceed to state and prove the main result of this section regarding the existence of bifurcations from the uniform state ρ (cf. (1.3)). Before doing this however we need to introduce some relevant notions. We denote by H0n(Ω) the homogeneous Hn(Ω) space and by H0,sn(Ω) the closed subspace of H0n(Ω) consisting of functions which are even along every coordinate (pointwise a.e.). Note that the {ek}kNd,k0 form an orthogonal basis for H0,sn(Ω). We then introduce the following map F:H0,sn(Ω)×R+H0,sn(Ω) for n>d/2 which is given by

F(η,β):=β-1mm-1(ρ+η)m-1+Wη-β-1m|Ω|(m-1)ρ+ηLm-1(Ω)m-1. 4.3

Note that if F(η,β)=0 then the pair (ρ+η,β) satisfies (4.2) on all of Ω. If one can show that (ρ+η)(x)0,xΩ then we have found a bonafide stationary solution of (3.1) by the equivalency established in Proposition 4.1. Thus, we would like to study the bifurcations of the map F from its trivial branch (0,β) . To this order we compute its Fréchet derivatives around 0 as follows:

DηF(0,β)(e1)=β-1mρm-2e1+We1Dηβ2F(0,β)(e1)=-β-2mρm-2e1Dηη2F(0,β)(e1,e2)=β-1m(m-2)ρm-3e1e2-β-1m(m-2)|Ω|ρm-3Ωe1e2dxDηηη3F(0,β)(e1,e2,e3)=β-1m(m-2)(m-3)ρm-4e1e2e3-β-1m(m-2)(m-3)|Ω|ρm-4Ωe1e2e3dx,

for some e1,e2,e3H0,sn(Ω). We then have the following result:

Theorem 4.4

(Existence of bifurcations). Consider the map F:H0,sn(Ω)×R+H0,sn(Ω) for n>d/2 as defined in (4.3) with its trivial branch (0,β). Assume there exists kNd,k0 such that the following two conditions are satisfied

  1. W^(k)<0

  2. card{kNd,k0:W^(k)Θ(k)=W^(k)Θ(k)}=1 .

Then, (0,β) is a bifurcation point of (0,β) with

β=-mρm-3/2Θ(k)W^(k),

i.e. there exists a neigbourhood N of (0,β) and a curve (η(s),β(s))N,s(-δ,δ),δ>0 such that F(η(s),s)=0. The branch η(s) has the form

η(s)=sek+r(sek,β(s)),

where rH0,sn(Ω)=o(s) as s0. Additionally, we have that β(0)=0 and

β(0)=β(m-2)(m-3)3ρ2Ωek4dx.

Proof

The proof of this theorem relies on the Crandall–Rabinowitz theorem (cf. Theorem B.1). Note that FC2(H0,sn(Ω)×R+;H0,sn(Ω)). Thus, we need to show that: (a) DηF(0,β):H0,sn(Ω)H0,sn(Ω) is Fredholm with index zero and has a one-dimensional kernel and (b) for any eker(DηF(0,β)),e0 it holds that Dηβ2F(0,β)(e)Im(DηF(0,β)).

For (a) we first note that DηF(0,β) is a compact perturbation of the identity as the operator We is compact on H0,sn. It follows then that it is a Fredholm operator. Note that the functions {ek}kNd,k0 diagonalise the operator DηF(0,β). Indeed, we have

DηF(0,β)(ek)=(β-1mρm-2+1NkW^(k))ek=(β-1mρm-2+ρ-1/2W^(k)Θ(k))ek.

Note that if the conditions (1) and (2) in the statement of the theorem are satisfied it follows, using the expression for β, that DηF(0,β)(ek)=0 if and only if k=k. Thus, we have that ker(DηF(0,β))=span(ek). This completes the verification of the condition (1) in Theorem B.1.

For condition (2) in Theorem B.1, we note again by the diagonalisation of DηF(0,β) that Im(DηF(0,β))={span(ek)}. Thus, we have that

Dηβ2F(0,β)(ek)=-β-2mρm-2ekIm(DηF(0,β)).

We can now compute the derivatives of the branch. Using the identity [Kie12, I.6.3], it follows that β(0)=0 if Dηη2F(0,β)(ek,ek)Im(DηF(0,β)). Thus, it is sufficient to check that

Dηη2F(0,β)(ek,ek),ek=β-1m(m-2)ρm-3ek2,ek=0,

where the last inequality follows by using the expression for ek2 from Proposition 5.23 and orthogonality of the basis {ek}kNd. Here ·,· denotes the dual pairing in H0,sn. Thus, we have that β(0)=0. Finally we can compute β(0) by using [Kie12, I.6.11] to obtain

β(0)=-Dηηη3F(0,β)(ek,ek,ek),ek3Dηβ2F(0,β)(ek),ek=β-1m(m-2)(m-3)ρm-4Ωek4dx3β-2mρm-2=β(m-2)(m-3)3ρ2Ωek4dx.

This completes the proof of the theorem.

Remark 4.5

Since H0,sn(Ω) is continuously embedded in C0(Ω) it follows that for the branch of solutions ρ+η(s) found in Theorem 4.4 are in fact strictly positive for s sufficiently small and are thus stationary solutions by the result of Proposition 4.1. Any interaction potential W(x) such that infinitely many k satisfy the conditions of Theorem 4.4 will have infinitely many bifurcation points (0,βk) from the trivial branch. A typical example would a be a potential for which the map kW^(k) is strictly negative and injective.

Remark 4.6

Note that β(0)>0 for all m(1,2)(3,). This means that the branch turns to the right, i.e. it is supercritical. On the other hand if m(2,3), then β(0)<0. This means that the branch turns to the left, i.e. it is subcritical. If m{2,3} we have that β(0)=0. The relation of this phenomenon to the minimisers of the free energy will be discussed in Proposition 5.22.

Minimisers of the Free Energy and Phase Transitions

The nontrivial stationary solutions found as a result of the bifurcation analysis in the previous section need not correspond to minimisers of the free energy, Fβm(ρ). Indeed, we do not know yet if minimisers even exist. We start first by proving the existence of minimisers of Fβm. We then show that for β sufficiently small Fβm has a unique minimiser, namely ρ (cf. (1.3)).

The natural question to ask then is if this scenario changes for larger values of β. We provide a rigorous definition by which this change can be characterised via the notion of a transition point and define two possible kinds of transition points, continuous and discontinuous. We then provide necessary and sufficient conditions on W for the existence of a transition point and sufficient conditions for the existence of continuous and discontinuous transition points.

We start with a technical lemma that provides us with some useful a priori bounds on the minimisers of Fβm.

Lemma 5.1

(L(Ω)-bounds). Assume β>0,m>1. Then there exists some Bβ,m>0, such that if ρP(Ω) with ρL(Ω)>Bβ,m, then there exists ρ¯P(Ω) with ρ¯L(Ω)Bβ,m with

Fβm(ρ¯)<Fβm(ρ).

Proof

We start by noting that the following bounds hold

Sβm(ρ)β-1m-1(1|Ω|)m-1-β-1m-1 5.1
E(ρ)-12W-L(Ω). 5.2

We divide our analysis into two cases. For B>0 and ρP(Ω) let

BB:={xΩ:ρB},

and

εB=BBρdx.

Case 1: (ρ,B) s.t. εB12

We then have the following bounds on the entropy.

Sβm(ρ)=β-1m-1(BBρmdx+BBcρmdx)-β-1m-1β-1Bm-12(m-1)-β-1m-1.

It follows then that we have the following bound on the free energy.

Fβm(ρ)β-1Bm-12(m-1)-12W-L(Ω)-β-1m-1.

If we define a constant B1 as follows

B1(m,β):=(2|Ω|m-1+β(m-1)W-L(Ω))1/(m-1),

such that for B>B1, 1/|Ω| has a lower value of the free energy than ρ.

Case 2: (ρ,B) s.t. εB<12

We write ρ=ρB+ρr, where ρB:=ρ·χBB and ρr:=ρ-ρB. We then have the following bound on the entropy.

Sβm(ρ)Sβm(ρr)+β-1Bm-1m-1εBSβm(ρr).

We can assume without loss of generality that Fβm(ρ)<Fβm(ρ), otherwise the proof is complete. It follows then that

E(ρ)<E(ρ),Sβm(ρr)+β-1m-1Sβm(ρ)+β-1m-112W-L(Ω)+β-1(m-1)|Ω|m-1:=s(m,β).

By expanding E(ρ), the following estimate can be obtained

E(ρr)<E(ρ)+12W-L(Ω):=e,

where we have used the fact that εB<1/2. Define ρr¯:=(1-εB)-1ρrP(Ω). We have

Sβm(ρ)-Sβm(ρr¯)Sβm(ρr)+β-1Bm-1m-1εB-β-1m-1(1-εB)-mΩρrmdx+β-1m-1εB[β-1Bm-1m-1-((1-εB)-m-1εB)s(m,β)].

One can control the second term in the brackets as follows

((1-εB)-m-1εB)s(m,β)max(m+m(m+1)(1-δ)-m-2δ2,2m-1δ)s(m,β),

for any δ<1. Setting δ=12, we obtain

((1-εB)-m-1εB)s(m,β)m(1+(m+1)2m)s(m,β).

Similarly, for the interaction energy we can compute the difference as follows

E(ρ)-E(ρr¯)=E(ρ)-E(ρr)+E(ρr)-E(ρr¯)-12W-L(Ω)εB+E(ρr)(εB2-2εB(1-εB)2)εB[(εB-2(1-εB)2)E(ρr)-12W-L(Ω)].

Using the fact that εB<1/2 we can obtain

E(ρ)-E(ρr¯)εB[-8e-12W-L(Ω)].

Now, we can define a second constant as follows

B2(β,m):=[(m-1)β(m(1+2m(m+1))s(m,β)+8e+12W-L(Ω))]1/(m-1),

such that for B>B2, ρr¯ has a lower value of the free energy than ρ. We now set our constant as follows

Bβ,m:=max(B1(β,m),2B2(β,m)),

and set ρ¯ to either be (1/|Ω|) or ρr¯. The constant 2 in front of B2(β,m) follows from the fact that ρr¯ has been normalised.

The expression for the constant Bβ,m is explicit as a result of which we can even obtain some uniform control in m.

Corollary 5.2

Let (β,m)(0,C)×[1+ε,)=:A(0,)×(1,) for some C,ε>0. Then B:=supABβ,m<.

We now proceed to the existence result for minimisers of Fβm.

Theorem 5.3

(Existence of minimisers). Fix β>0 and m>1, then Fβm:P(Ω)(-,+] has a minimiser ρP(Ω)L(Ω). Additionally we have that

ρL(Ω)Bβ,m.

Proof

We note first that, from (5.1) and (5.2), Fβm is bounded below on P(Ω). Let {ρn}nN be a minimising sequence. Note that by Lemma 5.1 we can pick this sequence such that ρnL(Ω)Bβ,m. By the Banach–Alaoglu theorem we have a subsequence {ρnk}kN and measure ρL(Ω) such that

ρnkρin weak-*L(Ω).

Furthermore, we can find another subsequence (which we do not relabel), such that

ρnkρin weakL2(Ω).

Note that ρ is nonnegative a.e. and also has mass one. Thus, ρP(Ω)L(Ω). The proof would be complete if we can show lower semicontinuity of Fβm in weak L2(Ω). Note that for WC2(Ω), E(ρ) is continuous. On the other hand, Sβm(ρ) is convex and lower semicontinuous in the L2(Ω) topology. It follows from fairly classical results (cf. [Bre11, Theorem 3.7]) that Fβm is also weakly lower semicontinuous. This concludes the proof of existence of minimisers. The bound simply follows from the fact that norms are lower semicontinuous under weak- convergence.

Lemma 5.4

(Regularity and compactness of minimisers). Let ρβP(Ω) be a minimiser of Fβm(ρ). Then ρβ is Hölder continuous with exponent a(0,1) given by Theorem 3.3, where a depends continuously on β. Let {ρβ}βI be a family of such minimisers, where IR+ is some bounded interval. Then the family {ρβ}βI is relatively compact in C0(Ω).

Proof

The proof of the first statement follows simply by applying Proposition 4.1 and Theorem 3.3 with M=Bβ,m. For the second statement, let I¯ be the closure of I. Then applying (8.17) for some x,yTd, we have that

|ρβ(x)-ρβ(y)|ChdTd(x,y)a,

where a=a(β),Ch=Ch(β). Setting a=maxI¯a(β) and B to be as in Corollary 5.2, we have that

|ρβ(x)-ρβ(y)|ChdTd(x,y)a,

where Ch is some new constant depending on B, m, d, and W. Thus, the family {ρβ}βI is equicontinuous. It is clearly equibounded from Corollary 5.2. Applying the Arzelà–Ascoli theorem, the result follows.

Now that we have shown existence and regularity of minimisers we show that for β small or WHs minimisers of Fβm are unique and given by ρ. To show this we start with the following lemma which shows positivity of stationary solutions for β sufficiently small.

Lemma 5.5

There exists an δ>0 depending on m and W, such that for all β<δ it holds that if ρP(Ω)Lm(Ω) is a stationary solution of (3.1), then ρ(x)12|Ω| for all xΩ.

Proof

Note that if ρP(Ω)Lm(Ω) is stationary, then, by Proposition 4.1, it satisfies on each connected component A of its support

β-1mm-1ρm-1+Wρ=C(A,ρ)

with C(A,ρ) given by

C(A,ρ)=β-1m|A|(m-1)ρLm-1(A)m-1+1|A|AWρ(x)dx.

Thus, we have that ρL(Ω). Using a mollification argument and (4.2), one can then obtain the following bound

ρm-1L(Ω)βm-1mWρL(Ω)βm-1mWL(Ω).

By Theorem 3.3, it follows that ρ is a-Hölder continuous. Note further that we have that

maxxΩρ(x)|Ω|-1,

Thus, we can choose β to be small enough, dependent on m and W, and apply the bound to argue that

minxΩρm-121-m|Ω|1-m.

Thus, the result follows.

We can now use the positivity estimate of Lemma 5.5 to prove that for β sufficiently small stationary solutions of (3.1) (and thus minimisers of Fβm) are unique. This improves the result of [CKY13], in which uniqueness is proved only for 1<m2.

Lemma 5.6

For β1 and m(1,), ρ is the unique stationary solution of (3.1) and minimiser of the free energy, Fβm.

Proof

Assume ρP(Ω)Lm(Ω) is a stationary solution of (3.1). Then, we can apply the same argument as in the proof of Lemma 5.5 to obtain

ρm-1L(Ω)βm-1mWρL(Ω)βm-1mWL1(Ω)ρL(Ω).

It follows that

(m-1)ρm-2ρL(Ω)βm-1mWL1(Ω)ρL(Ω). 5.3

Let us now assume that β<δ, where δ is the constant from the statement of Lemma 5.5. Furthermore, if 1<m<2 the constant C(Ω,ρ) in Proposition 4.1 can be controlled as follows

C(Ω,ρ)β-1m|Ω|(m-1)Ωρm-1dxβ-1m|Ω|(m-1),

where in the last step we have applied Jensen’s inequality. Thus, we have

|ρ(x)|(βm-1mWL(Ω)+1|Ω|)1/(m-1)

for all xΩ. Thus, for 1<m<2, we can apply the above bound to (5.3) to obtain

ρL(Ω)βm(βm-1mWL(Ω)+1|Ω|)(2-m)/(m-1)WL1(Ω)ρL(Ω).

If β is sufficiently small, we have that ρL(Ω)=0. Thus, ρ=ρ for β sufficiently small. Similarly for 2m<, we can apply the bound from Lemma 5.5 to obtain

ρL(Ω)βm22-m|Ω|2-mWL1(Ω)ρL(Ω).

Applying a similar argument as before, we have that, for β1, ρ=ρ. Thus, for β1, ρ is the unique stationary solution of (3.1) and, by Proposition 4.1, the unique minimiser of Fβm.

We also have the following result on uniqueness of minimisers when WHs.

Theorem 5.7

Let WHs and m(1,). Then Fβm(ρ) has a unique minimiser ρ=ρ.

Proof

We first consider the case in which WHs. We write the linear interpolant as ρt=ρ0+tη where η=ρ1-ρ0 where ρ0,ρ1P(Ω) with Fβm(ρ0),Fβm(ρ1)<. Differentiating with respect to t twice we obtain that

d2dt2Fβm(ρt)=β-1Ωmρtm-2η2dx+Ω×ΩW(x-y)η(x)η(y)dxdy.

For WHs the above expression is strictly positive. Thus, Fβm(ρt) is a convex function, from which it follows that Fβm must have unique minimisers. We further argue that the minimiser must be ρ. Indeed, we have for any P(Ω)ρρ that

Fβm(ρ)=Sβm(ρ)+E(ρ)>Sβm(ρ)+E(ρ)Sβm(ρ)=Fβm(ρ),

where the first inequality follows from Jensen’s inequality and the second one from the fact that WHs and Definition 2.1.

We know now from Lemma 5.6, that for β1, ρ is the unique minimiser of Fβm and stationary solution of (3.1). We now present the following result on the long-time behaviour of (3.1) in this regime:

Theorem 5.8

(Long-time behaviour). Let ρ be a weak solution of (3.1) with initial datum ρ0L(Ω)P(Ω). Assume that β and W are such that ρ is the unique stationary solution of (3.1) (and, therefore, the unique minimiser of Fβm). Then, it holds that

limtρ(·,t)-ρL(Ω)=0.

Proof

We start by showing that if ρ0L(Ω)P(Ω), then ρL(Ω)M<. We choose as a test function in the weak formulation, ϕ=pρp-1, for some p>1. Note that we can justify this choice by mollifying ϕ and then passing to the limit. We then obtain from (3.2) the following expression

Ωρpdx0T+0TΩ(β-1mρm-1ρ(x,t)·ϕ(x,t)+ρ(x,t)(Wρ)(x,t)·ϕ(x,t))dxdt=0.

Plugging in the value of ϕ on the right hand side and integrating by parts, we obtain

ρ(·,T)Lp(Ω)p=ρ0Lp(Ω)p+0T(-4β-1pm(p-1)(m+p-1)2Ω|ρ(x,t)m+p-12|2dx)dt+0T((p-1)Ω(ΔWρ(x,t))ρ(x,t)pdx)dt

Applying the Lebesgue differentiation theorem, we obtain that for t a.e., it holds that

ddtρ(·,t)Lp(Ω)p=-4β-1pm(p-1)(m+p-1)2Ω|ρ(x,t)m+p-12|2dx+(p-1)Ω(ΔWρ(x,t))ρ(x,t)pdx-4β-1pm(p-1)(m+p-1)2Ω|ρ(x,t)m+p-12|2dx+(p-1)ΔWLΩρ(·,t)Lp(Ω)p. 5.4

Note that we can control the second term on the right hand side of the above expression as follows

ρ(·,t)Lp(Ω)pρ(·,t)L1(Ω)pθρ(·,t)L(m+p-1)dd-2(Ω)p(1-θ)=ρ(·,t)L(m+p-1)dd-2(Ω)p(1-θ),

where we have used the fact that 1<p<(m+p-1)dd-2 and the constant θ(0,1) is given by

θ=(m-1)d+2p((m+p-2)d+2).

We now apply the Sobolev inequality on the torus, to obtain

ρ(·,t)Lp(Ω)pρ(·,t)-1+1L(m+p-1)dd-2(Ω)p(1-θ)2p(1-θ)-1(ρ(·,t)-1L(m+p-1)dd-2(Ω)p(1-θ)+1)2p(1-θ)-1((Cdρ(·,t)m+p-12L2(Ω))2p(1-θ)m+p-1+1)=12((2m+p-12Cdρ(·,t)m+p-12L2(Ω))2p(1-θ)m+p-1+2p(1-θ)).

Note that the constant Cd in the above estimate depends only on dimension and is independent of p>1. We set q1:=(m+p-1)/(p(1-θ)) and q2:=q1/(q1-1). Note that from the definition of θ we have

q1=m+p-1p(1-θ)=(m+p-2)d+2d(p-1)>1.

Thus, we have that

q2=q1q1-1=(m+p-2)d+2(m-1)d+2.

We can thus apply Young’s inequality with q1,q2 to obtain

ρ(·,t)Lp(Ω)p12(Cp,m,βρ(·,t)m+p-12L2(Ω)2+2p(1-θ)q2Cd2q1-1q2Cp,m,βq2q1+2p(1-θ)) 5.5

where Cp,m,β>0 is given by

Cp,m,β:=4β-1pm(p-1)(m+p-1)2ΔWL(Ω)(p-1).

Multiplying through by ΔWL(Ω)(p-1), we can apply the estimate in (5.5) to (5.4) to obtain

ddtρ(·,t)Lp(Ω)p-(p-1)ΔWL(Ω)ρ(·,t)Lp(Ω)p+ΔWL(Ω)(p-1)(2p(1-θ)q2Cd2q1-1q2Cp,m,βq2q1+2p(1-θ)).

Applying Grönwall’s inequality, we obtain that

ρ(·,t)Lp(Ω)pe-(p-1)ΔWL(Ω)tρ0Lp(Ω)p+(2p(1-θ)q2Cd2q1-1q2Cp,m,βq2q1+2p(1-θ)).

It follows that

ρ(·,t)Lp(Ω)(e-(p-1)ΔWL(Ω)tρ0Lp(Ω)p+(2p(1-θ)q2Cd2q1-1q2Cp,m,βq2q1+2p(1-θ)))1/p31/pmax{ρ0L(Ω),2(1-θ)q2Cd2p-1q1-1q2Cp,m,β1/pq21/pq11/p,2(1-θ)}.

Note now that

2(1-θ)q2Cd2p-1q1-1q2Cp,m,β1/pq21/pq11/p1

as p. It follows then that we can find a constant M dependent on ρ0L(Ω), d, β, and m but independent of t and p such that

ρ(·,t)Lp(Ω)M,

for all t[0,). Passing to the limit as p, it follows that

ρL(Ω)M, 5.6

for all t[0,). We can now apply Theorem 3.3 to argue that the solution ρ is Hölder continuous with some exponent a(0,1). Furthermore, we can apply Corollary 3.4, to argue that

|ρ(y,t1)-ρ(x,t2)|Ch(dTd(x,y)+|t1-t2|1/2)a, 5.7

for all x,yTd and 0<C<t1<t2<. Consider now the solution semigroup St:ZEZE,t0 associated to the evolution in (3.1), where

ZE={ρP(Ω):Fβm(ρ)E},

for some ER. We make ZE into a complete metric space by equipping it with the d2(·,·) Wasserstein distance. The fact that it is complete follows from the fact that Fβm is lower semicontinuous with respect to convergence in d2(·,·). Note that the family of mappings {St}t0 forms a metric dynamical system in the sense of [CH98, Definition 9.1.1]. This follows from the fact (cf. [AGS08, Theorem 11.2.8]) the evolution defines a gradient flow ρC([0,);ZE0) in P(Ω) in the sense of [AGS08, Definition 11.1.1] where E0=Fβm(ρ0). We now define the ω-limit set associated to the initial datum ρ0L(Ω)P(Ω), as follows

ω(ρ0):={ρZE0:limnd2(Stn(ρ0),ρ)=0,tn}.

Since the metric space ZE0 is compact, it follows that the set t0St(ρ0) is relatively compact in ZE0. Applying [CH98, Theorem 9.1.8], we have that ω(ρ0) and

limtd2(ρ(·,t),ω(ρ0))=limtd2(St(ρ0),ω(ρ0))=0,

where ρ(·,t) is the unique solution of (3.1) with initial datum ρ0P(Ω)L(Ω). We now need to show that ω(ρ0) is contained in the set of stationary solutions of (3.1). Assume ρω(ρ0), then there exists a time-diverging sequence tn such that

limnd2(ρ(·,tn),ρ)=limnd2(Stn(ρ0),ρ)=0.

Since the solution ρ(·,t) is gradient flow of the free energy Fβm with respect to the d2(·,·) distance on P(Ω), it follows that the following energy-dissipation equality holds true for all t[0,) (cf. [AGS08, Theorem 11.1.3])

Fβm(ρ0)-Fβm(ρ(·,t))=0t|Fβm|2(ρ(·,s))ds, 5.8

where |Fβm|:P(Ω)(-,+] is the metric slope of Fβm and is given by

|Fβm|(ρ):=(Ω|β-1ρmρ+Wρ|2ρdx)1/2.

Bounding the energy from below and then passing to the limit as t in (5.8), we obtain

0|Fβm|2(ρ(·,s))ds-minρP(Ω)Fβm(ρ)+Fβm(ρ0)C. 5.9

We now consider the time-diverging sequence tn and the sequence of curves {ρn}nNC([0,1];ZE0) with ρn(·,t)=ρ(·,tn+t). For each nN, we have that

d2(ρn(·,t1),ρn(·,t2))L2ρn(·,t1)-ρn(·,t2)L1(Ω)1/2Ld2+12ρn(·,t1)-ρn(·,t2)L(Ω)1/2Ch1/2Ld2+12|t1-t2|a/4,

for all t1,t2[0,1], where in the last step we have used (5.7). We can thus apply the generalised Arzelá–Ascoli/Aubin–Lions compactness theorem (cf. [AGS08, Proposition 3.3.1]) to argue that there exists a curve μC([0,1];ZE0) such that ρn(·,t) converges to μ(·,t), in the sense of weak convergence of probability measures, for all t[0,1]. Furthermore, from the lower semicontinuity of |Fβm| (cf. [AGS08, Theorem 5.4.4]) and Fatou’s lemma, we have that

01|Fβm|2(μ(·,s))dslim infn01|Fβm|2(ρn(·,s))ds=lim infntntn+1|Fβm|2(ρ(·,s))ds=0,

where in the last step we have used (5.9). It follows that |Fβm|(μ(·,t))=0 for t a.e. Thus, since μ is continuous, we can find a sequence of times mN, tm0, such that |Fβm|(μ(·,tm))=0 and d2(μ(·,tm),μ(·,0))0 as m. Note further that μ(·,0)=limnρ(·,tn)=ρ. From the lower semicontinuity of |Fβm|(·) we have that

|Fβm|(ρ)=|Fβm|(μ(·,0))=0.

Applying Proposition 4.1, it follows that ρZE0P(Ω)Lm(Ω) is necessarily a stationary solution of (3.1). Since ρ is the unique stationary solution, it follows that

limtd2(ρ(·,t),ρ)=0. 5.10

However, from (5.6) and (5.7), we know that, for any time-diverging sequence tn, {ρ(·,tn)}nN has a convergent subsequence in L(Ω), whose limit must be ρ by (5.10). Since the limit is unique, it follows that

limtρ(·,t)-ρL(Ω)=0.

Remark 5.9

We remark that the technique used in the proof of Theorem 5.8 can be adapted to study the asymptotic properties of general gradient flows in the space of probability measures. These ideas have been expanded upon in [CGW20].

From Theorem 5.7, it is also immediately clear that WHsc is a necessary condition for the existence of a nontrivial minimiser at higher values of the parameter β. Indeed, Theorem 5.7 tells us that if WHs then minimisers of Fβm are unique and are given by ρ. Before we discuss this any further, we introduce a notion of transition point that allows us to capture a change in the set of minimisers.

Definition 5.10

(Transition point). A parameter value βc>0 is said to be a transition point of Fβm if the following conditions are satisfied.

  1. For β<βc, ρ is the unique minimiser of Fβm.

  2. At β=βc, ρ is a minimiser of Fβm.

  3. For β>βc, there exists P(Ω)ρβρ, such that ρβ is a minimiser of Fβm.

We further classify transition points into discontinuous and continuous transition points.

Definition 5.11

(Continuous and discontinuous transition points). A transition point βc of Fβm is said to be a continuous transition point if

  1. At β=βc, ρ is the unique minimiser of Fβm.

  2. For any family of minimisers {ρβ}β>βc it holds that
    lim supββc+ρβ-ρL(Ω)=0.

A transition point βc>0 of Fβm which is not continuous is said to be discontinuous.

It turns out that WHsc is in fact a sufficient condition for the existence of a transition point. This result is analogous to the result in case m=1 discussed in [GP70, CP10, CGPS20].

Proposition 5.12

Assume WHsc. Then there exists some parameter value 0<βcβm with βm defined as

βm:=-mρm-3/2minkNd,k0W^(k)Θ(k),

such that βc is a transition point of Fβm. Thus, WHsc is a necessary and sufficient condition for the existence of a transition point.

Proof

Consider the measure ρε=ρ+εekP(Ω) for 0<ε1 where kNd is defined as

k:=argminkNd,k0W^(k)Θ(k).

if it is defined uniquely. If not we pick any k that realises the minimum of the above expression. We now consider an expansion of the energy Fβm(ρε) around ρε which we will use repeatedly throughout the rest of this section. We Taylor expand around ρ to obtain

Fβm(ρε)=Fβm(ρ)+(β-1mρm-2+ρ-1/2W^(k)Θ(k))ε22ekL2(Ω)2+β-1m(m-2)ε36Ωfm-3ek3dx,

where the function f(x)(ρ,ρε(x)). For ε>0 small enough, the highest order term can be controlled as follows

Fβm(ρε)Fβm(ρ)+(β-1mρm-2+ρ-1/2W^(k)Θ(k))ε22ekL2(Ω)2+β-1m(m-2)Nk3ε36fL(Ω)m-3|Ω|=Fβm(ρ)+(β-1mρm-2+ρ-1/2W^(k)Θ(k))ε22ekL2(Ω)2+o(ε2).

For β>βm, the second order term in the above expression has a negative sign. Thus, for ε>0 sufficiently small we have that Fβm(ρε)<Fβm(ρ). Since, by Theorem 5.3, minimisers of Fβm exists for all β>0, it follows that for all β>βm there exist nontrivial minimisers of the free energy. Thus, there exists some βcβm which is a transition point of the free energy Fβm(ρ).

Remark 5.13

We note here that the βm defined in the statement of Proposition 5.12 corresponds exactly to the point of critical stability of the uniform state ρ, i.e. if the stationary problem is linearised about ρ, then βm corresponds to the value of the parameter at which the first eigenvalue of the linearised operator crosses the imaginary axis.

Before attempting to provide conditions for the existence of continuous and discontinuous transition points we define the function Fm:(0,)R

Fm(β):=minρP(Ω)Fβm.

Lemma 5.14

For all β>0, the function Fm is continuous. Assume further that there exists β>0 and P(Ω)ρβρ such that Fβm(ρβ)=Fm(β). Then for all β>β, Fβm(ρ)>Fm(β).

Proof

We note that for 0<ββc (where βc is possibly +) we have that Fm(β)=Fβm(ρ) which is clearly a continuous function of β. Let β2>β1>βc (if βc<, else we are done) and let ρβ1 be the minimiser of Fβ1m. Note however due to the structure of the free energy we have that

Fm(β2)Fβ2m(ρβ1)=Fβ1m(ρβ1)+1m-1(β2-1-β1-1)Ωρβ1m-ρβ1dx=Fm(β1)+1m-1(β2-1-β1-1)Ωρβ1m-ρβ1dx.

To obtain continuity of Fm, note that the steps of the above equation would still hold with β1 and β2 exchanged. Using that ρβ1 and ρβ2 are uniformly bounded by Theorem 5.3, one has the desired continuity.

Assume now that Fβm(ρβ)=Fβm(ρ) and let β>β. We then have that

Fm(β)Fβm(ρβ)=Fβm(ρβ)+1m-1(β-1-β-1)Ωρβm-ρβdxFβm(ρ)+1m-1(β-1-β-1)Ωρβm-ρβdx<Fβm(ρ)+1m-1(β-1-β-1)Ωρm-ρdx=Fβm(ρ).

We will now try and refine our descriptions of discontinuous and continuous transition points in analogy with the results in [CP10, CGPS20].

Lemma 5.15

If a transition point βc>0 is continuous, then βc=βm.

Proof

We know already from Proposition 5.12 that βcβm. Let us assume that βc<βm. We know from Definition 5.11 that ρ is the unique minimiser of Fβcm. Additionally for any sequence of minimisers {ρβ}β>βc we know that

lim supββc+ρβ-ρL(Ω)=0.

Consider such a sequence and set ηβ=ρβ-ρ. For β>βc, we expand the free energy about ρ as follows

Fβm(ρβ)=Fβm(ρ)+β-1mρm-2ηβL2(Ω)22+ρ-1/22kNdW^(k)Θ(k)σSymk(Λ)|ηβ^(σ(k))|2-β-1m(m-2)6Ωfm-3ηβ3dx.

where f(x)(ρ,ρβ(x)) and can be bounded by ρβL(Ω)Bβ,mB from the result of Theorem 5.3 and Corollary 5.2. Additionally we can control W^(k)Θ(k) to obtain the following bound

Fβm(ρβ)Fβm(ρ)+(β-1mρm-2+ρ-1/2minkNd,k0W^(k)Θ(k))ηβL2(Ω)22-β-1m(m-2)6Bm-3ηβL3(Ω)3.

Note that due to the fact that ηβL(Ω)0 as ββc+, we have that ηβL3(Ω)3 is o(ηβL2(Ω)2), i.e. ηβL3(Ω)3ηβL(Ω)ηβL2(Ω)2. This leaves us with

Fβm(ρβ)Fβm(ρ)+(β-1mρm-2+ρ-1/2minkNd,k0W^(k)Θ(k))ηβL2(Ω)22-o(ηβL2(Ω)2).

Since βc<βm, the term in the brackets is positive close to βc we obtain a contradiction as ρβ is a nontrivial minimiser of Fβm. Thus, we must have that βc=βm.

From Definition 5.11, we see that some βc>0 is a discontinuous transition point if it violates either (or both) of the conditions (1) and (2). In the following lemma, we will show that if (2) is violated then (1) is as well.

Lemma 5.16

Assume βc>0 is a discontinuous transition point of the energy Fβm and that for some family of minimisers {ρβ}β>βc it holds that

lim supββc+ρβ-ρL(Ω)0.

Then there exists P(Ω)ρβcρ such that:

  1. Fβcm=Fβcm(ρβc)=Fβcm(ρ).

  2. Sβcm(ρβc)>Sβcm(ρ) and E(ρβc)<E(ρ)=0.

Proof

Consider a sequence of points {βn}nN>βc and βnβc as n. We know that the set of minimisers {ρβn}nN is compact in C0(Ω)P(Ω) from Lemma 5.4. Thus, there exists a subsequence ρβn{ρβ}β>βc (which we do not relabel) and a limit ρβcP(Ω)C0(Ω) such that

limnρβn-ρβcC0(Ω)=0.

From the statement of the lemma we know that ρβcρ. All that remains is to show that ρβc is a minimiser of Fβcm. We first note that limnFβn(ρβn)=Fβc(ρβc). This follows from the fact that the interaction energy E is continuous on C0(Ω)P(Ω) for WC2(Ω) and the entropy Sβm is essentially an Lm-norm and is thus also controlled by the C0(Ω) topology. Finally we use the result of Lemma 5.14 to note that

Fβc(ρβc)=limnFβn(ρβn)=limnFm(βn)=Fm(βc),

which completes the proof of (1). The proof of (2) follows immediately from the fact that ρ is the unique minimiser of Sβm(ρ) on P(Ω) (which is a consequence of Jensen’s inequality).

Remark 5.17

The above lemma tells us that we have not lost much by defining discontinuous transition points with respect to the L(Ω) norm since the transition points obtained are discontinuous with respect to the Lp(Ω) norm as well for all p[1,]. Indeed if we consider the sequence constructed in the proof of Lemma 5.16{ρβn}nN it follows that

limnρβn-ρβcLp(Ω)|Ω|1/plimnρβn-ρβcC0(Ω)=0,

where ρβc is the limiting object btained in the proof of Lemma 5.16. Thus, lim supββc+ρβ-ρLp(Ω)0 for all p[1,].

In the following proposition we outline the strategy we will use to provide sufficient conditions for the existence of continuous and discontinuous transition points.

Proposition 5.18

Assume that WHsc so that there exists a transition point βc>0 of Fβm. Then:

  1. If ρ is the unique minimiser of Fβmm, then βc=βm is a continuous transition point.

  2. If ρ is not a minimiser of Fβmm, then βc<βm is a discontinuous transition point.

Proof

For the proof of Proposition 5.18(a) we note that βc already satisfies condition (1) of Definition 5.11. All we need to show is that it satisfies condition (2). Assume βc<βm, then by the very definition of a transition point we would have a contradiction since ρ is the unique minimiser of Fβm at β=βm. It follows then that βc=βm. Assume now that condition (2) of Definition 5.11 is violated, i.e. there exists a family of minimisers {ρβ}β>βm such that

limββm+ρβ-ρL(Ω)0.

By Lemma 5.16 it follows that there exists P(Ω)ρβmρ which minimises Fβmm. This is a contradiction.

For Proposition 5.18(b), we note that since ρ is not a minimiser at β=βm by Definition 5.10 and Proposition 5.12 it follows that βc<β. Thus, by Lemma 5.15, βc is a discontinuous transition point.

The next theorem provides conditions on the Fourier modes of W(x) for the existence of discontinuous transition points. It can be thought of as the analogue for the case of nonlinear diffusion.

Theorem 5.19

Assume WHsc and m2. Define, for some δ>0, the set Kδ as follows

Kδ:=kNd\{0}:W^(k)Θ(k)minkNd\{0}W^(k)Θ(k)+δ

We define δ to be the smallest value, if it exists, of δ for which the following condition is satisfied:

there existka,kb,kcKδ,such thatka=kb+kc. A1

We remark that two of the modes in the above expression can be repeated. For example, we could have ka=2,kb=1,kc=1. Then if δ is sufficiently small, Fβm exhibits a discontinuous transition point at some βc<β.

Proof

We know already from Proposition 5.12 that the system possesses a transition point βc. We are going to use Proposition 5.18(b) and construct a competitor ρP(Ω) which has a lower value of the free energy than ρ at β=βm. Define the function

γ(m):=1m<2-1m2

and let

ρε=ρ1+γ(m)εkKδekP(Ω),

for some ε>0, sufficiently small. We denote by |Kδ| the cardinality of Kδ, which is necessarily finite as WL2(Ω). Expanding about the free energy about ρ we obtain

Fβmm(ρε)Fβmm(ρ)+|Kδ|((βm)-1mρm-2+ρ-1/2minkNd\{0}W^(k)Θ(k)+ρ-1/2δ)ε22ekL2(Ω)2+(βm)-1γ(m)3m(m-2)ρm-3ε36Ω(kKδek)3dx+(βm)-1m(m-2)(m-3)ε424Ωfm-4(kKδek)4dx,

where the function f(x)(ρ,ρε(x)). We use the definition of βm and control the highest order term in the same manner as Proposition 5.12 to simplify the expansion as follows:

Fβmm(ρε)Fβmm(ρ)+|Kδ|(ρ-1/2δ)ε22+(βm)-1γ(m)3m(m-2)ρm-3ε36Ω(kKδek)3dx+o(ε3),

Setting ε=δ12 (if δ>0, otherwise we stop here), we obtain

Fβmm(ρε)Fβmm(ρ)+(βm)-1γ(m)3m(m-2)ρm-3δ3/23Ω(kKδek)3dx+|Kδ|ρ-1/2δ22+o(δ32).

One can now check that under condition (A1), it holds that

ΩkKδek3dx>a>0,

where the constant a is independent of δ. Indeed, the cube of the sum of n numbers ai, i=1,,n consists of only three types of terms, namely: ai3, ai2aj and aiajak. Setting the ai=ws(i), with s(i)Kδ, one can check that the first type of term will always integrate to zero. The sum of the other two will take nonzero and in fact positive values if and only if condition (A1) is satisfied. This follows from the fact that

-ππcos(x)cos(mx)cos(nx)dx=π2(δ+m,n+δm+n,+δn+,m).

Also the term γ(m)3m(m-2) is always negative. Thus, for δ sufficiently small, considering the fact that |Kδ|2 and is nonincreasing as δ decreases, ρε has smaller free energy and ρ is not a minimiser at β=βm.

Remark 5.20

The case m=2 is special, as transition points for any WHsc are necessarily discontinuous. This case will be treated in detail in Proposition 5.22.

The following lemma shows that discontinuous transitions are stable in m.

Lemma 5.21

Assume WHsc such that Fβm has a discontinuous transition point and βcm<βm. Then for m(m-ε,m+ε) (or m[1,1+ε) for m=1) for some ε>0 small enough, Fβm has a discontinuous transition point at some βcm<βm.

Proof

We start with the case m>1. Denote by ρC0(Ω)P(Ω) the nontrivial minimiser of Fβmm(ρ). We know that

Fβmm(ρ)-Fβmm(ρ)=δ>0.

It would be sufficient for the purposes of this proof to show that such a nontrivial minimiser exists for Fβmm for m close enough to m. Choosing ρ to be the competitor state, we have

Fβmm(ρ)-Fβmm(ρ)=Fβmm(ρ)-Fβmm(ρ)+(βm)-1m-11|Ω|m-1-(βm)-1m-1-(βm)-1m-11|Ω|m-1+(βm)-1m-1+((βm)-1m-1Ω(ρ)mdx-(βm)-1m-1-(βm)-1m-1Ωρmdx+(βm)-1m-1)=δ+(βm)-1m-11|Ω|m-1-(βm)-1m-1-(βm)-1m-11|Ω|m-1+(βm)-1m-1+((βm)-1m-1Ω(ρ)mdx-(βm)-1m-1-(βm)-1m-1Ωρmdx+(βm)-1m-1)

Since βmβm and (m-1)-1(am-1)(m-1)-1(am-1),a0 as mm, it follows, using the fact that ρC0(Ω), that we can choose m close enough to m so that the above term is strictly positive. We then have that for m(m-ε,m+ε) for some ε>0 small enough, ρ is not a minimiser of the free energy Fβmm(ρ). By Proposition 5.18(b), it follows that Fβm possesses a discontinuous transition point at some βcm<βm. The case m=1 can be treated similarly.

In the following proposition, we single out some special values of m at which one always finds a discontinuous transition point for WHsc.

Proposition 5.22

Assume WHsc such that βc is a transition point of Fβm. Then if m[2,3], βc is a discontinuous transition point. Specifically for the case m=2 we have that

  1. βc2=β2

  2. There exists a one parameter family of minimiser {ρα}α[0,|Ω|-1/2Θ(k)-1] of Fβ22 with ρ0=ρ.

Proof

We will try again to show that we have a competitor at βm. We start with the case 2<m<3. Consider the competitor

ρε=ρ+εek

for ε>0 and small and k:=argminkNd\{0}W^(k)/Θ(k) if it is uniquely defined or any one such k if it is not. Expanding the energy upto fifth order and noting that second order terms vanish we obtain

Fβmm(ρε)=Fβmm(ρ)+(βm)-1m(m-2)ρm-3ε33!Ωek3dx+(βm)-1m(m-2)(m-3)ρm-4ε44!Ωek4dx+(βm)-1m(m-2)(m-3)(m-4)ε55!Ωfm-5ek5dx,

where the function f(x)(ρ,ρε(x)). We again bound the highest order term as in Proposition 5.12 and use the fact that Ωek3dx=0 for any kNd\{0} to obtain

Fβmm(ρε)=Fβmm(ρ)+(βm)-1m(m-2)(m-3)ρm-4ε44!Ωek4dx+o(ε4).

Since m(m-2)(m-3) is negative for m(2,3), for ε>0 sufficiently small, we have shown that ρ is no longer the minimiser of Fβmm. The result follows by Proposition 5.18(b): we have a discontinuous transition point at some βc<βm.

We now consider the case m=2,3. Using the same expansion we have that

Fβ22(ρε)=Fβ22(ρ)Fβ33(ρε)=Fβ33(ρ).

Thus, ρ is not the unique minimiser of Fβmm for m=2,3. It then follows from Definition 5.10 that there must be a discontinuous transition point at βcmβm.

Consider now the convex interpolant ρt:=(1-t)ρ0+tρ1,t(0,1) for ρ0,ρ1P(Ω) such that Fβ2(ρ0),Fβ2(ρ1)<. We then have that

d2dt2Fβ2(ρt)=2β-1Ωη2dx+Ω×ΩW(x-y)η(x)η(y)dxdy(2β-1+minkNd\{0}W^(k)Θ(k))ηL2(Ω)2.

Note that the above expression is strictly positiove if β<β2. Thus, Fβ2 is strictly convex for β<β2 and has only one minimiser, namely, ρ. Since the function F is continuous (cf. Lemma 5.14), it follows that βc2=β2 for all WHsc. Furthermore, ρα=ρ+αek form a one-parameter family of minimisers of Fβ22 for α[0,|Ω|-1/2Θ(k)-1].

We conclude the section by discussing the existence of continuous transition points. We show that for m=4 one can construct a large class of potentials for which the transition point βc is continuous. We start with the following proposition.

Proposition 5.23

Let kNd be such that k0 and let kiN,i=1,,d be such that

k=k1kd.

Then we have:

ek2=jP2(k)cjej+c0e0,

where

P2(k):={jNd,j0,ji{2ki,0}},cj=ρNjandc0=ρN0.

Similarly

ek3=P3(k)ce+ckek

with

P3(k):={Nd,k,i{3ki,ki}},c=ρ2NkN(3)|{i:i=ki}|andck=ρ2NkN(3)d.

Note that P2(k)P3(k)=. Similarly, we have that

(σSymk(Λ)aσ(k)eσ(k))2=jP2(k)σ1,σ2Symk(Λ)aσ1(k)aσ2(k)cjσ1,σ2eσ1·σ2(j)+C0e0(σSymk(Λ)aσ(k)eσ(k))3=P3()σ1,σ2,σ3Symk(Λ)aσ1(k)aσ2(k)aσ3(k)cσ1,σ2,σ3eσ1·σ2·σ3()+σSymk(Λ)Ckσek

where the constants cjσ1,σ2,cσ1,σ2,σ3,C0,CkσR depend only on d, k, and ρ but are independent of the coefficients aσ(k)R.

Proof

The proof is simply a careful application of the trigonometric identities cos2(a)=2-1(1+cos(2a)), cos3(a)=4-1(cos(3a)+3cos(a)), and sin3(a)=4-1(3sin(a)-sin(3a)).

We now proceed to the result concerning continuous transition points for m=4.

Theorem 5.24

Let WHsc, such that βc< is a transition point of Fβ4. Assume that

k:=argminkNd,k0W^(k)Θ(k),

is uniquely defined. Furthermore, we assume that W^(k)0 for all kk and that

W^(j)>maxσ1,σ2Symk(Λ)6Θ(j)5(cjσ1,σ2)2|P2(k)P3(k)|ρΘ(k)|W^(k)|jP2(k) A2
W^()>maxσ1,σ2,σ3Symk(Λ)2Θ()9Θ(k)(cσ1,σ2,σ3)2|P2(k)P3(k)|3ρ2|W^(k)|P3(k), A3

where the sets P2,P3 and the constants cjσ1,σ2,cσ1,σ2,σ3 are as defined in Proposition 5.23. Then βc=β4 is a continuous transition point. Note that the constant Θ(k) for kNd is as defined in (2.1).

Proof

We will rely on Proposition 5.18(a) for the proof of this result. We need to show that, at β=β4, ρ is the unique minimiser of Fβ4. Let ρP(Ω)L(Ω) be any measure different from ρ. Then it is sufficient to show that Fβ44(ρ)>Fβ44(ρ) (it is sufficient to check bounded densities from the result of Lemma 5.1). We now define η:=ρ-ρ and note that η has the following properties

ηL(Ω),η-ρ,Ωηdx=0. 5.11

We can compute the free energy of ρ as follows

Fβ44(ρ)=(β4)-13Ωρ4dx-(β4)-13+12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy=(β4)-13(Ωρ4dx-1+4Ωρ3ηdx+6Ωρ2η2dx+4Ωρη3dx+Ωη4dx)+kNdW^(k)12NkσSymk(Λ)|η^(σ(k))|2,

where we have used (2.2). Simplifying further, by using the definition of β4 and the fact that η has mean zero, we obtain

Fβ44(ρ)=Fβ44(ρ)+kNd,kk(6(β4)-13ρ2+W^(k)12Nk)σSymk(Λ)|η^(σ(k))|2+4(β4)-13Ωρη3dxI1+(β4)-13Ωη4dxI2. 5.12

We define η2:=η-fη,k where fη,k=σSymk(Λ)η^(σ(k))eσ(k) and deal with the two terms I1 and I2 separately. We then have

I1=4(β4)-13Ωρη3dx=4(β4)-13ρΩ[fη,k3+3fη,k2η2]dx+Ω[3fη,kη22+η23]dx=4(β4)-13ρΩ[3fη,k2η2+3fη,kη22+η23]dx,

where we have used the fact that

Ωfη,k3dx=0.

We now use the fact that η has mean zero from (5.11) and Proposition 5.23 to obtain

I1=(β4)-13(4ρΩη23dx+12ρΩfη,kη22dx)+4(β4)-1ρjP2(k)σ1,σ2Symk(Λ)η^(σ1(k))η^(σ2(k))cjσ1,σ2η2^(σ1·σ2(j)). 5.13

For the second term we obtain

I2=(β4)-13Ωη4dx=(β4)-13Ω[fη,k4+4fη,k3η2+6fη,k2η22+4fη,kη23+η24]dx.

Applying Proposition 5.23 again, we obtain

I2=(β4)-13Ωη4dx=(β4)-13Ω[fη,k4+6fη,k2η22+4fη,kη23+η24]dx+4(β4)-13P3()σ1,σ2,σ3Symk(Λ)η^(σ1(k))η^(σ2(k))η^(σ3(k))cσ1,σ2,σ3η2^(σ1·σ2·σ3()), 5.14

where we have used the fact that η2^(σ(k))=0 for all σSymk(Λ). We now note that

kNd,kkσSymk(Λ)|η^(σ(k))|2=η2L2(Ω)2. 5.15

Putting (5.12), (5.13), (5.14), and (5.15), together we obtain

Fβ44(ρ)=Fβ44(ρ)+kNd,kk(W^(k)12Nk)σSymk(Λ)|η2^(σ(k))|2+4(β4)-1ρjP2(k)σ1,σ2Symk(Λ)η^(σ1(k))η^(σ2(k))cjσ1,σ2η2^(σ1·σ2(j))+4(β4)-13P3()σ1,σ2,σ3Symk(Λ)η^(σ1(k))η^(σ2(k))η^(σ3(k))cσ1,σ2,σ3η2^(σ1·σ2·σ3())+(β4)-13Ω6ρ2+12ρfη,k+4η2ρ+6fη,k2+4fη,kη2+η22η22dx+(β4)-13Ωfη,k4dx.

Note now that

6ρ2+12ρfη,k+4η2ρ+6fη,k2+4fη,kη2+η22=2(ρ+fη,k)2+(η2+2(ρ+fη,k))20.

Thus, it follows that

Fβ44(ρ)Fβ44(ρ)+kNd,kk(W^(k)12Nk)σSymk(Λ)|η2^(σ(k))|2+4(β4)-1ρjP2(k)σ1,σ2Symk(Λ)η^(σ1(k))η^(σ2(k))cjσ1,σ2η2^(σ1·σ2(j))+4(β4)-13P3()σ1,σ2,σ3Symk(Λ)η^(σ1(k))η^(σ2(k))η^(σ3(k))cσ1,σ2,σ3η2^(σ1·σ2·σ3())+(β4)-13Ωfη,k4dxFβ44(ρ)+kP2(k)P3(k)(W^(k)12Nk)σSymk(Λ)|η2^(σ(k))|2+4(β4)-1ρjP2(k)σ1,σ2Symk(Λ)η^(σ1(k))η^(σ2(k))cjσ1,σ2η2^(σ1·σ2(j))+4(β4)-13P3()σ1,σ2,σ3Symk(Λ)η^(σ1(k))η^(σ2(k))η^(σ3(k))cσ1,σ2,σ3η2^(σ1·σ2·σ3())+(β4)-13Ωfη,k4dx, 5.16

where in the last step we have simply used the fact that W^(k)0 for all kk. We now note that

σSymk(Λ)|η2^(σ(k))|2=Θ(k)-2σ1,σ2Symk(Λ)|η2^(σ1·σ2(k))|2=Θ(k)-4σ1,σ2,σ3Symk(Λ)|η2^(σ1·σ2·σ3(k))|2,

where we have used the fact that |Symk(Λ)|=Θ(k)2. Additionally, we have that

Ωfη,k4dx=(σSymk(Λ)|η^(σ(k))|2)2Ω(1(σSymk(Λ)|η^(σ(k))|2)1/2σSymk(Λ)η^(σ(k))eσ(k))4dxρ(σSymk(Λ)|η^(σ(k))|2)2,

where in the last step we applied Jensen’s inequality and used the fact that the integrand has unit L2(Ω) norm. For any kNd, we define the following quantity

|η^|k2=σSymk(Λ)|η^(σ(k))|2,

and note that

|η^|k4maxσ1,σ2Symk(Λ)i=12|η^(σ1(k))|2|η^(σ2(k))|2. 5.17

Finally, we can rewrite the inequality in (5.16) as

Fβ44(ρ)Fβ44(ρ)+jP2(k)σ1,σ2Symk(Λ)(Aj|η2^(σ1·σ2(j))|2+Bjσ1,σ2η2^(σ1·σ2(j))+Cj)+P3(k)σ1,σ2,σ3Symk(Λ)(A|η2^(σ1·σ2·σ3())|2+Bσ1,σ2,σ3η2^(σ1·σ2·σ3())+C), 5.18

where

Aj=W^(j)2NjΘ(j)-2A=W^()2NΘ()-4Bjσ1,σ2=4(β4)-1ρcjσ1,σ2i=12η^(σi(k))Bσ1,σ2,σ3=4(β4)-13cσ1,σ2,σ3i=13η^(σi(k))Cj=(β4)-13Θ(j)2|P2(k)P3(k)|ρ|η^|k4C=(β4)-13Θ()4|P2(k)P3(k)|ρ|η^|k4

Assume that |η^|k0. Then (.A2) and (A3) along with the expression for β4, (5.17), and the fact that |η^(k)|Nk, imply that the discriminants of the quadratic expressions in (5.18) are all negative, i.e. (Bjσ1,σ2)2-4AjCj<0,(Bσ1,σ2,σ3)2-4AC<0. Indeed, we have that

(Bjσ1,σ2)24AjCj=24(β4)-1ρΘ(j)4(cjσ1,σ2)2Nj|P2(k)P3(k)||η^|k4W^(j)i=12|η^(σi(k))|26|W^(k)|Θ(j)5(cjσ1,σ2)2|P2(k)P3(k)|ρΘ(k)W^(j)<(A2)1.

Similarly,

(Bσ1,σ2,σ3)24AC=8(β4)-1Θ()8(cσ1,σ2,σ3)2N|P2(k)P3(k)|3ρ|η^|k4W^()i=13|η^(σi(k))|22|W^(k)|Θ()9Θ(k)(cσ1,σ2,σ3)2|P2(k)P3(k)|3ρ2W^()<(A3)1.

Thus, it follows that Fβ44(ρ)>Fβ44(ρ). On the other hand if |η^|k0, the proof follows by noting that any contribution from the interaction energy is positive and that ρ is the unique minimiser of Sβ,4(ρ). The fact that βc=β4 is a consequence of Lemma 5.15.

Remark 5.25

Note that although the assumptions in Theorem 5.24 seem complicated, all they really require is that all Fourier coefficients of W, except the dominant negative mode W^(k) are nonnegative and that a finitely many of them “positive enough” compared to W^(k). Consider d=1, with W(x)=w1e1(x)+w2e2(x)+w3e3(x) with w1<0 and w2,w3>0. If, for some explicitly computable positive constants c2,c3>0, w2>c2|w1| and w3>c3|w1|, the conditions of Theorem 5.24 are satisfied and the transition point βc=β4 is continuous. In this setting, P2(1)={e2} and P3(1)={e3}.

The Mesa Limit m

A natural question to ask is ho w the sequence of free energies Fβm:P(Ω)(-,+] behave in the limit as m. We conjecture the following limit free energy, F:P(Ω)(-,+],

F(ρ)=12Ω×ΩW(x-y)ρ(x)ρ(y)dxdyρL(Ω)1+otherwise. 6.1

This is analogous to the so-called mesa limit of the porous medium equation considered by Caffarelli and Friedman [CF87]. It is also studied in [CKY18, CT20] for Newtonian interactions and [KPW19] for general drift-diffusion equations. We rederive the result in our setting.

Theorem 6.1

Consider the sequence of functionals {Fβm}m1 defined on P(Ω)L(Ω) equipped with the weak- topology. Then

F=Γ-limmFβm,

for any fixed β>0.

Proof

  1. Recovery sequence: For each ρP(Ω)L(Ω) we choose ρm=ρ as the recovery sequence. The interaction energy term remains unchanged as it is independent of m, while (m-1)-1 converges to 0 as m. Assume first that ρL(Ω)>1. It follows that there exists some ε>0 and a set A of positive measure susch that ρ|A>1+ϵ. Thus, we have
    β-1m-1Ωρmdxβ-1m-1|A|(1+ϵ)mm,
    and thus Fβm(ρ) for all ρL(Ω)>1. Now, let us assume that ρL(Ω)1. This gives us
    β-1m-1Ωρmdxβ-1m-1ρm-1m0,
    and thus completes the construction of the recovery sequence.
  2. Γ-lim inf: Assume that there exists {ρm}m1 such that ρmρ in L-weak-. For WC2(Ω), the interaction energy is continuous and so we can disregard its behaviour. We start with the case in which ρL(Ω)1. In this case the entropic term, Sβm(ρm), can be controlled from below by 0 and thus the Γ-lim inf holds trivially. The other case left to treat is when ρL(Ω)>1. This implies again that there exists some ε>0 and a set of positive measure A such that ρ|A>1+ε. It follows from the weak- convergence that
    limmAρmdx=(1+ϵ)|A|+δ,
    for some fixed positive constant δ>0 independent of m. We define the sets Am+:={xA:ρm>(1+ε)} and Am-:=A\Am+. There also exists NN such that for mN, Aρmdx(1+ϵ)|A|+δ/2. Thus, for mN we have that
    Am+ρmdx+Am-ρmdx(1+ϵ)|Am+|+(1+ϵ)|Am-|+δ/2
    from which it follows that
    Am+ρmdxδ/2.
    This gives us the estimate we need on the entropic term since
    β-1m-1Ωρmmdxβ-1m-1Am+ρmmdxβ-1m-1(1+ϵ)m-1Am+ρmdxβ-1m-1(1+ϵ)m-1δ/2.
    Passing to the limit as m, the result follows.

We would now like to understand how the presence of phase transitions for finite m affects the minimisers of F. This is discussed in the next result.

Theorem 6.2

(Minimisers of the mesa problem). Let F:P(Ω)(-,+] be as defined in (6.1). Then

  1. If |Ω|<1, F+.

  2. If |Ω|=1, F(ρ)<+ if and only if ρ=ρ. Thus, ρ is the unique minimiser of F.

  3. If |Ω|>1 and WHs and W0, ρ is the unique minimiser of F. On the other hand if WHsc there exists P(Ω)ρρ such that ρ is the minimiser of F with F(ρ)<F(ρ). Furthermore, there exists a sequence, {ρm}m1 of nontrivial minimisers of Fβm such that ρmρ in L-weak- as m.

Proof

The proof of Theorem 6.2(a) follows from the fact that if |Ω|<1, then for any ρP(Ω)L1(Ω) there exists a set A of positive measure such that ρ(x)>1 for all xA. Indeed, if this were not the case we would have that

Ωρdx|Ω|<1,

which would be a contradiction. Thus, we have that ρL(Ω)>1 for all ρP(Ω)L1(Ω) and so F.

The proof of Theorem 6.2(b) is similar. If ρρ, we can again find a set of positive measure A such that ρ(x)>1 for all xA. We then repeat the same argument as in the previous case.

Assume now that |Ω|>1 and WHs,W0 (if W is identically zero then clearly F0). Since W is mean-zero we have that

F(ρ)=0.

On the other hand if P(Ω)L(Ω)ρρ, we know from Definition 2.1, that

F(ρ)=12Ω×ΩW(x-y)ρ(x)ρ(y)dxdy>0.

Finally consider the case WHsc. Let β>0 be fixed and note that, since |Ω|>1, βm0 as m. Clearly for m large enough a nontrivial minimiser ρmP(Ω) exists for β>0 from the result of Proposition 5.12. Consider the measure ρε=ρ+εek where k is as defined previously. We then have the following bound

Fβm(ρm)Fβm(ρε)=Fβm(ρ)+(β-1mρm-2+ρ-1/2W^(k)Θ(k))ε22ekL2(Ω)2+β-1m(m-2)ε36Ωfm-3ek3dx,

where the function f(x)(ρ,ρε(x)). Note that |f|(ρ+εNk). Thus, we have the bound

Fβm(ρm)Fβm(ρε)Fβm(ρ)+(β-1mρm-2+ρ-1/2W^(k)Θ(k))ε22ekL2(Ω)2+β-1m(m-2)ε36(ρ+εNk)m-3Nk3|Ω|,

Additionally note that if ε is small enough and ρ<1, the last term tends to 0 as m. Also since WHsc, the second term in the above expression is negative for m large enough as mρm0 as m. It follows from this that, for m large enough, the following estimate holds

Fβm(ρm)Fβm(ρε)Fβm(ρ)-C1ε2+C2ε3, 6.2

where C1,C2>0 are independent of m. it hus follows from Theorem 6.1, (6.2), and the definition of Γ-convergence that

F(ρ)<F(ρ),

where ρP(Ω) is the minimiser of F. Thus, ρρ and the result follows.

Numerical Experiments

The numerical experiments in this section are meant to shed light on the qualitative features of the global bifurcation diagram of the system, while also serving as a source of possible conjectures that can be studied in future work. They were performed using a modified version on the numerical scheme in [CCH15].

Discontinuous bifurcations for m>2 and W=-cos(2πx/L)

Fig. 1 shows the branches of stationary solutions obtained in the long-time limit for m2 and W=-cos(2πx/L). The black dot denotes the point of linear stability βm while the red dot denotes the value of β at which the support of the stationary solution is a strict subset of T. Note that the diagram does not necessarily reflect the actual bifurcation diagram of the system as it is obtained from the long-time dynamics and thus will only see stable solutions. We already know that this choice of W satisfies the conditions of Theorem 4.4 and so there will a bifurcation at βm (the black points in Fig. 1). One would expect this branch to turn to the right for m(2,3) (cf. Remark 4.6) and then turn back. We conjecture that the red points are all saddle-node bifurcations and correspond to discontinuous phase transitions for m2 due to Lemma 5.15 and the fact that they lie ahead of the corresponding βm.

Fig. 1.

Fig. 1

Stationary solutions in the long-time limit for m2 and W=-cos(2πx/L). The black dot denotes the point of linear stability βm while the red dot denotes the value of β at which the support of the stationary solution is a strict subset of T. Note that ρsL=0.1 corresponds to the flat state ρ

The mesa minimisers

In Fig. 2, we plot the stationary solutions observed in the long-time limit for m large and β>βc. Since the stationary solutions are potentially minimisers of Fβm and the minimisers converge to the minimisers of F as m (cf. Theorem 6.1), the plots in Fig. 2 provide us with some information about the structure of the minimisers of the mesa problem. It seems to be that they converge to the indicator function of some fixed set. A natural next question one can ask is what happens to the continuity of phase transitions in the limit as m.

Fig. 2.

Fig. 2

Stationary solutions/minimisers for m large and |Ω|>1. The limiting object seems to be the indicator function of some interval

Proof of Hölder Regularity

We divide the proof into two parts. In Sect. 8.1, we derive some a priori estimates that will be useful in the proof of regularity. In Sect. 8.2, we perform the so-called reduction of oscillation scheme and complete the proof of Theorem 3.3. As mentioned earlier, readers interested only in bifurcations and phase transitions can skip directly to Sect. 4.

Before turning to the proof of Theorem 3.3, we introduce some notation. Since the Eq. (3.1) is invariant under translations of the co-ordinate axis, we define the parabolic cylinder

Q(τ,R)=[-R,R]d×[-τ,0],

centred at (0, 0) and note that we can move it to any point by adding (x0,t0). We also used KR as a shorthand for [-R,R]d. We denote the parabolic boundary by

pQ(τ,R)=KR×(-τ,0)KR×{-τ,0}.

We use the following shorthand notation:

w+=max(w,0),w-=-min(w,0),ρ+=min(ρ,),ρ-=-min(-ρ,-).

Additionally, we consider the cut-off functions ζ such that

0ζ1,|ζ|<+,ζ(x,t)=0,xKR.

Through the rest of this section we will also use f(xt) to denote Wρ(x,t). Note that

fL(Ω)WL(Ω),D2fL(Ω)D2WL(Ω)

The reader should note that proof of regularity holds for any fC2(Ω) that for which one can prove bounds of the kind shown above. We note before starting the proof that all estimates in the proof have constants that depend continuously on β>0. Thus, the Hölder exponent a and semi-norm |ρ|a also depend continuously on β>0.

A priori estimates

There are two a priori estimates that play a key role in the proof of Hölder regularity: a Cacciopoli-type energy estimate and a logarithmic estimate. The proof of the energy estimate is essentially the same as [Urb08, Proposition 2.4] and we state it without proof.

Lemma 8.1

(Energy estimates). Pick k,R+ and some cut-off function ζ, such that ζ=0 on pQ(τ,R). Then it holds for any weak solution of (3.1) that

12[esssupt[-τ,0]KR×{t}(ρ+-k)+2ζ2dx+Q(τ,R)(ρ+)m-1|(ρ+-k)+ζ|2dxdt]Q(τ,R)(ρ+-k)+2ζζtdxdt+2(-k)+Q(τ,R)(ρ-)+ζζtdxdt+2mβ-1Q(τ,R)(ρ+-k)+2(ρ+)m-1|ζ|2dxdt+2mβ-1(-k)+Q(τ,R)(ρsm-1ds)(|ζ|2+ζΔζ)χρdxdt+Q(τ,R)|f||ζ||ζ|(ρ+-k)+2dxdt+Q(τ,R)|Δf|(ρ+-k)+2ζ2dxdt.

Similarly we have,

12[esssupt[-τ,0]KR×{t}(ρ--k)-2ζ2dx+Q(τ,R)(ρ-)m-1|(ρ--k)-ζ|2dxdt]Q(τ,R)(ρ--k)-2ζζtdxdt+2(-k)-Q(τ,R)(ρ-)-ζζtdxdt+2mβ-1Q(τ,R)(ρ--k)-2(ρ-)m-1|ζ|2dxdt-2mβ-1(-k)-Q(τ,R)(ρsm-1ds)(|ζ|2+ζΔζ)χρdxdt+Q(τ,R)|f||ζ||ζ|(ρ--k)-2dxdt+Q(τ,R)|Δf|(ρ--k)-2ζ2dxdt. 8.1

We note that Urbano [Urb08, Proposition 2.4] proves the above energy estimate for the p-Laplace equation, tρ-Δpρ=0. The proof in our setting follows the same technique. We test the weak formulation in Theorem 8.3 (see page 33) against ϕ=((ρ±)h-k)±ζ2, for some cut-off function ζ supported in Q(τ,R) and integrate by parts. Applying similar bounds as in [Urb08, Proposition 2.4] and then passing to the limit as h0, we obtain the desired energy estimate. We also refer the reader to [Rod16, Proposition 2.7] where the proof of the energy estimate is carried out for the porous medium equation, tρ-Δρm=0, which is closer in structure to (3.1). We now move on to the logarithmic estimate. The proof of this needs to be adapted from the classical estimate in the presence of the drift term ·(fρ). Before stating and proving it, we introduce the following function

ψ±(s)=ψk,c±(s):=(ln(Hs,k±(Hs,k±+c)-(s-k)±))+,0<c<Hs,k±,

where s is a bounded, measurable function on Q(τ,R) and

Hs,k±=esssupQ(τ,R)|(s-k)±|.

The function has certain useful properties, namely,

0ψ±(s)(ψ+)(s)0,(ψ-)(s)0(ψ±)=((ψ±))2.

We also need to define the Steklov average for any ρL1(Ω×[0,T]) for any 0<h<T as follows

ρh:=h-1tt+hρ(·,τ)dτ0tT-h0otherwise.

The Steklov average has certain nice properties which we state without proving.

Lemma 8.2

[Urb08, Lemma 2.2]. Let ρLq([0,T];Lr(Ω)) then ρh converges to ρ in ρLq([0,T];Lr(Ω)) as h0 for q,r(1,). Additionally, if ρC([0,T];L2(Ω)), then ρh(·,t) converges to ρ(·,t) in Lq(Ω) for t[0,T].

Using this we have the following alternative notion of a weak solution of

Definition 8.3

A weak solution of (3.1) is a bounded measurable function

ρC([0,T];L2(Ω))

with

ρmL2([0,T];H1(Ω)),

such that

Ω×{t}t(ρh)ϕ+mβ-1(ρm-1ρ)h·ϕ+(ρWρ)h·ϕdx=0, 8.2

for all ϕH01(Ω), h(0,T), t(0,T] and ρ(x,0)=ρ0.

Proposition 8.4

[Urb08]. The notion of weak solution introduced in Theorem 3.1 and Theorem 8.3 are equivalent.

Lemma 8.5

(Logarithmic estimates). Let ρ be a nonnegative weak solution of (3.1) and ζ be a time-independent cut-off function, then it holds that

KR×{t}((ψ±)2)(ρ)ζ2dxKR×{-τ}((ψ±)2)(ρ)ζ2dx-2mβ-1-τtKR×{s}(ρm-1|ρ|2((ψ±)(ρ))2ζ2)dxds+2mβ-1-τtKR×{s}ρm-1ψ±(ρ)|ζ|2dxds+2-τtKR×{s}ρ|f||ρ||((ψ±)(ρ))2(1+(ψ±(ρ))|ζ2dxds+4-τtKR×{s}ρ|f||ζ||((ψ±)(ρ))ψ±(ρ)||ζ|dxds.

for any -τt0.

Proof

We start by testing (8.2) against ((ψ±)2)(ρh)ζ2 and integrating by parts to obtain

Ω×{t}t(ρh)((ψ±)2)(ρh)ζ2+mβ-1(ρm-1ρ)h·(((ψ±)2)(ρh)ζ2)+(ρf)h·(((ψ±)2)(ρh)ζ2)dx=0, 8.3

Consider the first term on the LHS and integrating from -τ to t

-τtΩ×{s}s(ρh)((ψ±)2)(ρh)ζ2dxds=-τtΩ×{s}s((ψ±)2)(ρh)ζ2dxds=Ω×{t}((ψ±)2)(ρh)ζ2dx-Ω×{-τ}((ψ±)2)(ρh)ζ2dx.

Passing to the limit as h0 we obtain that

-τtΩ×{s}s(ρh)((ψ±)2)(ρh)ζ2dxdsΩ×{t}((ψ±)2)(ρ)ζ2dx-Ω×{-τ}((ψ±)2)(ρ)ζ2dx.

Now consider the second term on the LHS of (8.3) (after passing to the limit as h0)

β-1-τtΩ×{s}m(ρm-1ρ)·(((ψ±)2)(ρ)ζ2)dxds=2mβ-1-τtΩ×{s}(ρm-1|ρ|2((ψ±)(ρ))2(1+(ψ±(ρ))ζ2)dxds+4mβ-1-τtΩ×{s}(ρm-1ρ(ψ±)(ρ)ψ±(ρ)ζ·ζ)dxds2mβ-1-τtΩ×{s}(ρm-1|ρ|2((ψ±)(ρ))2ζ2)dxds-2mβ-1-τtΩ×{s}ρm-1ψ±(ρ)|ζ|2dxds,

where the last expression follows from Youngs inequality. Finally we consider the last term on the LHS of (8.3) (after passing to the limit as h0)

-τtΩ×{s}(ρf)·(((ψ±)2)(ρ)ζ2)dxds=2-τtΩ×{s}ρf·ρ((ψ±)(ρ))2(1+(ψ±(ρ))ζ2dxds+4-τtΩ×{s}ρf·ζ((ψ±)(ρ))ψ±(ρ)ζdxds-2-τtΩ×{s}ρ|f||ρ||((ψ±)(ρ))2(1+(ψ±(ρ))|ζ2dxds-4-τtΩ×{s}ρ|f||ζ||((ψ±)(ρ))ψ±(ρ)||ζ|dxds.

Putting it all together we obtain

Ω×{t}((ψ±)2)(ρ)ζ2dxΩ×{-τ}((ψ±)2)(ρ)ζ2dx-2mβ-1-τtΩ×{s}(ρm-1|ρ|2((ψ±)(ρ))2ζ2)dxds+2mβ-1-τtΩ×{s}ρm-1ψ±(ρ)|ζ|2dxds+2-τtΩ×{s}ρ|f||ρ||((ψ±)(ρ))2(1+(ψ±(ρ))|ζ2dxds+4-τtΩ×{s}ρ|f||ζ||((ψ±)(ρ))ψ±(ρ)||ζ|dxds.

Taking into account the support of ζ, one obtains the result of the lemma.

Proof of Theorem 3.3

We now get to the meat of the regularity argument, i.e. the reduction of oscillation. We assume again that ρ is a nonnegative weak solution of (3.1). We pick a cylinder Q(4R2-ε,2R) that lies inside ΩT (shifted to (0, 0)) for 0<R<1. Then we can define

μ+=esssupQ(4R2-ε,2R)ρ,μ-=essinfQ(4R2-ε,2R)ρ,

along with

ω=μ+-μ-=essoscQ(4R2-ε,2R)ρ.

We then define the rescaled cylinder

Q(ω1-mR2,R)Q(4R2-ε,2R),

which holds true if

αωm-1>Rε. 8.4

For a fixed ε>0,α(0,1) if the above inequality does not hold true for any R that can be made arbitrarily small, it follows that ω is comparable to the radius of the cylinder and thus we have Hölder continuity already. The proof of this statement is by contradiction. Let ωR:=essoscQ(4R2-ε,2R)ρ. Then for any point (x,t)ΩT we set R:=dTd(x,0)+|t|1/2, the parabolic distance to the origin. Thus, we have

|ρ(x,t)-ρ(0,0)|ωRα-1m-1Rεm-1=α-1m-1(dTd(x,0)+|t|1/2)εm-1.

We will specify the value of α later. We thus have by this inclusion that

essoscQ(w1-mR2,R)ρω.

We will also assume throughout the remainder of this proof that μ-<ω/4, as otherwise the equation is uniformly parabolic in Q(4R2-ε,2R). Before we proceed we pick some ν0(0,1) and divide our analysis into two cases.

Case 1

|(x,t)Q(ω1-mR2,R):ρ<μ-+ω/2||Q(ω1-mR2,R)|ν0, 8.5

or

Case 2

|(x,t)Q(ω1-mR2,R):ρμ-+ω/2||Q(ω1-mR2,R)|<1-ν0,

or equivalently

|(x,t)Q(ω1-mR2,R):ρμ+-ω/2||Q(ω1-mR2,R)|<1-ν0. 8.6

We now treat the two cases independently.

Reduction of oscillation in case 1

In the first case, we start by proving the following result.

Lemma 8.6

Assume that μ-<ω/4 and that (8.5). holds for some ν0(to be chosen), then

ρ(x,t)>μ-+ω4a.e. inQ(ω1-m(R2)2,R2).
Proof

We start by considering the sequence

Rn=R2+R2n+1n=0,1,

such that R0=R and RnR/2 as n. We then construct a sequence of nested shrinking cylinders Q(ω1-mRn2,Rn) along with cut-off functions ζn satisfying

0ζn1,ζn=1inQ(ω1-mRn+12,Rn+1),ζn=0onpQ(ω1-mRn2,Rn),|ζn|2n+2R,0(ζn)t22n+2R2ωm-1,Δζn22n+5R2.

We now apply the energy estimate of Lemma 8.1 in Q(ω1-mRn2,Rn) with =μ-+ω/4, and kn=μ-+ω/4+ω/(2n+1) for the function (ρ--kn)-. We will bound the terms on the LHS and RHS separately. Considering first the terms on the LHS we have

12esssup-Rn2ω1-m<t<0KRn×{t}(ρ--kn)-2ζn2dx+Q(ω1-mRn2,Rn)(ρ-)m-1|(ρ--kn)-ζn|2dxdt21-2messsup-Rn2ω1-m<t<0KRn×{t}(ρ--kn)-2ζn2dx+ωm-1Q(ω1-mRn2,Rn)|(ρ--kn)-ζn|2dxdt,

where we have used the fact that ρ-=max(ρ,μ-+ω/4)μ-+ω/4ω/4. For the RHS we first note the following facts:

  1. 0μ-ω/4 which implies that ρ5ω/4, ω/2, and ρ-5ω/4 .

  2. =μ-+ω/4<kn which implies that χ[ρ]χ[ρkn]=χ[(ρ-kn)->0].

  3. If ρ-=ρ, then χ[(ρ--kn)->0]=χ[(ρ-kn)->0]. On the other hand if ρ-=, we have that ρ<kn we have that χ[(ρ-kn)->0]=0=χ[(-kn)->0]=χ[(ρ--kn)->0].

  4. (l-kn)-=ω/(2n+1)ω/2, (ρ--kn)-ω/2n+1ω/2, (ρ-)-ω/4 .

We now proceed to bound individual terms on the RHS of (8.1). For the first term we have:

Q(ω1-mRn2,Rn)(ρ--kn)-2ζn(ζn)tdxdt++2(-kn)-Q(ω1-mRn2,Rn)(ρ-)-ζn(ζn)tdxdtω22ωm-122n+2R2Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt.

For the second term:

2mβ-1Q(ω1-mRn2,Rn)(ρ--kn)-2(ρ-)m-1|ζn|2dxdtmβ-1(54)m-1ω2ωm-122n+3R2Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt.

For the third term:

-2mβ-1(-kn)-Q(ω1-mRn2,Rn)(ρsm-1ds)(|ζn|2+ζnΔζn)χρdxdtmβ-1ω24ωm-122n+5R2Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt.

For the final two terms we have:

Q(ω1-mRn2,Rn)|f||ζn||ζn|(ρ--kn)-2dxdt+Q(ω1-mRn2,Rn)|Δf|(ρ--kn)-2ζn2dxdt(2n+2RfL(Ω)+ΔfL(Ω))ω24Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt=22nR2ωm-1(ω1-mR2n-2fL(Ω)+ΔfL(Ω)ω1-mR22-2n)ω24Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt22nR2ωm-1(4L1-εfL(Ω)+ΔfL(Ω)L2-ε)ω24Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt,

where in the last step we have used the fact that Rεω1-m<α<1 and that R<L. Putting the bounds for the LHS and RHS of (8.1) together we obtain

[esssup-Rn2ω1-m<t<0KRn×{t}(ρ--k)-2ζn2dx+ωm-1Q(ω1-mRn2,Rn)|(ρ--k)-ζn|2dxdt]C(m,L,β,fL(Ω),ΔfL(Ω))22nR2ωm-1ω24Q(ω1-mRn2,Rn)χ[(ρ--kn)->0]dxdt.

Let t¯=ωm-1t and define the following rescaled functions

ρ¯-(·,t¯)=ρ-(·,t),ζn¯(·,t¯)=ζn(·,t).

In these new variables the inequality simplifies to

[esssup-Rn2<t¯<0KRn×{t¯}(ρ¯--kn)-2ζn¯2dx+Q(Rn2,Rn)|(ρ¯--kn)-ζn¯|2dxdt]C22nR2ω24An, 8.7

where

An:=Q(Rn2,Rn)χ[(ρ¯--kn)->0]dxdt.

Furthermore we have

122n+2ω24An+1=|kn-kn+1|2An+1=Q(Rn+12,Rn+1)|kn-kn+1|2χ[(ρ¯--kn+1)->0]dxdtQ(Rn+12,Rn+1)|kn-ρ¯-|2χ[(ρ¯--kn+1)->0]dxdt(kn-ρ¯-)-L2(Q(Rn+12,Rn+1))2CdAn2/(2+d)(kn-ρ¯-)-V2(Q(Rn+12,Rn+1))2,

where in the last step we have used the embedding into the parabolic space V2(cf. Lemma A.4). Thus, we have

122n+2ω24An+1Cd[esssup-Rn+12<t¯<0KRn+1×{t¯}(ρ¯--k)-2dx+Rn+12,Rn+1)|(ρ¯--k)-|2dxdt]CdAn2/(2+d)[esssup-Rn2<t¯<0KRn×{t¯}(ρ¯--k)-2ζn¯2dx+Q(Rn2,Rn)|(ρ¯--k)-ζn¯|2dxdt]C22nR2ω24An1+2/(d+2),

where we have used the fact that ζn¯=1 on Q(Rn+12,Rn+1) and have used (8.7). Thus, we have

An+1|Q(Rn+12,Rn+1)|C|Q(Rn+12,Rn+1)|2/(2+d)42n+1R2(An|Q(Rn+12,Rn+1)|)1+2/(d+2)C42n(|Q(Rn2,Rn)||Q(Rn+12,Rn+1)|An|Q(Rn2,Rn)|)1+2/(d+2)C42n(An|Q(Rn2,Rn)|)1+2/(d+2),

where we use the fact that |Q(Rn2,Rn)|=Rn+1d+2Rd+2 and Rn/Rn+12. Setting

Xn:=(An|Q(Rn2,Rn)|),

we have the recursive inequality

Xn+1C42nXn1+2/(2+d),

with the constant C independent of ω,R,n and dependent only d,m,β,f. Setting ν0=C-(d+2)/24-(d+2)2/2, we see that X0ν0 is equivalent (8.5) to being satisfied with constant ν0, since k0=ω/2. Thus, for this choice, Xn0 by the geometric convergence lemma (cf. Lemma A.2). It follows then, after changing variables, that ρ->μ-+ω/4 a.e. in Q(ω1-m(R2)2,R2). The result follows by noting that ρ->μ-+ω/4= implies that ρ-=ρ.

Corollary 8.7

(Reduction of oscillation in case 1). Assume that (8.5) holds with constant ν0 as specified in the proof of Lemma 8.6. Then there exists a σ1(0,1), independent of ω, R, such that

essoscQ(ω1-m(R2)2,R2)ρσ1ω.
Proof

We have by the result of the previous lemma that

essinfQ(ω1-m(R2)2,R2)ρμ-+ω/4.

Thus, we have that

essoscQ(ω1-m(R2)2,R2)ρ=esssupQ(ω1-m(R2)2,R2)ρ-essinfQ(ω1-m(R2)2,R2)ρμ+-μ--ω/434ω.

Thus, the result holds with σ1=34.

Reduction of oscillation in case 2

We now assume that (8.6) holds but with the constant ν0 fixed from the previous argument. We argue now that if (8.6) is satisfied then there exists some t0,

t0[-ω1-mR2,-ν02ω1-mR2],

such that

|{xKR:ρ(x,t0)>μ+-ω2}|1-ν01-ν0/2|KR|.

We prove this by contradiction. Assume this is not the case then

|{xQ(ω1-mR2,R):ρ(x,t)>μ+-ω2}|-ω1-mR2-ν02ω1-mR2|xKR:ρ(x,s)>μ+-ω/2|ds>(-ν02ω1-mR2+ω1-mR2)(1-ν01-ν0/2)|KR|=(1-ν0)|Q(ω1-mR2,R)|,

which contradicts (8.6). We now proceed to prove the following lemma.

Lemma 8.8

Assume that (8.6). holds. Then there exists a qN, depending only on the data, such that

|{xKR:ρ(x,t)>μ+-ω2q}|(1-(ν02)2)|KR|,

for all t[-ν02ω1-mR2,0] and α in (8.4) chosen to be small, depending only on ν0, m, d, β, W, M but independent of R and ω.

Proof

The proof of this lemma relies on the Lemma 8.5 with the function ψ+(u) on the cylinder Q(-t0,R). We choose

k=μ+-ω2,c=ω2n+1,

where the constant n>1 will be chosen later. It is fine to apply it to this function as we can assume that

Hρ,k+=esssupQ(-t0,R)|(ρ-μ++ω2)+|>ω4ω2n+1,

otherwise the proof of the lemma would be complete with q=2. Indeed, we would have for all t[t0,0]:

|{xKR:ρ(x,t)>μ+-ω4}|=|{xKR:ρ(x,t)-μ++ω2>ω4}|=0.

Before we write down the inequality, we need to further understand the properties of the function ψ+(ρ) defined on the cylinder Q(-t0,R). Note first that

ψ+(ρ)=ln(Hρ,k+Hρ,k+-ρ+k+ω2n+1)ρ>k+ω2n+10ρk+ω2n+1.

Furthermore in Q(-t0,R), we have that

ρ-kHρ,k+ω2.

Therefore

ψ+(ρ)ln(Hρ,k+Hρ,k+-ρ+k+ω2n+1)ln(2n)nln(2).

Furthermore, we need to study the properties of (ψ+)(ρ):

(ψ+)(ρ)=1Hρ,k+-ρ+k+ω2n+1ρ>k+ω2n+10ρk+ω2n+1.

Thus, we have

0(ψ+)(ρ)2n+1ω.

We now proceed to writing down the estimate

KR×{t}((ψ+)2)(ρ)ζ2dxKR×{t0}((ψ+)2)(ρ)ζ2dx-2mβ-1t0tKR×{s}(ρm-1|ρ|2((ψ+)(ρ))2ζ2)dxds+2mβ-1t0tKR×{s}ρm-1ψ+(ρ)|ζ|2dxds+2t0tKR×{s}ρ|f||ρ||((ψ+)(ρ))2(1+(ψ+(ρ))|ζ2dxds+4t0tKR×{s}ρ|f||ζ||((ψ+)(ρ))ψ+(ρ)||ζ|dxds. 8.8

for any t0t0. We choose a time-independent cut-off function 0ζ1 such that

ζ1,xK(1-δ)R,|ζ|(δR)-1.

Consider now the first term involving f on the RHS of (8.8)

2t0tKR×{s}ρ|f||ρ||((ψ+)(ρ))2(1+(ψ+(ρ))|ζ2dxdsλ2mβ-1t0tKR×{s}(ρm-1|ρ|2((ψ+)(ρ))2ζ2)dxds+12λmβ-1t0tKRρ3-m|f|2|((ψ+)(ρ))2(1+(ψ+(ρ))2|ζ2dxds,

where we have simply applied Young’s inequality and the constant λ(0,1/2). We derive a similar bound for the second term involving f as follows

4t0tKR×{s}ρ|f||ζ||((ψ+)(ρ))ψ+(ρ)||ζ|dxdsλ2mβ-1t0tKR×{s}(ρm-1|ρ|2((ψ+)(ρ))2ζ2)dxds+2λmβ-1t0tKRρ3-m|f|2(ψ+(ρ))2|ζ|2dxds.

Putting it all together we can get rid of the negative term in (8.8) and take the esssup to obtain:

esssupt[t0,0]KR×{t}((ψ+)2)(ρ)ζ2dxKR×{t0}((ψ+)2)(ρ)ζ2dx+2mβ-1t00KR×{s}ρm-1ψ+(ρ)|ζ|2dxds+12λmβ-1t00KRρ3-m|f|2|((ψ+)(ρ))2(1+(ψ+(ρ))2|ζ2dxds+2λmβ-1t00KRρ3-m|f|2(ψ+(ρ))2|ζ|2dxds. 8.9

We proceed to bound each of the terms individually. For the first term on the RHS of (8.9) we obtain:

KR×{t0}((ψ+)2)(ρ)ζ2dxn2ln(2)21-ν01-ν0/2|KR|.

For the second term we use the fact that ρ5ω/4 to obtain:

2mβ-1t00KR×{s}ρm-1ψ+(ρ)|ζ|2dxds2mβ-1(54)m-1ωm-1(δR)-2|t0|nln(2)|KR|2mβ-1(54)m-1ln(2)δ-2n|KR|.

For the third term we use the fact that 5/4ωρω/2 on the supports of ψ+(ρ) and (ψ+)(ρ) to obtain:

12λmβ-1t00KRρ3-m|f|2|((ψ+)(ρ))2(1+(ψ+(ρ))2|ζ2dxdsC12λmβ-1ω3-mω1-mR2fL(Ω)22n+1ω-2(1+nln(2))2|KR|=C2λmβ-1ω1-mω1-mR2fL(Ω)22n+1(1+nln(2))2|KR|.

Similarly for the final term we obtain

2λmβ-1t00KRρ3-m|f|2(ψ+(ρ))2|ζ|2dxds2Cλmβ-1ω2ω1-mω1-mR2fL(Ω)2n2ln(2)2|KR|.

For the LHS of (8.6), consider the set

St={xK(1-δ)R:ρ(x,t)>μ+-ω/2n+1}KR,t(t0,0).

It is clear that ζ=1 on St and, since -ρ+k+ω/2n+1<0, the function

Hρ,k+Hρ,k+-ρ+k+ω2n+1,

is decreasing in Hρ,k+. Thus, in St we have

Hρ,k+Hρ,k+-ρ+k+ω2n+1ω/2ω/2-ρ+k+ω2n+1ω/2ω/2+ω/2n+1-ω/2+ω/2n+1=2n-1.

Thus, we have

esssupt[t0,0]KR×{t}((ψ+)2)(ρ)ζ2dx(n-1)2ln(2)|St|.

Putting all the terms back together we obtain and bounding ω2 by M2,

|St|((nn-1)21-ν01-ν0/2+C(m,β)δ-2n(n-1)2)|KR|+(C1(m,β,λ,fL(Ω))ω1-mω1-mR22n+1(1+nln(2)n-1)2)|KR|+(C2(m,β,λ,fL(Ω),M)ω1-mω1-mR2(nn-1)2)|KR|.

Finally, we obtain the estimate we need

|{xKR:ρ(x,t)>μ+-ω2q}||St|+|KR\K(1-δ)R|((nn-1)21-ν01-ν0/2+C(m,β)δ-2n(n-1)2+dδ)|KR|+(C1(m,β,λ,fL(Ω))Rεω1-mRεω1-mL2-2ε2n+1(1+nln(2)n-1)2)|KR|+(C2(m,β,λ,fL(Ω),M)Rεω1-mRεω1-mL2-2ε(nn-1)2)|KR|,

where one should note that RL and the term Rεω1-m can be controlled by α through (8.4). Note that for the term in the first set of brackets we can choose dδ3ν02/16 and n large enough such that

(nn-1)2(1-ν0/2)(1+ν0),C(m,β)n(n-1)2δ-23ν02/16,

because (1-ν0/2)(1+ν0)>1. Now that n and δ have been fixed we note that the constant α in (8.4) can be made small enough (independent of ω and R) so that terms in the other two brackets are lesser that 3ν02/16. This gives us

|{xKR:ρ(x,t)>μ+-ω2q}|(1-ν02+3ν02/4)|KR|=(1-ν024)|KR|.

The proof follows by setting q=n+1 and noting that [t0,0][-ν02ω1-mR2,0].

We now proceed to prove that ρ is strictly lesser than its supremum in a smaller parabolic cylinder.

Lemma 8.9

Assume that (8.6). holds. Then there exists some s0N large enough, independent of ω, such that

ρ(x,t)<μ+-ω2s0a.e.(x,t)Q(ν02ω1-m(R2)2,R2).
Proof

The proof is similar to that of Lemma 8.6 and relies on the energy estimates in Lemma 8.1. We start by considering the sequence

Rn=R2+R2n+1n=0,1,

such that R0=R and RnR/2 as n. We then construct a sequence of nested shrinking cylinders Q(ν02-1ω1-mRn2,Rn) along with cut-off functions ζn satisfying

0ζn1,ζn=1inQ(ν02-1ω1-mRn+12,Rn+1),ζn=0onpQ(ν02-1ω1-mRn2,Rn),|ζn|2n-1R,0(ζn)t22n-2R2ωm-1,Δζn22n-2R2.

We now apply the energy estimate of Lemma 8.1 in Q(ν02-1ω1-mRn2,Rn) with =μ+-ω/2s0, and kn=μ+-ω/(2s0)-ω/(2n+s0) for the function (ρ+-kn)+. We will bound the terms on the LHS and RHS separately. Considering first the terms on the LHS we have

12esssup-Rn2ω1-mν02-1<t<0KRn×{t}(ρ+-kn)+2ζn2dx+Q(ν02-1ω1-mRn2,Rn)(ρ+)m-1|(ρ+-kn)+ζn|2dxdt2-messsup-ν02-1Rn2ω1-m<t<0KRn×{t}(ρ+-kn)+2ζn2dx+ωm-1Q(ν02-1ω1-mRn2,Rn)|(ρ+-kn)+ζn|2dxdt,

where we have used the fact that when |(ρ+-k)+ζn| is nonzero, ρ+knω/2. For the RHS we first note the following facts:

  1. 0μ-ω/4 which implies that ρ5ω/4, and ρ+5ω/4 .

  2. =μ--ω/2s0>kn which implies that χ[ρ]χ[ρkn]=χ[(ρ-kn)+>0].

  3. If ρ+=ρ, then χ[(ρ+-kn)+>0]=χ[(ρ-kn)+>0]. On the other hand if ρ+=, we have that ρkn. Thus, we have that χ[(ρ-kn)+>0]=χ[(ρ+-kn)+>0].

  4. (l-kn)+=ω/(2n+s0)ω/2s0-1, (ρ+-kn)+ω/2n+s0ω/2s0-1, (ρ-)+ω/2s0-1.

Applying, essentially the same bounds as Lemma 8.6, we obtain

[esssup-ν02-1Rn2ω1-m<t<0KRn×{t}(ρ+-k)+2ζn2dx+ωm-1Q(ν02-1ω1-mRn2,Rn)|(ρ+-k)+ζn|2dxdt]C(m,L,β,fL(Ω),ΔfL(Ω))22nR2ωm-1ω222s0-2Q(ν02-1ω1-mRn2,Rn)χ[(ρ+-kn)+>0]dxdt.

Let t¯=ν0-12ωm-1t and define the following rescaled functions

ρ¯+(·,t¯)=ρ+(·,t),ζn¯(·,t¯)=ζn(·,t).

In these new variables the inequality simplifies to

[esssup-Rn2<t¯<0KRn×{t¯}(ρ¯+-kn)+2ζn¯2dx+ν02Q(Rn2,Rn)|(ρ¯+-kn)+ζn¯|2dxdt]C22nR2ν02ω222s0-2An,

where

An:=Q(Rn2,Rn)χ[(ρ¯+-kn)+>0]dxdt.

Since ν0(0,1) it simplifies to,

[esssup-Rn2<t¯<0KRn×{t¯}(ρ¯+-kn)+2ζn¯2dx+Q(Rn2,Rn)|(ρ¯+-kn)+ζn¯|2dxdt]C22nR2ω222s0-2An.

Furthermore we have

122n+2ω222s0-2An+1=|kn-kn+1|2An+1=Q(Rn+12,Rn+1)|kn-kn+1|2χ[(ρ¯+-kn+1)+>0]dxdtQ(Rn+12,Rn+1)|kn-ρ¯+|2χ[(ρ¯+-kn+1)+>0]dxdt(kn-ρ¯+)+L2(Q(Rn+12,Rn+1))2CdAn2/(2+d)(kn-ρ¯+)+V2(Q(Rn+12,Rn+1))2,

where in the last step we have used the emebedding into the parabolic space V2 (cf. Lemma A.4). Thus, as in Lemma 8.6 we have

122n+2ω222s0-2An+1C22nR2ω222s0-2An1+2/(d+2).

This can be simplified to

Xn+1C42nXn1+2/(d+2),

where

Xn=An|Q(Rn2,Rn)|,

and the constant C independent of ω,R,n and dependent only d,m,β,f. Thus, if

X0C-(d+2)/24(d+2)2/2:=ν0, 8.10

by the geometric convergence lemma (cf. Lemma A.2), Xn0 and the result follows as in the proof of Lemma 8.6. Thus, all that remains to be shown is (8.10) holds. Before we do this we introduce the following notation

Bσ(t)={xKR:ρ(x,t)>μ+-ω2σ},

and

Bσ={(x,t)Q(ν02ω1-mR2,R):ρ(x,t)>μ+-ω2σ}.

In this notation (8.10) reads as

|Bs0-1|ν0|Q(ν02ω1-mR2,R)|.

The above inequality means that the subset of Q(ν02ω1-mR2,R) where ρ is close to its supremum can be made arbitrarily small. To show this, we apply the energy estimate of Lemma 8.1 to the function (ρ+μ+-k)+ with

k=μ+-ω2s,q<s<s0,

with a cut-off function ζ defined in Q(ν02ω1-mR2,2R) such that

ζ1,inQ(ν02ω1-mR2,R),ζ=0onpQ(ν02ω1-mR2,2R),|ζ|1R,0ζtωm-1R2.

We delete the first term on the LHS and bound the rest as follows:

12[esssup-R2ω1-mν02-1<t<0K2R×{t}(ρ-k)+2ζ2dx+Q(ν02-1ω1-mR2,2R)(ρ)m-1|(ρ-k)+ζ|2dxdt]2-mωm-1Q(ν02-1ω1-mR2,R)|(ρ-k)+ζ|2dxdt=2-mωm-1Q(ν02-1ω1-mR2,R)|(ρ-k)+|2dxdt,

where we have used the fact that when |(ρ-k)+ζ| is nonzero then ρ>k>ω/2. For the terms on the RHS we bound them as in Lemma 8.6 (note that two of the terms are zero because ρ=μ+ a.e. (xt)). Thus, we have the bound

2-mωm-1Q(ν02-1ω1-mR2,R)|(ρ-k)+|2dxdtC(m,L,β,fL(Ω),ΔfL(Ω))ωm-1R2ω222s-2Q(ν02-1ω1-mR2,2R)χ[(ρ-k)+>0]dxdtCωm-1R2ω222s-2|Q(ν02-1ω1-mR2,2R)|

Since |Q(ν02-1ω1-mR2,2R)|=2d+1|Q(ν02-1ω1-mR2,R)|, multiplying my ω1-m this reduces to

Q(ν02-1ω1-mR2,R)|(ρ-k)+|2dxdtCR2ω222s-2|Q(ν02-1ω1-mR2,R)|.

Note now that BsQ(ν02-1ω1-mR2,R) and, in Bs, |(ρ-k)+|=|(ρ-k)|=|ρ|. Thus, the above inequality gives us

Bs|ρ|2dxdtCR2ω222s-2|Q(ν02-1ω1-mR2,R)|. 8.11

We now apply the lemma of De Giorgi (cf. Lemma A.3) with k1=μ+-ω/2s and k2=μ+-ω/2s+1, to obtain that for all t[-ν02-1ω1-mR2,0]

ω2s+1|Bs+1(t)|CRd+1|KR\Bs(t)|Bs(t)\Bs+1(t)|ρ|dx. 8.12

Since qs-1, by Lemma 8.8, it follows that |Bs-1(t)||Bq(t)|(1-ν02/4)|KR| for all t[-ν02-1ω1-mR2,0]. Thus, for all such t it follows that

|KR\Bs(t)|=|{xKR:ρ(x,t)<μ+-ω2s}||{xKR:ρ(x,t)<μ+-ω2s-1}|=|KR|-|Bs-1(t)|ν024|KR|.

Thus, (8.12) can be rewritten as

ω2s+1|Bs+1(t)|CRd+1|KR|ν02Bs(t)\Bs+1(t)|ρ|dx.

for t[-ν02-1ω1-mR2,0]. We integrate the above inequality over [-ν02-1ω1-mR2,0] to obtain

ω2s+1|Bs+1|CR|KR|ν02Bs\Bs+1|ρ|dxdtCRν02(Bs\Bs+1|ρ|2dxdt)1/2|Bs\Bs+1|1/2Cν02ω2s|Q(ν02-1ω1-mR2,R)|1/2|Bs\Bs+1|1/2,

where in the last step we have applied (8.11). Squaring both sides we obtain

|Bs+1|2Cν04|Q(ν02-1ω1-mR2,R)||Bs\Bs+1|.

Since q<s<s0, we sum the above inequality for s=q+1,,s0-2 to obtain

s=q+1s0-2|Bs+1|2Cν04|Q(ν02-1ω1-mR2,R)|s=q+1s0-2|Bs\Bs+1|.

Note that s=q+1s0-2|Bs\Bs+1||Q(ν02-1ω1-mR2,R)|. Additionally, Bs0-1Bs for all s=q+1,,s0-2. Thus, we have

|Bs0-1|2Cν04((s0-q-3))|Q(ν02-1ω1-mR2,R)|2.

For s0N sufficiently large independent of ω, R, (8.10) is satisfied and the result follows.

Finally we can state the reduction of oscillation result in case 2.

Corollary 8.10

(Reduction of oscillation in case 2). Assume that (8.5) holds with constant ν0 as specified in the proof of Lemma 8.6. Then there exists a σ2(0,1), independent of ω, R, such that

essoscQ(ν02-1ω1-m(R2)2,R2)ρσ2ω.
Proof

We know from Lemma 8.9 that there exists some s0N such that

esssupQ(ν02-1ω1-m(R2)2,R2)ρμ+-ω2s0.

Thus

essoscQ(ν02-1ω1-m(R2)2,R2)ρμ+-ω2s0-μ-(1-12s0)ω.

Thus, for σ2=(1-12s0) the result follows.

We combine the two cases into one:

Lemma 8.11

(Total reduction of oscillation). Fix some 0<R<L such that Q(4R2-ε,2R)ΩT. Assume that essoscQ(4R2-ε,2R)ρω and αωm-1>Rε and that μ->ω/4. Then there exists a constant σ(0,1), depending only on the data (and continuously on β>0), and independent of ω and R, such that

essoscQ(ν02-1ω1-m(R2)2,R2)ρσω.
Proof

The proof follows from the fact that Q(ν02-1ω1-m(R2)2,R2)Q(ω1-m(R2)2,R2) and setting σ=max{σ1,σ2}.

We can now complete the proof of Theorem 3.3:

Proof of Theorem 3.3

We now show that there exist constants γ>1, a(0,1), depending only on the data (W, β, m, d, M), such that for all 0rL we have

essoscQ(ω1-mr2,r)ργω(2rL)a. 8.13

where ω=c1M and c1 is chosen to be large enough so that αωm-1>Lε. We choose as our starting point the cylinder Q(4(L/2)2-ε,L)ΩT. We start by defining

Rk=c0kL/2,c0=12σ(m-1)/εν02<12,ωk=σkω,

for k=0,1, and ε(m-1). We already have that αωm-1>R0ε for all 0rR. This implies that

ωk1-mRkε=σk(1-m)c0kεω1-mR0ε<α(ν04)kε<α.

Additionally, we also have that

σ=σ1+1-mεσm-1ε>c0.

It follows that

essoscQ(ω1-mR02,R0)ρessoscQ(4R02-ε,R0)ρMc1M=ω.

Furthermore, we have

essoscQ(ω1-mR12,R1)ρessoscQ(ω1-mν02-1(R/2)2,R/2)ρσω,

where we have applied Lemma 8.11. We can repeat the procedure starting at Rk with ωk=σkω and μk-:=essinfQ(ω1-mRk2,Rk)ρ assumed to be smaller than ωk/4. If this is not the case, then the equation is uniformly parabolic in Q(ω1-mRk2,Rk) and by parabolic regularity theory (cf. [LSU68]), (8.13) holds for some constants γ>1,a(0,1), depending only on the data. The dependence of the constants on β>0 is continuous.

Assuming μk->ωk/4 and applying the results of Lemma 8.11 to Rk+1 we obtain

essoscQ(ω1-mRk+12,Rk+1)ρ=essoscQ(σk(1-m)ω1-mσ(m-1)(k+2/ε)ν022-2(Rk/2)2,Rk/2)ρessoscQ(ωk1-mν02-1(Rk/2)2,Rk/2)ρσωk.

By induction it follows that

essoscQ(ω1-mRk2,Rk)ρσkω.

Additionally, for all 0rL we have that

c0k+1(L/2)rc0k(L/2),

for some k. Picking a=logc0σ>0, we derive

σk+1(2rL)a.

Thus, we have

essoscQ(ω1-mr2,r)ργω(2rL)a,

where γ=max{σ-1,γ}>1 and a=min{logc0σ,a}(0,1) since σ>σ1>1/2>c0. Note that (8.13) implies that ρ is continuous. One can see this by mollifying with some standard mollifier φε and applying Arzelà–Ascoli to show that the limit as ε0 is continuous.

Now that we have control on the oscillation of the solution we can proceed to the proof of Hölder regularity. Consider a weak solution ρ(x,t) defined on ΩT. We would like the Hölder regularity to be uniform in space and time so we consider only those points such that (x,t)+Q(4(L/2)2-ε,L)ΩT. The local regularity near pΩT can be derived in a similar manner. Fix two points (xt) and (yt) for some t large enough, and consider the recursive scheme starting from K:=(x,t)+Q(4(L/2)2-ε,L)ΩT. Setting r=dTd(x,y) and applying (8.13), we obtain

|ρ(x,t)-ρ(y,t)|essosc(x,t)+Q(ω1-mr2,r)ργω(2rL)aγ2ac1ML-adTd(x,y)a. 8.14

For the time regularity we consider two points (x,t1),(x,t2)ΩT,t1>t2 assuming that |t1-t2|1/2ω1-m(L/2)2. We consider the recursive scheme starting from K:=(x,t1)+Q(4(L/2)2-ε,L)ΩT. Setting r=ω(m-1)/2|t1-t2|1/2, we obtain

|ρ(x,t2)-ρ(x,t1)|essosc(x,t1)+Q(ω1-mr2,r)ργω(2rL)aγ2a(c1M)(2+a(m-1))/2L-a|t1-t2|a/2. 8.15

For |t1-t2|1/2>ω1-m(L/2)2, the proof is easier since

|ρ(x,t2)-ρ(x,t1)|2M2M|t1-t2|a/2(L/2)-a(c1M)(m-1)/2. 8.16

Combining (8.14), (8.15), and (8.16) together we have the required Hölder regularity away from the boundary:

|ρ(x,t1)-ρ(y,t2)|Ch(dTd(x,y)a+|t1-t2|a/2)Ch(dTd(x,y)+|t1-t2|1/2)a, 8.17

where a(0,1) depends continuously on β>0 and Ch depends on M, L, m, γ, and d. The regularity near the parabolic boundary can be derived in a similar manner.

Remark 8.12

We note that the proof of Corollary 3.4 follows from the fact that the constant Ch is uniform in time as long as we are far enough from the initial data ρ0, i.e. if 0<C<t1<t2< for some constant C>0.

Acknowledgements

The authors would like to thank Felix Otto and Yao Yao for useful discussions during the course of this work. We are also grateful to the reviewers for their careful reading of the manuscript and their useful suggestions.

Appendix A. Some useful results

In this section we state some useful lemmas and inequalities which we will use in the proof of Theorem 3.3.

Lemma A.2

(Geometric convergence lemma). Let {Xn}nN be a sequence of nonnegative real numbers satisfying the recurrence inequality

Xn+1CbnXn1+a,

for some C,b>1 and a>0. If X0C-1/ab-1/a2, then limnXn=0.

Let ΩTd be a smooth, convex, open subdomain. Then we have the following lemma due to De Giorgi [DG57]:

Lemma A.3

Given a function vW1,1(Ω) and real numbers k1<k2 we define

[vki]:={xΩ:v(x)ki}[k1<v<k2]:={xΩ:k1<v(x)<k2}.

Then there exists a constant C=C(d) such that

(k2-k1)|[v>k2]|CRd+1|[v<k1]|[k1<v<k2]|v|dx,

where R=diam(Ω).

Consider now the parabolic space V2(ΩT), equipped with the norm

ρV2(ΩT)2:=esssup0tTρL2(Ω)2(t)+ρL2(ΩT)2.

We then have the following embedding [DiB93, page 9]:

Lemma A.4

Let ρV2(ΩT). Then there exists a constant Cd depending only on d such that

ρL2(ΩT)2Cd|{|ρ|>0}|2/(2+d)ρV2(ΩT)2.

Appendix B. Bifurcation theory

We state here the Crandall–Rabinowitz theorem (cf. [Nir01, Kie12]) for bifurcations with a one-dimensional kernel.

Theorem B.1

Consider a separable Hilbert space X with UX an open neighbourhood of 0, and a nonlinear C2 map, F:U×VX, where V is an open subset of R+ such that F(0,κ)=0 for all κV. Assume the following conditions are satisfied for some κV:

  1. Dx(0,κ)F is a Fredholm operator with index zero and has a one-dimensional kernel.

  2. Dxκ2(0,κ)F[v0^]Im(Dx(0,κ)), where v0^ker(Dx(0,κ)),v0^=1 .

Then, there exists a nontrivial C1 curve through (0,κ) such that for some δ>0,

{(x(s),κ(s)):s(-δ,δ),x(0)=0,κ(0)=κ},

and F(x(s),κ(s))=0. Additionally, for some neighbourhood of (0,κ), this is the only such solution (apart from the trivial solution) and it has the following form:

x(s)=sv0^+Ψ(sv0^,ψ(s)),κ(s)=ψ(s),

where Ψ:ker(Dx(0,κ))×R+(ker(Dx(0,κ))) is a C1 map and ψ:(-δ,δ)V is a C1 function such that ψ(0)=κ. Furthermore if DκΨ(v0,κ)=0, we obtain a simplified expression of the form

x(s)=sv0^+r1(sv0^,ψ(s)),

such that lim|s|+|ψ(s)-κ|0r1(sv0^,ψ(s))|s|+|ψ(s)-κ|=0.

Footnotes

JAC was partially supported by EPSRC Grant number EP/P031587/1 and the Advanced Grant Nonlocal-CPD (Nonlocal PDEs for Complex Particle Dynamics: Phase Transitions, Patterns and Synchronization) of the European Research Council Executive Agency (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 883363). RSG was funded by an Imperial College President’s PhD Scholarship, partially through EPSRC Award Ref. 1676118. Part of this work was carried out at the Junior Trimester Programme in Kinetic Theory” held at the Hausdorff Research Institute for Mathematics, Bonn. RSG is grateful to the institute for its hospitality.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

José A. Carrillo, Email: carrillo@maths.ox.ac.uk

Rishabh S. Gvalani, Email: gvalani@mis.mpg.de

References

  • [AGS08].Ambrosio L, Gigli N, Savaré G. Gradient Flows in Metric Spaces and in the Space of Probability Measures. Basel: Birkhäuser Verlag; 2008. [Google Scholar]
  • [ASBCD19].Aceves-Sánchez P, Bostan M, Carrillo J-A, Degond P. Hydrodynamic limits for kinetic flocking models of Cucker–Smale type. Math. Biosci. Eng. 2019;16(6):7883–7910. doi: 10.3934/mbe.2019396. [DOI] [PubMed] [Google Scholar]
  • [BCD+18].Barré J, Carrillo JA, Degond P, Peurichard D, Zatorska E. Particle interactions mediated by dynamical networks: assessment of macroscopic descriptions. J. Nonlinear Sci. 2018;28(1):235–268. doi: 10.1007/s00332-017-9408-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [BCL09].Blanchet A, Carrillo JA, Laurençot P. Critical mass for a Patlak–Keller–Segel model with degenerate diffusion in higher dimensions. Calc. Var. Partial Differ. Equ. 2009;35(2):133–168. [Google Scholar]
  • [BCM07].Burger M, Capasso V, Morale D. On an aggregation model with long and short range interactions. Nonlinear Anal. Real World Appl. 2007;8(3):939–958. [Google Scholar]
  • [BCnCD16].Barbaro ABT, Cañizo JA, Carrillo JA, Degond P. Phase transitions in a kinetic flocking model of Cucker–Smale type. Multiscale Model. Simul. 2016;14(3):1063–1088. [Google Scholar]
  • [BDZ17].Barré J, Degond P, Zatorska E. Kinetic theory of particle interactions mediated by dynamical networks. Multiscale Model. Simul. 2017;15(3):1294–1323. [Google Scholar]
  • [Bre11].Brezis H. Functional Analysis, Sobolev Spaces and Partial Differential Equations. New York: Springer; 2011. [Google Scholar]
  • [BS10].Bertozzi AL, Slepčev D. Existence and uniqueness of solutions to an aggregation equation with degenerate diffusion. Commun. Pure Appl. Anal. 2010;9(6):1617–1637. [Google Scholar]
  • [BV13].Bodnar M, Velázquez J. Friction dominated dynamics of interacting particles locally close to a crystallographic lattice. Math. Methods Appl. Sci. 2013;36(10):1206–1228. [Google Scholar]
  • [CCH15].Carrillo JA, Chertock A, Huang Y. A finite-volume method for nonlinear nonlocal equations with a gradient flow structure. Commun. Comput. Phys. 2015;17(1):233–258. [Google Scholar]
  • [CCW+20].Carrillo JA, Chen X, Wang Q, Wang Z, Zhang L. Phase transitions and bump solutions of the Keller–Segel model with volume exclusion. SIAM J. Appl. Math. 2020;80(1):232–261. [Google Scholar]
  • [CCY19].Carrillo JA, Craig K, Yao Y. Aggregation-diffusion equations: dynamics, asymptotics, and singular limits. In: Bellomo N, Degond P, Tadmor E, editors. Active Particles. Cham: Birkhäuser/Springer; 2019. pp. 65–108. [Google Scholar]
  • [CF87].Caffarelli LA, Friedman A. Asymptotic behavior of solutions of ut=Δum as m. Indiana Univ. Math. J. 1987;36(4):711–728. [Google Scholar]
  • [CGPS20].Carrillo JA, Gvalani RS, Pavliotis GA, Schlichting A. Long-time behaviour and phase transitions for the Mckean–Vlasov equation on the torus. Arch. Ration. Mech. Anal. 2020;235(1):635–690. [Google Scholar]
  • [CGW20].Carrillo, J.A., Gvalani, R.S., Wu, J.: An invariance principle for gradient flows in the space of probability measures. arXiv e-prints arXiv:2010.00424 (2020)
  • [CH98].Cazenave, T., Haraux, A.: An Introduction to Semilinear Evolution Equations. Oxford Lecture Series in Mathematics and its Applications, vol. 13. The Clarendon Press, Oxford University Press, New York (1998). Translated from the 1990 French original by Yvan Martel and revised by the authors
  • [CKY13].Chayes L, Kim I, Yao Y. An aggregation equation with degenerate diffusion: qualitative property of solutions. SIAM J. Math. Anal. 2013;45(5):2995–3018. [Google Scholar]
  • [CKY18].Craig K, Kim I, Yao Y. Congested aggregation via Newtonian interaction. Arch. Ration. Mech. Anal. 2018;227(1):1–67. [Google Scholar]
  • [CMS+19].Carrillo JA, Murakawa H, Sato M, Togashi H, Trush O. A population dynamics model of cell–cell adhesion incorporating population pressure and density saturation. J. Theoret. Biol. 2019;474:14–24. doi: 10.1016/j.jtbi.2019.04.023. [DOI] [PubMed] [Google Scholar]
  • [CMV03].Carrillo JA, McCann RJ, Villani C. Kinetic equilibration rates for granular media and related equations: entropy dissipation and mass transportation estimates. Rev. Mat. Iberoam. 2003;19(3):971–1018. [Google Scholar]
  • [CP10].Chayes L, Panferov V. The McKean–Vlasov equation in finite volume. J. Stat. Phys. 2010;138(1–3):351–380. [Google Scholar]
  • [CT20].Craig K, Topaloglu I. Aggregation-diffusion to constrained interaction: minimizers & gradient flows in the slow diffusion limit. Ann. Inst. H. Poincaré Anal. Non Linéaire. 2020;37(2):239–279. [Google Scholar]
  • [Daw83].Dawson DA. Critical dynamics and fluctuations for a mean-field model of cooperative behavior. J. Stat. Phys. 1983;31(1):29–85. [Google Scholar]
  • [DFL15].Degond P, Frouvelle A, Liu J-G. Phase transitions, hysteresis, and hyperbolicity for self-organized alignment dynamics. Arch. Ration. Mech. Anal. 2015;216(1):63–115. [Google Scholar]
  • [DG57].De Giorgi E. Sulla differenziabilità e l’analiticità delle estremali degli integrali multipli regolari. Mem. Accad. Sci. Torino. Cl. Sci. Fis. Mat. Nat. 1957;3(3):25–43. [Google Scholar]
  • [DiB79].DiBenedetto E. Regularity results for the porous media equation. Ann. Mat. Pura Appl. 1979;4(121):249–262. [Google Scholar]
  • [DiB93].DiBenedetto E. Degenerate Parabolic Equations. New York: Universitext. Springer-Verlag; 1993. [Google Scholar]
  • [FL12].Frouvelle A, Liu J-G. Dynamics in a kinetic model of oriented particles with phase transition. SIAM J. Math. Anal. 2012;44(2):791–826. [Google Scholar]
  • [FP08].Figalli A, Philipowski R. Convergence to the viscous porous medium equation and propagation of chaos. ALEA Lat. Am. J. Probab. Math. Stat. 2008;4:185–203. [Google Scholar]
  • [GP70].Gates DJ, Penrose O. The van der Waals limit for classical systems. III. Deviation from the van der Waals–Maxwell theory. Commun. Math. Phys. 1970;17(3):194–209. [Google Scholar]
  • [HZ19].Hwang, S., Zhang, Y.P.: Continuity results for degenerate diffusion equations with LtpLxq drifts. arXiv e-prints arXiv:1906.04961 (2019)
  • [JKO98].Jordan R, Kinderlehrer D, Otto F. The variational formulation of the Fokker–Planck equation. SIAM J. Math. Anal. 1998;29(1):1–17. [Google Scholar]
  • [Kie12].Kielhöfer H. Bifurcation Theory: An Introduction with Applications to Partial Differential Equations. 2. New York: Springer; 2012. [Google Scholar]
  • [KPW19].Kim I, Požár N, Woodhouse B. Singular limit of the porous medium equation with a drift. Adv. Math. 2019;349:682–732. [Google Scholar]
  • [KZ18].Kim I, Zhang YP. Regularity properties of degenerate diffusion equations with drifts. SIAM J. Math. Anal. 2018;50(4):4371–4406. [Google Scholar]
  • [LSU68].Ladyženskaja, O.A., Solonnikov, V.A., Uraltceva, N.N.: Linear and Quasilinear Equations of Parabolic Type. Translated from the Russian by S. Smith. Translations of Mathematical Monographs, Vol. 23. American Mathematical Society, Providence, RI (1968)
  • [Nir01].Nirenberg, L.: Topics in Nonlinear Functional Analysis. Courant Lecture Notes in Mathematics. vol. 6. New York University, Courant Institute of Mathematical Sciences, New York; American Mathematical Society, Providence, RI. Chapter 6 by E. Zehnder, Notes by R. A. Artino, Revised reprint of the 1974 original (2001)
  • [Oel90].Oelschläger K. Large systems of interacting particles and the porous medium equation. J. Differ. Equ. 1990;88(2):294–346. [Google Scholar]
  • [Ott01].Otto F. The geometry of dissipative evolution equations: the porous medium equation. Commun. Partial Differ. Equ. 2001;26(1–2):101–174. [Google Scholar]
  • [Phi07].Philipowski R. Interacting diffusions approximating the porous medium equation and propagation of chaos. Stoch. Process. Appl. 2007;117(4):526–538. [Google Scholar]
  • [Rod16].Rodrigues, A.M.: Regularity properties for the porous medium equation. Master’s thesis, Unviersidade de Coimbra (2016)
  • [Sch85].Schaaf R. Stationary solutions of chemotaxis systems. Trans. Am. Math. Soc. 1985;292(2):531–556. [Google Scholar]
  • [Szn91].Sznitman A-S. Topics in propagation of chaos. In: Hennequin PL, editor. École d’Été de Probabilités de Saint-Flour XIX—1989. Berlin: Springer; 1991. pp. 165–251. [Google Scholar]
  • [Tam84].Tamura Y. On asymptotic behaviors of the solution of a nonlinear diffusion equation. J. Fac. Sci. Univ. Tokyo Sect. IA Math. 1984;31(1):195–221. [Google Scholar]
  • [Tug14].Tugaut J. Phase transitions of McKean–Vlasov processes in double-wells landscape. Stochastics. 2014;86(2):257–284. [Google Scholar]
  • [Urb08].Urbano JM. The Method of Intrinsic Scaling: A Systematic Approach to Regularity for Degenerate and Singular PDEs. Berlin: Springer-Verlag; 2008. [Google Scholar]
  • [V07].Vázquez JL. The Porous Medium Equation. Oxford University Press, Oxford, Mathematical theory: The Clarendon Press; 2007. [Google Scholar]
  • [Vil03].Villani C. Topics in Optimal Transportation. Providence, RI: American Mathematical Society; 2003. [Google Scholar]
  • [VS15].Volkening A, Sandstede B. Modelling stripe formation in zebrafish: an agent-based approach. J. R. Soc. Interface. 2015;12(112):20150812. doi: 10.1098/rsif.2015.0812. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Communications in Mathematical Physics are provided here courtesy of Springer

RESOURCES