Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1997 Jan 7;94(1):33–35. doi: 10.1073/pnas.94.1.33

Lp estimates for the bilinear Hilbert transform

Michael Lacey , Christoph Thiele
PMCID: PMC19231  PMID: 11038537

Abstract

For the bilinear Hilbert transform given by: Inline graphic we announce the inequality ∥H fgp3Kp1,p2fp1gp2, provided 2 < p1, p2 < ∞, 1/p3 = 1/p1 + 1/p2 and 1 < p3 < 2.


We announce a partial resolution to long standing conjectures concerning the operator known as the bilinear Hilbert transform, defined as follows:

graphic file with name M2.gif

This operation is initially defined only for certain functions f and g, for instance those in the Schwartz class on ℝ. The conjectures concern the extension of H to a bounded operator on Lp spaces. We have proved:

Theorem 1. H extends to a bounded operator on Lp1 × Lp2 into Lp3, provided 2 < p1, p2 < ∞ and 1 < p3 < 2, where 1/p3 = 1/p1 + 1/p2.

Some 30 years ago, in connection with the Cauchy integral on Lipschitz curves, Calderón (1) raised the question of H mapping L2 × L2 into L1; this inequality is true. Indeed, the bilinear Hilbert transform maps into Lp3  provided only that p3 > 2/3.

Study of the bilinear Hilbert transform is intimately related to Carleson’s theorem (2) asserting the pointwise convergence of Fourier series. A seminal result, it has received two proofs, with the alternative proof provided by Fefferman (3). These proofs have provided us with ingenious and complementary methods of time frequency analysis. A similar analysis seems necessary to understand H, and so our proof entails significant aspects of both Carleson’s and Fefferman’s proofs. We give a description of our proof, with details presented in their most concrete form. Complete proofs, which appear in ref. 4, require definitions and constructions somewhat more general than those presented here.

The bilinear Hilbert transform must be broken into scales and the frequency behavior of each scale understood. Hence we replace the kernel 1/y with Σj=−∞ 2jρ(2jy), where ρ is a Schwartz function with Fourier transform ρ̂(ξ) = ∫ e−2πixξρ(x) dx supported on [1/2, 2). For each j, consider:

graphic file with name M3.gif

which has bilinear symbol ρ̂(2j(ξ − θ)). More specifically,

graphic file with name M4.gif

Therefore, if f is supported in frequency on the interval [n2j, (n + 1)2j], then Hj fg(x) acts on the inverse Fourier transform of ĝ(ξ)1[(n + 1/2)2j, (n + 3)2j](ξ), and is supported in frequency on the interval [(2n + 1/2)2j, (2n + 4)2j]. The differing rates of translation make these three intervals distinct.

It is important to note that the location of the intervals is arbitrary, and therefore, for all j and j′, the inner product of Hj fg and Hj fg need not tend to zero as |jj′| tends to infinity. The analysis of H must be done in terms of both time and frequency.

Instead of proceeding with a decomposition of H, we define a model of it adapted to the combinatorics of the time–frequency plane. Let 𝒟 be a dyadic grid in ℝ. Call I × ω ∈ 𝒟 × 𝒟 a tile if |I|·|ω| = 1. The interval ω is a union of four dyadic subintervals of equal length, ω1, ω2, ω3, and ω4, which we list in ascending order. Thus, ξi < ξj for all 1 ≤ i < j ≤ 4 and ξj ∈ ωj. (We will only use ωj for j = 1, 2, 3.) We adopt the notation t = It × ωt and tj = It × ωtj for j = 1, 2, 3. Fix a Schwartz function φ with φ̂ supported on [−1/8, 1/8], in addition require that ∫ φ(x − 16n)φ(x) dx = 0 for all integers n. Set for all tiles t and j = 1, 2, 3,

graphic file with name M5.gif

where c(J) denotes the center of the interval J.

Then our model of the bilinear Hilbert transform is

graphic file with name M6.gif

which is initially defined only for Schwartz functions f1 and f2. We emphasize that the sum extends over all tiles, and hence all scales. The analogue of Theorem 1 is

Theorem 2. ℳ extends to a bounded operator on Lp1 × Lp2 into Lp′3, provided 2 < p1, p2 < ∞ and 1 < p′3 = (1/p1 + 1/p2)−1 < 2.

With more liberal notions of “grid,” “tile,” and “φtj,” the bilinear Hilbert transform is in the convex hull of terms like our model ℳ.

In the present situation we can give a proof by way of duality. Thus we take f3Lp3 and show that:

graphic file with name M7.gif

The sum is over positive quantities; namely, the decomposition above already captures all of the cancellation necessary for convergence of the sums. It also shows that the sum defining ℳ is unconditionally convergent in t. And as each fjLpj, where pj > 2, it follows that each function is locally square integrable. As it turns out, L2 arguments are decisive in proving Theorem 2.

We localize the sum above in the x variable by setting:

graphic file with name M8.gif

Certainly ∫ Ft(x) dx = |It|−1/2j=13 |〈fj, φtj〉|. And so we show that F(x) = ΣtFt(x) is integrable. This follows from a weak-type result: for pj as above, there is a δ > 0 so that for all |rjpj| < δ, 1 ≤ j ≤ 3, the operator F(x) maps Lr1 × Lr2 × Lr3 into Lr,∞, where 1/r = 1/r1 + 1/r2 + 1/r3. Then a variant of the Marcinkiewicz interpolation theorem due to Janson (5) implies the strong-type inequality.

A single instance of the weak-type inequality is:

graphic file with name M9.gif 1

for some constant K. But this inequality implies the weak-type result, because F commutes with dilations by powers of 2, and so it suffices to establish this last inequality. These observations are useful since some of our estimates begin to break down on exceptional sets of small measure. Due the localization of Ft in the time variable and the fact that we only aim for a distributional inequality, we can delete tiles t whose time coordinate falls in a set of bounded measure.

The combinatorics of the time frequency plane enter in by way of the partial order on the tiles given by t < t′ if ItIt and ω ⊃ ω′. Note that t and t′ are not comparable with respect to < if and only if tt′ = ∅. Being disjoint suggests orthogonality for the functions φtj and φtj, the dominant theme of Lemmas 1–3 we state below.

Call a collection of tiles T a Carleson–Fefferman (CF) set with top q if t < q for all tT. Thus ωq ∩ ωt ≠ ∅ for tT. Call T a j-CF set if T is a CF-set for which the intervals ωtj intersect for all tT. Notice that if T is a 1-CF set, say, then the intervals {ωtj|tT} are pairwise disjoint for j = 2, 3. Therefore, by application of Cauchy–Schwartz:

graphic file with name M10.gif 2
graphic file with name M11.gif

Notice that the last two square functions are Littlewood–Paley g functions, albeit conjugated by an exponential to account for the location of the CF set in frequency.

This estimate forms the motivation for Lemma 1 below, which formalizes a decomposition of the set of tiles that is fundamental to our argument.

Lemma 1. Fix pi > 2. There is a δ > 0 and an ɛ0 > 0 and a constant K so that for all |ri − pi| < δ and 0 < ɛ < ɛ0, the following holds. The collection of all tiles S is a union:

graphic file with name M12.gif

with these properties. First, S0 is trivial in that:

graphic file with name M13.gif 3

Then Sn,i,j is a union of disjoint i-CF sets Tq with tops q ∈ S*n,i,j, and:

graphic file with name M14.gif 4

Here, recall that 1/r = Σi 1/ri, which can be taken arbitrarily close to 1. And, most significantly, for t = mini{pi/2} − ɛ,

graphic file with name M15.gif 5

With Lemma 1 in place, we estimate:

graphic file with name M16.gif
graphic file with name M17.gif

The last sum is finite as r is arbitrarily close to one, while t + ɛ = min{pi/2} > 1 is a fixed distance from one. Therefore, with Eq. 3, Eq. 1 holds.

We cannot give the complete construction of the Sn,i,j, but rather the initial steps, in which the nearly orthogonal classes of φti are identified. First, we make an important comparison to a maximal function. If Tq is an i-CF set with top q, we have for ji,

graphic file with name M18.gif

Here M2g is the maximal function (M|g|2)1/2. Thus the set F = ∪i{M2fi > C−1} has bounded measure and we define S0 = {s|IsF}, making Eq. 4 trivial. For all i-CF sets T with top q, and TS/S0, we have Δ(T, j) ≤ 1, for ji.

The remaining construction is inductive. Assume that the Sm,i,j are defined for all m < n and all i, j, in such a way that for Sr = S/∪m<ni,jSm,i,j) we have:

graphic file with name M19.gif 6

and for any i-CF set TqSr with top q, Δ(Tq, j) ≤ 2n/rj+2, for ji. As the same inequality applies to each sub-CF set of T, we conclude that:

graphic file with name M20.gif 7

We define S*n,1,1 to be the set of maximal tiles q with |〈f1, φq,1〉| ≥ 2n/r1−1Inline graphic, and take Sn,1,1 to consist of all tiles t so that t1 < q for some qS*n,1,1. These tiles are removed, and then Sn,i,i is defined similarly for i = 2, 3. After the deletion of the tiles D0 = ∪i=13Sn,i,i, we have |〈fi, φti〉| ≤ 2n/ri−1Inline graphic for all tiles tSr = Sr/D0.

The set Sn,1,2 has a slightly different construction. Consider 1-CF sets TqSr with top q so that Δ(T, 2) ≥ 2n/r2+1. We take Tq to be the maximal 1-CF set with this property. Let q(1) be such a top, which is maximal with respect to <, and in addition sup{ξ|ξ ∈ ωq} is maximal. Remove the tiles Tq(1), and repeat this procedure to define Tq(2) and so on. Sn,1,2 is then ∪Tq(ℓ) and S*n,1,2 = {q(ℓ)|ℓ ≥ 1}. Observe that for any 1-CF set TSr/Sn,1,2, we have Δ(T, 2) ≤ 2n/r2+1. These procedures are repeated inductively to define the Sn,i,j for all n, i, j.

With the construction above it is elementary to check that these properties hold.

graphic file with name M23.gif 8

And in the case of ij, the collection Sn,i,j is a union of disjoint i-CF sets Tq, with qS*n,i,j, for which:

graphic file with name M24.gif 9

These last two bounds differ by a factor of 4, which is relevant below. See the comments concerning the minimal tiles immediately following Lemma 3 below. To achieve Eq. 4, one must delete some tiles t, using Eq. 2, the upper bounds Eqs. 6 and 7, and the control on the number of trees given in Eq. 5.

The essence of the matter lies in the control of the number of CF-sets, which is in the verification of Eq. 5. Eq. 5 relies upon the inequalities in the previous paragraph and Lemma 2 and 3 below, which address the issue of almost orthogonality.

Let us consider Sn,1,1, say. The tiles S*n,1,1 are maximal and therefore pairwise disjoint, which suggest weak orthogonality for the collection of functions {φq,1|qS*n,1,1}. If they were in fact orthogonal, Bessel’s inequality and Eq. 8 implies:

graphic file with name M25.gif

While f1 is not in L2, this inequality can be strengthened to an analogous form for Lr for r > 2.

However, disjointness of tiles does not imply orthogonality, because the functions φq,1 are not compactly supported in the x variable. Indeed, by our choice of φ, for two tiles t and s we have 〈φti, φsi〉 = 0 if ωti ∩ ωsi = ∅ or ωti = ωsi. But if ωti ωsi then for all n ≥ 0:

graphic file with name M26.gif 10

If we assume that a stronger separation of the tiles in the x variable, then we would expect orthogonality. And in this direction we have:

Lemma 2. For n ≥ 1 there are constants K and Kn so that the following holds for all A ≥ 1. Let S be any collection of tiles so that:

graphic file with name M27.gif 11

Here, for an interval I, AI denotes the interval with the same center as I and length A|I|. Set NS(x) = Σt∈S1It(x). Then:

graphic file with name M28.gif

A further combinatorial lemma asserts that if the tiles {ti|tS′} are merely disjoint, then after deleting tiles t for which It falls in an exceptional set of small measure, S′ is a union of O(A3) collections of tiles S that satisfy the stronger disjointness condition (Eq. 11).

The previous lemma is essential in obtaining Eq. 5 for the classes Sn,i,i. A corresponding lemma is necessary for the Sn,i,j, with ij, with Eq. 9 replacing the role of Eq. 8. It is:

Lemma 3. For n ≥ 1 there are constants K and Kn so that the following holds for all A ≥ 1. Let S be a union of j-CF sets Tq with tops q ∈ S*. Suppose that:

graphic file with name M29.gif

and for t ∈ Tq, q ∈ S* and i ≠ j fixed,

graphic file with name M30.gif 12

Set NS(x) = Σq∈S* 1Iq(x). Then:

graphic file with name M31.gif

Notice that in a j-CF set Tq, the tiles {ωti|tTq} are pairwise disjoint. Thus Eq. 12 is stronger than merely asserting that the tiles {ωti|tT} are pairwise disjoint. With the construction of the Sn,i,j for ij given above, Eq. 12 is true after deleting the minimal tiles Sn,i,jmin in Sn,i,j. The minimal tiles are controlled with the first half of Eq. 9 and the observation that ΣsSn,i,jmin 1Is(x) ≤ Nn,i,j(x) for all x.

The method of proof of both Lemmas 2 and 3 is similar. For instance, in Lemma 2, one considers the operator:

graphic file with name M32.gif

If S is finite, this is a compact self-adjoint operator, with maximal eigenvalue B. It suffices to estimate B, as for all fL2, ΣtS |〈f, φti〉|2 = 〈f, 𝒮Sf〉 ≤ Bf22. Consider a normalized extremal eigenfunction f of 𝒮S. One then estimates:

graphic file with name M33.gif

which is expanded in diagonal and off-diagonal terms. The diagonal term is ΣtS |〈f, φti〉|2 = 〈f, 𝒮Sf〉 ≤ B, which is an adequate estimate for B2. The off-diagonal term is by Cauchy–Schwarz,

graphic file with name M34.gif
graphic file with name M35.gif

The innermost sum is bounded by CnAnInline graphic infxItMMf(x). This is seen by invoking the estimate |〈f, φsi〉| ≤ KInline graphic infxIsMf(x), using Eq. 10 and carefully exploiting the geometry of the tiles via the assumption 11.

One then sees that the off-diagonal term is no more than:

graphic file with name M38.gif

This, with the diagonal estimate, proves that B2B + CnB1/2AnNS, whence follows Lemma 2.

The research outlined herein is the product of several years of effort (6, 7).

Acknowledgments

The success of our efforts is due to the guidance and encouragement we have received from R. Coifman. M.L. has been supported by the National Science Foundation. M.L. and C.T. acknowledge the support of a North Atlantic Treaty Organization travel grant.

Footnotes

Abbreviation: CF, Carleson–Fefferman.

References

  • 1.Calderón A P. Proc Natl Acad Sci USA. 1965;53:1092–1099. doi: 10.1073/pnas.53.5.1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Carleson L. Acta Math. 1966;116:135–157. [Google Scholar]
  • 3.Fefferman C. Ann Math. 1973;98:551–571. [Google Scholar]
  • 4.Lacey, M. & Thiele, C. (1997) Ann. Math., in press.
  • 5.Janson, S. (1986) in Lect. Notes Math., eds. 1302, 290–302.
  • 6.Lacey, M. (1997) Rev. Mat. Iberoamericana, in press.
  • 7.Thiele C. Ph.D. thesis. New Haven, CT: Yale University; 1995. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES