Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1997 Oct 14;94(21):11143–11146. doi: 10.1073/pnas.94.21.11143

Congruences between modular forms: Raising the level and dropping Euler factors

Fred Diamond 1
PMCID: PMC34509  PMID: 11607760

Abstract

We discuss the relationship among certain generalizations of results of Hida, Ribet, and Wiles on congruences between modular forms. Hida’s result accounts for congruences in terms of the value of an L-function, and Ribet’s result is related to the behavior of the period that appears there. Wiles’ theory leads to a class number formula relating the value of the L-function to the size of a Galois cohomology group. The behavior of the period is used to deduce that a formula at “nonminimal level” is obtained from one at “minimal level” by dropping Euler factors from the L-function.


An example of a congruence between modular forms is provided by the newforms

graphic file with name M1.gif

of levels 11 and 77, respectively, whose first few Fourier coefficients are found in Table 1. One can show that, in fact, anbn mod 3 for all n not divisible by 7. (See Theorem 5.1 below.)

We shall discuss the relationship among the following three results concerning congruences to a newform ƒ of weight 2 and level N. We assume that K is a number field containing the coefficients of ƒ and restrict our attention to congruences mod powers of a prime λ dividing ℓ.

•  A formula of Hida (1) measuring congruences to ƒ in terms of the value of an L-function.

•  A result of Ribet (2) that establishes the existence of certain systematic congruences between ƒ and forms of level Np (such as the one above).

•  A theorem of Wiles (3), completed by his work with Taylor (4), which shows that all suitable deformations of Galois representations associated to ƒ actually arise from forms congruent to ƒ.

Hida’s formula, though not part of the logical structure of ref. 3, provides some insight into the role played in Wiles’ proof by a certain generalization of Ribet’s result. This generalization can be interpreted as the invariance of a period appearing in Hida’s formula. Using this invariance, one shows that Wiles’ theorem at minimal level implies the theorem at nonminimal level.

Remark 1.1: We are concerned here mainly with Ribet’s “raising the level” result, rather than his “lowering the level” result of ref. 5. We remark that Hida also found systematic congruences between ƒ and forms of level Nr. We shall not discuss these, but focus on congruences between ƒ and forms of level Nd with d not divisible by ℓ.

Notation and Review

We fix a prime ℓ and embeddings and C. Suppose that K is a number field contained in C and let λ denote the prime of 𝒪K determined by our choice of embeddings. Let 𝒪 denote the localization of 𝒪K at λ.

We suppose that ƒ is a newform of weight 2, level Nƒ and character χƒ with coefficients in K. The Eichler–Shimura construction associates to ƒ an ℓ-adic representation

graphic file with name M2.gif

such that if p does not divide Nƒℓ, then ρƒ is unramified at p and ρƒ(Frobp) has characteristic polynomial

graphic file with name M3.gif 1

We let ρ̄ƒ denote the semisimplification of the reduction of ƒ. If ƒ and g are newforms of weight 2, then we write ƒ ∼ g if ρ̄ƒ is equivalent to ρ̄g. By the Cebotarev density theorem and the Brauer–Nesbitt theorem, we have ƒ ∼ g if and only if ap(ƒ) ≡ ap(g) for all primes p not dividing NƒNgℓ, the congruence being modulo the maximal ideal of the integral closure of Z in .

We assume throughout that ℓ is odd, ℓ2 does not divide Nƒ, and ℓ does not divide the conductor of χƒ. We assume also that the restriction of ρ̄f to Gal (/F) is irreducible where F is the quadratic subfield of Q). It is convenient to distinguish two sets of primes which can create technical problems.

•  We let Sƒ denote the set of primes p such that ρƒ|dp is not minimally ramified in the sense of ref. 6.

•  We let Pƒ denote the set of primes ρ ≠ ℓ such that ρ̄ƒIp = 0, but ad0(ρ̄ƒ)Ip ≠ 0.

If p is not in Pƒ ∪ ℓ, then p is in Sƒ if and only if the powers of p differ in the conductors of ρƒ and ρ̄ƒ. In the introductory example, we have Sƒ = Pƒ = Pg = ∅, and Sg = {7}.

Counting Congruences

We assume that N is divisible by Nƒ but not by ℓ2 and let

graphic file with name M4.gif
graphic file with name M5.gif

Let TN denote the 𝒪-subalgebra of ∏g∈FN C generated by the set of Tp for p not dividing Nℓ, where Tp denotes (ap(g))g. Then TN is a local ring, free over 𝒪 of rank equal to the cardinality of FN.

Consider the homomorphism πƒ:TN → 𝒪 defined by projection to the ƒ coordinate. Define ideals of TN by

graphic file with name M6.gif
graphic file with name M7.gif

Then the ideal πƒ(Jƒ) has finite index in 𝒪, and is called a congruence ideal. This is a variant of the notion of a congruence module used in refs. 1 and 2.

To see how it measures congruences, consider again the above example with ƒ of level 11. We suppose that N = 77 and ℓ = 3. Then T77 can be identified with

graphic file with name M8.gif

and we find that the congruence ideal is 3𝒪.

We consider also some useful variants. Suppose that Σ is a finite set of primes containing Sƒ. We let FΣ denote the set

graphic file with name M9.gif
graphic file with name M10.gif

We then define TΣ as above, but using the set FΣ instead of FN. We denote the resulting congruence ideal Cƒ,Σ. If ƒ is replaced by the newform associated to a twist, then TΣ is replaced by a ring to which it is canonically isomorphic, and we obtain the same congruence ideal. So we suppose from now on that χƒ is of order not divisible by ℓ.

If Σ contains Pƒ, then FΣ can be identified with FNΣ for a certain integer NΣ. Assuming this holds, we shall also associate to ƒ and Σ a cohomology congruence ideal.

Let ΓH(NΣ) denote the maximal subgroup of Γ0(NΣ) in which Γ1(NΣ) has ℓ-power order. Let T denote the 𝒪-subalgebra of

graphic file with name M11.gif

generated by the Hecke operators Tn for n ≥ 1. We let ƒΣ denote the normalized T-eigenform characterized by

•  the newform associated to ƒΣ is ƒ;

•  apΣ) = 0 for primes p in Σ − {ℓ};

•  alΣ) is a unit in 𝒪 if ℓ divides NΣ;

where we have enlarged K if necessary. Consider the prime ideal θ in T defined as the kernel of the map T → 𝒪 arising from ƒΣ, and let m denote the maximal ideal generated by θ and λ. If ρ̄ƒ is irreducible, the completion of TΣ at its maximal ideal can be identified with the completion of T at m. (See section 4.2 of ref. 7.)

We now define a cohomology congruence ideal using the cohomology of the modular curve XΣ = XH(NΣ) = ΓH(NΣ)∖ℋ*. We have a natural action of T on

graphic file with name M12.gif

We choose a basis {x, y} for the rank two submodule M = H1(XΣ,𝒪)[θ], the intersection of the kernels of the elements of θ. We define the cohomology congruence ideal

graphic file with name M13.gif

where 〈,〉 is the perfect pairing on H1(XΣ,𝒪) gotten from xWy, where W is the Atkin–Lehner involution. One checks the following (see section 4.4 of ref. 7).

Lemma 3.1. The ideal Cƒ,Σ is contained in Cƒ,Σcoh. Furthermore if the completion H1(XΣ,𝒪)m is free over TmTΣ, then equality holds.

Remark 3.2: The freeness of H1(XΣ,𝒪)m is equivalent to H1(XΣ,k)[m] being two-dimensional over k, which is known under our hypotheses through work of Mazur et al. (see section 2.1 of ref. 3).

Relation with L-Functions

Hida’s formula relates Cƒ,Σcoh to the value of an L-function. We consider the L-function associated to the Galois representation ad0ρƒ. This L-function is defined by analytic continuation of the Euler product

graphic file with name M14.gif 2

where for primes p not dividing Nƒ, the Euler factor Lp(ad0ρƒ,s) is

graphic file with name M15.gif

αp and βp being the roots of Eq. 1. We shall not give here the recipe for the Euler factors at primes p dividing Nƒ. We remark, however, that L(ad0ƒ,s) remains the same if ƒ is replaced by the newform associated to a twist, and that if Nƒ is minimal among such newforms, then Lp(ad0ƒ,s) for p dividing Nƒ is one of the following:

graphic file with name M16.gif

If Σ is a finite set of primes, then we write LΣ(ad0ƒ,s) for the function obtained by omitting the Euler factors at the primes in Σ.

Suppose now that Σ contains PƒSƒ as at the end of preceding section. We let ω denote the class in H1(XΣ,C) associated to the holomorphic differential 2πiƒΣ(τ)dτ on XΣ. We let ω′ denote the class associated to the antiholomorphic differential Inline graphic where ωc is defined using ƒΣc = ∑ānΣ)einτ instead of ƒΣ.

Viewing M as contained in H1(XΣ,C), we find that the span of x and y coincides with that of ω and ω′. We write A for the matrix in GL2(C) such that

graphic file with name M18.gif

Define the period Ω to be the determinant of A. (Note that because we have chosen a basis for M, Ω is well defined only up to a unit in 𝒪.) Set δ = 3 if ℓ is in Σ, 1 if ℓ|Nƒ but ℓ ∉ Σ, and 0 otherwise. Hida’s formula can then be stated as follows:

Theorem 4.1. Cƒ,Σcoh is generated by

graphic file with name M19.gif

The proof uses results of Shimura to express the Petersson inner product of ƒ with itself in terms of the value of the L-function. In particular, the ratio is an element of 𝒪.

Recall that we have assumed here that Σ contains PƒSƒ, but the formula actually holds assuming only that Σ contains Sƒ. However, we have not explained how to define Ω in that situation. We shall see that in fact

Theorem 4.2. Ω is independent of Σ.

So we could use any Σ containing SƒPƒ to define Ω. From the theorem, we also see precisely how Cƒ,Σcoh varies with Σ: Adding primes other than ℓ to Σ simply corresponds to dropping the corresponding Euler factors from the L-function. Furthermore, we shall see that the congruences established by Ribet are related to the theorem, which is essentially a reformulation of Wiles’ generalization (3) of Ribet’s result.

Dropping Euler Factors

Ribet’s result (2) on “raising the level” is the following theorem:

Theorem 5.1. If p does not divide Nƒ then the following are equivalent: (a) There exists g such that ƒ ∼ g, χƒ = χg and Ng = dp for some divisor d of Nƒ.

(b) The congruence ap(ƒ)2 ≡ χƒ(p)(p + 1)2 mod λ holds.

The introductory example is a congruence as in the theorem. We take p = 7 and λ dividing 3. Because ap(ƒ) = −2, we see there must be a form g congruent to ƒ with Ng = 77 (because Ng = 7 is impossible).

The direction (a) ⇒ (b) of the theorem follows from consideration of the representation ρ̄ƒ. We give the idea of the proof in the case p ≠ ℓ: If there exists a g as in the theorem, then the ratio of the eigenvalues of ρ̄ƒ (Frobp) must be p±1mod λ. Then one applies the formula

graphic file with name M20.gif

The direction (b) ⇒ (a) is closely related to Theorem 4.2, which shows that

graphic file with name M21.gif

if p is not in Σ and does not divide Nƒ. Ribet’s proof relies on a comparison of cohomology congruence ideals, but his setup is slightly different from the one here. He compares cohomology congruence ideals at level Nƒ and Nƒp, with the result that the factor of p − 1 does not occur.

To prove Theorem 4.2, one defines a certain TΣ′-linear injection

graphic file with name M22.gif

It is defined so that φ(M)⊂M′ where ′ indicates we are using Σ′ instead of Σ. We may even normalize the map so that this restriction, tensored with C, sends ƒΣ to ƒΣ′, i.e., the map drops Euler factors. The key ingredient in the proof of independence is the following generalization by Wiles of a lemma of Ribet:

Lemma 5.2. φ has torsion-free cokernel.

This is proved using a result of Ihara whose role in the comparison of cohomology congruence ideals is identified in Ribet’s work.

It follows that φ induces an isomorphism M M′, and we conclude that A = A′ using φ(x),φ(y) as a basis for M′. From Theorem 4.2 we deduce:

Corollary 5.3. Suppose that Σ′⊃Σ⊃PƒSƒ. If ℓ is not in Σ′ − Σ, then let ɛ = 0. Otherwise let ɛ = 2 or 3 according to whether or not Nƒ is divisible by ℓ. Then

graphic file with name M23.gif
graphic file with name M24.gif
graphic file with name M25.gif 3

Relation with Selmer Groups

Using Mazur’s theory of deformations of Galois representations, one associates a ring RΣ and a universal deformation

graphic file with name M26.gif

of ρ̄ƒ minimally ramified outside Σ (see ref. 6). Here we work over the completion 𝒪 of 𝒪̂, which we view as contained in . Supposing that Σ contains Sƒ, we obtain a homomorphism πƒ,Σ: RΣ from ρƒ and the universal property. The 𝒪̂-module

graphic file with name M27.gif

can be described using Galois cohomology. In fact we have a canonical isomorphism

graphic file with name M28.gif 4

where L is gotten from ad0ρƒ. The group on the right is sometimes called a Selmer group. The subscript Σ indicates that for p∉Σ the cohomology classes are supposed to restrict to elements of Hƒ1(Gp,LZQ/Z) (as defined in ref. 8). There is also a possibly weaker condition imposed at p = ℓ if it is in Σ (3, 9). The universal property of the deformation also yields a surjective homomorphism φΣ from RΣ to the completion of TΣ. The key result of Wiles (3) and its generalization in (9) is that φΣ is an isomorphism (6, 7).

This result turns out to be related to the comparison of the congruence ideal Cƒ,Σ with the Fitting ideal of Φƒ,Σ, which we denote Dƒ,Σ. (Recall that if Φƒ,Σ has finite length d, then its Fitting ideal is generated by λd, and if the length is infinite than the Fitting ideal is trivial.) On the one hand, an easy commutative algebra argument shows that

graphic file with name M29.gif 5

On the other hand, a deeper commutative algebra argument shows that equality holds in Eq. 5 if and only if the following hold: (a) φΣ is an isomorphism, and (b) TΣ is a complete intersection.

One first proves the two assertions in the case Σ = ∅, so to get started one needs the existence of ƒ such that Sƒ = ∅. This existence is a version of Serre’s epsilon conjecture, and the most difficult step in the proof is Ribet’s theorem on lowering the level (5). Assuming that we also have Pƒ = ∅, Taylor and Wiles (4) show that T is a complete intersection, and using this fact Wiles (3) shows that φ is an isomorphism. Their proofs use the generalization of Mazur’s result discussed in Remark 3.2, and from which we also deduce

graphic file with name M30.gif 6

if Σ = Sƒ = Pƒ = ∅.

Combining the inclusion Eq. 3 with its counterpart

graphic file with name M31.gif

resulting from a Galois cohomology argument, we find that Eq. 6 holds for arbitrary Σ provided Sƒ = Pƒ = ∅. Hence we have (a) and (b), and therefore Dƒ,Σ = Ĉƒ,Σ, assuming only that Σ ⊃ Sƒ and Pƒ = ∅. Applying the result of remark 3.2, we get Eq. 6 as well in that case.

Remark 6.1: Improvements to these arguments, due to Faltings, Lenstra, Fujiwara, and the author (10) establish (a), (b), and Eq. 6 simultaneously (first for Σ = ∅, then in general) without appealing to Remark 3.2.

If Pƒ is not empty, then we can sometimes get empty Pƒ for a twist, but in general we appeal to ref. 9 to get (a) and (b) in the case of Σ = Sƒ = ∅, along with Eq. 3 if Σ′ = Pƒ. We conclude that

Theorem 6.2. Keep the above hypotheses and notation.

•  For arbitrary ∑, (a) and (b) hold.

•  If ∑ contains Sf, then Inline graphic is a generator for Df,∑.

•  If ∑ contains Sf ∪ Pf, then Eq. 6 holds.

Remark 6.3: Coates and Flach have pointed out that one can deduce form the theorem a formula relating the order of H1 (GQ,L⊗zQ/Z) to LΣ(ad0f,1). To relate the orders of H1 and HΣ1, one uses a variant of proposition 5.14 (ii) of ref. 8. In the case of f corresponding to an elliptic curve, see section 3 of ref. 11 for this variant and ref. 12 for a discussion of the relation with the Tamagawa number conjecture (8).

Table 1.

Fourier coefficients

n 1 2 3 4 5 6 7 8
an 1 −2 −1 2 1 2 −2 0
bn 1 1 2 −1 −2 2 −1 3

Acknowledgments

The author is grateful to M. Flach for comments on an earlier draft. This research was supported by the Engineering and Physical Sciences Research Council (Grant No. GR/J4761).

References

  • 1.Hida H. Inv Math. 1981;63:225–261. [Google Scholar]
  • 2.Ribet K A. In: Proceedings of the International Congress of Mathematicians, Vol. 1 (Warsaw, 1983) Ciesielski Z, Olech C, editors; Ciesielski Z, Olech C, editors. Warsaw: PWN; 1984. pp. 503–514. [Google Scholar]
  • 3.Wiles A. Ann Math. 1995;141:443–551. [Google Scholar]
  • 4.Taylor R, Wiles A. Ann Math. 1995;141:553–572. [Google Scholar]
  • 5.Ribet K A. Inv Math. 1990;100:431–476. [Google Scholar]
  • 6.Diamond F. In: Modular Forms and Fermat’s Last Theorem. Cornell G, Silverman J, Stevens G, editors; Cornell G, Silverman J, Stevens G, editors. New York: Springer; 1997. , in press. [Google Scholar]
  • 7.Darmon H, Diamond F, Taylor R. In: Current Developments in Mathematics, 1995. Bott R, Jaffe A, Hopkins M, Singer I, Stroock D, Yau S T, editors; Bott R, Jaffe A, Hopkins M, Singer I, Stroock D, Yau S T, editors. Cambridge, MA: International Press; 1996. pp. 1–154. [Google Scholar]
  • 8.Bloch S, Kato K. In: The Grothendieck Festschrift, Vol. I. Cartier P, Illusie L, Katz N M, Laumon G, Manin Yu, Ribet K A, editors; Cartier P, Illusie L, Katz N M, Laumon G, Manin Yu, Ribet K A, editors. Boston: Birkhauser; 1990. pp. 333–400. [Google Scholar]
  • 9.Diamond F. Ann Math. 1996;144:137–166. [Google Scholar]
  • 10.Diamond, F. (1997) Inv. Math., in press.
  • 11.Coates J, Sydenham A. In: Elliptic Curves, Modular Forms and Fermat’s Last Theorem. Coates J, Yau S T, editors; Coates J, Yau S T, editors. Cambridge, MA: International Press; 1995. pp. 2–21. [Google Scholar]
  • 12.Flach M. In: Séminaire de Théorie des Nombres, Paris, 1991–92. David S, editor; David S, editor. Boston: Birkhauser; 1993. pp. 23–36. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES