Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1997 Sep 30;94(20):10520–10527. doi: 10.1073/pnas.94.20.10520

Ordinary representations and modular forms

C M Skinner 1, A J Wiles 1
PMCID: PMC23392  PMID: 11038578

1. Introduction

Let p be a prime, and fix an embedding of into p. Suppose that f is a newform of weight k ≥ 2, level N, and character ψ. For each prime ℓ ∤ N, let T be the Hecke operator associated to ℓ and suppose that Tf = c(ff. Eichler and Shimura (for k = 2) and Deligne (for k > 2) have shown that there is a continuous representation ρf : Gal(/Q) → GL2(p) that is unramified at the primes not dividing pN and such that

graphic file with name M1.gif

for each prime ℓ ∤ pN.

This paper establishes criteria for a representation ρ : Gal(/Q) → GL2(p) to be “modular” in the sense that there is a newform f such that ρ ≅ ρf. Fontaine and Mazur have conjectured that ρ is modular provided it is unramified outside of a finite set of primes, irreducible, has odd determinant, and the restriction to a decomposition group Dp at p satisfies certain conditions. The following conjecture is a special case of [FM95, ref. 1, Conjecture 3c].

Conjecture 1.1. Suppose ρ : Gal(/Q) → GL2(p) is continuous, irreducible, and unramified outside of a finite set of primes (including ∞). If

graphic file with name M2.gif 1.1

and if det ρ = ψɛk−1 is odd, where ɛ is the cyclotomic character, k ≥ 2 is an integer, and ψ is a finite character, then ρ is modular.

A representation satisfying condition 1.1 is said to be ordinary (at p).

Suppose that ρ is as in the Conjecture. From the compactness of Gal(/Q), it follows that after choosing a suitable basis, ρ takes values in GL2(𝒪) for some ring of integers 𝒪 of a finite extension of Qp. Let λ be a uniformizer of 𝒪, and let ρ̄ = ρ mod λ be the reduction of ρ (in general, this is only well-defined up to semisimplification). If ρ̄ is irreducible, then ρ is residually irreducible. If ρ̄ (more precisely, its semisimplification) is isomorphic to the reduction of some ρf, then ρ is residually modular. If ρ̄|Dp ≅ (χ1χ2), χ1 ≠ χ2, then ρ is Dp-distinguished. Recently, the above conjecture (and some more general conjectures) have been shown to be true provided p is odd and ρ is residually irreducible, residually modular, and Dp-distinguished ([Wil95, ref. 2], but see also [Dia96, ref. 3] and [Fuj, ref. 4]).

In this paper, we consider representations ρ which are residually reducible and ordinary. In this case, the semisimplification ρ̄ss of ρ̄ satisfies ρ̄ss ≅ χ1 ⊕ χ2 with χ2 unramified at p. We prove the following theorem, which establishes many new cases of the conjecture.

Theorem 1.2. Suppose p is an odd prime, and suppose ρ : Gal(/Q) → GL2(p) is a continuous, irreducible representation unramified away fromand a finite set Σ of primes. Suppose also that ρ̄ss ≅ χ1 ⊕ χ2 with χ2 unramified at p. If

(i)  χ = χ1χ2−1 is ramified at p and odd,

(ii)  the χ-eigenspace of the p-part of the class group of the splitting field of χ is trivial,

(iii)  for q ∈ Σ either χ is ramified at q or χ(q) ≠ q,

(iv)  det ρ = ψɛk−1, where k ≥ 2 is an integer and ψ has finite order, and

(v)  ρ is ordinary,

then ρ is modular.

This theorem is a consequence of a more general result, Theorem 6.1, identifying certain universal deformation rings with Hecke rings. In spirit, both the statement and proof of Theorem 6.1 resemble those of the main theorems of [Wil95, ref. 2]. Here, too, we first establish a “minimal case” of the theorem, subsequently deducing the general result from it. However, instead of resorting to the patching argument of [TW95, ref. 5] to prove this minimal case, we directly establish the numerical criteria of [Wil95, ref. 2, Appendix] and [Len95, ref. 6], proving that the order of a certain cohomology group is equal to the size of a congruence module for a certain Hecke ring (technical complications arise when including the prime 2 in Σ, but these are circumvented later). The Galois cohomology group that is computed is the Selmer group of the adjoint of a reducible represention. The relevant Galois module has a filtration whose Jordan-Hölder pieces are one-dimensional, and the computation of its cohomology boils down to class field theory and some simple consequences of the main conjecture of Iwasawa theory (for Q). The congruence module that arises measures congruences between cusp forms and an Eisenstein series and is closely related to the “Eisenstein ideals” studied in [MW84, ref. 7]. Indeed, the ideas of [MW84, ref. 7] provide the means to estimate the order of this module.

It is unfortunate that the theorem stated above places restrictions on the primes contained in Σ. Of course, this is merely a failure of our approach. In fact, the theorem is essentially the best that can be obtained from trying to identify deformation rings with Hecke rings. For if Σ contained some prime for which χ(q) = q, then, as is easily checked, the corresponding universal, ordinary deformation ring (see Section 2) sometimes contains a component having dimension greater than 3, coming from the various reducible deformations. However, the corresponding Hecke ring has dimension at most 2. Matters are made worse by the fact that in this case there may not be a natural map from the deformation ring to the Hecke ring; there may not be a representation defined over the Hecke ring.

The organization of this paper is as follows. Section 2 describes various deformation problems, each more restrictive than the last, culminating with the minimal cases. It also discusses the relations between these deformation problems and singles out some distinguished deformations. In Section 3 we define the Selmer group associated to a distinguished minimal deformation and estimate its order. Section 4 introduces the Hecke rings, and in Section 5 we estimate the size of a certain congruence module for the minimal Hecke rings. Finally, in Section 6, we prove our main result, Theorem 6.1, and deduce from it the theorem stated above.

2. Deformations and Deformation Rings

This section introduces the deformation problems with which this paper is concerned. It also includes some simple, but essential, observations about the corresponding universal deformation rings, relations between them, and certain distinguished deformations.

Let p be an odd prime. For Σ a finite set of primes including p, let QΣ be the maximal extension of Q unramified outside of Σ and ∞. We fix once and for all embeddings of into q for each rational prime q and into C. This fixes a choice of decomposition group Dq and inertia group Iq for each prime q and a choice of complex conjugation, hereafter denoted by c. Suppose that k is a finite field of characteristic p and that χ : Gal(QΣ/Q) → k× is an odd character ramified at p. Suppose also that

graphic file with name M3.gif 2.1

is a continuous representation satisfying

graphic file with name M4.gif 2.2

and having scalar centralizer (i.e., ρ0 is reducible, but not semisimple).

Henceforth χ and Σ satisfy the following conditions. The χ-eigenspace of the p-part of the class group of the splitting field of χ is trivial. This is always satisfied, for example, by χ = ω and by χ = ω−1, where ω is the character giving the action of Gal(/Q) on the pth roots of unity [Was80, ref. 8, Proposition 6.16]. The set Σ is such that if q ∈ Σ, then either χ is ramified at q or χ(q) ≠ q. (Note that Σ must contain all the primes at which χ is ramified.) For such χ and Σ, ρ0 is essentially unique, as we now explain. Let Q(χ) be the splitting field of χ, and let L0(Σ) be the maximal abelian p-extension of Q(χ) unramified outside Σ, having exponent p, and such that Gal(Q(χ)/Q) acts on Gal(L0(Σ)/Q(χ)) via the irreducible Fp-representation associated to χ. The hypotheses on χ and Σ imply that as a Gal(Q(χ)/Q)-module, Gal(L0(Σ)/Q(χ)) is isomorphic to exactly one copy of the Fp-representation associated to χ. Therefore, if ρ′ is any representation satisfying 2.1 and 2.2, then ρ′ ≅ ρ0. Furthermore, the extension L0(Σ)/Q(χ) is ramified at all places above p and nowhere else. We single out ρ0 by requiring that

graphic file with name M5.gif

and

graphic file with name M6.gif

for some fixed g0Ip.

Let 𝒪 be a local complete Noetherian ring with residue field k. An 𝒪-deformation of ρ0 is a local complete Noetherian 𝒪-algebra A with residue field k and maximal ideal 𝔪A together with an equivalence class of continuous representations ρ : Gal(QΣ/Q) → GL2(A) satisfying ρ0 = ρ mod 𝔪A. We often write “deformation” instead of “𝒪-deformation” when this will cause no confusion. We usually denote a deformation by a single member of its equivalence class. We require all of our deformations to satisfy

graphic file with name M7.gif

with χ2 unramified. Such a deformation is said to be ordinary. An ordinary deformation satisfying

graphic file with name M8.gif

is called Selmer, while one satisfying

graphic file with name M9.gif

is called strong. Here, χ̃ is the Teichmüller lift of χ, and ɛ is the cyclotomic character. Finally, a strong deformation satisfying

graphic file with name M10.gif

for all primes q ∈ Σ such that q is congruent to 1 modulo p is called Σ-minimal.

There exists a local complete Noetherian 𝒪-algebra RΣ,𝒪min and a universal Σ-minimal 𝒪-deformation

graphic file with name M11.gif

We omit the precise formulation of the universal property as well as the proof of existence as these are now standard (see [Maz89, ref. 9], [Ram93, ref. 10], and [Wil95, ref. 2]). Similarly, there exist universal ordinary, Selmer, and strong 𝒪-deformations

graphic file with name M12.gif
graphic file with name M13.gif

and

graphic file with name M14.gif

respectively.

We note that RΣ,𝒪Sel can be realized as a quotient of RΣ,𝒪ord. For if γ ∈ Ip is such that ɛ(γ) = 1 + p, and if 1 + T = det ρΣ,𝒪ord(γ)(1 + p)−1, then

graphic file with name M15.gif 2.3

There is also a simple relation between RΣ,𝒪str and RΣ,𝒪Sel. For q ∈ Σ let Δq be the Sylow p-subgroup of (Z/q)×, and let δq be a generator. We write ΔΣ for the product of the Δq’s. Let χq denote the character

graphic file with name M16.gif

and let χΣ = ∏ χq. The deformation

graphic file with name M17.gif

is Selmer, and, using the universal properties of RΣ,𝒪Sel and RΣ,𝒪str, one checks that

graphic file with name M18.gif 2.4

The relation between RΣ,𝒪str and RΣ,𝒪min is also easily described. If q ∈ Σ is a prime congruent to 1 modulo p, then arguing as in [TW95, ref. 5, Lemma, p. 569] shows that

graphic file with name M19.gif

where ϕq factors through χq (i.e., there is a unique map 𝒪[Δq] → RΣ,𝒪str taking χq to ϕq). Let 𝔞Σ be the ideal in RΣ,𝒪str generated by the set {δq − 1}, where q runs over all primes in Σ that are congruent to 1 modulo p. Then

graphic file with name M20.gif 2.5

Suppose now that 𝒪 is the ring of integers of a finite extension of Qp with residue field k. We consider various reducible ordinary 𝒪-deformations of ρ0. Suppose Ψ = (ψ1, ψ2) is a pair of p-valued characters of Gal(QΣ/Q) such that ψ2 is unramified at p and ψ1ψ2 = χ̃ω−1ψɛ with ψ a character of finite, p-power order. Let 𝒪Ψ be the 𝒪-algebra generated by the values of ψ1 and ψ2. This is a finite local 𝒪-algebra with residue field k and uniformizer, say, λ. Suppose also that ψ1 = χ mod λ and ψ2 = 1 mod λ. Let Q(Ψ) be the splitting field of the pair Ψ, and let LΨ(Σ) be the maximal abelian pro-p-extension of Q(Ψ) unramified outside Σ and such that Gal(Q(Ψ)/Q) acts on Gal(LΨ(Σ)/Q(Ψ)) via ψ1ψ2−1. The hypotheses on χ and Σ imply that Gal(LΨ(Σ)/Q(Ψ)) is a free Zp-module and that

graphic file with name M21.gif

This is essentially Kummer theory (cf. [Coa77, ref. 11, Theorem 1.8]). It follows from this together with our description of ρ0 that there is some τ ∈ Ip such that τ generates Gal(LΨ(Σ)/Q(Ψ)) as a Gal(Q(Ψ)/Q)-module and

graphic file with name M22.gif

Fix such a τ.

One can write down a reducible 𝒪-deformation ρΨ : Gal(QΣ/Q) → GL2(𝒪Ψ) of ρ0 as follows. First, project onto Gal(LΨ/(Σ)/Q), and then choose a lift H of Gal(Q(Ψ)/Q) to Gal(LΨ(Σ)/Q) containing c. Put

graphic file with name M23.gif

and put

graphic file with name M24.gif

Since H and τ topologically generate Gal(LΨ(Σ)/Q), this determines the representation. This representation is obviously ordinary. Corresponding to ρΨ is an ideal IΨ of RΣ,𝒪ord. If ρΨ is Selmer or strong, we also denote by IΨ the corresponding ideal of RΣ,𝒪Sel or RΣ,𝒪str. The pair Ψ = (χ̃ω−1ɛ, 1) is the unique pair such that ρΨ is Σ-minimal. We denote by IΣ the corresponding ideal of RΣ,𝒪min and refer to it as the Eisenstein ideal of RΣ,𝒪min. We write ρΣ,𝒪Eis for the corresponding representation.

We conclude this section with a brief analysis of the Eisenstein ideal of RΣ,𝒪min. Again, 𝒪 is the ring of integers of some finite extension of Qp with residue field k. Let S be any finite set of primes containing Σ. The Eisenstein ideal contains the ideal IS generated by the set

graphic file with name M25.gif

We claim that these ideals are equal. Choose a basis of ρΣ,𝒪min such that

graphic file with name M26.gif

where τ ∈ Ip is chosen for the pair Ψ = (χ̃ω−1ɛ, 1) as in the preceeding discussion. For each σ ∈ Gal(QΣ/Q) write

graphic file with name M27.gif

It is clear that

graphic file with name M28.gif
graphic file with name M29.gif
graphic file with name M30.gif

It follows from these identities that

graphic file with name M31.gif

so ρS = ρΣ,𝒪min mod IS satisfies

graphic file with name M32.gif

One sees that with respect to the chosen basis the matrix entries of ρS are in 𝒪, and this representation is in fact ρΣ,𝒪Eis. The universal property of RΣ,𝒪 now implies that IS = IΣ. For ease of reference we record this as a proposition.

Proposition 2.1. If S is any finite set of primes containing Σ, then the Eisenstein ideal IΣ is generated by the set

graphic file with name M33.gif

Finally, let RS,𝒪min,tr be the closed 𝒪-subalgebra of RΣ,𝒪min generated by the elements {trace(ρΣ,𝒪min(Frob)) : ℓ ∉ S}. We define RS,𝒪ord,tr, RS,𝒪Sel,tr, and RS,𝒪str,tr similarly.

Corollary 2.2. For · = min, str, Sel, or ord,

graphic file with name M34.gif
graphic file with name M35.gif
graphic file with name M36.gif

It follows from Proposition 2.1 that

graphic file with name M37.gif

One easily deduces from this that RS,𝒪min,tr = RΣ,𝒪min (see [Mat86, ref. 12, Theorem 8.4]. The remaining cases are proved similarly using the relations 2.3, 2.4, and 2.5. □

3. Some Galois Cohomology

In this section we give an upper bound for the size of the 𝒪-module IΣ/IΣ2, where IΣ is the Eisenstein ideal of RΣ,𝒪min defined in the previous section. We maintain the notation of Section 2 with the restriction that 𝒪 is always the ring of integers of some finite extension K of Qp with residue field k. We often write GΣ for Gal(QΣ/Q).

Let ϕ = χ̃ω−1, and let U be the representation space for ρΣ,𝒪Eis. Then U is a free 𝒪-module of rank two having a filtration 0 ⊆ U1 ⊆ U, where U1 is the rank one, free 𝒪-submodule on which Gal(QΣ/Q) acts via ϕɛ. The quotient U2 = U/U1 is a rank one, free 𝒪-module on which Gal(QΣ/Q) acts trivially. Let V = Hom𝒪(U, U) be the adjoint representation, and let

graphic file with name M38.gif

We write W and WSel for V𝒪 K/𝒪 and VSel𝒪 K/𝒪, respectively. Let

graphic file with name M39.gif

and for those q ∈ Σ different from p let

graphic file with name M40.gif

We define the Selmer group to be

graphic file with name M41.gif

Following [Wil95, ref. 2, Proposition 2.1] one proves that

graphic file with name M42.gif

Therefore, an upper bound for #HΣ1(Q, W) yields an upper bound for #(IΣ/IΣ2).

Let Σ1 ⊆ Σ comprise those primes in Σ that are congruent to 1 modulo p together with p. Let W1 = Hom𝒪(U2, U) ⊗𝒪 K/𝒪, and let W2 = Hom𝒪(U1, U) ⊗𝒪 K/𝒪. There is a commutative diagram of Gal(QΣ/Q)-modules

graphic file with name M43.gif

having exact rows and inducing the following commutative diagram of cohomology groups:

graphic file with name M44.gif

where

graphic file with name M45.gif

The rows in this diagram are exact, so there is an exact sequence

graphic file with name M46.gif

An upper bound for #HΣ1(Q, W) therefore follows from upperbounds for #ker(α) and #ker(γ):

graphic file with name M47.gif 3.1

Now, W1 fits into the short exact sequence

graphic file with name M48.gif

The associated long exact cohomology sequence yields the exact sequence

graphic file with name M49.gif

Since WSelK/𝒪(ϕɛ), one easily checks that the hypotheses on χ and Σ imply that the second arrow is surjective. It follows that

graphic file with name M50.gif 3.2

Similarly, W2 fits into the short exact sequence

graphic file with name M51.gif

The associated long exact cohomology sequence yields the commutative diagram

graphic file with name M52.gif

having exact exact rows. It follows that

graphic file with name M53.gif 3.3

Class field theory alone shows that #ker(f1) = 1 and together with the “main conjecture” of Iwasawa theory [MW84, ref. 7, Theorem, p. 214] and [MW84, ref. 7, Proposition 1, p. 193] implies that

graphic file with name M54.gif 3.4

where

graphic file with name M55.gif 3.5

Here, B2(ϕ) is the second generalized Bernoulli number for ϕ. Substituting 3.4 into 3.3 and combining the result with 3.2 and 3.1 yields the following proposition.

Proposition 3.1. Write ϕ = χ̃ω−1. Let η(ϕ, Σ) be as in 3.5. Then

graphic file with name M56.gif

4. Hecke Rings

In this section we introduce the Hecke rings that we will later relate to the deformation rings of the second section. We keep the notation of the previous sections. In particular, 𝒪 is the ring of integers of some finite extension of Qp with residue field k and uniformizer λ.

As before, let ϕ = χ̃ω−1. Let Σ2 be the set of primes q ∈ Σ / {p} such that either χ is ramified at q, or χ|Dq = ω−1, or q is congruent to 1 modulo p. If χ ≠ ω or ω−1, then let r be a prime not contained in Σ and such that r is greater than 4, r is not congruent to 1 modulo p, and χ|Dr ≠ ω, ω−1, or 1. This is always possible. If χ = ω or ω−1, then put r = 1. For each prime q, let

graphic file with name M57.gif

Put

graphic file with name M58.gif

We identify ΔΣ with the Sylow p-subgroup of (Z/pNΣ)×. Let ΓΣ be the inverse image of ΔΣ under the usual homomorphism Γ0(pNΣ) → (Z/pNΣ)×. Also, let ΓΣ,1 be Γ1(pNΣ). We denote by TΣ) and TΣ,1) the finite 𝒪-algebras generated by the Hecke operators {T, 〈ℓ〉 : ℓ ∉ Σ ∪ {r}} acting on the spaces of weight 2 modular forms invariant under the standard action of ΓΣ and ΓΣ,1, respectively. We write 𝔪Σ for the maximal ideal of TΣ) generated by λ (a uniformizer of 𝒪) and by T − 1 − χ̃(ℓ) for all primes ℓ ∉ Σ ∪ {r}. Let E2,ϕ be the Eisenstein series whose associated L-series is ζ(s)L(s − 1, ϕ). Then 𝔪Σ is the maximal ideal of TΣ) associated to E2,ϕ. The inverse image of 𝔪Σ under the surjection TΣ,1) → TΣ) is also denoted by 𝔪Σ. Let

graphic file with name M59.gif

Now put

graphic file with name M60.gif

let ΓΣ,str be the inverse image of ΔΣ under the usual map Γ0(pNΣ) → (Z/pNΣ)×, and let ΓΣ,Sel be Γ1(pNΣ). We denote by TΣ,str) and TΣ,Sel) the 𝒪-algebras generated by the Hecke operators {T, 〈ℓ〉 : ℓ ∉ Σ ∪ {r}} acting on the spaces of weight 2 modular forms invariant under the standard action of ΓΣ,str and ΓΣ,Sel, respectively. We also denote by 𝔪Σ the maximal ideal of TΣ,str) and of TΣ,Sel) associated to the modular form E2,ϕ. Let

graphic file with name M61.gif

We denote by TΣ,𝒪min,0, TΣ,𝒪1,0, TΣ,𝒪str,0, and TΣ,𝒪Sel,0 the quotient algebras obtained by restricting the Hecke operators to the corresponding spaces of cusp forms. Note that these rings may be trivial.

Remark 4.1: When χ ≠ ω or ω−1, we have introduced the auxiliary prime r to ensure that ΓΣ and ΓΣ,str have no elliptic points. It is easy to see that TΣ,𝒪min, TΣ,𝒪1, etc., would not be different if we omitted r.

Proposition 4.2. rank𝒪TΣ,𝒪Sel = rank𝒪TΣ,𝒪str·#ΔΣ.

Proof: If χ = ω, then Σ = {p}, #ΔΣ = 1, and ΓΣ,str = ΓΣ,Sel, so the proposition is obvious. Assume χ ≠ ω. A simple analysis of the possible Eisenstein series associated to the maximal ideal 𝔪Σ yields

graphic file with name M62.gif 4.1

and

graphic file with name M63.gif 4.2

For each prime q dividing pNΣ, we denote by Uq the usual Atkin-Lehner operator. Let YΣstr and YΣSel be the open curves over C corresponding to the quotients of the complex upper half-plane by the congruence subgroups ΓΣ,str and ΓΣ,Sel, respectively. Let XΣstr and XΣSel be the respective compactifications, obtained by adjoining the cusps. For · = XΣstr, YΣstr, etc., the singular cohomology group H1(·, 𝒪) is acted upon by the relevant Hecke operators. Let HΣ1(·, 𝒪) be the maximal direct summand of the localized cohomology group H1(·, 𝒪)𝔪Σ such that Uq acts nilpotently on HΣ1(·, 𝒪) for all qp or r, and Up − 1 and Ur − 1 act nilpotently on HΣ1(·, 𝒪)/λ. Using the correspondence between spaces of cusp forms and cohomology groups (cf. [Shi71, ref. 13, Chapter 8]), it is straightforward to check that

graphic file with name M64.gif 4.3

and

graphic file with name M65.gif 4.4

Here, and in what follows, the superscript minus sign denotes the −1 eigenspace for the action of (  0−110) on the indicated cohomology group.

The excision sequence for singular cohomology gives rise to the exact sequences

graphic file with name M66.gif 4.5
graphic file with name M67.gif

and

graphic file with name M68.gif 4.6
graphic file with name M69.gif

where the subscript Σ on DivΣ0(·, cusps, 𝒪)𝔪Σ has the same meaning as it does for the other terms in the sequences. A simple analysis of the cuspidal divisor groups, using that the cover XΣSelXΣstr is unramified at the cusps, shows that

graphic file with name M70.gif 4.7

Arguing as in the proof of [TW95, ref. 5, Proposition 1] shows that HΣ1(YΣSel, 𝒪) is a free 𝒪[ΔΣ]-module of rank equal to the 𝒪-rank of HΣ1(YΣstr, 𝒪). This, together with 4.7 and 4.5, 4.6, implies that

graphic file with name M71.gif

which combined with 4.3, 4.4 and 4.5, 4.6 yields the proposition. □

Proposition 4.3. rank𝒪TΣ,𝒪str = rank𝒪TΣ,𝒪min·#ΔΣ.

Proof: Again, the proposition is obvious if χ = ω, so assume otherwise. Let YΣ and YΣ1 be the open curves over C corresponding to ΓΣ and ΓΣ,1, respectively, and let XΣ and XΣ1 be the respective compactifications. For · = YΣ, YΣ1, XΣ, or XΣ1, let HΣ1(·, 𝒪) be the maximal direct summand of H1(·, 𝒪)𝔪Σ such that on HΣ1(·, 𝒪)/λ, if χ|Dq = ω−1, then Uq − ϕ(q)q acts nilpotently, and otherwise Uq − 1 acts nilpotently. One has

graphic file with name M72.gif

and

graphic file with name M73.gif

The comparisons with the ranks of cohomology groups are proved just as are 4.3, 4.4. That the ranks of TΣ,𝒪str,0 and TΣ,𝒪1,0 are equal follows from the fact that the modular forms on which the one acts are just twists of the forms on which the other acts. Considering the excision sequences for YΣ and YΣ1 and arguing as in the proof of Proposition 4.2 shows that

graphic file with name M74.gif

This, combined with 4.1 and the simple observation that

graphic file with name M75.gif

yields the proposition. □

Finally, for each positive integer m, let T(m) denote the 𝒪-algebra generated by the Hecke operators {T, 〈ℓ〉 : ℓ ∉ Σ ∪ {r}} acting on the space of weight 2 modular forms that are invariant under the usual action of Γ1(pmNΣ) and that are ordinary at p in the sense of [Hid85, ref. 14]. Note that E2,ϕ is such a form and therefore defines a maximal ideal of T(m), also denoted 𝔪Σ. Put

graphic file with name M76.gif

Note that TΣ,𝒪(1) = TΣ,𝒪Sel.

Let Λ = 𝒪[[T]]. By the work of Hida [Hid85, ref. 14, Hid88, ref. 15], TΣ,𝒪ord is a finite, torsion-free Λ-algebra via T ↦ γ0 − 1, where

graphic file with name M77.gif

Hida has also shown that if k ≥ 2 is an integer, and if 𝔭 is a height one prime ideal of TΣ,𝒪ord containing (1 + T)pm − (1 + p)pm(k−2), then 𝔭 corresponds to a weight k newform f that is ordinary at p, has level dividing pmNΣ, and has Nebentypus character χ̃ω−1ψ, where ψ has order dividing pm. This correspondence is given by (ℓth Fourier coefficient of f) = T mod 𝔭.

Proposition 4.4. The canonical surjection TΣ,𝒪ordTΣ,𝒪Sel factors through TΣ,𝒪ord/T.

Later we shall see that TΣ,𝒪Sel = TΣ,𝒪ord/T (see 6.4).

5. The Eisenstein Ideal

One can define an “Eisenstein ideal” of TΣ,𝒪min analogous to the Eisenstein ideal IΣ of RΣ,𝒪min. Let IEis be the ideal of TΣ,𝒪min generated by the set {T − 1 − ϕɛ(ℓ), 〈ℓ〉 − ϕ(ℓ) : ℓ ∉ Σ ∪ {r}}. This is just the minimal prime ideal of TΣ,𝒪min corresponding to the Eisenstein series E2,ϕ. The following proposition is the main result of this section.

Proposition 5.1. Write ϕ = χ̃ω−1. Let η(ϕ, Σ) be as in 3.5. Suppose that if 2 ∈ Σ, then χ(2) ≠ 2−1. Then

graphic file with name M78.gif

Our proof of this proposition relies heavily on ideas and results of [MW84, ref. 7], and for many definitions and details in the following arguments we refer to this paper. In particular, we adopt the conventions of [MW84, ref. 7] regarding models of modular curves and their cusps.

The proposition is obvious if ϕ is the trivial character or if ϕ = ω−2 and Σ = {p}, so throughout the rest of this section we assume otherwise. Let XΣ and XΣ1 be the curves defined in the proof of Proposition 4.2. Let X/Zpp]1 be the regular model for XΣ1 as described in [MW84, ref. 7, Chapter 2, §8]. The special fiber X/Fp1 consists of two smooth curves Σét and Σμ (in the notation of [MW84, ref. 7]) intersecting transversally at the supersingular points. The quotient X/Zpp] = X1Σ is a regular model for XΣ (cf. [MW84, ref. 7, Chapter 2, §7]). The covering X1 → X is étale, and the normalization of the special fiber X/Fp consists of the two smooth curves Σ0ét = ΣétΣ and Σ0μ = ΣμΣ. That the cover is étale away from the cusps is a consequence of ΓΣ being contained in Γ1(ℓ) for some prime ℓ > 4. That it is étale at the cusps is a simple computation. The cover π : Σét → Σ0ét is étale with Galois group ΔΣ. The usual actions of the Hecke operators Tn and 〈n〉, (n, pNΣ) = 1, extend to actions on Pic00ét) and Pic0ét).

Let R(1) = Zp[(Z/pNΣ)×/± 1], and let 𝔪 be the component of R(1) corresponding to ϕ (see [MW84, ref. 7, Chapter 1, §3]). Let [10] be the zero cusp of XΣ1 and consider ℭ𝔪(1) = R𝔪(1)·[10] as on [MW84, ref. 7, p. 298] (the action of R(1) is via the diamond operators). Similarly, let R0(1) = Zp[(Z/pNΣ)×/± ΔΣ] and let ℭ0,𝔪(1) = R0,𝔪(1)·[10] (here [10] is the zero cusp of XΣ). The map κ : R0(1)R(1) given by κ(x) = Σδ∈ΔΣ [δ]x is compatible with the canonical imbedding π* : Pic00ét) ↪ Pic0ét) in the sense that π*(h·[10]) = κ(h)·[10].

Let k0 be the minimal field of definition for χ (i.e., the subfield of k generated by the values of χ), and let W(k0) be the Witt vectors of k0. By the choice of 𝔪, R𝔪(1) and R0,𝔪(1) are natural W(k0)-algebras. Put

graphic file with name M79.gif 5.1

and put

graphic file with name M80.gif 5.2

Clearly, TΣ,𝒪min acts on ℭ0. Let

graphic file with name M81.gif
graphic file with name M82.gif

Proof of Lemma 5.2: The action of TΣ,𝒪min on ℭ0 factors through TΣ,𝒪min,0, so I0 contains Icusp. Furthermore, by the choice of 𝔪, 〈ℓ〉 acts as ϕ(ℓ) on ℭ0, and since the action of T on the cuspidal group ℭ0,𝔪(1) is via 1 + 〈ℓ〉ℓ (see [MW84, ref. 7, p. 238]), T acts on ℭ0 as 1 + ϕɛ(ℓ). This proves that I0 contains IEis. □

Now, let

graphic file with name M83.gif

It follows from Lemma 5.2 that

graphic file with name M84.gif 5.3

Combining 5.3 with the following lemma yields the proposition.

Lemma 5.3. #(R0/I) ≥ #(𝒪/η(ϕ, Σ)).

Proof of Lemma 5.3: Let I′ = AnnR0,𝔪(1)(ℭ0,𝔪(1)). Since 𝒪 is a flat W(k0)-module, it follows from 5.1 and 5.2 that

graphic file with name M85.gif 5.4

Suppose that the conductor of ϕ is pNΣ/m and suppose that hI′. We can find gR0,𝔪(1) such that gh (mod pM) for M arbitrarily large and such that g·[10] ∼ 0 (hence κ(g)·[10] ∼ 0). By the arguments in the last paragraph on [MW84, ref. 7, p. 299], there is an integer e, coprime to p, such that e·κ(g)·[10] = div(f) for some f ∈ 𝔉(m) (notation as in [MW84, ref. 7, Chapter 4]). Moreover, f is invariant under the action of ΔΣ (as div(f) is). The proofs of [MW84, ref. 7, Propositions 3 and 3(m), p. 298] show that if f = c·∏ f0,sℓ(s) is invariant under ΔΣ, then ℓ(s) + ℓ(−s) = ℓ(δs) + ℓ(−δs) for all δ ∈ ΔΣ. It follows from [MW84, ref. 7, (8), p. 298] that

graphic file with name M86.gif

where the second sum is over a complete set of representatives for Z/pNΣ modulo the equivalence

graphic file with name M87.gif

and where ϑ̂(m)(s; pNΣ) is the “hatted” Stickelberger element defined in [MW84, ref. 7, Chapter 1, §1]. As ℓ(s) = 0 if (p, s) ≠ 1, this shows that κ(h) is contained in the ideal

graphic file with name M88.gif

By 5.4, κ(I) is contained in the same ideal.

Now, consider the homomorphism ρϕ : R1 → 𝒪 induced by ϕ. Arguing as in [MW84, ref. 7, Propositions 2 and 4, pp. 201–205] shows that if (p, s) = 1, then

graphic file with name M89.gif 5.5

where B2(ϕ) is the second generalized Bernoulli number for ϕ. Here, we note that B2(ϕ) is 𝒪-integral if ϕ ≠ 1 or ω−2 and that B2−2)·(1 − ω−2(q)q2) is 𝒪-integral for any prime q distinct from p (recall that if ϕ = ω−2, then we have assumed that Σ ≠ {p}). Since ρϕ(κ(R0)) = #ΔΣ·𝒪, and since

graphic file with name M90.gif

by 5.5, it follows that # (R0/I) ≥ [ρϕ(κ(R0)) : ρϕ(κ(I))] ≥ # (𝒪/η(ϕ, Σ)). □

6. The Main Theorem

In this section we prove our main results, relating the deformation rings of Section 2 to the Hecke rings of Section 4. In particular, we show that any deformation of ρ0 that “looks modular” is modular in the sense that its semisimplification is equivalent to a representation associated to a modular form.

Each minimal prime ideal of TΣ,𝒪min corresponds to a newform of weight 2 and level dividing pNΣ. Let G be the set of all newforms corresponding to prime ideals of TΣ,𝒪min, let F be the subset consisting of cusp forms, and let E be the Eisenstein series (E = {E2,ϕ}). For fG, we denote by Af the normalization of TΣ,𝒪min/𝔭f (𝔭f being the prime ideal corresponding to f). This is a local complete finite 𝒪-algebra with residue field k, maximal ideal 𝔪f, and fraction field Ff. It is well known that there is a semisimple representation ρf : Gal(/Q) → GL2(Ff) such that ρf is unramified at all primes not dividing pNΣ, det ρf = ϕɛ, and

graphic file with name M91.gif 6.1

Let LFf2 be a Gal(/Q)-stable Af-lattice. Choosing a suitable basis for L yields a representation ρL : Gal(/Q) → GL2(Af) whose trace satisfies 6.1 and such that ρL(c) = (−11). Reducing the trace modulo 𝔪f shows that ρ̄L = ρLmod 𝔪f satisfies

graphic file with name M92.gif

where the superscript ss denotes the corresponding semisimplification. If fF, then ρL is irreducible, and there are three possibilities for ρ̄L; either

graphic file with name M93.gif

where ∗ is not identically zero. It is not hard to see that the latter two possibilities always occur for some choice of L. Fix a lattice Lf for which the last possibility occurs. Abusing notation, we call the corresponding representation ρf. As remarked in Section 2, since ρ̄f satisfies 2.1 and 2.2, ρ̄f is equivalent to ρ0, so, after possibly choosing a new basis for Lf, we may assume that ρ̄f = ρ0. The work of Carayol, Deligne, Langlands, and Wiles (see [Car86, ref. 16] and [Wil88, ref. 17]) relating the level and Hecke eigenvalues of f to the representation ρf shows that ρf is a Σ-minimal 𝒪-deformation. (In particular, if χ ≠ ω or ω−1, then ρf is unramified at the auxiliary prime r and the level of f is coprime to r.) There is therefore a homomorphism πf : RΣ,𝒪minAf inducing ρf. If fE, then

graphic file with name M94.gif

and we take for πf the map RΣ,𝒪min → 𝒪 = Af given by reduction modulo IΣ. It follows from Corollary 2.2 that the image of the map RΣ,𝒪min → ∏fG Af given by x ↦ (πf(x)) is TΣ,𝒪min. Thus, there is a (local) surjection πmin : RΣ,𝒪minTΣ,𝒪min of 𝒪-algebras such that πf = πminmod 𝔭f.

In the same manner, one shows that there are similar surjections πstr : RΣ,𝒪strTΣ,𝒪str, πSel : RΣ,𝒪SelTΣ,𝒪Sel, and π(m) : RΣ,𝒪ordTΣ,𝒪(m), only now to define πf for f an Eisenstein series one uses the reducible deformations described in Section 2. Let πord : RΣ,𝒪ordTΣ,𝒪ord be

graphic file with name M95.gif

This is a homomorphism of Λ-algebras (Λ = 𝒪[[T]]).

Theorem 6.1. Let χ, Σ, and ρ0 be as in Section 2. If · = min, str, Sel, or ord, then

graphic file with name M96.gif

is an isomorphism.

Proof: Assume at first that 2 ∉ Σ if χ(2) = 2−1. Let π : TΣ,𝒪min → 𝒪 be the homomorphism given by reduction modulo IEis. Since TΣ,𝒪min is reduced, AnnTΣ,𝒪min ker π = AnnTΣ,𝒪minIEis is just the intersection of all minimal prime ideals of TΣ,𝒪min distinct from IEis. In other words, the annihilator of the kernel of π is just the kernel of the projection TΣ,𝒪minTΣ,𝒪min,0. Letting (η) denote the ideal π(AnnTΣ,𝒪minker π), it follows from Proposition 5.1 that

graphic file with name M97.gif 6.2

By Proposition 2.1, IEis = πmin(IΣ), so, upon considering Fitting ideals (see [Wil95, ref. 2, Appendix] or [Len95, ref. 6]),

graphic file with name M98.gif

Combining this with 6.2 and the estimate for #(IΣ/IΣ2) provided by Proposition 3.1 shows that

graphic file with name M99.gif 6.3

The main result of [Len95, ref. 6] now applies, yielding RΣ,𝒪min = TΣ,𝒪min and that these rings are complete intersections over 𝒪. Suppose now that 2 ∈ Σ and that χ(2) = 2−1. Let Σ0 = Σ / {2}. As just shown, RΣ0,𝒪minTΣ0,𝒪min. Fix a lift σ of Frob2 and a generator τ of the pro-p-part of tame inertia at 2. By the hypotheses on χ and Σ, there is a basis for ρΣ,𝒪min such that ρ(σ) = (  βα) and ρ(τ) = (c 11). Clearly, (c) = ker{RΣ,𝒪minRΣ0,𝒪min}, so TΣ,𝒪min/c = TΣ0,𝒪min. Put x = (β/α − 2) + c. The map πmin induces an isomorphism RΣ,𝒪min/xTΣ,𝒪min/x. As it is easily checked that x is not a zero-divisor in TΣ,𝒪min, it follows that πmin is an isomorphism. By 2.5, RΣ,𝒪min = RΣ,𝒪str/𝔞Σ. Since RΣ,𝒪min is a finite 𝒪-algebra, as we have just seen, RΣ,𝒪str is a finite 𝒪-algebra generated by at most

graphic file with name M100.gif

elements, the second equality coming from Proposition 4.3. Since TΣ,𝒪str is a finite free 𝒪-module and a quotient of RΣ,𝒪str, it follows that RΣ,𝒪str = TΣ,𝒪str. The proof for the Selmer case is similar. In 2.4 we observed that RΣ,𝒪Sel = RΣ,𝒪str𝒪 𝒪[ΔΣ], so it follows from the strong case of the theorem that RΣ,𝒪Sel is a finite free 𝒪-module and that

graphic file with name M101.gif

the last equality coming from Proposition 4.3. Since TΣ,𝒪Sel is a quotient of RΣ,𝒪Sel, equality of ranks implies that the rings are the same. Finally, the Selmer case of the theorem together with 2.3, Proposition 4.5, and the surjection πord : RΣ,𝒪ordTΣ,𝒪ord yields

graphic file with name M102.gif 6.4

Therefore, RΣ,𝒪ord and TΣ,𝒪ord are finite Λ-modules, generated by rank𝒪TΣ,𝒪Sel elements. Since TΣ,𝒪ord is a torsion-free Λ-module, and since TΣ,𝒪Sel = TΣ,𝒪ord/T is a free 𝒪-module, it is not hard to see that TΣ,𝒪ord is a free Λ-module, necessarily having Λ-rank equal to the 𝒪-rank of TΣ,𝒪Sel. It now follows, just as in the previous cases, that RΣ,𝒪ord = TΣ,𝒪ord. This completes the proof of the theorem. □

Theorem 1.2 is an immediate consequence of Theorem 6.1. Suppose ρ is a representation satisfying the conditions of the theorem. Consider ρ1 = ρχ̃2−1, and let F be a finite extension of Qp such that ρ1 takes values in GL2(F). Let A be the ring of integers of F, and let k be the residue field of A. Using the hypotheses on χ and Σ, arguing as in the second paragraph of this section shows that there is a Gal(QΣ/Q)-stable A-lattice L such that ρL has the same trace and determinant as ρ1 and ρL is an ordinary A-deformation of a representation ρ0 : Gal(QΣ/Q) → GL2(k) satisfying 2.2. Theorem 6.1 implies that there is a homomorphism

graphic file with name M103.gif

such that

graphic file with name M104.gif

Let 𝔭 be the kernel of α. Then 𝔭 is a height one prime ideal of TΣ,Aord containing (1 + T)pm − (1 + p)pm(k−2) for some m. As mentioned in Section 4, Hida [Hid85, ref. 14, Hid88, ref. 15] has shown that 𝔭 corresponds to a newform of weight k. That is, there is a weight k newform f such that the ℓth Fourier coefficient of f is α(T). As the representation ρf satisfies

graphic file with name M105.gif

it must be that ρf ≅ ρ1, and therefore ρf ⊗ χ2 ≅ ρ.

Acknowledgments

This work was supported by a National Science Foundation Graduate Fellowship and a Sloan Foundation Doctoral Dissertation Fellowship (C.M.S.) and a National Science Foundation grant.

References

  • 1.Fontaine J-M, Mazur B. In: Elliptic Curves, Modular Forms, and Fermat’s Last Theorem. Coates J, Yau S T, editors. Boston: International Press; 1995. [Google Scholar]
  • 2.Wiles A. Ann Math. 1995;141:443–551. [Google Scholar]
  • 3.Diamond D. Ann Math. 1996;144:136–166. [Google Scholar]
  • 4.Fujiwara K. Deformation rings and Hecke rings in the totally real case. 1996. p. preprint. [Google Scholar]
  • 5.Taylor R, Wiles A. Ann Math. 1995;142:553–572. [Google Scholar]
  • 6.Lenstra H W. In: Elliptic Curves, Modular Forms, and Fermat’s Last Theorem. Coates J, Yau S T, editors. Boston: International Press; 1995. [Google Scholar]
  • 7.Mazur B, Wiles A. Invent Math. 1984;76:179–330. [Google Scholar]
  • 8.Washington, L. (1980) Introduction to Cyclotomic Fields, Graduate Texts in Math. 83 (Springer, New York).
  • 9.Mazur B. Galois Groups over Q. New York: Springer; 1989. [Google Scholar]
  • 10.Ramakrishna R. Compositio Math. 1993;87:269–286. [Google Scholar]
  • 11.Coates J. In: Algebraic Number Fields. Fröhlich A, editor. New York: Academic; 1977. [Google Scholar]
  • 12.Matsumura, H. (1986) Commutative Ring Theory, Camb. Studies in Adv. Math., 8 (Cambridge Univ. Press, Cambridge, U.K.).
  • 13.Shimura G. Introduction to the Arithmetic Theory of Automorphic Functions. Princeton: Iwanami Shoten and Princeton Univ.; 1971. [Google Scholar]
  • 14.Hida H. Invent Math. 1985;85:545–613. [Google Scholar]
  • 15.Hida H. Ann Math. 1988;128:295–384. [Google Scholar]
  • 16.Carayol, H. (1986) Ann. Sci. Ec. Nor. Sup., IV, Ser. 19, 409–468.
  • 17.Wiles A. Invent Math. 1988;94:529–573. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES