Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2025 Jul 11;122(28):e2500729122. doi: 10.1073/pnas.2500729122

The Eisenstein ideal at prime-square level has constant rank

Jaclyn Lang a,1,2, Preston Wake b,1,2
PMCID: PMC12280971  PMID: 40643979

Significance

In his proof of Fermat’s Last Theorem, Wiles pioneered a method for relating two disparate collections of objects, namely Galois representations and modular forms. He parameterizes each collection by a certain space, and then the parameterizing spaces are shown to match, thereby associating a modular form to each Galois representation. However, the shape of the underlying parameterizing space typically remains somewhat mysterious. In this paper, the parameterizing space is explicitly determined under certain hypotheses, and the description turns out to be strikingly uniform when varying the data upon which the parameterizing space depends.

Keywords: number theory, modular forms, Galois representations

Abstract

Let N and p be prime numbers with p5 such that p(N+1). In a previous paper, we showed that there is a cuspform f of weight 2 and level Γ0(N2) whose -th Fourier coefficient is congruent to +1 modulo a prime above p for all primes . In this paper, we prove that this form f is unique up to Galois conjugacy, and the extension of Zp generated by the coefficients of f is exactly Zp[ζp+ζp1]. We also prove similar results when a higher power of p divides N+1.

1. Introduction

Let p be a prime number and let ρ¯:Gal(Q¯/Q)GL2(F¯p) be a modular residual Galois representation. How many different Hecke eigenforms f give rise to ρ¯, and what can be said about the p-adic field Qp(f) generated by the Hecke eigenvalues of f? One can fine-tune this question by constraining the various parameters involved. For instance, if one fixes the level of f but allows the weight to vary, then Buzzard, motivated by conjectures about slopes of modular forms, asked whether the degrees [Qp(f):Qp] are bounded (1, question 4.4). In the case p=2 with level 1, Buzzard even suggested a bound of 2 on [Q2(f):Q2]. We know of very little progress on this question in the twenty years since Buzzard asked it. (While preparing this article, we learned of recent work by Kimball Martin and Anna Medvedovsky giving examples of level-one f, where [Q2(f):Q2]>2. There does not seem to be a consensus among experts about the question of boundedness.)

In this paper, we consider a question orthogonal to Buzzard’s: We fix the representation ρ¯=ω1, where ω:GQFp× is the mod-p cyclotomic character. We are interested in Hecke eigenforms of fixed weight 2 that give rise to ρ¯, but we allow the level to vary among certain prime powers. There is another way to phrase this in terms of Hecke algebras: There is a localization T of the Hecke algebra whose minimal prime ideals correspond to Galois-conjugacy classes of such eigenforms, and we are interested in the number and degree of these minimal primes. For primes N,p5, in order to have a cuspidal eigenform of weight 2 and N-power level with mod-p residual representation ρ¯, one must have p(N21) (2, theorem 2.8), so the two cases of interest are when N±1modp.

If the level is constrained to be prime, Mazur asked about the rank of the Hecke algebra T (3, p. 140). [Calegari and Emerton (4) first pointed out the parallels between Mazur’s question and Buzzard’s.] In this prime-level case, the degrees of Qp(f) (and their sum—the rank of T) have a great deal of arithmetic significance. They have been studied using modular symbols by Merel (5) and Lecouturier (6), where they are shown to be related to special values of equivariant L-functions. Using Galois representations, Calegari–Emerton (4) and Wake–Wang-Erickson (7) show that these ranks are related to class groups and Massey products in Galois cohomology. In numerical examples, the most common scenario is that there is a unique cusp form giving rise to ρ¯ and its Hecke field is Qp, but this is certainly not always the case. See the tables in refs. 3, p. 40 and 7, § 1.6 for data about the rank and irreducible components of T in small prime level. It is not known whether the degrees of the Qp(f) are bounded independently of the level N. Heuristics given in ref. 7 suggest that, given a prime N with p(N1), the probability that there is a form f of level N such that [Qp(f):Qp]=d is p1pd. This accounts for the numerical evidence that the degrees [Qp(f):Qp] are usually small but suggests that they are unbounded.

In this paper, we consider the same representation ρ¯=ω1 and same weight 2, but we vary the level over squares of primes N such that p(N+1). For such primes N, Mazur’s results imply that there are no newforms f of level Γ0(N) giving rise to ρ¯. However, in our previous work, we show that there is a newform of level Γ0(N2) with

residual representation ρ¯ (2, theorem B). Our main result in this paper is that in this case, we can compute the Hecke algebra T explicitly. More precisely we have the following theorem.

Theorem 1.1.

Let N,p5 be prime numbers such that N1modp, and let r1 be the p-adic valuation of N+1. Let T be the Hecke algebra parameterizing modular forms of level Γ0(N2) and weight 2 with mod-p residual representation ω1. Let Δ=FN2×,p-part, a cyclic group of order pr, let Λ=Zp[Δ], and let Λ+Λ be the subring fixed under the involution given by inversion on Δ. Then there is a canonical isomorphism of Zp-algebras Λ+T sending the augmentation ideal of Λ+ to the Eisenstein ideal of T.

Since minimal prime ideals of T correspond to (Galois-conjugacy classes of) eigenforms, the theorem allows us to answer all of the questions asked at the start of this introduction: The fields Qp(f) correspond to the fraction fields of minimal primes of Λ+. The situation is particularly simple when p(N+1) (i.e. when r=1), in which case there is only one minimal prime of Λ+ other than the augmentation ideal, and this prime has residue ring Zp[ζp+ζp1], so the theorem yields the following.

Corollary 1.2.

Let N,p5 be prime numbers such that p(N+1). Then there is a cuspidal eigenform f of level Γ0(N2) and weight 2 with coefficients in Zp[ζp+ζp1] such that

a(f)1+(modp) [1]

for all prime numbers , where p is the maximal ideal of Zp[ζp+ζp1]. Moreover, this form is the unique (up to Galois-conjugacy) cuspform satisfying Eq. 1.

Note that this implies that the fields Qp(f) are independent of N. Contrast this with the prime-level case, where the heuristic suggests that the degrees [Qp(f):Q] are unbounded as one varies over primes N with p(N1).

Of course, there is a variant of Corollary 1.2 when r>1: In that case, there are r Galois-conjugacy classes, and the coefficient rings are Zp[ζpi+ζpi1] for i=1,,r.

The proof of Theorem 1.1 uses Galois deformation theory, as pioneered by Mazur (8), to bound the size of T by showing, essentially, that the possible ways to deform the residual Galois representation are limited. There are many previous works about deforming reducible residual representations, including refs. 915, that influenced our thinking, but we believe the idea to use deformation theory to understand ranks of Hecke algebras originates with ref. 4. We use the theory of pseudorepresentations with deformation conditions as developed in refs. 7 and 16.

1.1. Outline of the Paper.

The proof of Theorem 1.1 takes up most of the paper. We sketch the proof here, indicating in which sections the steps take place. Let R be the pseudodeformation ring of ρ¯ parameterizing deformations that have fixed determinant and that are unramified outside Np and finite-flat at p. (The theory of pseudodeformations is reviewed in Section 2.) As usual, there is a surjection RT. We define the “pseudo-minimal” quotient Rpseudo-min of R corresponding to deformations whose trace equals the trace of the trivial representation on inertia-at-N. In other words, Rpseudo-min parameterizes representations for which the semisimplification of the restriction to inertia-at-N is trivial. This includes representations that are unramified at N, but also representations that are Steinberg at N. However, Mazur’s results imply that there are no cuspforms of level Γ0(N) that are congruent to the Eisenstein series, so one would expect that there are no representations that are Steinberg at N. In Section 3, we prove that this is true: Rpseudo-min=Zp. This key result shows that R is entirely determined by the local behavior at N. In Section 4, we define a local-at-N pseudodeformation ring RN, and prove that all local deformations come from inducing a character of GQN2, which gives an isomorphism RNΛ+. Together with Rpseudo-min=Zp, this gives surjections Λ+RT. To complete the proof, in Section 5, we show that these surjections are isomorphisms using Wiles’s numerical criterion, applying our previous results (2) to understand the congruence number. Finally, in Section 6, we indicate how our results are related to the Massey-products method of ref. 7.

2. Pseudodeformations

In this section, we review the aspects of deformation theory of pseudorepresentations that we will need in the next section. There are no new results in this section; it is a digest of material from many sources, including refs. 10, 12, 14, 17, and 18.

2.1. Pseudorepresentations.

The concept of a pseudorepresentation came about to codify the formal properties of the trace (or, more generally, the characteristic polynomial) of a representation. The first definition of pseudorepresentation was made by Wiles (19) for 2-dimensional representations and was later generalized by Taylor (20), Rouquier (21), and Chenevier (12). We will use Chenevier’s version, which he calls “determinants.”

Chenevier’s notion of pseudorepresentation mimics the properties of the determinant. For a commutative ring A, the determinant map det:Mn(A)A has many well-known properties: It is multiplicative, in that det(xy)= det (x)det(y) and det(1)=1, and has degree n, in that det(ax)=andet(x) for aA and x,yMn(A). It is also a polynomial function in the entries of the matrix. In particular, if B is a commutative A-algebra, then one can also apply det to an element of the tensor product Mn(A)AB and obtain, in a natural way, an element of B. In particular, taking B=A[t] allows one to define the characteristic polynomial det(tx)A[t] of a matrix x.

Definition 2.1:

Let A be a commutative ring and E an A-algebra. A pseudorepresentation of E of degree d, written D:EA, is a collection of maps DB:EABB, one for each commutative A-algebra B, that are natural in B and satisfy:

  • DB(xy)=DB(x)DB(y) and DB(1)=1,

  • DB(bx)=bdDB(x)

for all x,yEAB and all bB. The map DA is abbreviated to D. The characteristic polynomial of eE for a fixed pseudorepresentation D:EA is defined to be DA[t](te)A[t].

If G is a group, then a pseudorepresentation of G of degree d over A, written D:GA, is a pseudorepresentation of A[G].

In this paper, we will be interested exclusively in degree-two pseudorepresentations and only in the case where 2 is invertible in the ring A. In this case, pseudorepresentations have a simpler description (12, example 1.8), as we now recall. A d-dimensional pseudorepresentation D:EA is determined by the coefficients of DA[t1,,td](x1t1++xdtd) for xiE, which is a homogeneous polynomial of degree d. In particular, when d=2 these coefficients are determined by DA (22, proposition II.1):

DA[t1,t2](x1t1+x2t2)=D(x1)t12+(D(x1+x2)D(x1)D(x2))t1t2+D(x2)t22.

Taking x1=1 and specializing t2 to 1, we find that the characteristic polynomial of xE is

DA[t](tx)=t2(D(1+x)D(x)1)t+D(x).

Therefore we define the trace TrD:EA of D by the formula

TrD(x)=D(x+1)D(x)1.

For a degree-two pseudorepresentation D:GA of G, the trace satisfies relations

  1. TrD(xy)=TrD(yx) and TrD(1)=2, and

  2. D(x)TrD(x1y)TrD(x)TrD(y)+TrD(xy)=0

for all x,yG (12, lemma 7.7).

Conversely, if D:GA× is a homomorphism and T:GA is a function such that the pair (D,T) satisfy (1) and (2), then the formula

DB(bx+cy)=D(x)b2+(T(x)T(y)T(xy))bc+D(y)c2

for x,yG and b,cB, defines a pseudorepresentation D:GA. In this way, one can think of a pseudorepresentation as the data of the functions D and TrD. Moreover, if 2 is invertible in A, then one can recover DA from TrD using formula (2) as D(x)=TrD(x)2TrD(x2)2.

Remark 2.2:

Thus far in the discussion, we have considered discrete groups and rings. For topological groups and rings, one considers continuous pseudorepresentations D:GA, which amounts to requiring that the functions D:GA and TrD:GA are continuous (see ref. 12, section 2.30). If D:GA is a continuous pseudorepresentation and HG is a dense subgroup, then D is determined by its restriction to H (12, example 2.31). To simplify the discussion below, we will always assume that pseudorepresentations are continuous if we are using topological groups.

Example 2.3:

Let G be a group and A be a commutative ring. If ρ:GGL2(A) is a homomorphism, then the pair of functions (D,T)=(detρ,Trρ) is, of course, a pseudorepresentation. Moreover, if there is a subring AA such that D and T both have images in A, then (D,T) defines a pseudorepresentation D:GA.

This example can be seen as one of the major advantages of pseudorepresentations and is the purpose for which Wiles first used them. Note that it may not be true that there is a conjugate ρ of ρ such that ρ has values in GL2(A). For instance, let G be the subgroup of GL2(C) generated by (i00i) and (0110), which is isomorphic to the quaternion group of order 8; the trace and determinant of all elements of G are in R, but the inclusion GGL2(C) cannot be conjugated to land in GL2(R).

2.2. Cayley–Hamilton Algebras and Generalized Matrix Algebras.

Not every pseudorepresentation D:GA comes from a true representation ρ:A[G]M2(A) as in Example 2.3. However, Chenevier has defined a generalization of representations, called Cayley-Hamilton representations, that provide a natural substitute (12, remark 7.19). In good situations, these Cayley-Hamilton representations are valued in a generalized matrix algebra, which have many useful properties in common with usual matrix algebras.

Definition 2.4:

Let A be a commutative ring and let E be an A-algebra. A degree-two pseudorepresentation D:EA is called Cayley–Hamilton if for all xEAB,

x2TrDB(x)x+DB(x)=0.

A pair (E,D) of an A-algebra E and a Cayley–Hamilton pseudorepresentation D is called a Cayley–Hamilton algebra.

If G is a group, then a Cayley–Hamilton representation of G is a triple (E,D,ρ), where (E,D) is a Cayley–Hamilton algebra and ρ:GE× is a group homomorphism. The composition Dρ:GA defines a pseudorepresentation ψ(ρ) of G over A called the associated pseudorepresentation.

For example, the algebra E=M2(A) with the pseudorepresentation given by the determinant is Cayley–Hamilton, by the Cayley–Hamilton Theorem (whence the name). A representation ρ:A[G]M2(A) gives rise to a Cayley–Hamilton representation, just as in Example 2.3.

Definition 2.5:

Let A be a commutative ring and let E be an A-algebra that is finitely generated as an A-module. A (2-dimensional) generalized matrix algebra structure on E is the data of

  • an idempotent element eE,

  • A-algebra isomorphisms ϕ:eEeA and ϕ:eEeA, where e=1e,

such that the function Tr:EA defined by

Tr(x)=ϕ(exe)+ϕ(exe)

satisfies Tr(xy)=Tr(yx) for all x,yE.

An A-algebra E together with a generalized matrix algebra structure is called an A-GMA.

An example of an A-GMA is the matrix algebra E=M2(A) with e=(1000) and the obvious isomorphisms eM2(A)eA and eM2(A)eA. In general, an A-GMA can be written in the form

E=ABCA,

where B=eEe and C=eEe are sub-A-modules of E. The multiplication can be written as

abcdabcd=aa+m(b,c)ab+bdca+dcdd+m(c,b), [2]

where m:B×CA is the map m(exe,eye)=ϕ(exeye).

Conversely, if B and C are two finitely generated A-modules and m:BACA is an A-linear map satisfying certain properties, then defining EABCA with the multiplication as in Eq. 2 defines an A-GMA (see refs. 10, section 1.3 and 18, example 3.1.7 for more precise statements).

Example 2.6:

If A=k[x]/(xn) for a field k, then there is a GMA E given by

E=AxAxAA,

where m:xA×xAA is m(ax,bx)=abx.

2.3. Pseudodeformation Rings.

Let G be a group and F be a finite field of characteristic p and let D¯:GF be a pseudorepresentation. In this section, we discuss deformations of D¯. We assume that G is profinite and satisfies Mazur’s finiteness condition: For every open normal subgroup HG, there are only finitely many continuous group homomorphisms HZ/pZ. For instance, G could be the absolute Galois group of a local field or the Galois group of the maximal extension of a number field that is unramified outside a finite set.

Let C be the category of complete local Noetherian W(F)-algebras (A,mA) with residue field F. For an object A in C, a deformation of D¯ to A is a pseudorepresentation D:GA such that DAF=D¯. The set-valued functor on C sending A to the set of deformations of D¯ to A is representable by a ring (RD¯,mD¯) in C (12, proposition E). The resulting pseudorepresentation Du:GRD¯ is called the universal pseudodeformation.

A Cayley–Hamilton representation (E,D,ρ) of G is said to have residual representation D¯ if the associated pseudorepresentation ψ(ρ) of G is a deformation of D¯. The collection of Cayley–Hamilton representations with residual representation D¯ forms a category in a natural way, and this category has a universal object (ED¯,DEE¯u,ρu), which is a Cayley–Hamilton algebra over RD¯ and whose associated pseudorepresentation is the universal pseudodeformation (14, proposition 3.6).

Now assume that D¯=χ1χ2 for two distinct characters χ1,χ2:GF×. In this case, there is a natural generalized matrix algebra structure on ED¯, written as

ED¯=RD¯BD¯CD¯RD¯ [3]

with the property that, if ρu:GED¯ is written as ρu(g)=(a(g)b(g)c(g)d(g)), then a(g)χ1(g)(modmRD¯). See refs. 10, lemma 1.4.3 and 12, theorem 2.22 for more details.

2.4. Tangent Spaces.

The (equicharacteristic) tangent space to a deformation functor is the set of first-order deformations (that is, deformations with values in the dual numbers); this set is naturally a vector space over the residue field. For a representation ρ¯:GGL2(F), this means looking at deformations ρ:GGL2(F[ϵ]/(ϵ2)). It is well known that, in this case, the tangent space can be identified with the group cohomology H1(G,ad(ρ¯)) (see (23, proposition 1, pg. 284), for instance). The identification sends a cocycle ϕZ1(G,ad(ρ¯)) to the deformation

ρϕ=(1+ϕϵ)ρ¯:GGL2(F[ϵ]/(ϵ2)).

The computation of the tangent space of a pseudodeformation ring is similar to this but is complicated by the fact that not all of these deformations alter the pseudorepresentation. For instance, if ρ¯=(χ100χ2) for distinct characters χ1 and χ2, then there is an isomorphism of G-modules ad(ρ¯)(FF(χ1χ21)F(χ11χ2)F). If ϕZ1(G,(F00F)), then the deformation ρϕ amounts to deforming the two characters χ1 and χ2 separately and does change the pseudorepresentation. However, if bZ1(G,F(χ1χ21))Z1(G,ad(ρ¯)), then ρb=(χ1χ2bϵ0χ2). This is a nontrivial deformation of ρ¯, but, since the trace and determinant are unchanged, it is a trivial pseudodeformation.

To get a nontrivial pseudodeformation out of cocycles bZ1(G,F(χ1χ21)) and cZ1(G,F(χ11χ2)) one has to assume more. Namely, if the cup product bc vanishes in H2(G,F), then there is a cochain ϕ:GF such that dϕ=bc. There is also a cochain ϕ:GF such that dϕ=cb, namely ϕ=bcϕ, where bc:GF is the function (bc)(g)=b(g)c(g). If, in addition, there is a cochain b1:GF such that db1=bϕ+ϕb, then one can define a representation using these data by

ρb,c,ϕ=χ1+χ1ϕϵχ2(b+b1ϵ)χ1cϵχ2+χ2ϕϵ. [4]

Note that this is not a deformation of ρ¯ as a representation, since its residual representation is (χ1χ2b0χ2), but it is a pseudodeformation. Let

Db,c,ϕ=Tr(ρb,c,ϕ)=χ1+χ2+ϵ(χ1ϕ+χ2ϕ) [5]

be the associated pseudorepresentation, and note that it involves ϕ, b, and c, but not b1. In fact, one can prove that Db,c,ϕ defines a pseudodeformation without assuming the existence of the cochain b1 (this can be proven using the GMA of Example 2.6).

An exact description of the tangent space of a pseudodeformation ring has been worked out beautifully by Bellaïche in ref. 17 and generalized by Wang-Erickson in ref. 24, section 3.3. Let D¯:GF be D¯=χ1χ2 for distinct characters χ1 and χ2. Let mD¯ be the maximal ideal of RD¯ and let tD¯=HomF(mD¯/(p,mD¯2),F) be the tangent space. By ref. 17, theorem A), there is an exact sequence

0H1(G,F)H1(G,F)tD¯H1(G,χ1χ21)FH1(G,χ11χ2)H2(G,F)H2(G,F). [6]

The subspace H1(G,F)H1(G,F) corresponds to the reducible deformations that deform χ1 and χ2 separately. For bH1(G,F(χ1χ21)) and cH1(G,F(χ11χ2)) such that bc=0, the corresponding element of tD¯ is exactly Eq. 5.

2.5. Reducibility Ideal.

We now return to the situation of Eq. 3, so D¯=χ1χ2 for distinct characters χ1,χ2:GF×. We say that a deformation D of D¯ is reducible if D=χ~1χ~2 for deformations χ~i of χi. The reducible deformations define a subfunctor of the pseudodeformation functor that is represented by a quotient RD¯red of RD¯. The kernel of the map is called the ideal of reducibilityJD¯=ker(RD¯RD¯red).

The ring RD¯red is fairly easy to understand: It can be identified with the completed tensor product of deformation rings of the characters χi (see ref. 18, proposition 4.3.4). The ideal of reducibility is related to the GMA-structure on ED¯ by a theorem of Bellaïche and Chenevier: JD¯ is the image of the map BD¯RD¯CD¯RD¯ defined by the GMA-structure Eq. 3 (see ref. 10, section 1.5.1). In particular, there is a surjective map

BD¯RD¯CD¯JD¯. [7]

Moreover, certain quotients of the modules BD¯ and CD¯ can be understood using group cohomology. Let RD¯A be a morphism in C, and let χ1,A,χ2,A:GA× be the corresponding deformations of χ1 and χ2. Then there is an isomorphism

HomA(BD¯RD¯A,A)H1(G,χ1,Aχ2,A1) [8]

by ref. 10, theorem 1.5.6 and a similar isomorphism for CD¯ with the roles of χ1,A and χ2,A reversed.

Taken together, these results can give a fairly clear picture of the structure of RD¯, especially when the cohomology groups H1(G,χ1,Aχ2,A1) and H1(G,χ1,A1χ2,A) are small.

Example 2.7:

Suppose that H1(G,χ1χ21) and H1(G,χ11χ2) are both 1-dimensional F-vector spaces. Then Eq. 8 and Nakayama’s lemma imply that BD¯ and CD¯ are both cyclic RD¯-modules. By Eq. 7, this implies that JD¯ is a principal ideal.

To see how this compares to the tangent space sequence Eq. 6, consider the reduction of Eq. 7 modulo mD¯:

BD¯/mD¯BD¯FCD¯/mD¯CD¯JD¯/mD¯JD¯(JD¯+(p,mD¯2))/(p,mD¯2)mD¯/(p,mD¯2).

Taking the F-dual of this composite map and using Eq. 8 gives a map

tD¯H1(G,χ1χ21)FH1(G,χ11χ2)

that equals the map in Eq. 6.

2.6. Deformation Conditions.

For applications to number theory, often one wants to consider deformations that satisfy certain conditions rather than the universal deformations considered thus far. For instance, one often wants to understand Galois representations that “come from geometry,” a condition that is usually expressed in terms of ramification and p-adic Hodge theory. For deformations of representations, Ramakrishna worked out a theory for deformations with conditions (25), and this theory has been generalized to pseudodeformations (18).

A deformation condition on representations of a group G is a full subcategory c of the category of finite Zp[G]-modules that is closed under isomorphisms, submodules, quotient modules, and finite direct sums. We think of this as a condition on modules, so we say that a module “has c” if it is in c. By definition, a pseudorepresentation D:GA of G with values in a finite ring A in Chas c if there is a Cayley–Hamilton representation (E,DE,ρ) over A such that the Zp[G]-module E has c and such that D=DEρ. A general ring A in C is a limit of finite rings, so the definition is extended to A by taking limits.

With this definition, the constructions and properties carried out in this section extend to pseudorepresentations with c. In particular, there are quotients RD¯,c and ED¯,c of RD¯ and ED¯ that parameterize deformations and having property c (18, section 2.5). Moreover, the analogs of Eqs. 7 and 8 hold with the c-versions, except that, in Eq. 8, the group cohomology H1(G,χ1,Aχ2,A1) needs to be replaced by the group Hc1(G,χ1,Aχ2,A1) of extensions

0χ1,AEχ2,A0,

where E has c (18, section 4.3). [This group Hc1 is a natural generalization of the Bloch–Kato cohomology groups He1, Hf1, and Hg1 (26, section 3, pg. 352).]

3. Reduction to a Local Problem

In this section, we prove some important reductions toward the proof of Theorem 1.1. First, we define the Hecke algebra T and the pseudodeformation ring R that are relevant to the problem and prove that there is a surjection RT. Then we analyze the tangent space of R and use this to prove the key result: The pseudo-minimal quotient Rpseudo-min of R is equal to Zp.

3.1. The Hecke Algebra.

Denote by M2(Γ0(N2)) the space of modular forms of weight 2 and level Γ0(N2) with integral coefficients and S2(Γ0(N2)) the submodule of cusp forms. Let T~ be the subring of EndZ(M2(Γ0(N2)) generated by the Hecke operators Tn for all n. Let I, called the Eisenstein ideal, be the ideal in T generated by TN and T1 for all N. Let mρ¯ be the maximal ideal generated by I and p, and let T be the completion of T~ at mρ¯. Finally, T0 denotes the maximal quotient of T that acts faithfully on S2(Γ0(N2))mρ¯. The quotient M2(Γ0(N2))mρ¯/S2(Γ0(N2))mρ¯ is generated by a single Eisenstein series E, which is an eigenform for all TT. Its TN-eigenvalue is 0, and its T-eigenvalue is +1 for all primes N (2, theorem 2.8).

3.2. The Pseudodeformation Ring.

Let GQ,Np be the Galois group of the maximal extension of Q that is unramified outside , N, and p. Fix embeddings of Q¯ into Q¯p and Q¯N, and let GN,GpGQ,Np be the corresponding decomposition groups at N and p. Let INGN and IpGp be their respective inertia groups. Let D¯:GQ,NpFp be the pseudorepresentation ω1. Let c be the “finite-flat” condition; that is, c is the category of finite Zp[GQ,Np]-modules M such that there is a finite-flat group scheme G over Zp such that MG(Q¯p) as Gp-modules. This is a deformation condition by ref. 25, section 2. Let R be the quotient of RD¯,c parameterizing deformations that have determinant equal to the p-adic cyclotomic character, which we denote by ϵ. That is, R is the quotient by the ideal generated by Du(σ)ϵ(σ) for all σGQ,Np. Abusing notation slightly, let Du:GR denote the composition of the universal deformation with RD¯R.

Lemma 3.1.

There is a surjective Zp-algebra homomorphism RT.

Proof: Since T is known to be reduced, there is an injection TpT/p, where p ranges over the minimal primes of T. There is one minimal prime given by the action of T on the Eisenstein series E. The other minimal primes are the kernels of the maps TQp(f) for eigenforms f in S2(Γ0(N2))mρ¯. Such forms f have all of their Hecke eigenvalues congruent to those of E; in particular, each such f is ordinary at p (since ap(E)=1+p is a unit) and has ρ¯f=ω1. (In fact, using Katz’s result on the injectivity of the theta operator on weight 2 forms, one can show that every cuspform f with ρ¯f=ω1 is ordinary at p.) Let S be the set of such cuspidal eigenforms. Then there is an injection

TZp×fSQp(f),

sending Tn to (an(E),(an(f))fS). We will identify T with the image of this injection and construct a homomorphism RZp×fSQp(f) whose image is T.

For each fS, the Galois representation ρf defines a pseudorepresentation Df:GQ,NpOf that deforms D¯. Since the level of f is prime to p, Df satisfies the finite flat condition; indeed, the Galois representation of f comes from that of an abelian variety with good reduction at p. Also, the determinant of Df is ϵ since f has weight 2 and trivial Nebentypus. Hence Df defines a map RQp(f). There is also a map RZp given by the pseudorepresentation ϵ1. This defines a map

Φ:RZp×fSQp(f).

We have to show that the image of Φ is T. For a prime Np, since Tr(ρf(Frob))=a(f), the map Φ sends TrDu(Frob) to the image of T. Since the elements TrDu(Frob) topologically generate R by Chebotarov density, this implies that the image of Φ equals the Zp-subalgebra of T generated by {T:Npprime}. It remains to show that this subalgebra contains TN and Tp. Since every fS is new at N2, it follows that aN(f)=0. But aN(E)=0 as well, so TN=0 in T. Finally, Tp is in the subalgebra generated by {T:Npprime} by the ordinary property. To see this, let αf be the unique unit root of X2ap(f)X+p. The fact that f is ordinary at p implies that, for σGp,

Tr(ρf)(σ)=ϵ(σ)λ(αf)(σ)+λ(αf)1(σ),

where λ(x) is the unramified character of Gp sending Frobp to x. If τIp is an element such that ϵ(τ)1modp, then

Tr(ρf)(τFrobp)Tr(ρf)(Frobp)=(ϵ(τ)1)ϵ(Frobp)αf,

so the (unique) unit root α of X2TpX+p in T equals the image of TrDu(τFrobp)TrDu(Frobp)(ϵ(τ)1)ϵ(Frobp). Since the nonunit root of X2TpX+p is pα1, it follows that Tp=α+pα1 is in the image of R, as desired.

Let Rred=RRD¯RD¯red be the quotient of R that parameterizes reducible deformations.

Lemma 3.2.

The homomorphism RredZp given by the reducible deformation ϵ1 is an isomorphism.

Proof: The ring RD¯red is the completed tensor product of the finite flat deformation rings of ω and 1, with universal deformation χωχ1, where χω and χ1 are the universal finite flat deformations and ω and 1, respectively (18, proposition 4.3.4). Fixing the determinant to be ϵ gives χω=ϵχ11 in Rred, so it suffices to show that χ1=1 in Rred. A deformation of 1 factors through the maximal abelian pro-p quotient of GQ,Np, which, by the Kronecker–Weber theorem, is the pro-p quotient of Gal(Q(ζNp)/Q). Since p(N1), the maximal pro-p quotient is unramified at N. Recall that finite flat characters of Gp are an unramified character times either the trivial character or the p-adic cyclotomic character. In particular, the projections of χω and χ1 in Rred are both of this form, which forces χ1 to be unramified at p. Thus χ1 is unramified everywhere and hence trivial.

Let J=ker(RRred) be the reducibility ideal of R; Lemma 3.2 implies that R/J=Zp. Let B=BD¯,cRD¯,cR and C=CD¯,cRD¯,cR. By Eq. 7, there is a surjective map

BRCJ.

Lemma 3.3.

The R-modules B and C are cyclic and J is a principal ideal.

Proof: By ref. 18, theorem 4.3.5 (which is the analog of Eq. 8 with deformation conditions), there are isomorphisms

HomR(B,Fp)Hc1(GQ,Np,Fp(1)),HomR(C,Fp)=Hc1(GQ,Np,Fp(1)).

These groups have been computed to be one-dimensional in ref. 7, proposition 6.3.2 and lemma 6.3.6, respectively.* Note that the same reference shows that the groups Hc1(GQ,p,Fp(±1)), with no ramification at N, are trivial; we will use this fact in the proof of the next proposition.

Since HomR(B,Fp) and HomR(C,Fp) are one dimensional, Nakayama’s lemma implies that B and C are cyclic R-modules. Then the surjection BRCJ of Eq. 7 implies that J is principal.

Proposition 3.4.

There is an isomorphism

R/(p,mD¯2)Fp[ϵ]/(ϵ2)

given by a pseudorepresentation Db,c,ϕ of the form Eq. 5 with χ1=ω and χ2=1, where b and c are cocycles representing generators of the groups Hc1(GQ,Np,Fp(1)) and Hc1(GQ,Np,Fp(1)), respectively. Moreover, b and c are ramified at N.

Proof: By Lemma 3.3, there is an element xJ that generates J. By Lemma 3.2, R/J=Zp. This implies that mD¯=(p,x), and that the maximal ideal of R/pR is principal. There is a surjection RT by Lemma 3.1, and T is a free Zp-module of rank at least 2 by ref. 2, theorem B, so R/pRFp. Hence R/(p,mD¯2) is isomorphic to Fp[ϵ]/(ϵ2). This isomorphism defines an element of the tangent space tD¯ of RD¯. This element cannot be a reducible deformation by Lemma 3.2, so it must be of the claimed form. Finally, the last statement follows from the fact, mentioned in the proof of Lemma 3.3, that the groups Hc1(GQ,p,Fp(±1)), with no ramification at N, are trivial.

Let Rpseudo-min be the quotient of R by the ideal generated by TrDu(σ)2 for all σIN. This is called the pseudo-minimal quotient as it parameterizes pseudorepresentations that equal the trivial pseudorepresentation on IN. A pseudorepresentation is called minimal if it comes from a Cayley–Hamilton representation (E,D,ρ) such that ρ|IN=1. A pseudo-minimal pseudorepresentation need not be minimal: A Steinberg-at-N representation is pseudo-minimal but not minimal.

Under the surjection RT of Lemma 3.1, the quotient Rpseudo-min should correspond to quotient of T that acts on forms of level Γ0(N). Since p(N1), results of Mazur (3, proposition II.9.7) imply that there are no cuspforms f of weight 2 and level Γ0(N) such that ρ¯f=ω1. Thus, if RT then one expects that Rpseudo-min=Rred=Zp. Indeed, this is the case.

Lemma 3.5.

The map Rpseudo-minZp given by the deformation ϵ1 is an isomorphism.

Proof: The deformation ϵ1 is obviously pseudo-minimal (in fact, minimal), so it defines a surjective homomorphism Rpseudo-minZp. To show it is an isomorphism, it is enough to show that the tangent space of Rpseudo-min/pRpseudo-min is trivial. Since the tangent space of R/pR is one-dimensional and generated by Db,c,ϕ by Proposition 3.4, it is enough to show that Db,c,ϕ is not pseudo-minimal. Recall the formula Eq. 5

Db,c,ϕ(x)=ω(x)+1+ϵ(ω(x)ϕ(x)+b(x)c(x)ϕ(x)).

Since ω is unramified at N, for σIN this equation simplifies to

Db,c,ϕ(σ)=2+b(σ)c(σ)ϵ.

Since b and c are ramified at N, there is σIN such that b(σ)c(σ)0. This implies that ϵ is in the kernel of the map

Fp[ϵ]/(ϵ2)R/(p,m2)Rpseudo-min/(p,m2),

completing the proof.

4. Computation of the Local Deformation Ring

In this section, we define a local deformation ring RN that is naturally augmented over Zp with augmentation ideal I. The global deformation ring R is an RN-algebra in a natural way, and the extension IR of I to R is the kernel of the map RRpseudo-min. In particular, Lemma 3.5 implies that R/IR=Zp. This says that the global deformations are completely controlled by the local deformations; indeed, by Nakayama’s lemma, it says that R is a cyclic RN-module. Finally, we completely characterize the local deformations, proving that they all come from inducing a character of GQN2, and deduce an isomorphism RNΛ+.

4.1. The Deformation Ring of the Supercuspidal Character.

One way to construct a deformation ρ:GNGL2(A) of D¯|GN with unramified determinant is to induce a character from GN2. As a preliminary to considering such inductions, we recall some properties of the universal such character.

Let Λ~ be the universal deformation ring of the trivial character GN2Fp, where GN2=Gal(Q¯N/QN2). By ref. 8, section 1.4, there is an isomorphism

Λ~=Zp[[GN2ab, pro-p]],

where GN2ab, pro-p is the maximal abelian pro-p quotient and the universal character is the tautological one. Fix a choice of Frobenius element FrobN2GN2. The local Artin map induces an isomorphism GN2ab, pro-pQN2×,pro-p that sends FrobN2 to N. Let Λ denote the quotient of Λ~ given by identifying FrobN2 with N. Using the local Artin isomorphism as an identification, Λ is identified with Zp[Δ], where Δ=ZN2×,pro-p=FN2×,pro-p is a cyclic group of order pr, where r=vp(N+1). (Here vp is the p-adic valuation normalized such that vp(p)=1.) Denote the universal character GN2Λ× by []. Let δΔ be a generator.

Consider the Galois representation

ρN:=IndGN2GN[]:GNGL2(Λ) [9]

given by inducing []. This is a deformation of D¯|GN and it satisfies det (ρN)=ϵ. For σIN, the trace of ρN(σ) is given by

Tr(ρN(σ))=[σ]+[σ]1. [10]

This lands in the subring Λ+Λ fixed by the involution ι that acts as inversion on group-like elements. For later use, we recall the structure of the ring Λ+.

Lemma 4.1.

There is an isomorphism Zp[x](xΨ(x))Λ+ given by x[δ]+[δ1]2, where Ψ(x) is a distinguished polynomial of degree pr12 with vp(Ψ(0))=r.

Proof: First, note that Λ+ is equal to the subring of Λ generated by [δ]+[δ1]. Indeed, every element of Λ+ can be represented by a symmetric polynomial in [δ] and [δ1], and every such polynomial is a polynomial in [δ]+[δ1].

Next note that the map

ΛZp×i=1rZp[ζpi] [11]

sending [δ] to (1,ζp,,ζpr) is injective with p-torsion cokernel. Taking ι-fixed parts gives a map

Λ+Zp×i=1rZp[ζpi+ζpi1],

again injective with p-torsion cokernel. Hence the surjective map Zp[x]Λ+ given by x[δ]+[δ1]2 sends xΨ(x) to zero, where Ψ(x) is the product of the minimal polynomials Ψi(x) of ζpi+ζpi2. The induced map Zp[x]/(xΨ(x))Λ+ is a surjective homomorphism of free Zp-modules of the same finite rank, so it is an isomorphism. Since each ring Zp[ζpi+ζpi1] is totally ramified over Zp, the polynomials Ψi(x) are Eisenstein, so vp(Ψ(0))=r.

4.2. A Computation of an Inertial Pseudodeformation Ring.

We now define a kind of local deformation ring RN. Roughly speaking, it is the ring parameterizing “deformations on inertia that extend to the decomposition group.” The main result of this section is Proposition 4.4, which states that all inertia deformations that extend to the decomposition group are supercuspidal, in the sense that they arise from an induction construction.

We first recall some properties of local Galois groups. There is an exact sequence

0INGNGal(QNnr/QN)0,

where QNnr is the maximal unramified extension. The group Gal(QNnr/QN) is isomorphic to GFN and hence is topologically generated by FrobNGN. The group IN is complicated, but its maximal pro-p quotient IN(p) is procyclic. Let τIN be an element that topologically generates IN(p). Frobenius acts on the image of τ in IN(p) by

FrobNτFrobN1=τN.

If ρ is a representation of IN(p) that extends to a representation of GN, then ρ(τ) and ρ(τN) must be conjugate and thus have the same traces and determinants. This motivates the following definition.

Definition 4.2:

Let RN be the quotient of RD¯|IN by the ideal generated by

  • Du(σ)1 for all σIN, and

  • TrDu(τ)TrDu(τN).

The pseudorepresentation associated to the trivial representation INGL2(Zp) defines a map RNZp, making RN into an augmented Zp-algebra. Let I=ker(RNZp) be the augmentation ideal; explicitly, it is the ideal generated by TrDu(σ)2 for all σIN.

Of course, if Δ:GNA is a deformation of D¯|GN with unramified determinant, then Δ|IN defines a map RNA. Thus restricting the universal pseudodeformation Du:GQ,NpR to IN induces a ring homomorphism RNR. The following lemma shows we are in the unusual situation that this map is surjective.

Lemma 4.3.

The natural map RNR is surjective.

Proof: Let IRN be the augmentation ideal of RN as in Definition 4.2. The ideal IR is generated by TrDu(σ)2 for all σIN, which is exactly the kernel of RRpseudo-min, so R/IR=Rpseudo-min. By Lemma 3.5, this implies R/IR=Zp. Then by Nakayama’s lemma, the map RNR is surjective.

There is a quotient of RN that parameterizes supercuspidal (that is, induced) deformations. Indeed, the pseudorepresentation ρN:GNGL2(Λ) constructed in the previous section is the universal induced representation. By Eq. 10, its pseudorepresentation on inertia has values in the subring Λ+ of Λ. This defines a surjective homomorphism RNΛ+. The following proposition shows that, in fact, all deformations are supercuspidal. (Note that such deformations are allowed to be reducible; for instance 1ϵ is supercuspidal as it is the induction of the trivial character.)

Proposition 4.4.

The map RNΛ+ is an isomorphism of augmented Zp-algebras.

Proof: Let R~N be the quotient of RD¯|IN by the ideal generated by Du(σ)1 for all σIN. That is, R~N is the universal deformation ring of the trivial 2-dimensional pseudorepresentation on IN having trivial determinant. Let D~:INR~N be the universal deformation. Consider the representation

ρ~N:INGL2(Zp[[x]])

obtained as the composite

ININ(p)=ττ(1+x1x1)GL2(Zp[[x]]).

The pseudorepresentation of ρ~N defines a map

ψ:R~NZp[[x]]

We claim that ψ is an isomorphism, with inverse given by the map

ϕ:Zp[[x]]R~N,xTrD~(τ)2.

Since ψ(TrD~(τ))=Tr(ρ~N(τ))=x+2, the composition ψϕ is the identity. On the other hand, the map

ϕψ:R~NR~N

defines a pseudorepresentation D~:INR~N. To see that ϕψ is the identity, it is enough to show that D~=D~. Since they both have trivial determinant, it suffices to show TrD~=TrD~. By construction,

TrD~(τ)=TrD~(τ).

Then, by the pseudorepresentation identity Item 2, this implies that for all n,

TrD~(τn)=TrD~(τn).

Since TrD~ and TrD~ are continuous and agree on a dense subgroup of τ, they are equal on τ (Remark 2.2). Finally, both TrD~ and TrD~ send every element σker(ININ(p)) to 2. Indeed, for any Cayley–Hamilton representation ρ:INE× inducing either one, since ρ1(modmE), the image of ρ is pro-p, so ρ factors through IN(p). Since TrD~ and TrD~ factor through IN(p) and agree on τ, they agree on IN.

Now, since RN is the quotient of R~N by the relation TrD~(τ)=TrD~(τN), the map ψ induces an isomorphism RNZp[[x]]/(f(x)), where f(x)=Tr(ρ~N(τ))Tr(ρ~N(τN)). To compute f(x) more explicitly, it is convenient to pass to an overring of Zp[[x]] that contains the eigenvalues of ρ~N(τ), which are the roots of the polynomial λ2(2+x)λ+1. Over the ring

Zp[[x]][λ]λ2(2+x)λ+1=Zp[[x]][λ](λ1)2xλZp[[λ1]],x(λ1)2λ,

the eigenvalues of ρ~N(τ) are λ and λ1, so f(x)=λ+λ1(λN+λN). Since Zp[[λ1]]/Zp[[x]] is a torsion-free Zp[[x]]-module, it follows that f(x)Zp[[λ1]]Zp[[x]]=f(x)Zp[[x]], so that the map

Zp[[x]](f(x))Zp[[λ1]](f(x))

induced by x(λ1)2λ is injective. Thus RN is isomorphic to the subring generated by (λ1)2λ in the ring A defined by

A=Zp[[λ1]]λ+λ1λNλN.

This presentation of A can be simplified by factoring:

λ+λ1λNλN=λN(λN+11)(λN11).

Noting that λNZp[[λ1]] is a unit and that, since p(N1) and pr(N+1) the ratios

λN11λ1,λN+11λpr1Zp[[λ1]]

are units, the ring A can be written as

A=Zp[[λ1]](λ1)(λpr1).

Note that there is a surjective homomorphism AΛ given by λ[δ], which gives the presentation ΛZp[[λ1]]/(λpr1). In this presentation, Λ+ is the subring generated by [δ]+[δ1]2=(λ1)2λ. Factoring λpr1 into a product of irreducible polynomials yields an embedding

AZp[λ1](λ1)2i=1rZp[ζpi],

with λ mapping to ζpi in the rightmost factors. This map induces the map Eq. 11 on the quotient Λ of A. Hence there is a commutative diagram

4.2.

in which the leftmost vertical arrow is the map induced on the subrings generated by (λ1)2λ in the middle vertical arrow. The rightmost vertical arrow is just the identity on the summands indexed by 1ir. To see that the map RNΛ+ is injective, it is enough to show that the kernel of the rightmost vertical arrow has trivial intersection with the subring of A generated by (λ1)2λ. Since the kernel of this arrow is contained in the Zp[λ1](λ1)2 factor, and (λ1)2λ maps to zero in this factor, this is clear.

5. Proof of Theorem 1.1

Combining Lemmas 3.1, 4.3, and Proposition 4.4, we find that there is a chain of surjective ring homomorphisms

Λ+RNRT. [12]

Letting φ:Λ+T denote the composition of these maps, we have a commutative diagram

5. [13]

as in the set up of Wiles’s numerical criterion (27, appendix), as improved by Lenstra (28) (see also ref. 29). Let J=ker(π) be the augmentation ideal in Λ+ and I=ker(πT) the Eisenstein ideal in T. It is well known and easy to see that AnnT(I) is the kernel of the quotient TT0 of T that acts faithfully on cuspforms.

Theorem 5.1.

The surjective maps in Eq. 12 are all isomorphisms.

Proof: Let η=pt be a generator of the ideal πT(AnnT(I))Zp. By the numerical criterion (29, criterion I), it is enough to show #J/J2η. It follows immediately from Lemma 4.1 that #J/J2=pr. Since AnnT(I)=ker(TT0), to show that ηpr, it is enough to show that the composite map

TπTZpZ/prZ

factors through T0. In other words, it is enough to show that the Eisenstein series E is a cuspform modulo pr. This follows from ref. 2, corollary 2.6, completing the proof.

Remark 5.2:

Since the ring Λ+ is monogenic, there is an alternative argument that does not use the numerical criterion (but still uses the fact that prη). For this, note that the surjection φ and Lemma 4.1 imply that T has a presentation TZp[x]/(xF(x)), where F(x) is a monic divisor of Ψ(x). Then η can be interpreted as the constant term F(0) (up to a p-adic unit). But since prη, this implies that Ψ(0)F(0). Since F(x)Ψ(x), this implies F(x)=Ψ(x), as desired.

This completes the proof of Theorem 1.1.

6. Complement: Relation to Massey Products

We have proven an isomorphism Λ+R by identifying R with a local deformation ring. Since the local deformation ring is so explicit, this gives us complete understanding of R, and in particular, its rank. In ref. 7, another method for studying the rank of R is introduced, using obstructions in Galois cohomology that come from Massey products. One might hope to use Theorem 1.1 and reverse the arguments of ref. 7 to obtain nontrivial arithmetic results about vanishing of Massey products. In this section, we indicate how our results are related to Massey products and conclude that the Massey products involved are particularly simple and not arithmetically interesting. (This explains why the rank is basically constant in our case, rather than varying in an arithmetically interesting way as in ref. 7.) We do not give another complete proof of Theorem 1.1 (although there is little doubt that one could give a proof along these lines). Instead we attempt to illustrate why Theorem 1.1 is reasonable from the point of view of ref. 7.

6.1. The Strategy for Relating Massey Products to Ranks.

It follows from Proposition 3.4 that the tangent space of R/pR is one-dimensional and spanned by a tangent vector Db,c,ϕ. So, there is an isomorphism R/pRFp[[ϵ]]/(ϵd) for some d>1 (including possibly d= if R/pRFp[[ϵ]]). This d is then the Fp-dimension of R/pR, which is an upper bound on the rank of R. One can study d one step at a time: d>2 if and only if the tangent vector RFp[ϵ]/(ϵ2) given by Db,c,ϕ lifts to a map RFp[ϵ]/(ϵ3). If d>2, then d>3 if and only if the map RFp[ϵ]/(ϵ3) lifts to a map RFp[ϵ]/(ϵ4), and so on.

Given this interpretation of the Fp-dimension of R/pR in terms of lifts of the tangent vector, we now sketch how this is related to the vanishing of certain elements, called Massey products, in Galois cohomology H2(Q,). For this, consider the problem of lifting RFp[ϵ]/(ϵ2) to a map RFp[ϵ]/(ϵ3). Recall that Db,c,ϕ comes from the trace of a representation ρb,c,ϕ Eq. 4 that can be written as

ρb,c,ϕ=ω(1+ϕ1ϵ)b+b1ϵωcϵ1+ϕ1ϵ:GQGL2(Fp[ϵ]/(ϵ2)),

where ϕ1 is a choice of 1-cochain satisfying dϕ1=bc, ϕ=bcϕ, and b1 is a cochain satisfying db1=bϕ1+ϕ1b. A map RFp[ϵ]/(ϵ3) lifting Db,c,ψ might come from a deformation ρ2 of the form

ρ2=ω(1+ϕ1ϵ+ϕ2ϵ2)b+b1ϵ+b2ϵ2ω(cϵ+c2ϵ2)1+ϕ1ϵ+ϕ2ϵ2:GQGL2(Fp[ϵ]/(ϵ3)).

(See Remark 6.1 below for pseudodeformations that may not arise from such a ρ2.) Here the functions ϕ2, b2, c2, and ϕ2 are 1-cochains that, in order for ρ2 to be a homomorphism, must satisfy conditions on their coboundaries. For instance, ϕ2 satisfies

dϕ2=ϕ1ϕ1+bc2+b1c.

The right-hand side of this equation is a 2-cocycle, and the equation expresses the fact that this 2-cocycle is a 2-coboundary (i.e. it vanishes in cohomology). A similar thing is true for the equations governing b2, c2, and ϕ2. Conversely, without knowing that ρ2 exists, if one knew that the relevant 2-cocycles were 2-coboundaries, one could define ρ2 using these equations. The cohomology classes of these 2-cocycles are examples of Massey products. This shows that Massey products are obstructions: The representation ρ2 exists if and only if the Massey products vanish.

Remark 6.1:

We have glossed over several points in this sketch. First, it is not clear that a pseudodeformation of Db,c,ϕ must come from a true representation like ρ2. This can be remedied by instead looking for deformations in the universal Cayley–Hamilton algebra. The results of Section 3 imply that this Cayley–Hamilton algebra is a GMA with a particularly simple form, so the true representations considered above are not too different from the universal case. The main caveat is that, just as one does not need the cochain b1 in order to define Db,c,ϕ, one also does not need the cochain b2 in order to define the pseudorepresentation associated to ρ2. Second, in order for the deformation ρ2 to define a map RFp[ϵ]/(ϵ3), the pseudorepresentation must satisfy the local conditions required in the definition of R. This can be resolved by working with Galois cohomology with restricted ramification H2(GQ,Np,), and working carefully with the finite-flat condition.

6.2. Computation of the Relevant Massey Products.

The relevant Massey products can be computed explicitly in this case. There are two main reasons for this. First, the restriction maps

resN:H2(GQ,Np,Fp(i))H2(QN,Fp(i))

for i=0,1,1 are injective. This is a kind of local-to-global principle: to compute the whether or not the global Massey-product classes vanish, it is enough to consider their restriction to local cohomology at N. Second, (and this is the most crucial difference with ref. 7), the cohomology group H1(QN,Fp(1)) is one-dimensional. Since N1(modp), there is an isomorphism Fp(1)Fp(1) of GQN modules, so resN(b) and resN(c) are both nonzero classes in the same one-dimensional space H1(QN,Fp(1)). Up to rescaling, we can assume that resN(b)=resN(c).

We will now sketch an argument that uses Massey products to explain why dimFp(R/pR)p+12 when N1(modp2). From this point on, we work exclusively with cohomology of GQN and drop the resN from the notation, which is justified by the local-to-global principle. Since b=c, a particularly simple cochain ϕ satisfying dϕ=bc=bb is ϕ=12b2. Similarly, to find a cochain b1 such that

db1=bϕ+ϕb=b12b2+12b2b

take b1=16b3. To simplify notation, for n<p, let b[n]=1n!bn. Then ρb,c,ϕ takes the simple form

ρb,c,ϕ=ω(1+b[2]ϵ)b+b[3]ϵωbϵ1+b[2]ϵ.

To deform this, one can take

ρ2=ω(1+b[2]ϵ+b[4]ϵ2)b+b[3]ϵ+b[5]ϵ2ω(bϵ+b[3]ϵ2)1+b[2]ϵ+b[4]ϵ2.

The obvious pattern continues: If 2n+1<p, then there is a deformation ρn:GQNGL2(Fp[ϵ]/(ϵn+1)) defined by

ρn=ω(1+b[2]ϵ++b[2n]ϵn)b+b[2]ϵ++b[2n+1]ϵnω(bϵ++b[2n1]ϵn)1+b[2]ϵ++b[2n]ϵn.

Just as the pseudorepresentation Db,c,ϕ does not require the cochain b1 to be defined, the pseudorepresentation associated to ρn does not require b[2n+1] to be defined. In other words, the pseudorepresentation associated to ρn can be defined as long as 2n<p. This defines a pseudorepresentation Dp12:GNFp[ϵ]/(ϵp+12). The obstruction to deforming Dp12 is the 2-cocycle

i=1p1b[i]b[pi]. [14]

This is the Massey pth-power bp of b, defined by Kraines (30, definition 11).§ By a variant of ref. 30, theorem 14 for nontrivial coefficients, bp is equal to (b), where is the connecting map in the exact sequence

0Fp(1)(Z/p2Z)(ω)Fp(1)0,

where (Z/p2Z)(ω) is the unramified GQN-module where FrobN acts by ω(N)=1. If N1(modp2), then a simple calculation shows that (b)=bx for a nontrivial class xH1(QN,Fp). By Tate duality, this implies that (b)0. In other words, the 2-cocycle Eq. 14 is not a coboundary, and this gives an obstruction to deforming Dp12. If dimFp(R/pR) were greater than p+12, there would be a surjective homomorphism RFp[ϵ]/(ϵp+32), from which one could construct a deformation of Dp12, a contradiction.

The inequality dimFpR/pRp+12 goes most of the way to proving Corollary 1.2. Indeed, it is not difficult to show that every cuspform f satisfying Eq. 1 must be supercuspidal at N. Taking the trace of the supercuspidal representation shows that the coefficient ring of f contains Zp[ζp+ζp1], so that rankZp(T)p+12. Then dimFpR/pRp+12 and the surjection RT imply that these containments and inequalities are all equalities. Of course, to complete the above sketch, one would have to deal with the issues mentioned in Remark 6.1.

Acknowledgments

We thank Shaunak Deo, Robert Pollack, Alice Pozzi, and Carl Wang-Erickson for helpful conversations as well as the referees for thoughtful comments that improved this paper. J.L. was supported by NSF grant DMS-2301738 and P.W. was supported by NSF CAREER grant DMS-2337830.

Author contributions

J.L. and P.W. designed research; performed research; and wrote the paper.

Competing interests

The authors declare no competing interest.

Footnotes

This article is a PNAS Direct Submission.

*See also ref. 4, especially lemma 3.9 and proposition 5.4, for an earlier proof of the same result, in slightly different terms.

In fact, ρ~N(τ) is conjugate to the matrix (λ10λ1) in GL2(Zp[[λ1]]).

With coefficients in Fp, Fp(1), Fp(1), and Fp, respectively.

§Kraines actually only considers trivial coefficients, but the generalization to nontrivial coefficients is straightforward.

Contributor Information

Jaclyn Lang, Email: jaclyn.lang@temple.edu.

Preston Wake, Email: wakepres@msu.edu.

Data, Materials, and Software Availability

There are no data underlying this work.

References

  • 1.Buzzard K., Questions about slopes of modular forms. Astérisque 298, 1–15 (2005). [Google Scholar]
  • 2.Lang J., Wake P., A modular construction of unramified p-extensions of Q(N1/p). Proc. Amer. Math. Soc. Ser. B 9, 415–431 (2022). [Google Scholar]
  • 3.Mazur B., Modular curves and the Eisenstein ideal. Inst. Hautes Études Sci. Publ. Math. 47, 33–168 (1977). [Google Scholar]
  • 4.Calegari F., Emerton M., On the ramification of Hecke algebras at Eisenstein primes. Invent. Math. 160, 97–144 (2005). [Google Scholar]
  • 5.Merel L., L’accouplement de Weil entre le sous-groupe de Shimura et le sous-groupe cuspidal de J0(p). J. Reine Angew. Math. 477, 71–115 (1996). [Google Scholar]
  • 6.Lecouturier E., Higher Eisenstein elements, higher Eichler formulas and rank of Hecke algebras. Invent. Math. 223, 485–595 (2021). [Google Scholar]
  • 7.Wake P., Wang-Erickson C., The rank of Mazur’s Eisenstein ideal. Duke Math. J. 169, 31–115 (2020). [Google Scholar]
  • 8.B. Mazur, “Deforming Galois representations” in Galois Groups Over Q, Y. Ihara, K. Ribet and J.-P. Serre, Eds. (Berkeley, CA, 1987) (Springer, New York, 1989), vol. 16, pp. 385–437.
  • 9.Skinner C. M., Wiles A. J., Ordinary representations and modular forms. Proc. Nat. Acad. Sci. U.S.A. 94, 10520–10527 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bellaïche J., Chenevier G., Families of Galois representations and Selmer groups. Astérisque 324, xii+314 (2009). [Google Scholar]
  • 11.Berger T., Klosin K., On deformation rings of residually reducible Galois representations and R = T theorems. Math. Ann. 355, 481–518 (2013). [Google Scholar]
  • 12.G. Chenevier, “The p-adic analytic space of pseudocharacters of a profinite group and pseudorepresentations over arbitrary rings” in Automorphic Forms and Galois Representations. Vol. 1 F. Diamond, P. L. Kassaei, M. Kim, Eds.(Cambridge Univ. Press, Cambridge, 2014), vol. 414, pp. 221–285.
  • 13.Calegari F., Eisenstein deformation rings. Compos. Math. 142, 63–83 (2006). [Google Scholar]
  • 14.Wang-Erickson C., Algebraic families of Galois representations and potentially semi-stable pseudodeformation rings. Math. Ann. 371, 1615–1681 (2018). [Google Scholar]
  • 15.S. V. Deo, Non-optimal levels of some reducible mod p modular representations. Adv. Math. 461, 110074 (2025).
  • 16.Wake P., Wang-Erickson C., The Eisenstein ideal with squarefree level. Adv. Math. 380, 107543 (2021). [Google Scholar]
  • 17.Bellaïche J., Pseudodeformations. Math. Z. 270, 1163–1180 (2012). [Google Scholar]
  • 18.Wake P., Wang-Erickson C., Deformation conditions for pseudorepresentations. Forum Math. Sigma 7, e20 (2019). [Google Scholar]
  • 19.Wiles A., On ordinary λ-adic representations associated to modular forms. Invent. Math. 94, 529–573 (1988). [Google Scholar]
  • 20.Taylor R., Galois representations associated to Siegel modular forms of low weight. Duke Math. J. 63, 281–332 (1991). [Google Scholar]
  • 21.Rouquier R., Caractérisation des caractères et pseudo-caractères. J. Algebra 180, 571–586 (1996). [Google Scholar]
  • 22.Roby N., Lois polynomes et lois formelles en théorie des modules. Ann. Sci. École Norm. Sup. 80, 213–348 (1963). [Google Scholar]
  • 23.B. Mazur, “An introduction to the deformation theory of Galois representations” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 243–311.
  • 24.C. Wang-Erickson, Presentations of non-commutative deformation rings via A-algebras and applications to deformations of Galois representations and pseudorepresentations arXiv [Preprint] (2020). 10.48550/arXiv.1809.02484 (Accessed 11 October 2024). [DOI]
  • 25.Ramakrishna R., On a variation of Mazur’s deformation functor. Compos. Math. 87, 269–286 (1993). [Google Scholar]
  • 26.S. Bloch, K. Kato, “L-functions and Tamagawa numbers of motives” in The Grothendieck Festschrift, Vol. I (Birkhäuser Boston, Boston, MA, 1990), vol. 86, P. Cartier, L. Illusie, N. M. Katz, G. Laumon, Yu. Manin, K. A. Ribet, Eds. pp. 333–400.
  • 27.Wiles A., Modular elliptic curves and Fermat’s last theorem. Ann. Math. 141, 443–551 (1995). [Google Scholar]
  • 28.H. W. Lenstra Jr., “Complete intersections and Gorenstein rings” in Elliptic Curves, Modular Forms,& Fermat’s Last Theorem (Hong Kong, 1993) J. Coates, S.-T. Yau, Eds. (Int. Press, Cambridge, MA, 1995), vol. I, pp. 99–109.
  • 29.B. de Smit, K. Rubin, R. Schoof, “Criteria for complete intersections” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 343–356.
  • 30.Kraines D., Massey higher products. Trans. Am. Math. Soc. 124, 431–449 (1966). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

There are no data underlying this work.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES