The Eisenstein ideal at prime-square level has constant rank

Jaclyn Lang; Preston Wake

doi:10.1073/pnas.2500729122

. 2025 Jul 11;122(28):e2500729122. doi: 10.1073/pnas.2500729122

The Eisenstein ideal at prime-square level has constant rank

Jaclyn Lang ^a,^1,², Preston Wake ^b,^1,²

PMCID: PMC12280971 PMID: 40643979

Significance

In his proof of Fermat’s Last Theorem, Wiles pioneered a method for relating two disparate collections of objects, namely Galois representations and modular forms. He parameterizes each collection by a certain space, and then the parameterizing spaces are shown to match, thereby associating a modular form to each Galois representation. However, the shape of the underlying parameterizing space typically remains somewhat mysterious. In this paper, the parameterizing space is explicitly determined under certain hypotheses, and the description turns out to be strikingly uniform when varying the data upon which the parameterizing space depends.

Keywords: number theory, modular forms, Galois representations

Abstract

Let N and p be prime numbers with $p \geq 5$ such that $p ∣ ∣ (N + 1)$ . In a previous paper, we showed that there is a cuspform f of weight 2 and level $Γ_{0} (N^{2})$ whose ℓ-th Fourier coefficient is congruent to $ℓ + 1$ modulo a prime above p for all primes ℓ. In this paper, we prove that this form f is unique up to Galois conjugacy, and the extension of $Z_{p}$ generated by the coefficients of f is exactly $Z_{p} [ζ_{p} + ζ_{p}^{- 1}]$ . We also prove similar results when a higher power of p divides $N + 1$ .

1. Introduction

Let p be a prime number and let $\bar{ρ} : Gal (\bar{Q} / Q) \to {GL}_{2} ({\bar{F}}_{p})$ be a modular residual Galois representation. How many different Hecke eigenforms f give rise to $\bar{ρ}$ , and what can be said about the p-adic field $Q_{p} (f)$ generated by the Hecke eigenvalues of f? One can fine-tune this question by constraining the various parameters involved. For instance, if one fixes the level of f but allows the weight to vary, then Buzzard, motivated by conjectures about slopes of modular forms, asked whether the degrees $[Q_{p} (f) : Q_{p}]$ are bounded (1, question 4.4). In the case $p = 2$ with level 1, Buzzard even suggested a bound of 2 on $[Q_{2} (f) : Q_{2}]$ . We know of very little progress on this question in the twenty years since Buzzard asked it. (While preparing this article, we learned of recent work by Kimball Martin and Anna Medvedovsky giving examples of level-one f, where $[Q_{2} (f) : Q_{2}] > 2$ . There does not seem to be a consensus among experts about the question of boundedness.)

In this paper, we consider a question orthogonal to Buzzard’s: We fix the representation $\bar{ρ} = ω \oplus 1$ , where $ω : G_{Q} \to F_{p}^{\times}$ is the mod-p cyclotomic character. We are interested in Hecke eigenforms of fixed weight 2 that give rise to $\bar{ρ}$ , but we allow the level to vary among certain prime powers. There is another way to phrase this in terms of Hecke algebras: There is a localization $T$ of the Hecke algebra whose minimal prime ideals correspond to Galois-conjugacy classes of such eigenforms, and we are interested in the number and degree of these minimal primes. For primes $N, p \geq 5$ , in order to have a cuspidal eigenform of weight 2 and N-power level with mod-p residual representation $\bar{ρ}$ , one must have $p ∣ (N^{2} - 1)$ (2, theorem 2.8), so the two cases of interest are when $N \equiv \pm 1 mod p$ .

If the level is constrained to be prime, Mazur asked about the rank of the Hecke algebra $T$ (3, p. 140). [Calegari and Emerton (4) first pointed out the parallels between Mazur’s question and Buzzard’s.] In this prime-level case, the degrees of $Q_{p} (f)$ (and their sum—the rank of $T$ ) have a great deal of arithmetic significance. They have been studied using modular symbols by Merel (5) and Lecouturier (6), where they are shown to be related to special values of equivariant L-functions. Using Galois representations, Calegari–Emerton (4) and Wake–Wang-Erickson (7) show that these ranks are related to class groups and Massey products in Galois cohomology. In numerical examples, the most common scenario is that there is a unique cusp form giving rise to $\bar{ρ}$ and its Hecke field is $Q_{p}$ , but this is certainly not always the case. See the tables in refs. 3, p. 40 and 7, § 1.6 for data about the rank and irreducible components of $T$ in small prime level. It is not known whether the degrees of the $Q_{p} (f)$ are bounded independently of the level N. Heuristics given in ref. 7 suggest that, given a prime N with $p ∣ ∣ (N - 1)$ , the probability that there is a form f of level N such that $[Q_{p} (f) : Q_{p}] = d$ is $\frac{p - 1}{p^{d}}$ . This accounts for the numerical evidence that the degrees $[Q_{p} (f) : Q_{p}]$ are usually small but suggests that they are unbounded.

In this paper, we consider the same representation $\bar{ρ} = ω \oplus 1$ and same weight 2, but we vary the level over squares of primes N such that $p ∣ (N + 1)$ . For such primes N, Mazur’s results imply that there are no newforms f of level $Γ_{0} (N)$ giving rise to $\bar{ρ}$ . However, in our previous work, we show that there is a newform of level $Γ_{0} (N^{2})$ with

residual representation $\bar{ρ}$ (2, theorem B). Our main result in this paper is that in this case, we can compute the Hecke algebra $T$ explicitly. More precisely we have the following theorem.

Theorem 1.1.

Let $N, p \geq 5$ be prime numbers such that $N \equiv - 1 m o d p$ , and let $r \geq 1$ be the p-adic valuation of $N + 1$ . Let $T$ be the Hecke algebra parameterizing modular forms of level $Γ_{0} (N^{2})$ and weight 2 with mod-p residual representation $ω \oplus 1$ . Let $Δ = F_{N^{2}}^{\times, p -part}$ , a cyclic group of order $p^{r}$ , let $Λ = Z_{p} [Δ]$ , and let $Λ^{+} \subset Λ$ be the subring fixed under the involution given by inversion on Δ. Then there is a canonical isomorphism of $Z_{p}$ -algebras $Λ^{+} \tilde{\to} T$ sending the augmentation ideal of $Λ^{+}$ to the Eisenstein ideal of $T$ .

Since minimal prime ideals of $T$ correspond to (Galois-conjugacy classes of) eigenforms, the theorem allows us to answer all of the questions asked at the start of this introduction: The fields $Q_{p} (f)$ correspond to the fraction fields of minimal primes of $Λ^{+}$ . The situation is particularly simple when $p ∣ ∣ (N + 1)$ (i.e. when $r = 1$ ), in which case there is only one minimal prime of $Λ^{+}$ other than the augmentation ideal, and this prime has residue ring $Z_{p} [ζ_{p} + ζ_{p}^{- 1}]$ , so the theorem yields the following.

Corollary 1.2.

Let $N, p \geq 5$ be prime numbers such that $p ∣ ∣ (N + 1)$ . Then there is a cuspidal eigenform f of level $Γ_{0} (N^{2})$ and weight 2 with coefficients in $Z_{p} [ζ_{p} + ζ_{p}^{- 1}]$ such that

$\begin{matrix} a_{ℓ} (f) \equiv 1 + ℓ (m o d p) \end{matrix}$ [1]

for all prime numbers ℓ, where $p$ is the maximal ideal of $Z_{p} [ζ_{p} + ζ_{p}^{- 1}]$ . Moreover, this form is the unique (up to Galois-conjugacy) cuspform satisfying Eq. 1.

Note that this implies that the fields $Q_{p} (f)$ are independent of N. Contrast this with the prime-level case, where the heuristic suggests that the degrees $[Q_{p} (f) : Q]$ are unbounded as one varies over primes N with $p ∣ ∣ (N - 1)$ .

Of course, there is a variant of Corollary 1.2 when $r > 1$ : In that case, there are r Galois-conjugacy classes, and the coefficient rings are $Z_{p} [ζ_{p^{i}} + ζ_{p^{i}}^{- 1}]$ for $i = 1, \dots, r$ .

The proof of Theorem 1.1 uses Galois deformation theory, as pioneered by Mazur (8), to bound the size of $T$ by showing, essentially, that the possible ways to deform the residual Galois representation are limited. There are many previous works about deforming reducible residual representations, including refs. 9–15, that influenced our thinking, but we believe the idea to use deformation theory to understand ranks of Hecke algebras originates with ref. 4. We use the theory of pseudorepresentations with deformation conditions as developed in refs. 7 and 16.

1.1. Outline of the Paper.

The proof of Theorem 1.1 takes up most of the paper. We sketch the proof here, indicating in which sections the steps take place. Let R be the pseudodeformation ring of $\bar{ρ}$ parameterizing deformations that have fixed determinant and that are unramified outside Np and finite-flat at p. (The theory of pseudodeformations is reviewed in Section 2.) As usual, there is a surjection $R ↠ T$ . We define the “pseudo-minimal” quotient $R^{pseudo-min}$ of R corresponding to deformations whose trace equals the trace of the trivial representation on inertia-at-N. In other words, $R^{pseudo-min}$ parameterizes representations for which the semisimplification of the restriction to inertia-at-N is trivial. This includes representations that are unramified at N, but also representations that are Steinberg at N. However, Mazur’s results imply that there are no cuspforms of level $Γ_{0} (N)$ that are congruent to the Eisenstein series, so one would expect that there are no representations that are Steinberg at N. In Section 3, we prove that this is true: $R^{pseudo-min} = Z_{p}$ . This key result shows that R is entirely determined by the local behavior at N. In Section 4, we define a local-at-N pseudodeformation ring $R_{N}$ , and prove that all local deformations come from inducing a character of $G_{Q_{N^{2}}}$ , which gives an isomorphism $R_{N} ≅ Λ^{+}$ . Together with $R^{pseudo-min} = Z_{p}$ , this gives surjections $Λ^{+} ↠ R ↠ T$ . To complete the proof, in Section 5, we show that these surjections are isomorphisms using Wiles’s numerical criterion, applying our previous results (2) to understand the congruence number. Finally, in Section 6, we indicate how our results are related to the Massey-products method of ref. 7.

2. Pseudodeformations

In this section, we review the aspects of deformation theory of pseudorepresentations that we will need in the next section. There are no new results in this section; it is a digest of material from many sources, including refs. 10, 12, 14, 17, and 18.

2.1. Pseudorepresentations.

The concept of a pseudorepresentation came about to codify the formal properties of the trace (or, more generally, the characteristic polynomial) of a representation. The first definition of pseudorepresentation was made by Wiles (19) for 2-dimensional representations and was later generalized by Taylor (20), Rouquier (21), and Chenevier (12). We will use Chenevier’s version, which he calls “determinants.”

Chenevier’s notion of pseudorepresentation mimics the properties of the determinant. For a commutative ring A, the determinant map $det : M_{n} (A) \to A$ has many well-known properties: It is multiplicative, in that $det (x y) = det (x) det (y)$ and $det (1) = 1$ , and has degree n, in that $det (a x) = a^{n} det (x)$ for $a \in A$ and $x, y \in M_{n} (A)$ . It is also a polynomial function in the entries of the matrix. In particular, if B is a commutative A-algebra, then one can also apply $det$ to an element of the tensor product $M_{n} (A) \otimes_{A} B$ and obtain, in a natural way, an element of B. In particular, taking $B = A [t]$ allows one to define the characteristic polynomial $det (t - x) \in A [t]$ of a matrix x.

Definition 2.1:

Let A be a commutative ring and E an A-algebra. A pseudorepresentation of E of degree d, written $D : E \to A$ , is a collection of maps $D_{B} : E \otimes_{A} B \to B$ , one for each commutative A-algebra B, that are natural in B and satisfy:

$D_{B} (x y) = D_{B} (x) D_{B} (y)$ and $D_{B} (1) = 1$ ,

$D_{B} (b x) = b^{d} D_{B} (x)$

for all $x, y \in E \otimes_{A} B$ and all $b \in B$ . The map $D_{A}$ is abbreviated to D. The characteristic polynomial of $e \in E$ for a fixed pseudorepresentation $D : E \to A$ is defined to be $D_{A [t]} (t - e) \in A [t]$ .

If G is a group, then a pseudorepresentation of G of degree d over A, written $D : G \to A$ , is a pseudorepresentation of $A [G]$ .

In this paper, we will be interested exclusively in degree-two pseudorepresentations and only in the case where 2 is invertible in the ring A. In this case, pseudorepresentations have a simpler description (12, example 1.8), as we now recall. A d-dimensional pseudorepresentation $D : E \to A$ is determined by the coefficients of $D_{A [t_{1}, \dots, t_{d}]} (x_{1} t_{1} + \dots + x_{d} t_{d})$ for $x_{i} \in E$ , which is a homogeneous polynomial of degree d. In particular, when $d = 2$ these coefficients are determined by $D_{A}$ (22, proposition II.1):

D_{A [t_{1}, t_{2}]} (x_{1} t_{1} + x_{2} t_{2}) = D (x_{1}) t_{1}^{2} + (D (x_{1} + x_{2}) - D (x_{1}) - D (x_{2})) t_{1} t_{2} + D (x_{2}) t_{2}^{2} .

Taking $x_{1} = 1$ and specializing $t_{2}$ to $- 1$ , we find that the characteristic polynomial of $x \in E$ is

D_{A [t]} (t - x) = t^{2} - (D (1 + x) - D (x) - 1) t + D (x) .

Therefore we define the trace ${Tr}_{D} : E \to A$ of D by the formula

{Tr}_{D} (x) = D (x + 1) - D (x) - 1 .

For a degree-two pseudorepresentation $D : G \to A$ of G, the trace satisfies relations

${Tr}_{D} (x y) = {Tr}_{D} (y x)$ and ${Tr}_{D} (1) = 2$ , and
$D (x) {Tr}_{D} (x^{- 1} y) - {Tr}_{D} (x) {Tr}_{D} (y) + {Tr}_{D} (x y) = 0$

for all $x, y \in G$ (12, lemma 7.7).

Conversely, if $D^{'} : G \to A^{\times}$ is a homomorphism and $T : G \to A$ is a function such that the pair $(D^{'}, T)$ satisfy (1) and (2), then the formula

D_{B} (b x + c y) = D^{'} (x) b^{2} + (T (x) T (y) - T (x y)) b c + D^{'} (y) c^{2}

for $x, y \in G$ and $b, c \in B$ , defines a pseudorepresentation $D : G \to A$ . In this way, one can think of a pseudorepresentation as the data of the functions D and ${Tr}_{D}$ . Moreover, if 2 is invertible in A, then one can recover $D_{A}$ from ${Tr}_{D}$ using formula (2) as $D (x) = \frac{{Tr}_{D} {(x)}^{2} - {Tr}_{D} (x^{2})}{2}$ .

Remark 2.2:

Thus far in the discussion, we have considered discrete groups and rings. For topological groups and rings, one considers continuous pseudorepresentations $D : G \to A$ , which amounts to requiring that the functions $D : G \to A$ and ${Tr}_{D} : G \to A$ are continuous (see ref. 12, section 2.30). If $D : G \to A$ is a continuous pseudorepresentation and $H \subseteq G$ is a dense subgroup, then D is determined by its restriction to H (12, example 2.31). To simplify the discussion below, we will always assume that pseudorepresentations are continuous if we are using topological groups.

Example 2.3:

Let G be a group and A be a commutative ring. If $ρ : G \to {GL}_{2} (A)$ is a homomorphism, then the pair of functions $(D, T) = (det \circ ρ, Tr \circ ρ)$ is, of course, a pseudorepresentation. Moreover, if there is a subring $A^{'} \subseteq A$ such that D and T both have images in $A^{'}$ , then $(D, T)$ defines a pseudorepresentation $D : G \to A^{'}$ .

This example can be seen as one of the major advantages of pseudorepresentations and is the purpose for which Wiles first used them. Note that it may not be true that there is a conjugate $ρ^{'}$ of ρ such that $ρ^{'}$ has values in ${GL}_{2} (A^{'})$ . For instance, let G be the subgroup of ${GL}_{2} (C)$ generated by $(\begin{matrix} i & 0 \\ 0 & - i \end{matrix})$ and $(\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix})$ , which is isomorphic to the quaternion group of order 8; the trace and determinant of all elements of G are in $R$ , but the inclusion $G \subset {GL}_{2} (C)$ cannot be conjugated to land in ${GL}_{2} (R)$ .

2.2. Cayley–Hamilton Algebras and Generalized Matrix Algebras.

Not every pseudorepresentation $D : G \to A$ comes from a true representation $ρ : A [G] \to M_{2} (A)$ as in Example 2.3. However, Chenevier has defined a generalization of representations, called Cayley-Hamilton representations, that provide a natural substitute (12, remark 7.19). In good situations, these Cayley-Hamilton representations are valued in a generalized matrix algebra, which have many useful properties in common with usual matrix algebras.

Definition 2.4:

Let A be a commutative ring and let E be an A-algebra. A degree-two pseudorepresentation $D : E \to A$ is called Cayley–Hamilton if for all $x \in E \otimes_{A} B$ ,

$x^{2} - {Tr}_{D_{B}} (x) x + D_{B} (x) = 0 .$

A pair $(E, D)$ of an A-algebra E and a Cayley–Hamilton pseudorepresentation D is called a Cayley–Hamilton algebra.

If G is a group, then a Cayley–Hamilton representation of G is a triple $(E, D, ρ)$ , where $(E, D)$ is a Cayley–Hamilton algebra and $ρ : G \to E^{\times}$ is a group homomorphism. The composition $D \circ ρ : G \to A$ defines a pseudorepresentation $ψ (ρ)$ of G over A called the associated pseudorepresentation.

For example, the algebra $E = M_{2} (A)$ with the pseudorepresentation given by the determinant is Cayley–Hamilton, by the Cayley–Hamilton Theorem (whence the name). A representation $ρ : A [G] \to M_{2} (A)$ gives rise to a Cayley–Hamilton representation, just as in Example 2.3.

Definition 2.5:

Let A be a commutative ring and let E be an A-algebra that is finitely generated as an A-module. A (2-dimensional) generalized matrix algebra structure on E is the data of

an idempotent element $e \in E$ ,

A-algebra isomorphisms $ϕ : e E e ≅ A$ and $ϕ^{'} : e^{'} E e^{'} ≅ A$ , where $e^{'} = 1 - e$ ,

such that the function $Tr : E \to A$ defined by

$Tr (x) = ϕ (e x e) + ϕ^{'} (e^{'} x e^{'})$

satisfies $Tr (x y) = Tr (y x)$ for all $x, y \in E$ .

An A-algebra E together with a generalized matrix algebra structure is called an A-GMA.

An example of an A-GMA is the matrix algebra $E = M_{2} (A)$ with $e = (\begin{matrix} 1 & 0 \\ 0 & 0 \end{matrix})$ and the obvious isomorphisms $e M_{2} (A) e ≅ A$ and $e^{'} M_{2} (A) e^{'} ≅ A$ . In general, an A-GMA can be written in the form

E = (\begin{matrix} A & B \\ C & A \end{matrix}),

where $B = e E e^{'}$ and $C = e^{'} E e$ are sub-A-modules of E. The multiplication can be written as

\begin{matrix} (\begin{matrix} a & b \\ c & d \end{matrix}) (\begin{matrix} a^{'} & b^{'} \\ c^{'} & d^{'} \end{matrix}) = (\begin{matrix} a a^{'} + m (b, c^{'}) & a b^{'} + b d^{'} \\ c a^{'} + d c^{'} & d d^{'} + m (c, b^{'}) \end{matrix}), \end{matrix}

[2]

where $m : B \times C \to A$ is the map $m (e x e^{'}, e^{'} y e) = ϕ (e x e^{'} y e)$ .

Conversely, if B and C are two finitely generated A-modules and $m : B \otimes_{A} C \to A$ is an A-linear map satisfying certain properties, then defining $E (\begin{matrix} A & B \\ C & A \end{matrix})$ with the multiplication as in Eq. 2 defines an A-GMA (see refs. 10, section 1.3 and 18, example 3.1.7 for more precise statements).

Example 2.6:

If $A = k [x] / (x^{n})$ for a field k, then there is a GMA E given by

$E = (\begin{matrix} A & xA \\ x A & A \end{matrix}),$

where $m : x A \times x A \to A$ is $m (a x, b x) = a b x$ .

2.3. Pseudodeformation Rings.

Let G be a group and $F$ be a finite field of characteristic p and let $\bar{D} : G \to F$ be a pseudorepresentation. In this section, we discuss deformations of $\bar{D}$ . We assume that G is profinite and satisfies Mazur’s finiteness condition: For every open normal subgroup $H \subseteq G$ , there are only finitely many continuous group homomorphisms $H \to Z / p Z$ . For instance, G could be the absolute Galois group of a local field or the Galois group of the maximal extension of a number field that is unramified outside a finite set.

Let $C$ be the category of complete local Noetherian $W (F)$ -algebras $(A, m_{A})$ with residue field $F$ . For an object A in $C$ , a deformation of $\bar{D}$ to A is a pseudorepresentation $D : G \to A$ such that $D \otimes_{A} F = \bar{D}$ . The set-valued functor on $C$ sending A to the set of deformations of $\bar{D}$ to A is representable by a ring $(R_{\bar{D}}, m_{\bar{D}})$ in $C$ (12, proposition E). The resulting pseudorepresentation $D^{u} : G \to R_{\bar{D}}$ is called the universal pseudodeformation.

A Cayley–Hamilton representation $(E, D, ρ)$ of G is said to have residual representation $\bar{D}$ if the associated pseudorepresentation $ψ (ρ)$ of G is a deformation of $\bar{D}$ . The collection of Cayley–Hamilton representations with residual representation $\bar{D}$ forms a category in a natural way, and this category has a universal object $(E_{\bar{D}}, D_{E_{\bar{E}}}^{u}, ρ^{u})$ , which is a Cayley–Hamilton algebra over $R_{\bar{D}}$ and whose associated pseudorepresentation is the universal pseudodeformation (14, proposition 3.6).

Now assume that $\bar{D} = χ_{1} \oplus χ_{2}$ for two distinct characters $χ_{1}, χ_{2} : G \to F^{\times}$ . In this case, there is a natural generalized matrix algebra structure on $E_{\bar{D}}$ , written as

\begin{matrix} E_{\bar{D}} = (\begin{matrix} R_{\bar{D}} & B_{\bar{D}} \\ C_{\bar{D}} & R_{\bar{D}} \end{matrix}) \end{matrix}

[3]

with the property that, if $ρ^{u} : G \to E_{\bar{D}}$ is written as $ρ^{u} (g) = (\begin{matrix} a (g) & b (g) \\ c (g) & d (g) \end{matrix})$ , then $a (g) \equiv χ_{1} (g) (mod m_{R_{\bar{D}}})$ . See refs. 10, lemma 1.4.3 and 12, theorem 2.22 for more details.

2.4. Tangent Spaces.

The (equicharacteristic) tangent space to a deformation functor is the set of first-order deformations (that is, deformations with values in the dual numbers); this set is naturally a vector space over the residue field. For a representation $\bar{ρ} : G \to {GL}_{2} (F)$ , this means looking at deformations $ρ : G \to {GL}_{2} (F [ϵ] / (ϵ^{2}))$ . It is well known that, in this case, the tangent space can be identified with the group cohomology $H^{1} (G, ad (\bar{ρ}))$ (see (23, proposition 1, pg. 284), for instance). The identification sends a cocycle $ϕ \in Z^{1} (G, ad (\bar{ρ}))$ to the deformation

ρ_{ϕ} = (1 + ϕ ϵ) \bar{ρ} : G \to {GL}_{2} (F [ϵ] / (ϵ^{2})) .

The computation of the tangent space of a pseudodeformation ring is similar to this but is complicated by the fact that not all of these deformations alter the pseudorepresentation. For instance, if $\bar{ρ} = (\begin{matrix} χ_{1} & 0 \\ 0 & χ_{2} \end{matrix})$ for distinct characters $χ_{1}$ and $χ_{2}$ , then there is an isomorphism of G-modules $ad (\bar{ρ}) ≅ (\begin{matrix} F & F (χ_{1} χ_{2}^{- 1}) \\ F (χ_{1}^{- 1} χ_{2}) & F \end{matrix})$ . If $ϕ \in Z^{1} (G, (\begin{matrix} F & 0 \\ 0 & F \end{matrix}))$ , then the deformation $ρ_{ϕ}$ amounts to deforming the two characters $χ_{1}$ and $χ_{2}$ separately and does change the pseudorepresentation. However, if $b \in Z^{1} (G, F (χ_{1} χ_{2}^{- 1})) \subset Z^{1} (G, ad (\bar{ρ}))$ , then $ρ_{b} = (\begin{matrix} χ_{1} & χ_{2} b ϵ \\ 0 & χ_{2} \end{matrix}) .$ This is a nontrivial deformation of $\bar{ρ}$ , but, since the trace and determinant are unchanged, it is a trivial pseudodeformation.

To get a nontrivial pseudodeformation out of cocycles $b \in Z^{1} (G, F (χ_{1} χ_{2}^{- 1}))$ and $c \in Z^{1} (G, F (χ_{1}^{- 1} χ_{2}))$ one has to assume more. Namely, if the cup product $b \cup c$ vanishes in $H^{2} (G, F)$ , then there is a cochain $ϕ : G \to F$ such that $d ϕ = b \cup c$ . There is also a cochain $ϕ^{'} : G \to F$ such that $d ϕ^{'} = c \cup b$ , namely $ϕ^{'} = b c - ϕ$ , where $b c : G \to F$ is the function $(b c) (g) = b (g) c (g)$ . If, in addition, there is a cochain $b_{1} : G \to F$ such that $d b_{1} = b \cup ϕ + ϕ^{'} \cup b$ , then one can define a representation using these data by

\begin{matrix} ρ_{b, c, ϕ} = (\begin{matrix} χ_{1} + χ_{1} ϕ ϵ & χ_{2} (b + b_{1} ϵ) \\ χ_{1} c ϵ & χ_{2} + χ_{2} ϕ^{'} ϵ \end{matrix}) . \end{matrix}

[4]

Note that this is not a deformation of $\bar{ρ}$ as a representation, since its residual representation is $(\begin{matrix} χ_{1} & χ_{2} b \\ 0 & χ_{2} \end{matrix})$ , but it is a pseudodeformation. Let

\begin{matrix} D_{b, c, ϕ} = Tr (ρ_{b, c, ϕ}) = χ_{1} + χ_{2} + ϵ (χ_{1} ϕ + χ_{2} ϕ^{'}) \end{matrix}

[5]

be the associated pseudorepresentation, and note that it involves ϕ, b, and c, but not $b_{1}$ . In fact, one can prove that $D_{b, c, ϕ}$ defines a pseudodeformation without assuming the existence of the cochain $b_{1}$ (this can be proven using the GMA of Example 2.6).

An exact description of the tangent space of a pseudodeformation ring has been worked out beautifully by Bellaïche in ref. 17 and generalized by Wang-Erickson in ref. 24, section 3.3. Let $\bar{D} : G \to F$ be $\bar{D} = χ_{1} \oplus χ_{2}$ for distinct characters $χ_{1}$ and $χ_{2}$ . Let $m_{\bar{D}}$ be the maximal ideal of $R_{\bar{D}}$ and let $t_{\bar{D}} = {Hom}_{F} (m_{\bar{D}} / (p, m_{\bar{D}}^{2}), F)$ be the tangent space. By ref. 17, theorem A), there is an exact sequence

\begin{matrix} 0 \to H^{1} (G, F) \oplus H^{1} (G, F) \to t_{\bar{D}} \to H^{1} (G, χ_{1} χ_{2}^{- 1}) \otimes_{F} H^{1} (G, χ_{1}^{- 1} χ_{2}) \overset{\cup}{\to} H^{2} (G, F) \oplus H^{2} (G, F) . \end{matrix}

[6]

The subspace $H^{1} (G, F) \oplus H^{1} (G, F)$ corresponds to the reducible deformations that deform $χ_{1}$ and $χ_{2}$ separately. For $b \in H^{1} (G, F (χ_{1} χ_{2}^{- 1}))$ and $c \in H^{1} (G, F (χ_{1}^{- 1} χ_{2}))$ such that $b \cup c = 0$ , the corresponding element of $t_{\bar{D}}$ is exactly Eq. 5.

2.5. Reducibility Ideal.

We now return to the situation of Eq. 3, so $\bar{D} = χ_{1} \oplus χ_{2}$ for distinct characters $χ_{1}, χ_{2} : G \to F^{\times}$ . We say that a deformation D of $\bar{D}$ is reducible if $D = {\tilde{χ}}_{1} \oplus {\tilde{χ}}_{2}$ for deformations ${\tilde{χ}}_{i}$ of $χ_{i}$ . The reducible deformations define a subfunctor of the pseudodeformation functor that is represented by a quotient $R_{\bar{D}}^{red}$ of $R_{\bar{D}}$ . The kernel of the map is called the ideal of reducibility $J_{\bar{D}} = ker (R_{\bar{D}} \to R_{\bar{D}}^{red})$ .

The ring $R_{\bar{D}}^{red}$ is fairly easy to understand: It can be identified with the completed tensor product of deformation rings of the characters $χ_{i}$ (see ref. 18, proposition 4.3.4). The ideal of reducibility is related to the GMA-structure on $E_{\bar{D}}$ by a theorem of Bellaïche and Chenevier: $J_{\bar{D}}$ is the image of the map $B_{\bar{D}} \otimes_{R_{\bar{D}}} C_{\bar{D}} \to R_{\bar{D}}$ defined by the GMA-structure Eq. 3 (see ref. 10, section 1.5.1). In particular, there is a surjective map

\begin{matrix} B_{\bar{D}} \otimes_{R_{\bar{D}}} C_{\bar{D}} ↠ J_{\bar{D}} . \end{matrix}

[7]

Moreover, certain quotients of the modules $B_{\bar{D}}$ and $C_{\bar{D}}$ can be understood using group cohomology. Let $R_{\bar{D}} \to A$ be a morphism in $C$ , and let $χ_{1, A}, χ_{2, A} : G \to A^{\times}$ be the corresponding deformations of $χ_{1}$ and $χ_{2}$ . Then there is an isomorphism

\begin{matrix} {Hom}_{A} (B_{\bar{D}} \otimes_{R_{\bar{D}}} A, A) ≅ H^{1} (G, χ_{1, A} χ_{2, A}^{- 1}) \end{matrix}

[8]

by ref. 10, theorem 1.5.6 and a similar isomorphism for $C_{\bar{D}}$ with the roles of $χ_{1, A}$ and $χ_{2, A}$ reversed.

Taken together, these results can give a fairly clear picture of the structure of $R_{\bar{D}}$ , especially when the cohomology groups $H^{1} (G, χ_{1, A} χ_{2, A}^{- 1})$ and $H^{1} (G, χ_{1, A}^{- 1} χ_{2, A})$ are small.

Example 2.7:

Suppose that $H^{1} (G, χ_{1} χ_{2}^{- 1})$ and $H^{1} (G, χ_{1}^{- 1} χ_{2})$ are both 1-dimensional $F$ -vector spaces. Then Eq. 8 and Nakayama’s lemma imply that $B_{\bar{D}}$ and $C_{\bar{D}}$ are both cyclic $R_{\bar{D}}$ -modules. By Eq. 7, this implies that $J_{\bar{D}}$ is a principal ideal.

To see how this compares to the tangent space sequence Eq. 6, consider the reduction of Eq. 7 modulo $m_{\bar{D}}$ :

B_{\bar{D}} / m_{\bar{D}} B_{\bar{D}} \otimes_{F} C_{\bar{D}} / m_{\bar{D}} C_{\bar{D}} ↠ J_{\bar{D}} / m_{\bar{D}} J_{\bar{D}} ↠ (J_{\bar{D}} + (p, m_{\bar{D}}^{2})) / (p, m_{\bar{D}}^{2}) \subseteq m_{\bar{D}} / (p, m_{\bar{D}}^{2}) .

Taking the $F$ -dual of this composite map and using Eq. 8 gives a map

t_{\bar{D}} \to H^{1} (G, χ_{1} χ_{2}^{- 1}) \otimes_{F} H^{1} (G, χ_{1}^{- 1} χ_{2})

that equals the map in Eq. 6.

2.6. Deformation Conditions.

For applications to number theory, often one wants to consider deformations that satisfy certain conditions rather than the universal deformations considered thus far. For instance, one often wants to understand Galois representations that “come from geometry,” a condition that is usually expressed in terms of ramification and p-adic Hodge theory. For deformations of representations, Ramakrishna worked out a theory for deformations with conditions (25), and this theory has been generalized to pseudodeformations (18).

A deformation condition on representations of a group G is a full subcategory $c$ of the category of finite $Z_{p} [G]$ -modules that is closed under isomorphisms, submodules, quotient modules, and finite direct sums. We think of this as a condition on modules, so we say that a module “has $c$ ” if it is in $c$ . By definition, a pseudorepresentation $D : G \to A$ of G with values in a finite ring A in $C$ has $c$ if there is a Cayley–Hamilton representation $(E, D_{E}, ρ)$ over A such that the $Z_{p} [G]$ -module E has $c$ and such that $D = D_{E} \circ ρ$ . A general ring A in $C$ is a limit of finite rings, so the definition is extended to A by taking limits.

With this definition, the constructions and properties carried out in this section extend to pseudorepresentations with $c$ . In particular, there are quotients $R_{\bar{D}, c}$ and $E_{\bar{D}, c}$ of $R_{\bar{D}}$ and $E_{\bar{D}}$ that parameterize deformations and having property $c$ (18, section 2.5). Moreover, the analogs of Eqs. 7 and 8 hold with the $c$ -versions, except that, in Eq. 8, the group cohomology $H^{1} (G, χ_{1, A} χ_{2, A}^{- 1})$ needs to be replaced by the group $H_{c}^{1} (G, χ_{1, A} χ_{2, A}^{- 1})$ of extensions

0 \to χ_{1, A} \to E \to χ_{2, A} \to 0,

where $E$ has $c$ (18, section 4.3). [This group $H_{c}^{1}$ is a natural generalization of the Bloch–Kato cohomology groups $H_{e}^{1}$ , $H_{f}^{1}$ , and $H_{g}^{1}$ (26, section 3, pg. 352).]

3. Reduction to a Local Problem

In this section, we prove some important reductions toward the proof of Theorem 1.1. First, we define the Hecke algebra $T$ and the pseudodeformation ring R that are relevant to the problem and prove that there is a surjection $R ↠ T$ . Then we analyze the tangent space of R and use this to prove the key result: The pseudo-minimal quotient $R^{pseudo-min}$ of R is equal to $Z_{p}$ .

3.1. The Hecke Algebra.

Denote by $M_{2} (Γ_{0} (N^{2}))$ the space of modular forms of weight 2 and level $Γ_{0} (N^{2})$ with integral coefficients and $S_{2} (Γ_{0} (N^{2}))$ the submodule of cusp forms. Let $\tilde{T}$ be the subring of ${End}_{Z} (M_{2} (Γ_{0} (N^{2}))$ generated by the Hecke operators $T_{n}$ for all n. Let I, called the Eisenstein ideal, be the ideal in $T$ generated by $T_{N}$ and $T_{ℓ} - ℓ - 1$ for all $ℓ ∤ N$ . Let $m_{\bar{ρ}}$ be the maximal ideal generated by I and p, and let $T$ be the completion of $\tilde{T}$ at $m_{\bar{ρ}}$ . Finally, $T^{0}$ denotes the maximal quotient of $T$ that acts faithfully on $S_{2} {(Γ_{0} (N^{2}))}_{m_{\bar{ρ}}}$ . The quotient $M_{2} {(Γ_{0} (N^{2}))}_{m_{\bar{ρ}}} / S_{2} {(Γ_{0} (N^{2}))}_{m_{\bar{ρ}}}$ is generated by a single Eisenstein series E, which is an eigenform for all $T \in T$ . Its $T_{N}$ -eigenvalue is 0, and its $T_{ℓ}$ -eigenvalue is $ℓ + 1$ for all primes $ℓ \neq N$ (2, theorem 2.8).

3.2. The Pseudodeformation Ring.

Let $G_{Q, N p}$ be the Galois group of the maximal extension of $Q$ that is unramified outside ∞, N, and p. Fix embeddings of $\bar{Q}$ into ${\bar{Q}}_{p}$ and ${\bar{Q}}_{N}$ , and let $G_{N}, G_{p} \subset G_{Q, N p}$ be the corresponding decomposition groups at N and p. Let $I_{N} \subset G_{N}$ and $I_{p} \subset G_{p}$ be their respective inertia groups. Let $\bar{D} : G_{Q, N p} \to F_{p}$ be the pseudorepresentation $ω \oplus 1$ . Let $c$ be the “finite-flat” condition; that is, $c$ is the category of finite $Z_{p} [G_{Q, N p}]$ -modules M such that there is a finite-flat group scheme $G$ over $Z_{p}$ such that $M ≅ G ({\bar{Q}}_{p})$ as $G_{p}$ -modules. This is a deformation condition by ref. 25, section 2. Let R be the quotient of $R_{\bar{D}, c}$ parameterizing deformations that have determinant equal to the p-adic cyclotomic character, which we denote by ϵ. That is, R is the quotient by the ideal generated by $D^{u} (σ) - ϵ (σ)$ for all $σ \in G_{Q, N p}$ . Abusing notation slightly, let $D^{u} : G \to R$ denote the composition of the universal deformation with $R_{\bar{D}} ↠ R$ .

Lemma 3.1.

There is a surjective $Z_{p}$ -algebra homomorphism $R ↠ T$ .

Proof: Since $T$ is known to be reduced, there is an injection $T \to \prod_{p} T / p$ , where $p$ ranges over the minimal primes of $T$ . There is one minimal prime given by the action of $T$ on the Eisenstein series E. The other minimal primes are the kernels of the maps $T \to Q_{p} (f)$ for eigenforms f in ${S_{2} (Γ_{0} (N^{2}))}_{m_{\bar{ρ}}}$ . Such forms f have all of their Hecke eigenvalues congruent to those of E; in particular, each such f is ordinary at p (since $a_{p} (E) = 1 + p$ is a unit) and has ${\bar{ρ}}_{f} = ω \oplus 1$ . (In fact, using Katz’s result on the injectivity of the theta operator on weight 2 forms, one can show that every cuspform f with ${\bar{ρ}}_{f} = ω \oplus 1$ is ordinary at p.) Let S be the set of such cuspidal eigenforms. Then there is an injection

T \to Z_{p} \times \prod_{f \in S} Q_{p} (f),

sending $T_{n}$ to $(a_{n} (E), {(a_{n} (f))}_{f \in S})$ . We will identify $T$ with the image of this injection and construct a homomorphism $R \to Z_{p} \times \prod_{f \in S} Q_{p} (f)$ whose image is $T$ .

For each $f \in S$ , the Galois representation $ρ_{f}$ defines a pseudorepresentation $D_{f} : G_{Q, N p} \to O_{f}$ that deforms $\bar{D}$ . Since the level of f is prime to p, $D_{f}$ satisfies the finite flat condition; indeed, the Galois representation of f comes from that of an abelian variety with good reduction at p. Also, the determinant of $D_{f}$ is ϵ since f has weight 2 and trivial Nebentypus. Hence $D_{f}$ defines a map $R \to Q_{p} (f)$ . There is also a map $R \to Z_{p}$ given by the pseudorepresentation $ϵ \oplus 1$ . This defines a map

Φ : R \to Z_{p} \times \prod_{f \in S} Q_{p} (f) .

We have to show that the image of Φ is $T$ . For a prime $ℓ ∤ N p$ , since $Tr (ρ_{f} ({Frob}_{ℓ})) = a_{ℓ} (f)$ , the map Φ sends ${Tr}_{D^{u}} ({Frob}_{ℓ})$ to the image of $T_{ℓ}$ . Since the elements ${Tr}_{D^{u}} ({Frob}_{ℓ})$ topologically generate R by Chebotarov density, this implies that the image of Φ equals the $Z_{p}$ -subalgebra of $T$ generated by ${T_{ℓ} : ℓ ∤ N p prime}$ . It remains to show that this subalgebra contains $T_{N}$ and $T_{p}$ . Since every $f \in S$ is new at $N^{2}$ , it follows that $a_{N} (f) = 0$ . But $a_{N} (E) = 0$ as well, so $T_{N} = 0$ in $T$ . Finally, $T_{p}$ is in the subalgebra generated by ${T_{ℓ} : ℓ ∤ N p prime}$ by the ordinary property. To see this, let $α_{f}$ be the unique unit root of $X^{2} - a_{p} (f) X + p$ . The fact that f is ordinary at p implies that, for $σ \in G_{p}$ ,

Tr (ρ_{f}) (σ) = ϵ (σ) λ (α_{f}) (σ) + λ {(α_{f})}^{- 1} (σ),

where $λ (x)$ is the unramified character of $G_{p}$ sending ${Frob}_{p}$ to x. If $τ \in I_{p}$ is an element such that $ϵ (τ) ≢ 1 mod p$ , then

Tr (ρ_{f}) (τ {Frob}_{p}) - Tr (ρ_{f}) ({Frob}_{p}) = (ϵ (τ) - 1) ϵ ({Frob}_{p}) α_{f},

so the (unique) unit root α of $X^{2} - T_{p} X + p$ in $T$ equals the image of $\frac{{Tr}_{D^{u}} (τ {Frob}_{p}) - {Tr}_{D^{u}} ({Frob}_{p})}{(ϵ (τ) - 1) ϵ ({Frob}_{p})}$ . Since the nonunit root of $X^{2} - T_{p} X + p$ is $p α^{- 1}$ , it follows that $T_{p} = α + p α^{- 1}$ is in the image of R, as desired.

Let $R^{red} = R \otimes_{R_{\bar{D}}} R_{\bar{D}}^{red}$ be the quotient of R that parameterizes reducible deformations.

Lemma 3.2.

The homomorphism $R^{red} \to Z_{p}$ given by the reducible deformation $ϵ \oplus 1$ is an isomorphism.

Proof: The ring $R_{\bar{D}}^{red}$ is the completed tensor product of the finite flat deformation rings of ω and 1, with universal deformation $χ_{ω} \oplus χ_{1}$ , where $χ_{ω}$ and $χ_{1}$ are the universal finite flat deformations and ω and 1, respectively (18, proposition 4.3.4). Fixing the determinant to be ϵ gives $χ_{ω} = ϵ χ_{1}^{- 1}$ in $R^{red}$ , so it suffices to show that $χ_{1} = 1$ in $R^{red}$ . A deformation of 1 factors through the maximal abelian pro-p quotient of $G_{Q, N p}$ , which, by the Kronecker–Weber theorem, is the pro-p quotient of $Gal (Q (ζ_{N^{\infty} p^{\infty}}) / Q)$ . Since $p ∤ (N - 1)$ , the maximal pro-p quotient is unramified at N. Recall that finite flat characters of $G_{p}$ are an unramified character times either the trivial character or the p-adic cyclotomic character. In particular, the projections of $χ_{ω}$ and $χ_{1}$ in $R^{red}$ are both of this form, which forces $χ_{1}$ to be unramified at p. Thus $χ_{1}$ is unramified everywhere and hence trivial.

Let $J = ker (R \to R^{red})$ be the reducibility ideal of R; Lemma 3.2 implies that $R / J = Z_{p}$ . Let $B = B_{\bar{D}, c} \otimes_{R_{\bar{D}, c}} R$ and $C = C_{\bar{D}, c} \otimes_{R_{\bar{D}, c}} R$ . By Eq. 7, there is a surjective map

B \otimes_{R} C ↠ J .

Lemma 3.3.

The R-modules B and C are cyclic and J is a principal ideal.

Proof: By ref. 18, theorem 4.3.5 (which is the analog of Eq. 8 with deformation conditions), there are isomorphisms

{Hom}_{R} (B, F_{p}) ≅ H_{c}^{1} (G_{Q, N p}, F_{p} (1)), {Hom}_{R} (C, F_{p}) = H_{c}^{1} (G_{Q, N p}, F_{p} (- 1)) .

These groups have been computed to be one-dimensional in ref. 7, proposition 6.3.2 and lemma 6.3.6, respectively.^* Note that the same reference shows that the groups $H_{c}^{1} (G_{Q, p}, F_{p} (\pm 1))$ , with no ramification at N, are trivial; we will use this fact in the proof of the next proposition.

Since ${Hom}_{R} (B, F_{p})$ and ${Hom}_{R} (C, F_{p})$ are one dimensional, Nakayama’s lemma implies that B and C are cyclic R-modules. Then the surjection $B \otimes_{R} C ↠ J$ of Eq. 7 implies that J is principal.

Proposition 3.4.

There is an isomorphism

$R / (p, m_{\bar{D}}^{2}) \tilde{\to} F_{p} [ϵ] / (ϵ^{2})$

given by a pseudorepresentation $D_{b, c, ϕ}$ of the form Eq. 5 with $χ_{1} = ω$ and $χ_{2} = 1$ , where b and c are cocycles representing generators of the groups $H_{c}^{1} (G_{Q, N p}, F_{p} (1))$ and $H_{c}^{1} (G_{Q, N p}, F_{p} (- 1))$ , respectively. Moreover, b and c are ramified at N.

Proof: By Lemma 3.3, there is an element $x \in J$ that generates J. By Lemma 3.2, $R / J = Z_{p}$ . This implies that $m_{\bar{D}} = (p, x)$ , and that the maximal ideal of $R / p R$ is principal. There is a surjection $R ↠ T$ by Lemma 3.1, and $T$ is a free $Z_{p}$ -module of rank at least 2 by ref. 2, theorem B, so $R / p R \neq F_{p}$ . Hence $R / (p, m_{\bar{D}}^{2})$ is isomorphic to $F_{p} [ϵ] / (ϵ^{2})$ . This isomorphism defines an element of the tangent space $t_{\bar{D}}$ of $R_{\bar{D}}$ . This element cannot be a reducible deformation by Lemma 3.2, so it must be of the claimed form. Finally, the last statement follows from the fact, mentioned in the proof of Lemma 3.3, that the groups $H_{c}^{1} (G_{Q, p}, F_{p} (\pm 1))$ , with no ramification at N, are trivial.

Let $R^{pseudo-min}$ be the quotient of R by the ideal generated by ${Tr}_{D^{u}} (σ) - 2$ for all $σ \in I_{N}$ . This is called the pseudo-minimal quotient as it parameterizes pseudorepresentations that equal the trivial pseudorepresentation on $I_{N}$ . A pseudorepresentation is called minimal if it comes from a Cayley–Hamilton representation $(E, D, ρ)$ such that ${ρ |}_{I_{N}} = 1$ . A pseudo-minimal pseudorepresentation need not be minimal: A Steinberg-at-N representation is pseudo-minimal but not minimal.

Under the surjection $R ↠ T$ of Lemma 3.1, the quotient $R^{pseudo-min}$ should correspond to quotient of $T$ that acts on forms of level $Γ_{0} (N)$ . Since $p ∤ (N - 1)$ , results of Mazur (3, proposition II.9.7) imply that there are no cuspforms f of weight 2 and level $Γ_{0} (N)$ such that ${\bar{ρ}}_{f} = ω \oplus 1$ . Thus, if $R ≅ T$ then one expects that $R^{pseudo-min} = R^{red} = Z_{p}$ . Indeed, this is the case.

Lemma 3.5.

The map $R^{pseudo-min} \to Z_{p}$ given by the deformation $ϵ \oplus 1$ is an isomorphism.

Proof: The deformation $ϵ \oplus 1$ is obviously pseudo-minimal (in fact, minimal), so it defines a surjective homomorphism $R^{pseudo-min} ↠ Z_{p}$ . To show it is an isomorphism, it is enough to show that the tangent space of $R^{pseudo-min} / p R^{pseudo-min}$ is trivial. Since the tangent space of $R / p R$ is one-dimensional and generated by $D_{b, c, ϕ}$ by Proposition 3.4, it is enough to show that $D_{b, c, ϕ}$ is not pseudo-minimal. Recall the formula Eq. 5

D_{b, c, ϕ} (x) = ω (x) + 1 + ϵ (ω (x) ϕ (x) + b (x) c (x) - ϕ (x)) .

Since ω is unramified at N, for $σ \in I_{N}$ this equation simplifies to

D_{b, c, ϕ} (σ) = 2 + b (σ) c (σ) ϵ .

Since b and c are ramified at N, there is $σ \in I_{N}$ such that $b (σ) c (σ) \neq 0$ . This implies that ϵ is in the kernel of the map

F_{p} [ϵ] / (ϵ^{2}) \tilde{\to} R / (p, m^{2}) ↠ R^{pseudo-min} / (p, m^{2}),

completing the proof.

4. Computation of the Local Deformation Ring

In this section, we define a local deformation ring $R_{N}$ that is naturally augmented over $Z_{p}$ with augmentation ideal I. The global deformation ring R is an $R_{N}$ -algebra in a natural way, and the extension IR of I to R is the kernel of the map $R \to R^{pseudo-min}$ . In particular, Lemma 3.5 implies that $R / I R = Z_{p}$ . This says that the global deformations are completely controlled by the local deformations; indeed, by Nakayama’s lemma, it says that R is a cyclic $R_{N}$ -module. Finally, we completely characterize the local deformations, proving that they all come from inducing a character of $G_{Q_{N^{2}}}$ , and deduce an isomorphism $R_{N} \tilde{\to} Λ^{+}$ .

4.1. The Deformation Ring of the Supercuspidal Character.

One way to construct a deformation $ρ : G_{N} \to {GL}_{2} (A)$ of $\bar{D} |_{G_{N}}$ with unramified determinant is to induce a character from $G_{N^{2}}$ . As a preliminary to considering such inductions, we recall some properties of the universal such character.

Let $\tilde{Λ}$ be the universal deformation ring of the trivial character $G_{N^{2}} \to F_{p}$ , where $G_{N^{2}} = Gal ({\bar{Q}}_{N} / Q_{N^{2}})$ . By ref. 8, section 1.4, there is an isomorphism

\tilde{Λ} = Z_{p} [[G_{N^{2}}^{ab, pro- p}]],

where $G_{N^{2}}^{ab, pro- p}$ is the maximal abelian pro-p quotient and the universal character is the tautological one. Fix a choice of Frobenius element ${Frob}_{N^{2}} \in G_{N^{2}}$ . The local Artin map induces an isomorphism $G_{N^{2}}^{ab, pro- p} ≅ Q_{N^{2}}^{\times, pro- p}$ that sends ${Frob}_{N^{2}}$ to N. Let Λ denote the quotient of $\tilde{Λ}$ given by identifying ${Frob}_{N^{2}}$ with $- N$ . Using the local Artin isomorphism as an identification, Λ is identified with $Z_{p} [Δ]$ , where $Δ = Z_{N^{2}}^{\times, pro- p} = F_{N^{2}}^{\times, pro- p}$ is a cyclic group of order $p^{r}$ , where $r = v_{p} (N + 1)$ . (Here $v_{p}$ is the p-adic valuation normalized such that $v_{p} (p) = 1$ .) Denote the universal character $G_{N^{2}} \to Λ^{\times}$ by $[-]$ . Let $δ \in Δ$ be a generator.

Consider the Galois representation

\begin{matrix} ρ_{N} : = Ind G_{N^{2}}^{G_{N}} [-] : G_{N} \to {GL}_{2} (Λ) \end{matrix}

[9]

given by inducing $[-]$ . This is a deformation of ${\bar{D} |}_{G_{N}}$ and it satisfies $det (ρ_{N}) = ϵ$ . For $σ \in I_{N}$ , the trace of $ρ_{N} (σ)$ is given by

\begin{matrix} Tr (ρ_{N} (σ)) = [σ] + {[σ]}^{- 1} . \end{matrix}

[10]

This lands in the subring $Λ^{+} \subset Λ$ fixed by the involution ι that acts as inversion on group-like elements. For later use, we recall the structure of the ring $Λ^{+}$ .

Lemma 4.1.

There is an isomorphism $\frac{Z_{p} [x]}{(x Ψ (x))} \tilde{\to} Λ^{+}$ given by $x \mapsto [δ] + [δ^{- 1}] - 2$ , where $Ψ (x)$ is a distinguished polynomial of degree $\frac{p^{r} - 1}{2}$ with $v_{p} (Ψ (0)) = r$ .

Proof: First, note that $Λ^{+}$ is equal to the subring of Λ generated by $[δ] + [δ^{- 1}]$ . Indeed, every element of $Λ^{+}$ can be represented by a symmetric polynomial in $[δ]$ and $[δ^{- 1}]$ , and every such polynomial is a polynomial in $[δ] + [δ^{- 1}]$ .

Next note that the map

\begin{matrix} Λ \to Z_{p} \times \prod_{i = 1}^{r} Z_{p} [ζ_{p^{i}}] \end{matrix}

[11]

sending $[δ]$ to $(1, ζ_{p}, \dots, ζ_{p^{r}})$ is injective with p-torsion cokernel. Taking ι-fixed parts gives a map

Λ^{+} \to Z_{p} \times \prod_{i = 1}^{r} Z_{p} [ζ_{p^{i}} + ζ_{p^{i}}^{- 1}],

again injective with p-torsion cokernel. Hence the surjective map $Z_{p} [x] ↠ Λ^{+}$ given by $x \mapsto [δ] + [δ^{- 1}] - 2$ sends $x Ψ (x)$ to zero, where $Ψ (x)$ is the product of the minimal polynomials $Ψ_{i} (x)$ of $ζ_{p^{i}} + ζ_{p^{i}} - 2$ . The induced map $Z_{p} [x] / (x Ψ (x)) ↠ Λ^{+}$ is a surjective homomorphism of free $Z_{p}$ -modules of the same finite rank, so it is an isomorphism. Since each ring $Z_{p} [ζ_{p^{i}} + ζ_{p^{i}}^{- 1}]$ is totally ramified over $Z_{p}$ , the polynomials $Ψ_{i} (x)$ are Eisenstein, so $v_{p} (Ψ (0)) = r$ .

4.2. A Computation of an Inertial Pseudodeformation Ring.

We now define a kind of local deformation ring $R_{N}$ . Roughly speaking, it is the ring parameterizing “deformations on inertia that extend to the decomposition group.” The main result of this section is Proposition 4.4, which states that all inertia deformations that extend to the decomposition group are supercuspidal, in the sense that they arise from an induction construction.

We first recall some properties of local Galois groups. There is an exact sequence

0 \to I_{N} \to G_{N} \to Gal (Q_{N}^{nr} / Q_{N}) \to 0,

where $Q_{N}^{nr}$ is the maximal unramified extension. The group $Gal (Q_{N}^{nr} / Q_{N})$ is isomorphic to $G_{F_{N}}$ and hence is topologically generated by ${Frob}_{N} \in G_{N}$ . The group $I_{N}$ is complicated, but its maximal pro-p quotient $I_{N}^{(p)}$ is procyclic. Let $τ \in I_{N}$ be an element that topologically generates $I_{N}^{(p)}$ . Frobenius acts on the image of τ in $I_{N}^{(p)}$ by

{Frob}_{N} τ {Frob}_{N}^{- 1} = τ^{N} .

If ρ is a representation of $I_{N}^{(p)}$ that extends to a representation of $G_{N}$ , then $ρ (τ)$ and $ρ (τ^{N})$ must be conjugate and thus have the same traces and determinants. This motivates the following definition.

Definition 4.2:

Let $R_{N}$ be the quotient of $R_{{\bar{D} |}_{I_{N}}}$ by the ideal generated by

$D^{u} (σ) - 1$ for all $σ \in I_{N}$ , and

${Tr}_{D^{u}} (τ) - {Tr}_{D^{u}} (τ^{N})$ .

The pseudorepresentation associated to the trivial representation $I_{N} \to {GL}_{2} (Z_{p})$ defines a map $R_{N} ↠ Z_{p}$ , making $R_{N}$ into an augmented $Z_{p}$ -algebra. Let $I = ker (R_{N} \to Z_{p})$ be the augmentation ideal; explicitly, it is the ideal generated by ${Tr}_{D^{u}} (σ) - 2$ for all $σ \in I_{N}$ .

Of course, if $Δ : G_{N} \to A$ is a deformation of ${\bar{D} |}_{G_{N}}$ with unramified determinant, then ${Δ |}_{I_{N}}$ defines a map $R_{N} \to A$ . Thus restricting the universal pseudodeformation $D^{u} : G_{Q, N p} \to R$ to $I_{N}$ induces a ring homomorphism $R_{N} \to R$ . The following lemma shows we are in the unusual situation that this map is surjective.

Lemma 4.3.

The natural map $R_{N} \to R$ is surjective.

Proof: Let $I \subset R_{N}$ be the augmentation ideal of $R_{N}$ as in Definition 4.2. The ideal IR is generated by ${Tr}_{D^{u}} (σ) - 2$ for all $σ \in I_{N}$ , which is exactly the kernel of $R \to R^{pseudo-min}$ , so $R / I R = R^{pseudo-min}$ . By Lemma 3.5, this implies $R / I R = Z_{p}$ . Then by Nakayama’s lemma, the map $R_{N} \to R$ is surjective.

There is a quotient of $R_{N}$ that parameterizes supercuspidal (that is, induced) deformations. Indeed, the pseudorepresentation $ρ_{N} : G_{N} \to {GL}_{2} (Λ)$ constructed in the previous section is the universal induced representation. By Eq. 10, its pseudorepresentation on inertia has values in the subring $Λ^{+}$ of Λ. This defines a surjective homomorphism $R_{N} ↠ Λ^{+}$ . The following proposition shows that, in fact, all deformations are supercuspidal. (Note that such deformations are allowed to be reducible; for instance $1 \oplus ϵ$ is supercuspidal as it is the induction of the trivial character.)

Proposition 4.4.

The map $R_{N} ↠ Λ^{+}$ is an isomorphism of augmented $Z_{p}$ -algebras.

Proof: Let ${\tilde{R}}_{N}$ be the quotient of $R_{{\bar{D} |}_{I_{N}}}$ by the ideal generated by $D^{u} (σ) - 1$ for all $σ \in I_{N}$ . That is, ${\tilde{R}}_{N}$ is the universal deformation ring of the trivial 2-dimensional pseudorepresentation on $I_{N}$ having trivial determinant. Let $\tilde{D} : I_{N} \to {\tilde{R}}_{N}$ be the universal deformation. Consider the representation

{\tilde{ρ}}_{N} : I_{N} \to {GL}_{2} (Z_{p} [[x]])

obtained as the composite

I_{N} ↠ I_{N}^{(p)} = ⟨ τ ⟩ \overset{τ \mapsto (\begin{matrix} 1 + x & 1 \\ x & 1 \end{matrix})}{\to} {GL}_{2} (Z_{p} [[x]]) .

The pseudorepresentation of ${\tilde{ρ}}_{N}$ defines a map

ψ : {\tilde{R}}_{N} \to Z_{p} [[x]]

We claim that ψ is an isomorphism, with inverse given by the map

ϕ : Z_{p} [[x]] \to {\tilde{R}}_{N}, x \mapsto {Tr}_{\tilde{D}} (τ) - 2 .

Since $ψ ({Tr}_{\tilde{D}} (τ)) = Tr ({\tilde{ρ}}_{N} (τ)) = x + 2$ , the composition $ψ \circ ϕ$ is the identity. On the other hand, the map

ϕ \circ ψ : {\tilde{R}}_{N} \to {\tilde{R}}_{N}

defines a pseudorepresentation ${\tilde{D}}^{'} : I_{N} \to {\tilde{R}}_{N}$ . To see that $ϕ \circ ψ$ is the identity, it is enough to show that ${\tilde{D}}^{'} = \tilde{D}$ . Since they both have trivial determinant, it suffices to show ${Tr}_{{\tilde{D}}^{'}} = {Tr}_{\tilde{D}}$ . By construction,

{Tr}_{{\tilde{D}}^{'}} (τ) = {Tr}_{\tilde{D}} (τ) .

Then, by the pseudorepresentation identity Item 2, this implies that for all n,

{Tr}_{{\tilde{D}}^{'}} (τ^{n}) = {Tr}_{\tilde{D}} (τ^{n}) .

Since ${Tr}_{{\tilde{D}}^{'}}$ and ${Tr}_{\tilde{D}}$ are continuous and agree on a dense subgroup of $⟨ τ ⟩$ , they are equal on $⟨ τ ⟩$ (Remark 2.2). Finally, both ${Tr}_{{\tilde{D}}^{'}}$ and ${Tr}_{\tilde{D}}$ send every element $σ \in ker (I_{N} \to I_{N}^{(p)})$ to 2. Indeed, for any Cayley–Hamilton representation $ρ : I_{N} \to E^{\times}$ inducing either one, since $ρ \equiv 1 (mod m E)$ , the image of ρ is pro-p, so ρ factors through $I_{N}^{(p)}$ . Since ${Tr}_{{\tilde{D}}^{'}}$ and ${Tr}_{\tilde{D}}$ factor through $I_{N}^{(p)}$ and agree on $⟨ τ ⟩$ , they agree on $I_{N}$ .

Now, since $R_{N}$ is the quotient of ${\tilde{R}}_{N}$ by the relation ${Tr}_{\tilde{D}} (τ) = {Tr}_{\tilde{D}} (τ^{N})$ , the map ψ induces an isomorphism $R_{N} ≅ Z_{p} [[x]] / (f (x))$ , where $f (x) = Tr ({\tilde{ρ}}_{N} (τ)) - Tr ({\tilde{ρ}}_{N} (τ^{N}))$ . To compute $f (x)$ more explicitly, it is convenient to pass to an overring of $Z_{p} [[x]]$ that contains the eigenvalues of ${\tilde{ρ}}_{N} (τ)$ , which are the roots of the polynomial $λ^{2} - (2 + x) λ + 1$ . Over the ring

\frac{Z_{p} [[x]] [λ]}{(λ^{2} - (2 + x) λ + 1)} = \frac{Z_{p} [[x]] [λ]}{({(λ - 1)}^{2} - x λ)} ≅ Z_{p} [[λ - 1]], x \mapsto \frac{{(λ - 1)}^{2}}{λ},

the eigenvalues^† of ${\tilde{ρ}}_{N} (τ)$ are λ and $λ^{- 1}$ , so $f (x) = λ + λ^{- 1} - (λ^{N} + λ^{- N})$ . Since $Z_{p} [[λ - 1]] / Z_{p} [[x]]$ is a torsion-free $Z_{p} [[x]]$ -module, it follows that $(f (x) Z_{p} [[λ - 1]]) \cap Z_{p} [[x]] = f (x) Z_{p} [[x]]$ , so that the map

\frac{Z_{p} [[x]]}{(f (x))} \to \frac{Z_{p} [[λ - 1]]}{(f (x))}

induced by $x \mapsto \frac{{(λ - 1)}^{2}}{λ}$ is injective. Thus $R_{N}$ is isomorphic to the subring generated by $\frac{{(λ - 1)}^{2}}{λ}$ in the ring A defined by

A = \frac{Z_{p} [[λ - 1]]}{(λ + λ^{- 1} - λ^{N} - λ^{- N})} .

This presentation of A can be simplified by factoring:

λ + λ^{- 1} - λ^{N} - λ^{- N} = - λ^{- N} (λ^{N + 1} - 1) (λ^{N - 1} - 1) .

Noting that $- λ^{- N} \in Z_{p} [[λ - 1]]$ is a unit and that, since $p ∤ (N - 1)$ and $p^{r} ∣ ∣ (N + 1)$ the ratios

\frac{λ^{N - 1} - 1}{λ - 1}, \frac{λ^{N + 1} - 1}{λ^{p^{r}} - 1} \in Z_{p} [[λ - 1]]

are units, the ring A can be written as

A = \frac{Z_{p} [[λ - 1]]}{(λ - 1) (λ^{p^{r}} - 1)} .

Note that there is a surjective homomorphism $A \to Λ$ given by $λ \mapsto [δ]$ , which gives the presentation $Λ ≅ Z_{p} [[λ - 1]] / (λ^{p^{r}} - 1)$ . In this presentation, $Λ^{+}$ is the subring generated by $[δ] + [δ^{- 1}] - 2 = \frac{{(λ - 1)}^{2}}{λ}$ . Factoring $λ^{p^{r}} - 1$ into a product of irreducible polynomials yields an embedding

A ↪ \frac{Z_{p} [λ - 1]}{{(λ - 1)}^{2}} \oplus ⨁_{i = 1}^{r} Z_{p} [ζ_{p^{i}}],

with λ mapping to $ζ_{p^{i}}$ in the rightmost factors. This map induces the map Eq. 11 on the quotient Λ of A. Hence there is a commutative diagram

in which the leftmost vertical arrow is the map induced on the subrings generated by $\frac{{(λ - 1)}^{2}}{λ}$ in the middle vertical arrow. The rightmost vertical arrow is just the identity on the summands indexed by $1 \leq i \leq r$ . To see that the map $R_{N} ↠ Λ^{+}$ is injective, it is enough to show that the kernel of the rightmost vertical arrow has trivial intersection with the subring of A generated by $\frac{{(λ - 1)}^{2}}{λ}$ . Since the kernel of this arrow is contained in the $\frac{Z_{p} [λ - 1]}{{(λ - 1)}^{2}}$ factor, and $\frac{{(λ - 1)}^{2}}{λ}$ maps to zero in this factor, this is clear.

5. Proof of Theorem 1.1

Combining Lemmas 3.1, 4.3, and Proposition 4.4, we find that there is a chain of surjective ring homomorphisms

\begin{matrix} Λ^{+} \tilde{\to} R_{N} ↠ R ↠ T . \end{matrix}

[12]

Letting $φ : Λ^{+} ↠ T$ denote the composition of these maps, we have a commutative diagram

[13]

as in the set up of Wiles’s numerical criterion (27, appendix), as improved by Lenstra (28) (see also ref. 29). Let $J = ker (π)$ be the augmentation ideal in $Λ^{+}$ and $I = ker (π_{T})$ the Eisenstein ideal in $T$ . It is well known and easy to see that ${Ann}_{T} (I)$ is the kernel of the quotient $T \to T^{0}$ of $T$ that acts faithfully on cuspforms.

Theorem 5.1.

The surjective maps in Eq. 12 are all isomorphisms.

Proof: Let $η = p^{t}$ be a generator of the ideal $π_{T} ({Ann}_{T} (I)) \subseteq Z_{p}$ . By the numerical criterion (29, criterion I), it is enough to show $# J / J^{2} \leq η$ . It follows immediately from Lemma 4.1 that $# J / J^{2} = p^{r}$ . Since ${Ann}_{T} (I) = ker (T \to T^{0})$ , to show that $η \geq p^{r}$ , it is enough to show that the composite map

T \overset{π_{T}}{\to} Z_{p} \to Z / p^{r} Z

factors through $T^{0}$ . In other words, it is enough to show that the Eisenstein series E is a cuspform modulo $p^{r}$ . This follows from ref. 2, corollary 2.6, completing the proof.

Remark 5.2:

Since the ring $Λ^{+}$ is monogenic, there is an alternative argument that does not use the numerical criterion (but still uses the fact that $p^{r} ∣ η$ ). For this, note that the surjection φ and Lemma 4.1 imply that $T$ has a presentation $T ≅ Z_{p} [x] / (x F (x))$ , where $F (x)$ is a monic divisor of $Ψ (x)$ . Then η can be interpreted as the constant term $F (0)$ (up to a p-adic unit). But since $p^{r} ∣ η$ , this implies that $Ψ (0) ∣ F (0)$ . Since $F (x) ∣ Ψ (x)$ , this implies $F (x) = Ψ (x)$ , as desired.

This completes the proof of Theorem 1.1.

6. Complement: Relation to Massey Products

We have proven an isomorphism $Λ^{+} \tilde{\to} R$ by identifying R with a local deformation ring. Since the local deformation ring is so explicit, this gives us complete understanding of R, and in particular, its rank. In ref. 7, another method for studying the rank of R is introduced, using obstructions in Galois cohomology that come from Massey products. One might hope to use Theorem 1.1 and reverse the arguments of ref. 7 to obtain nontrivial arithmetic results about vanishing of Massey products. In this section, we indicate how our results are related to Massey products and conclude that the Massey products involved are particularly simple and not arithmetically interesting. (This explains why the rank is basically constant in our case, rather than varying in an arithmetically interesting way as in ref. 7.) We do not give another complete proof of Theorem 1.1 (although there is little doubt that one could give a proof along these lines). Instead we attempt to illustrate why Theorem 1.1 is reasonable from the point of view of ref. 7.

6.1. The Strategy for Relating Massey Products to Ranks.

It follows from Proposition 3.4 that the tangent space of $R / p R$ is one-dimensional and spanned by a tangent vector $D_{b, c, ϕ}$ . So, there is an isomorphism $R / p R ≅ F_{p} [[ϵ]] / (ϵ^{d})$ for some $d > 1$ (including possibly $d = \infty$ if $R / p R ≅ F_{p} [[ϵ]]$ ). This d is then the $F_{p}$ -dimension of $R / p R$ , which is an upper bound on the rank of R. One can study d one step at a time: $d > 2$ if and only if the tangent vector $R ↠ F_{p} [ϵ] / (ϵ^{2})$ given by $D_{b, c, ϕ}$ lifts to a map $R ↠ F_{p} [ϵ] / (ϵ^{3})$ . If $d > 2$ , then $d > 3$ if and only if the map $R ↠ F_{p} [ϵ] / (ϵ^{3})$ lifts to a map $R ↠ F_{p} [ϵ] / (ϵ^{4})$ , and so on.

Given this interpretation of the $F_{p}$ -dimension of $R / p R$ in terms of lifts of the tangent vector, we now sketch how this is related to the vanishing of certain elements, called Massey products, in Galois cohomology $H^{2} (Q, -)$ . For this, consider the problem of lifting $R ↠ F_{p} [ϵ] / (ϵ^{2})$ to a map $R ↠ F_{p} [ϵ] / (ϵ^{3})$ . Recall that $D_{b, c, ϕ}$ comes from the trace of a representation $ρ_{b, c, ϕ}$ Eq. 4 that can be written as

ρ_{b, c, ϕ} = (\begin{matrix} ω (1 + ϕ_{1} ϵ) & b + b_{1} ϵ \\ ω c ϵ & 1 + ϕ_{1}^{'} ϵ \end{matrix}) : G_{Q} \to {GL}_{2} (F_{p} [ϵ] / (ϵ^{2})),

where $ϕ_{1}$ is a choice of 1-cochain satisfying $- d ϕ_{1} = b \cup c$ , $ϕ^{'} = b c - ϕ$ , and $b_{1}$ is a cochain satisfying $- d b_{1} = b \cup ϕ_{1} + ϕ_{1}^{'} \cup b$ . A map $R ↠ F_{p} [ϵ] / (ϵ^{3})$ lifting $D_{b, c, ψ}$ might come from a deformation $ρ_{2}$ of the form

ρ_{2} = (\begin{matrix} ω (1 + ϕ_{1} ϵ + ϕ_{2} ϵ^{2}) & b + b_{1} ϵ + b_{2} ϵ^{2} \\ ω (c ϵ + c_{2} ϵ^{2}) & 1 + ϕ_{1}^{'} ϵ + ϕ_{2}^{'} ϵ^{2} \end{matrix}) : G_{Q} \to {GL}_{2} (F_{p} [ϵ] / (ϵ^{3})) .

(See Remark 6.1 below for pseudodeformations that may not arise from such a $ρ_{2}$ .) Here the functions $ϕ_{2}$ , $b_{2}$ , $c_{2}$ , and $ϕ_{2}^{'}$ are 1-cochains^‡ that, in order for $ρ_{2}$ to be a homomorphism, must satisfy conditions on their coboundaries. For instance, $ϕ_{2}$ satisfies

- d ϕ_{2} = ϕ_{1} \cup ϕ_{1} + b \cup c_{2} + b_{1} \cup c .

The right-hand side of this equation is a 2-cocycle, and the equation expresses the fact that this 2-cocycle is a 2-coboundary (i.e. it vanishes in cohomology). A similar thing is true for the equations governing $b_{2}$ , $c_{2}$ , and $ϕ_{2}^{'}$ . Conversely, without knowing that $ρ_{2}$ exists, if one knew that the relevant 2-cocycles were 2-coboundaries, one could define $ρ_{2}$ using these equations. The cohomology classes of these 2-cocycles are examples of Massey products. This shows that Massey products are obstructions: The representation $ρ_{2}$ exists if and only if the Massey products vanish.

Remark 6.1:

We have glossed over several points in this sketch. First, it is not clear that a pseudodeformation of $D_{b, c, ϕ}$ must come from a true representation like $ρ_{2}$ . This can be remedied by instead looking for deformations in the universal Cayley–Hamilton algebra. The results of Section 3 imply that this Cayley–Hamilton algebra is a GMA with a particularly simple form, so the true representations considered above are not too different from the universal case. The main caveat is that, just as one does not need the cochain $b_{1}$ in order to define $D_{b, c, ϕ}$ , one also does not need the cochain $b_{2}$ in order to define the pseudorepresentation associated to $ρ_{2}$ . Second, in order for the deformation $ρ_{2}$ to define a map $R ↠ F_{p} [ϵ] / (ϵ^{3})$ , the pseudorepresentation must satisfy the local conditions required in the definition of R. This can be resolved by working with Galois cohomology with restricted ramification $H^{2} (G_{Q, N p}, -)$ , and working carefully with the finite-flat condition.

6.2. Computation of the Relevant Massey Products.

The relevant Massey products can be computed explicitly in this case. There are two main reasons for this. First, the restriction maps

{res}_{N} : H^{2} (G_{Q, N p}, F_{p} (i)) \to H^{2} (Q_{N}, F_{p} (i))

for $i = 0, 1, - 1$ are injective. This is a kind of local-to-global principle: to compute the whether or not the global Massey-product classes vanish, it is enough to consider their restriction to local cohomology at N. Second, (and this is the most crucial difference with ref. 7), the cohomology group $H^{1} (Q_{N}, F_{p} (1))$ is one-dimensional. Since $N \equiv - 1 (mod p)$ , there is an isomorphism $F_{p} (1) ≅ F_{p} (- 1)$ of $G_{Q_{N}}$ modules, so ${res}_{N} (b)$ and ${res}_{N} (c)$ are both nonzero classes in the same one-dimensional space $H^{1} (Q_{N}, F_{p} (1))$ . Up to rescaling, we can assume that ${res}_{N} (b) = {res}_{N} (c)$ .

We will now sketch an argument that uses Massey products to explain why ${dim}_{F_{p}} (R / p R) \leq \frac{p + 1}{2}$ when $N ≢ - 1 (mod p^{2})$ . From this point on, we work exclusively with cohomology of $G_{Q_{N}}$ and drop the ${res}_{N}$ from the notation, which is justified by the local-to-global principle. Since $b = c$ , a particularly simple cochain ϕ satisfying $- d ϕ = b \cup c = b \cup b$ is $ϕ = \frac{1}{2} b^{2}$ . Similarly, to find a cochain $b_{1}$ such that

- d b_{1} = b \cup ϕ + ϕ^{'} \cup b = b \cup \frac{1}{2} b^{2} + \frac{1}{2} b^{2} \cup b

take $b_{1} = \frac{1}{6} b^{3}$ . To simplify notation, for $n < p$ , let $b^{[n]} = \frac{1}{n!} b^{n}$ . Then $ρ_{b, c, ϕ}$ takes the simple form

ρ_{b, c, ϕ} = (\begin{matrix} ω (1 + b^{[2]} ϵ) & b + b^{[3]} ϵ \\ ω b ϵ & 1 + b^{[2]} ϵ \end{matrix}) .

To deform this, one can take

ρ_{2} = (\begin{matrix} ω (1 + b^{[2]} ϵ + b^{[4]} ϵ^{2}) & b + b^{[3]} ϵ + b^{[5]} ϵ^{2} \\ ω (b ϵ + b^{[3]} ϵ^{2}) & 1 + b^{[2]} ϵ + b^{[4]} ϵ^{2} \end{matrix}) .

The obvious pattern continues: If $2 n + 1 < p$ , then there is a deformation $ρ_{n} : G_{Q_{N}} \to {GL}_{2} (F_{p} [ϵ] / (ϵ^{n + 1}))$ defined by

ρ_{n} = (\begin{matrix} ω (1 + b^{[2]} ϵ + \dots + b^{[2 n]} ϵ^{n}) & b + b^{[2]} ϵ + \dots + b^{[2 n + 1]} ϵ^{n} \\ ω (b ϵ + \dots + b^{[2 n - 1]} ϵ^{n}) & 1 + b^{[2]} ϵ + \dots + b^{[2 n]} ϵ^{n} \end{matrix}) .

Just as the pseudorepresentation $D_{b, c, ϕ}$ does not require the cochain $b_{1}$ to be defined, the pseudorepresentation associated to $ρ_{n}$ does not require $b^{[2 n + 1]}$ to be defined. In other words, the pseudorepresentation associated to $ρ_{n}$ can be defined as long as $2 n < p$ . This defines a pseudorepresentation $D_{\frac{p - 1}{2}} : G_{N} \to F_{p} [ϵ] / (ϵ^{\frac{p + 1}{2}})$ . The obstruction to deforming $D_{\frac{p - 1}{2}}$ is the 2-cocycle

\begin{matrix} \sum_{i = 1}^{p - 1} b^{[i]} \cup b^{[p - i]} . \end{matrix}

[14]

This is the Massey pth-power ${⟨ b ⟩}^{p}$ of b, defined by Kraines (30, definition 11).^§ By a variant of ref. 30, theorem 14 for nontrivial coefficients, ${⟨ b ⟩}^{p}$ is equal to $\partial (b)$ , where ∂ is the connecting map in the exact sequence

0 \to F_{p} (1) \to (Z / p^{2} Z) (ω) \to F_{p} (1) \to 0,

where $(Z / p^{2} Z) (ω)$ is the unramified $G_{Q_{N}}$ -module where ${Frob}_{N}$ acts by $ω (N) = - 1$ . If $N ≢ - 1 (mod p^{2})$ , then a simple calculation shows that $\partial (b) = b \cup x$ for a nontrivial class $x \in H^{1} (Q_{N}, F_{p})$ . By Tate duality, this implies that $\partial (b) \neq 0$ . In other words, the 2-cocycle Eq. 14 is not a coboundary, and this gives an obstruction to deforming $D_{\frac{p - 1}{2}}$ . If ${dim}_{F_{p}} (R / p R)$ were greater than $\frac{p + 1}{2}$ , there would be a surjective homomorphism $R ↠ F_{p} [ϵ] / (ϵ^{\frac{p + 3}{2}})$ , from which one could construct a deformation of $D_{\frac{p - 1}{2}}$ , a contradiction.

The inequality ${dim}_{F_{p}} R / p R \leq \frac{p + 1}{2}$ goes most of the way to proving Corollary 1.2. Indeed, it is not difficult to show that every cuspform f satisfying Eq. 1 must be supercuspidal at N. Taking the trace of the supercuspidal representation shows that the coefficient ring of f contains $Z_{p} [ζ_{p} + ζ_{p}^{- 1}]$ , so that ${rank}_{Z_{p}} (T) \geq \frac{p + 1}{2}$ . Then ${dim}_{F_{p}} R / p R \leq \frac{p + 1}{2}$ and the surjection $R ↠ T$ imply that these containments and inequalities are all equalities. Of course, to complete the above sketch, one would have to deal with the issues mentioned in Remark 6.1.

Acknowledgments

We thank Shaunak Deo, Robert Pollack, Alice Pozzi, and Carl Wang-Erickson for helpful conversations as well as the referees for thoughtful comments that improved this paper. J.L. was supported by NSF grant DMS-2301738 and P.W. was supported by NSF CAREER grant DMS-2337830.

Author contributions

J.L. and P.W. designed research; performed research; and wrote the paper.

Competing interests

The authors declare no competing interest.

Footnotes

This article is a PNAS Direct Submission.

^*See also ref. 4, especially lemma 3.9 and proposition 5.4, for an earlier proof of the same result, in slightly different terms.

^†In fact, ${\tilde{ρ}}_{N} (τ)$ is conjugate to the matrix $(\begin{matrix} λ & 1 \\ 0 & λ^{- 1} \end{matrix})$ in ${GL}_{2} (Z_{p} [[λ - 1]])$ .

^‡With coefficients in $F_{p}$ , $F_{p} (1)$ , $F_{p} (- 1)$ , and $F_{p}$ , respectively.

^§Kraines actually only considers trivial coefficients, but the generalization to nontrivial coefficients is straightforward.

Contributor Information

Jaclyn Lang, Email: jaclyn.lang@temple.edu.

Preston Wake, Email: wakepres@msu.edu.

Data, Materials, and Software Availability

There are no data underlying this work.

References

1.Buzzard K., Questions about slopes of modular forms. Astérisque 298, 1–15 (2005). [Google Scholar]
2.Lang J., Wake P., A modular construction of unramified p-extensions of $Q (N^{1 / p})$ . Proc. Amer. Math. Soc. Ser. B 9, 415–431 (2022). [Google Scholar]
3.Mazur B., Modular curves and the Eisenstein ideal. Inst. Hautes Études Sci. Publ. Math. 47, 33–168 (1977). [Google Scholar]
4.Calegari F., Emerton M., On the ramification of Hecke algebras at Eisenstein primes. Invent. Math. 160, 97–144 (2005). [Google Scholar]
5.Merel L., L’accouplement de Weil entre le sous-groupe de Shimura et le sous-groupe cuspidal de J₀(p). J. Reine Angew. Math. 477, 71–115 (1996). [Google Scholar]
6.Lecouturier E., Higher Eisenstein elements, higher Eichler formulas and rank of Hecke algebras. Invent. Math. 223, 485–595 (2021). [Google Scholar]
7.Wake P., Wang-Erickson C., The rank of Mazur’s Eisenstein ideal. Duke Math. J. 169, 31–115 (2020). [Google Scholar]
8.B. Mazur, “Deforming Galois representations” in Galois Groups Over Q, Y. Ihara, K. Ribet and J.-P. Serre, Eds. (Berkeley, CA, 1987) (Springer, New York, 1989), vol. 16, pp. 385–437.
9.Skinner C. M., Wiles A. J., Ordinary representations and modular forms. Proc. Nat. Acad. Sci. U.S.A. 94, 10520–10527 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Bellaïche J., Chenevier G., Families of Galois representations and Selmer groups. Astérisque 324, xii+314 (2009). [Google Scholar]
11.Berger T., Klosin K., On deformation rings of residually reducible Galois representations and R = T theorems. Math. Ann. 355, 481–518 (2013). [Google Scholar]
12.G. Chenevier, “The p-adic analytic space of pseudocharacters of a profinite group and pseudorepresentations over arbitrary rings” in Automorphic Forms and Galois Representations. Vol. 1 F. Diamond, P. L. Kassaei, M. Kim, Eds.(Cambridge Univ. Press, Cambridge, 2014), vol. 414, pp. 221–285.
13.Calegari F., Eisenstein deformation rings. Compos. Math. 142, 63–83 (2006). [Google Scholar]
14.Wang-Erickson C., Algebraic families of Galois representations and potentially semi-stable pseudodeformation rings. Math. Ann. 371, 1615–1681 (2018). [Google Scholar]
15.S. V. Deo, Non-optimal levels of some reducible mod p modular representations. Adv. Math. 461, 110074 (2025).
16.Wake P., Wang-Erickson C., The Eisenstein ideal with squarefree level. Adv. Math. 380, 107543 (2021). [Google Scholar]
17.Bellaïche J., Pseudodeformations. Math. Z. 270, 1163–1180 (2012). [Google Scholar]
18.Wake P., Wang-Erickson C., Deformation conditions for pseudorepresentations. Forum Math. Sigma 7, e20 (2019). [Google Scholar]
19.Wiles A., On ordinary λ-adic representations associated to modular forms. Invent. Math. 94, 529–573 (1988). [Google Scholar]
20.Taylor R., Galois representations associated to Siegel modular forms of low weight. Duke Math. J. 63, 281–332 (1991). [Google Scholar]
21.Rouquier R., Caractérisation des caractères et pseudo-caractères. J. Algebra 180, 571–586 (1996). [Google Scholar]
22.Roby N., Lois polynomes et lois formelles en théorie des modules. Ann. Sci. École Norm. Sup. 80, 213–348 (1963). [Google Scholar]
23.B. Mazur, “An introduction to the deformation theory of Galois representations” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 243–311.
24.C. Wang-Erickson, Presentations of non-commutative deformation rings via A_∞-algebras and applications to deformations of Galois representations and pseudorepresentations arXiv [Preprint] (2020). 10.48550/arXiv.1809.02484 (Accessed 11 October 2024). [DOI]
25.Ramakrishna R., On a variation of Mazur’s deformation functor. Compos. Math. 87, 269–286 (1993). [Google Scholar]
26.S. Bloch, K. Kato, “L-functions and Tamagawa numbers of motives” in The Grothendieck Festschrift, Vol. I (Birkhäuser Boston, Boston, MA, 1990), vol. 86, P. Cartier, L. Illusie, N. M. Katz, G. Laumon, Yu. Manin, K. A. Ribet, Eds. pp. 333–400.
27.Wiles A., Modular elliptic curves and Fermat’s last theorem. Ann. Math. 141, 443–551 (1995). [Google Scholar]
28.H. W. Lenstra Jr., “Complete intersections and Gorenstein rings” in Elliptic Curves, Modular Forms,& Fermat’s Last Theorem (Hong Kong, 1993) J. Coates, S.-T. Yau, Eds. (Int. Press, Cambridge, MA, 1995), vol. I, pp. 99–109.
29.B. de Smit, K. Rubin, R. Schoof, “Criteria for complete intersections” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 343–356.
30.Kraines D., Massey higher products. Trans. Am. Math. Soc. 124, 431–449 (1966). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

There are no data underlying this work.

[r1] 1.Buzzard K., Questions about slopes of modular forms. Astérisque 298, 1–15 (2005). [Google Scholar]

[r2] 2.Lang J., Wake P., A modular construction of unramified p-extensions of $Q (N^{1 / p})$ . Proc. Amer. Math. Soc. Ser. B 9, 415–431 (2022). [Google Scholar]

[r3] 3.Mazur B., Modular curves and the Eisenstein ideal. Inst. Hautes Études Sci. Publ. Math. 47, 33–168 (1977). [Google Scholar]

[r4] 4.Calegari F., Emerton M., On the ramification of Hecke algebras at Eisenstein primes. Invent. Math. 160, 97–144 (2005). [Google Scholar]

[r5] 5.Merel L., L’accouplement de Weil entre le sous-groupe de Shimura et le sous-groupe cuspidal de J₀(p). J. Reine Angew. Math. 477, 71–115 (1996). [Google Scholar]

[r6] 6.Lecouturier E., Higher Eisenstein elements, higher Eichler formulas and rank of Hecke algebras. Invent. Math. 223, 485–595 (2021). [Google Scholar]

[r7] 7.Wake P., Wang-Erickson C., The rank of Mazur’s Eisenstein ideal. Duke Math. J. 169, 31–115 (2020). [Google Scholar]

[r8] 8.B. Mazur, “Deforming Galois representations” in Galois Groups Over Q, Y. Ihara, K. Ribet and J.-P. Serre, Eds. (Berkeley, CA, 1987) (Springer, New York, 1989), vol. 16, pp. 385–437.

[r9] 9.Skinner C. M., Wiles A. J., Ordinary representations and modular forms. Proc. Nat. Acad. Sci. U.S.A. 94, 10520–10527 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r10] 10.Bellaïche J., Chenevier G., Families of Galois representations and Selmer groups. Astérisque 324, xii+314 (2009). [Google Scholar]

[r11] 11.Berger T., Klosin K., On deformation rings of residually reducible Galois representations and R = T theorems. Math. Ann. 355, 481–518 (2013). [Google Scholar]

[r12] 12.G. Chenevier, “The p-adic analytic space of pseudocharacters of a profinite group and pseudorepresentations over arbitrary rings” in Automorphic Forms and Galois Representations. Vol. 1 F. Diamond, P. L. Kassaei, M. Kim, Eds.(Cambridge Univ. Press, Cambridge, 2014), vol. 414, pp. 221–285.

[r13] 13.Calegari F., Eisenstein deformation rings. Compos. Math. 142, 63–83 (2006). [Google Scholar]

[r14] 14.Wang-Erickson C., Algebraic families of Galois representations and potentially semi-stable pseudodeformation rings. Math. Ann. 371, 1615–1681 (2018). [Google Scholar]

[r15] 15.S. V. Deo, Non-optimal levels of some reducible mod p modular representations. Adv. Math. 461, 110074 (2025).

[r16] 16.Wake P., Wang-Erickson C., The Eisenstein ideal with squarefree level. Adv. Math. 380, 107543 (2021). [Google Scholar]

[r17] 17.Bellaïche J., Pseudodeformations. Math. Z. 270, 1163–1180 (2012). [Google Scholar]

[r18] 18.Wake P., Wang-Erickson C., Deformation conditions for pseudorepresentations. Forum Math. Sigma 7, e20 (2019). [Google Scholar]

[r19] 19.Wiles A., On ordinary λ-adic representations associated to modular forms. Invent. Math. 94, 529–573 (1988). [Google Scholar]

[r20] 20.Taylor R., Galois representations associated to Siegel modular forms of low weight. Duke Math. J. 63, 281–332 (1991). [Google Scholar]

[r21] 21.Rouquier R., Caractérisation des caractères et pseudo-caractères. J. Algebra 180, 571–586 (1996). [Google Scholar]

[r22] 22.Roby N., Lois polynomes et lois formelles en théorie des modules. Ann. Sci. École Norm. Sup. 80, 213–348 (1963). [Google Scholar]

[r23] 23.B. Mazur, “An introduction to the deformation theory of Galois representations” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 243–311.

[r24] 24.C. Wang-Erickson, Presentations of non-commutative deformation rings via A_∞-algebras and applications to deformations of Galois representations and pseudorepresentations arXiv [Preprint] (2020). 10.48550/arXiv.1809.02484 (Accessed 11 October 2024). [DOI]

[r25] 25.Ramakrishna R., On a variation of Mazur’s deformation functor. Compos. Math. 87, 269–286 (1993). [Google Scholar]

[r26] 26.S. Bloch, K. Kato, “L-functions and Tamagawa numbers of motives” in The Grothendieck Festschrift, Vol. I (Birkhäuser Boston, Boston, MA, 1990), vol. 86, P. Cartier, L. Illusie, N. M. Katz, G. Laumon, Yu. Manin, K. A. Ribet, Eds. pp. 333–400.

[r27] 27.Wiles A., Modular elliptic curves and Fermat’s last theorem. Ann. Math. 141, 443–551 (1995). [Google Scholar]

[r28] 28.H. W. Lenstra Jr., “Complete intersections and Gorenstein rings” in Elliptic Curves, Modular Forms,& Fermat’s Last Theorem (Hong Kong, 1993) J. Coates, S.-T. Yau, Eds. (Int. Press, Cambridge, MA, 1995), vol. I, pp. 99–109.

[r29] 29.B. de Smit, K. Rubin, R. Schoof, “Criteria for complete intersections” in Modular Forms and Fermat’s Last Theorem (Boston, MA, 1995) G. Cornell, J. H. Silverman, G. Stevens, Eds. (Springer, New York, 1997), pp. 343–356.

[r30] 30.Kraines D., Massey higher products. Trans. Am. Math. Soc. 124, 431–449 (1966). [Google Scholar]

PERMALINK

The Eisenstein ideal at prime-square level has constant rank

Jaclyn Lang

Preston Wake

Significance

Abstract

1. Introduction

Theorem 1.1.

Corollary 1.2.

1.1. Outline of the Paper.

2. Pseudodeformations

2.1. Pseudorepresentations.

Definition 2.1:

Remark 2.2:

Example 2.3:

2.2. Cayley–Hamilton Algebras and Generalized Matrix Algebras.

Definition 2.4:

Definition 2.5:

Example 2.6:

2.3. Pseudodeformation Rings.

2.4. Tangent Spaces.

2.5. Reducibility Ideal.

Example 2.7:

2.6. Deformation Conditions.

3. Reduction to a Local Problem

3.1. The Hecke Algebra.

3.2. The Pseudodeformation Ring.

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

Proposition 3.4.

Lemma 3.5.

4. Computation of the Local Deformation Ring

4.1. The Deformation Ring of the Supercuspidal Character.

Lemma 4.1.

4.2. A Computation of an Inertial Pseudodeformation Ring.

Definition 4.2:

Lemma 4.3.

Proposition 4.4.

5. Proof of Theorem 1.1

Theorem 5.1.

Remark 5.2:

6. Complement: Relation to Massey Products

6.1. The Strategy for Relating Massey Products to Ranks.

Remark 6.1:

6.2. Computation of the Relevant Massey Products.

Acknowledgments

Author contributions

Competing interests

Footnotes

Contributor Information

Data, Materials, and Software Availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases