Adjoint modular Galois representations and their Selmer groups
Abstract
In the last 15 years, many class number formulas and main conjectures have been proven. Here, we discuss such formulas on the Selmer groups of the three-dimensional adjoint representation ad(φ) of a two-dimensional modular Galois representation φ. We start with the p-adic Galois representation φ0 of a modular elliptic curve E and present a formula expressing in terms of L(1, ad(φ0)) the intersection number of the elliptic curve E and the complementary abelian variety inside the Jacobian of the modular curve. Then we explain how one can deduce a formula for the order of the Selmer group Sel(ad(φ0)) from the proof of Wiles of the Shimura–Taniyama conjecture. After that, we generalize the formula in an Iwasawa theoretic setting of one and two variables. Here the first variable, T, is the weight variable of the universal p-ordinary Hecke algebra, and the second variable is the cyclotomic variable S. In the one-variable case, we let φ denote the p-ordinary Galois representation with values in GL 2(Zp[[T]]) lifting φ0, and the characteristic power series of the Selmer group Sel(ad(φ)) is given by a p-adic L-function interpolating L(1, ad(φk)) for weight k + 2 specialization φk of φ. In the two-variable case, we state a main conjecture on the characteristic power series in Zp[[T, S]] of Sel(ad(φ) ⊗ ν−1), where ν is the universal cyclotomic character with values in Zp[[S]]. Finally, we describe our recent results toward the proof of the conjecture and a possible strategy of proving the main conjecture using p-adic Siegel modular forms.
The talk at the conference on Elliptic Curves and Modular Forms at the National Academy of Sciences was presented by H.H. The purpose of the talk was to describe formulas giving the characteristic ideal of the Selmer group of the Galois representations as in the title in terms of their L-values. We fix a prime p ≥ 5. Although we can treat the general case, allowing ramification at finitely many primes and ∞, to keep the paper short, we assume that the ramification is concentrated on {p, ∞}.
1. Selmer Groups
Let G be the Galois group of the maximal extension Q (p)/Q unramified outside {p, ∞}. Let 𝒪 be a valuation ring finite flat over Z p with residue field F. We start with a two-dimensional continuous representation φ : G → GL 2(A) for a complete (noetherian) local 𝒪-algebra A with residue field F = A/m A. The power series ring 𝒪[[T 1, … , T r]] is an example of such A. We let G act on V = A 2 via φ and on End(V) by conjugation: φ ⊗ φ∨ (σ)x = φ(σ)xφ(σ)−1. We look at its three-dimensional factor ad(φ) : G → GL 3(A) acting on trace zero subspace V(ad(φ)) in End(V). Thus φ ⊗ φ∨ = ad(φ) ⊕ 1. Let φ̄ = φ mod m A. We assume the following three conditions:
(AI) The restriction of
φ̄ to Gal
(Q
(p)/Q(
is absolutely irreducible;
(Ord) For each decomposition group D over p, φ|D ≅ (0 δ ɛ ∗) with unramified δ;
(Reg) δ mod mA ≠ ɛ mod mA.
Condition AI is equivalent to the absolute irreducibility of
ad(φ̄) over G. We write V(δ) ⊂ V
for the δ-eigen subspace, and for each A-submodule
X of V(ad(φ)), let X* = X
⊗A
A* for the Pontryagin dual A* =
Hom𝒪(A,
Q
p/Z
p) of
A. We put V
+ = {ξ ∈ V(ad(φ))
⊂ End(V) | ξ(V(δ)) = 0}. Then we define
the Selmer group for ad(φ), as a special case of Greenberg’s
definition (ref. 1; see also ref. 2):
for the inertia subgroup I of D. This is a
generalization of the class group; for example, taking a quadratic
character χ of G,
is the χ-part of the p-class group of the quadratic
extension F fixed by ker(χ). Thus if A =
𝒪 and if L(1, ad(φ)) ≠ 0, a naive guess is that
Sel(ad(φ)) is finite and that its order is the p-part of
L(1, ad(φ)) up to a transcendental factor. The finiteness
is first shown by Flach (3) and then by Wiles (4). We discuss later
some good cases where this guess works well. We generalize the above
definition to a tensor product ad(φ) ⊗ ɛ with a character
ɛ : G → B
× for a complete noetherian
𝒪-algebra B, replacing A by
A⊗̂𝒪
B and
V
+ by V
+(ad(φ) ⊗ ɛ)
= V
+⊗̂ B:
which is a discrete module over
A⊗̂𝒪
B.
2. Elliptic Curves over Q
For simplicity, we suppose that φ0 is the Galois
representation on
H
1(E
/Q̄,
Zp) for a modular elliptic curve
E
/Q inside the Jacobian
J = J
0(p) of the modular curve
X
0(p). Thus E has multiplicative
reduction at p and has good reduction outside p.
Taking the dual of the inclusion E ⊂ J, we have a quotient
map π : J → E. Then J = E + A for
A = ker(π), and E ∩ A is a finite group of
square order. For a Néron differential ω on the Néron
model E
/Z, by a result of Mazur (5)
corollary 4.1, we may assume that π*ω =
2e(2πif
0(z)dz) for a primitive form
f
0 ∈ S
2(Γ0(p)) and
e ∈ Z. Choosing a base c
± of
±-eigenspace of H
1(E(C), Z)
under complex conjugation, we define Ω± by
∫c± ω after normalizing
c
± as described below. The following formula
was proven 15 years ago in ref. 6 (see also ref. 7):
where C = 2a+2e
p(p − 1) for
2a = [H
1(E(C), Z)
: Z
c
+ ⊕ Zc
−].
We define the canonical period U(f
0) of
f
0 by
C
−1(2πi)Ω+Ω−. In
ref. 6, to get formula IN1, we used the period determinant
for a Z-base {c
1,
c
2} of H
1(E(C),
Z) in place of Ω+Ω− (see ref.
6, formula 6.20b). Writing ω± =
(ω ± ω̄)/2, we see
∫c± ω = ±
∫c± ω±, and thus
Ω+ ∈ R and
Ω−
∈ R. Replacing c
± by their
negative if necessary, we may assume that Ω+ > 0
and √
Ω− > 0. Under this
normalization, formula IN1 is correct. Then by definition,
2a
u =
Ω+Ω−, and we can deduce
formula IN1 from ref. 6, theorem 6.1, by just remarking that
L*f0/L
f0 ≅ E
∩ A under the notation of the theorem quoted.
Actually, a formula similar to formula IN1 is proven in ref. 6 for the Galois representation attached to any holomorphic primitive form of weight ≥2. The formula is generalized later to cohomological cusp forms on GL(2) over imaginary quadratic fields in ref. 8.
Let H be the subalgebra of End(J) generated by
Hecke operators T(n). Then π induces the projection
λ : H → Z ⊂ End(E) and another
projection λ′ : H → End(A). Then we define
two finite modules:
It is proven in ref. 7 (equation 5.8b) that
as H modules. Note that Spec(C
0)
is the scheme theoretic intersection of Spec(Im(λ)) and
Spec(Im(λ′)) in Spec(H). Thus we get
Recently, Taylor and Wiles (4, 9) have shown that
|C
0,p| = |C
1,p|, and
Wiles (4) has shown
This formula is a key to Wiles’ proof of Fermat’s last theorem.
The fact that Sel(ad(φ0)) has a natural map into
C1,p was first discovered by Mazur through his
deformation theory of Galois representations (10). The above formula is
conjectured in ref. 11 after proving the surjectivity of the map
besides other relevant results.
Anyway, under the various assumptions on p that we made, we
finally get a formula for the order of Sel(ad(φ0)):
3. One-Variable Case
The cusp form f 0 ∈ S 2(Γ0(p)) can be lifted to a p-adic family of p-ordinary common eigenforms f k = Σn=1 ∞ a(n; f k)q n ∈ S k+2(Γ0(p), ω−k) (k ≥ 0) for the Teichmüller character ω (cf. ref. 12, chapter 7, theorem 7.3.7). For this, we need to fix an embedding i p : Q̄ ↪ Q̄ p. Then “p-ordinarity” of f k implies that the pth coefficient of f k in its q-expansion satisfies |a(p; f k)|p = 1. Note that, by the multiplicative reduction hypothesis, a(p; f 0) = ±1. This family yields a Galois representation φ : G → GL 2 (Λ) for a finite flat 𝒪[[T]]-algebra Λ (ref. 12, section 7.5). For simplicity, we assume Λ = 𝒪[[T]]. Then writing as φk the specialization of φ via 1 + T ↦ u k for u = 1 + p, φk is the Galois representation of the cusp form f k. Then the Pontryagin dual Sel*(ad(φ)) of Sel(ad(φ)) is shown by Wiles and Taylor to be a torsion 𝒪[[T]]-module of finite type, and its characteristic power series is given by the characteristic power series of the Λ-adic congruence module C 0,Λ.
Before giving the definition of C
0,Λ, we note
that we have taken cohomological formulation of Galois representations.
In this paper, we characterize Galois representations by the
characteristic polynomial of geometric Frobenii
Frobq at primes q ≠ p. For
example, φk is characterized by
This normalization is dual to the one taken in ref. 4, but it is
all right for our purpose because ad(φk) =
ad(φk
∨).
To define C0,Λ, we need to introduce the space
S
Λ of p-ordinary Λ-adic cusp
forms. For that, we consider the subspace
S
k+2(Γ0(p), ω−k;
Q̄) of S
k+2(Γ0(p),
ω−k) made of cusp forms f with
a(n; f) ∈ Q̄ for all n. We
consider the Q̄
p-span
S
k+2(Γ0(p), ω−k;
Q̄
p) of
S
k+2(Γ0(p), ω−k;
Q̄) in Q̄
p[[q]]
via q-expansion. We write
S
k+2
ord(Γ0(p),
ω−k; Q̄
p) for the
subspace of S
k+2(Γ0(p),
ω−k; Q̄
p) spanned by
all p-ordinary eigenforms. An element ℱ ∈
S
Λ is a formal q-expansion
Σn=1
∞
a
n(T)q
n ∈
Λ[[q]] such that the specialization
ℱk via 1 + T ↦
u
k is the q-expansion of an element in
S
k+2
ord(Γ0(p),
ω−k; Q̄
p) for all
k ≥ 0. Then S
Λ is free of
finite rank over Λ on which Hecke operators T(n) naturally
act (ref. 12, section 7.3). Hereafter we write ℱ for the unique
Λ-adic form such that ℱk = f
k
for all k ≥ 0. Let H be the Λ-subalgebra
of EndΛ(S
Λ) generated by
T(n) for all n, and define a Λ-algebra
homomorphism λ : H → Λ by
ℱ|h = λ(h)ℱ. We also have another λ′ of
H into EndΛ(ker(λ)) given by multiplication
by h ∈ H on ker(λ). Then we define
Then it is easy to see that C0,Λ ≅
Λ/(η(T)) for an element η(T) ∈ Λ. We can
deduce from the result of Wiles and Taylor in ref. 4 (theorem 3.3) and
ref. 9 that
Here the characteristic ideal charA(M) for
a torsion A-module of finite type M over a normal
noetherian ring A is given by the product of prime divisors
P in A with exponent given by
lengthAP
M
P of the localization
M
P at P. Note that, as shown in ref.
7 (theorem 0.1), for a canonical period U(f
k)
associated to f
k,
up to p-adic units. This formula is not completely
satisfactory, because the p-adic L-function
η(T) is determined only up to units in Λ. For Λ-adic
forms of CM type, we can choose a suitable Katz p-adic
L-function in place of η (11, 13–15). In general, we can
only make a conjecture on the existence of a canonical
p-adic L-function
L
p(ad(φ)) with precise interpolation property
(16), which generates charΛ(Sel*(ad(φ))) =
(η(T)) after extending scalar to the p-adic integer
ring 𝒪Ω of the p-adic completion Ω of
Q̄
p.
4. Two-Variable Case
Now we look at the universal character ν : G → 𝒪[[S]]× deforming the identity character of G. As already said, our formulation is cohomological, and hence ν(Frobq) = qω(q)−1 for geometric Frobenius Frobq. Writing Q ∞ for the cyclotomic Z p-extension of Q and Γ = Gal(Q ∞/Q), the tautological character: Γ ↪ 𝒪[[Γ]] induces the above ν for S = γ − 1 for a generator γ of Γ. Then we consider Sel*(ad(φ) ⊗ ν−1), which is a module over 𝒪[[T, S]] of finite type (1). Classically, the Selmer group involving the cyclotomic variable S is defined in terms of cohomology groups over the cyclotomic Z p tower Q ∞. As shown by Greenberg (ref. 1, proposition 3.2; see also ref. 2, section 3.1), our Selmer group Sel(ad(φ) ⊗ ν−1) over Q is isomorphic to the classical one over Q ∞. Recently, we have proven a control theorem for Sel(ad(φ) ⊗ ν−1) giving the following theorem.
Theorem 1. The module Sel*(ad(φ) ⊗ ν−1) is a torsion 𝒪[[T, S]]-module of finite type. Moreover, the characteristic power series of Sel*(ad(φ) ⊗ ν−1) is of the form SΨ(T, S) in 𝒪[[T, S]] and Ψ(T, 0)|η(T) da/dT (T) in 𝒪[[T]], where a(T) is the eigen value of T(p) for ℱ lifting f 0 (2).
In early 1980s, we constructed (17) a two-variable p-adic
L-function L(T, S) in
η(T)−1
S𝒪[[T, S]] such that for even
m with −k ≤ m ≤ 0,
for a factor E like an Euler p-factor and a
simple constant ∗. This L-function ηL again
has ambiguity by units in Λ, although L(T, S) is uniquely
determined. In ref. 16, the existence of a canonical p-adic
L-functions L
p(ad(φ) ⊗
ν−1) in 𝒪[[T, S]] [for ad(φ) ⊗
ν−1] with precise interpolation property is
conjectured. In particular, we should have an equality:
Anyway, the denominator and the numerator are not yet known to
exist in general in spite of the known existence of the ratio
L(T, S). Because of this, we need to use η(T)
as a replacement of L
p(ad(φ)).
Theorem 2. (R. Greenberg and J. Tilouine).
Write ηL(T, S) = SΦ(T, S). We have
We know that da/dT (0) ≠ 0 by the theorem of St.
Etienne (18) due to four people at St. Etienne in France. Thus if one
can prove the divisibility Φ|Ψ in 𝒪[[T, S]], the
following conjecture follows.
Main Conjecture. We have Φ = Ψ up to a unit in 𝒪[[T, S]].
Actually this conjecture is close to being proven, assuming the following ordinarity conjecture on the local structure of Weissauer’s Galois representations, as discussed in the lectures of E. Urban at the Mehta Research Institute (Allahabad, India). Let us explain Urban’s strategy. First of all, there is a theory of (nearly) p-ordinary 𝒪[[T, S]]-adic forms on GSp(4), developed mainly by Tilouine and Urban (19, 20). A cohomological Hecke eigenform f on GSp(4)/Q is called nearly p-ordinary if its eigenvalues for two standard Hecke operators at p are p-adic units under the fixed embedding Q̄ into Q̄ p. Here the word cohomological means that the system of Hecke eigenvalues for f appears in the middle cohomology H 3 with coefficients in a polynomial representation L of a Siegel modular variety for GL(4)/Q. In other words, f belongs to a discrete series representation whose Harish–Chandra parameter is the sum of the highest weight of L and the half sum of positive roots. For each cohomological eigenform f, Weissauer has attached a p-adic modular Galois representation ρf into GL(4) with characteristic polynomials of Frobenii outside p given by the Hecke polynomial (see ref. 21). Here is the ordinarity conjecture for the Galois representation.
Ordinarity Conjecture. Assume that f is nearly p-ordinary. Then the image of the decomposition group at p of ρf is in a Borel subgroup of GSp(4).
Weissauer’s construction gave a compatible system of l-adic representations attached to f, and ρf is one of its members. When ρf is crystalline, we have two characteristic polynomials at p. One is that of the crystalline Frobenius L cris(X), and the other, L et(X), is that of the Frobenius at p of a non-p-adic member of the compatible system. The p-ordinarity conjecture follows in this case if one can prove L cris(X) = L et(X), which is a standard conjecture and is known to be true at least for constant sheaves (that is, so to speak, the weight 0 case).
It is enough to prove the ordinarity conjecture for crystalline ρf for the following reason. We can glue Weissauer’s Galois representations by means of Taylor’s pseudorepresentations and attach to each 𝒪[[T, S]]-adic eigen cusp form 𝒢 a Galois representation ρ𝒢 : G → GL 4(F 𝕀) for the field of fractions F 𝕀 of a finite extension 𝕀 of 𝒪[[T, S]]. Thus at densely populated points on Spec(𝕀), ρ𝒢 specializes into Weissauer’s Galois representations. Furthermore, ρ𝒢 has densely populated specializations on Spec(𝕀) which are crystalline at p. Thus if one can prove the p-ordinarity conjecture for crystalline specializations, the image under ρ𝒢 of each decomposition group at p is in a Borel subgroup in GSp(4), and hence the ordinarity conjecture for all specializations follows.
We now come back to the strategy for a proof of the Main Conjecture. We
look at the Klingen-style 𝒪[[T, S]]-adic Eisenstein
series ℰ induced from the Λ-adic form ℱ. The Galois
representation ρℰ attached to ℰ has values in the
standard maximal parabolic subgroup, that is, it is of the following
form:
The constant term of ℰ at the nonstandard parabolic subgroup ℙ
is almost equal to ℱ times η(T)L(T, S). Here we mean by
nonstandard the parabolic subgroup given by
Thus the Eisenstein ideal Eis giving congruence between
ℰ and another 𝒪[[T, S]]-adic cusp form 𝒢 should be
generated by η(T)L(T, S). In particular, under the
p-ordinarity conjecture, Urban has shown for such Eisenstein
primes P dividing Φ(T, S), if 𝒢 ≡ ℰ mod
P for a cusp form 𝒢, ρ𝒢 has values in
GSp(4) and is irreducible. It was a nontrivial task to prove
this because the representation is residually reducible. We also note
that, to prove this, we again need the result of Wiles (4) proving the
conjecture in ref. 11. The fact that ρ𝒢 has values in
GSp(4) is essential in the proof because it guarantees that
the adjoint action of ρℰ on the unipotent radical of
the standard maximal parabolic subgroup is actually isomorphic to
ad(φ) ⊗ ν−1. The extension of νdet(φ)
⊗tφ−1 mod P by φ
mod P induced from ρ𝒢 can be made nonsplit
because of the irreducibility of ρ𝒢. This nontrivial
extension gives rise to a nontrivial cocycle in Sel(ad(φ) ⊗
ν−1) under the Ordinarity Conjecture. This is a
GSp(4) version of an argument of Wiles in (22) applied to
GL(2). Since it is true for each height one prime
P dividing Eis, we conclude that the Eisenstein
ideal Eis of ℰ divides Ψ, assuming the Ordinarity
Conjecture. To establish the divisibility Φ|Eis, in
other words, to establish the congruence 𝒢 ≡ ℰ mod
P
e for P
e|Φ, we need
to have precise information on ℰ (not just its existence), for
example, its Fourier coefficients, its Whittaker model, and so on.
Acknowledgments
Although each author had already worked out some of their share of the work presented here before they visited the Mehta Research Institute of Mathematics and Mathematical Physics (MRI, Allahabad, India) in January and February, 1996, the coordination in bringing all the efforts into a general framework was done while they were visiting Allahabad. We are grateful to Prof. Dipendra Prasad at MRI for giving us the opportunity of working together and to the audience at MRI for patiently listening to our lectures on the subject whose formulation was not yet definite. H.H. acknowledges the support from the National Science Foundation during the preparation of the paper.
Footnotes
-
↵ † To whom reprint requests should be addressed.
-
This paper was presented at a colloquium entitled “Elliptic Curves and Modular Forms,” organized by Barry Mazur and Karl Rubin, held March 15–17, 1996, at the National Academy of Sciences in Washington, DC.
- Copyright © 1997, The National Academy of Sciences of the USA





