After a small break while traveling, I continue the discussion of my paper with Fouvry and Michel, continuing from here. But first of all, I’m happy to mention two proofs of the counting identities mentioned in my introductory post: one that D. Zywina has sent me, a nice proof based on explicit computations of modular forms and functions, and another that is in fact just an application of a more general result of Deligne and Flicker on local systems on the projective line minus four points (see Section 7 of this preprint).
Now, if memory serves, at the end of the last post, we saw that analytic techniques reduce the study of our sums
to the study of correlation sums
where is the normalized Fourier transform modulo .
For these, we need to show that “most” are small in order to conclude. Fixing a parameter , let us therefore define to be the set of matrices in such that
The discussion in the previous post shows that if, for some , all matrices in are upper-triangular, we get
for any . But we can deal with some more exceptions. More precisely, we show that this estimate is still valid, with an implied constant depending only on , if the matrices in are all either
- parabolic (they have a single fixed point in ), or
- upper-triangular, or
- if they either fix or permute two distinct points taken in a finite list, say containing at most pairs .
Compared with the previous discussion, the subtlety here is that it can indeed happen that such matrices appear in the transformations we apply to , and proving that their contributions remain under control involves some rather fun analysis.
This summarizes, in a rather hurried fashion, the first part of our paper. Logically, we obtain statements which are self-contained, but which are only applicable directly in a few cases (the case is an excellent example where this can be done). To go further, we need to use algebro-geometric tools.
And here comes a dilemma well-known to anyone who has had to present research involving two relatively distant areas of mathematics, so that specialists of one may not know the other. I can give a concise definition of the class of weights that we consider, and it will be immediately familiar and natural to algebraic geometers — but not to most analytic number theorists.
So, instead, I will just say (very fast) that these weights, which we call “irreducible trace weights”, are trace functions of geometrically irreducible -adic middle-extension Fourier sheaves pointwise pure of weight on . Then I will defer to a later post a more leisurely run-through this definition, together with more examples of weights and their formalisms, and with some further analytic properties which are of independent interest.
The reason this class of weight “works” can however be quickly summarized in a rather miraculous property, which is essentially a consequence of the Riemann Hypothesis: assuming has small complexity (in a precise sense, saying basically that it comes from a sheaf with small rank and small ramification), the correlation sum is either , where the implied constant depends only on the numerical invariant measuring the complexity of the weight, or we have an equality
for all and some fixed complex number of modulus . From this second part, we see that, for a suitable , the set discussed above is contained in the group of all (viewed in ) for which there exists (of modulus ) such that the identity above holds. This provides us with enough structure on the set of “bad” matrices (with large correlation sums), from which the bound
can fairly simply be deduced for irreducible trace weights using the automorphic part of our paper.
Indeed, we distinguish two cases, depending on the structure of the subgroup :
- If has order coprime to , we use the classification of such subgroups, and see that is either contained in the normalizer of some maximal torus, i.e., in the stabilizer of a two-point set (and hence these weights fall under the third case (3) of allowed “bad” matrices) or otherwise is of order at most , and consists only of semisimple elements (which allows us again to apply (3), with possibly more than one pair involved, but less than );
- If , we find some element in which is of order , hence unipotent. Thus acts transitively on (minus at most a single point); the formula defining the subgroup above implies easily that is then basically of a very restricted type, namely
for some fixed matrix and fixed complex number and integer . But this weight comes from a specific (Fourier transform of a) Artin-Schreier sheaf (it might not be the one defining originally, but a fortiori, we can assume it is!). For this sheaf, a rather simple analysis shows that is a unipotent group isomorphic to (unless or , which are exceptional cases). So the bad matrices are either trivial or parabolic, and we can appeal to case (1) to handle these weights…
I will conclude for today with another fact we noticed only very recently: in the special case it turns out that the correlation sums had already appeared in a paper of Friedlander and Iwaniec on incomplete Kloosterman sums, in the special case of when is lower-triangular (and non-diagonal). In the Appendix to this paper, Birch and Bombieri give two proofs of the estimate (for lower-triangular matrices): one is geometric (based on counting points on surfaces over finite fields), but the second one has amusing links to our arguments, with a camouflaged sighting of the group structure of the group of lower-triangular matrices and of the fact that is contained in a subgroup of … (Interestingly, there is no trace of modular forms in this paper of Friedlander and Iwaniec, so the coincidence is rather unexpected.)