Conductors of one-variable transforms of trace functions

In one of the recent posts by T. Tao on the progress of the Polymath8 project, the question arose of whether such functions as
\varphi(x)=\sum_{y\in\mathbf{F}_p} e\Bigl(\frac{f(x,y)}{p}\Bigr)
defined for x\in\mathbf{F}_p and a rational function f\in\mathbf{F}_p(X,Y), are trace functions, and more importantly, what is their conductor (see this, and the following, comments). In particular, if f is obtained by reduction modulo primes of a fixed rational function in \mathbf{Q}(X,Y), one could expect that the answer is “Yes”, with a bound for the conductor independent of p.

This is in fact a question that É. Fouvry, Ph. Michel and I had considered in special cases (or variants) in some of our papers. In fact, it is natural to consider more generally the linear map sending a function K\,:\, \mathbf{F}_p\rightarrow \mathbf{C} to the function
\varphi(x)= T_{f}(K)(x)=\sum_{y\in\mathbf{F}_p} K(y) e\Bigl(\frac{f(x,y)}{p}\Bigr),
(which is an analogue of an integral operator) and to ask whether this linear map sends trace functions to trace functions, and if yes, if the conductor of \varphi is bounded in terms of the conductor of K and the degree of the numerator and denominator of f.

The most important case, which is crucial in all our series of papers, is the Fourier transform, which corresponds simply to f(X,Y)=XY. We proved the desired property (which we view, rather naturally I think, as a form of “continuity” of the Fourier transform, in an algebraic sense) in that case using Deligne’s definition and Laumon’s study of the sheaf-theoretic Fourier transform. Most importantly, in order to estimate the conductor of the Fourier transform of a trace function, we used Laumon’s theory of the local Fourier transform, which is rather deep.

It is by no means clear (and this is a rather interesting question of algebraic geometry!) that such a local theory exists, and leads to the continuity property, for arbitrary “kernels” e(f(x,y)/p). However, we figured out a way to prove the conductor bound that bypasses these local results. It applies to the Fourier transform, and although it is then much less precise than what is obtained from Laumon’s theory, it gives a (possibly) more accessible proof of the continuity property.

We have just put here our preprint with this result. It is not yet submitted to arXiv because we are considering various possibilities for either extensions of applications, but the proof of the main result is complete.

The paper was rather interesting to write. On the one hand, it turns out that it is not needed for the original question from Polymath8: Philippe found an elementary argument that reduces the specific example of the problem which reduces it, essentially, to a special case of the Fourier transform (this is written down in the Deligne section of the Polymath8 paper). On the other hand, although we had thought a bit about this question beyond the Fourier transform, we had not made progress. The reason, in retrospect, is that in order to treat the general transform
(as defined above), we begin by treating the special case
T_f(1)(x)=\sum_{y\in\mathbf{F}_p} e\Bigl(\frac{f(x,y)}{p}\Bigr),
and without the motivation from the Polymath8 project, we had not thought of this first step.

The reason that things work this way is, as all other main ideas in this paper, very easy to explain by writing down and manipulating sums, and assuming that those behave always as the best Riemann Hypothesis over finite fields suggest. But the actual arguments are purely algebraico-geometric, and we end up using quite a bit of the general formalism of étale cohomology, but not the Riemann Hypothesis (which is morally as things should be.)

I will give here the informal sketch: first of all, the function
\varphi(x)= T_{f}(K)(x)=\sum_{y\in\mathbf{F}_p} K(y) e\Bigl(\frac{f(x,y)}{p}\Bigr),
is indeed a trace function if K is one, in almost all cases: precisely, the existence of higher-direct image sheaves with compact support, the proper base change theorem, and the Grothendieck trace formula, show that
where t_i is the trace function of the sheaf
R^ip_{1,!} (p_2^*\mathcal{F}\otimes \mathcal{L}),
where p_1,\ p_2 are the two projections (x,y)\mapsto x and (x,y)\mapsto y, the sheaf \mathcal{F} is the one with trace function K, and the sheaf \mathcal{L} is the Artin-Schreier sheaf on the plane with trace function
for x,\ y\in\mathbf{F}_p. In many cases, both t_0 and t_2 vanish, and then \varphi is (minus) the trace function of the sheaf
\mathcal{G}=R^1p_{1,!} (p_2^*\mathcal{F}\otimes \mathcal{L}).

This essentially answers the first question in full generality: the “transform” K\mapsto T_f(K) maps trace functions to trace functions.

Now consider the conductor of the sheaf \mathcal{G} above. We defined it as the sum of three terms. Two are relatively accessible: they are the (generic) rank of the sheaf, and the number of singularities. We can basically determine these if we know the dimension of all fibers: the maximum dimension gives an upper bound for the rank, and a result of Deligne says (roughly) that the singularities are the points where the fiber has smaller than maximal dimension. So let us assume that we can bound these quantities for the transform sheaf \mathcal{G}. Then there only remains to estimate the third term, which is the sum of the Swan conductors at the singularities. This is rather delicate, at least for people for whom — and this applies to us… — the Swan conductor remains a rather mysterious and subtle data.

The first idea is that, assuming the other two pieces of the conductor are under control, the sum of the Swan conductors is bounded by a global invariant that may be accessible in applications. Namely, if a sheaf \mathcal{M} is lisse on a dense open subset U of the affine line then the Euler-Poincaré characteristic formula (of Néron, Ogg, Shafarevitch) easily proves that
\sum_x \mathrm{swan}_x(\mathcal{M})\ll \text{(rank of }\mathcal{M}\text{)}+ \text{(nb. of sing.)} + \dim H^1_c(\bar{U},\mathcal{M}).
So we have to deal with the dimension of the cohomology group H^1_c(\bar{U},\mathcal{M}). The point is that, in good circumstances, and especially if \mathcal{M} is of weight 0, we can expect that
\dim H^1_c(\bar{U},\mathcal{M})=\limsup p^{-n/2} |S_n|,
where S_n is the sum of the trace function of \mathcal{M} over the points of U(\mathbf{F}_{p^n}). Estimating this becomes a problem of analytic number theory, and we may hope to succeed.

For instance, if we apply this principle to
with \mathcal{L} the Artin-Schreier sheaf associated to a rational function f as before, the sum S_n is simply
p^{-n/2}S_n=\frac{1}{p^{n/2}}\sum_{x\in U(\mathbf{F}_{p^n})}\sum_{y\in\mathcal{F}_{p^n}}e\Bigl(\frac{\mathrm{Tr}_nf(x,y)}{p}\Bigr),
were \mathrm{Tr}_n is the trace from \mathbf{F}_{p^n} to \mathbf{F}_p.

In good circumstances, we know square-root cancellation for this two-variable character sum, and we obtain a bound for the limsup of p^{-n/2}S_n, which depends only on the degree of the numerator and denominator of f, using Bombieri’s bounds for sums of Betti numbers for such exponential sums (or the generalizations of Adolphson-Sperber, or those of Katz.)

This deals (optimistically) with the transform with kernel \mathcal{L} when the input sheaf is the trivial sheaf, which we note is the case in the Polymath8 case. Now for the second idea: assume we consider
now, and try to use the same principle. With K denoting the trace function of \mathcal{F}, the sum S_n now satisfies
p^{-n/2}S_n=\frac{1}{p^{n/2}}\sum_{x\in U(\mathbf{F}_{p^n})}\sum_{y\in \mathbf{F}_{p^n}} K(y)e\Bigl(\frac{\mathrm{Tr}_nf(x,y)}{p}\Bigr).

In impeccable style, we exchange the two sums of course. We get
 p^{-n/2}S_n=\frac{1}{p^{n/2}}\sum_{y\in \mathbf{F}_{p^n}} K(y) \sum_{x\in U(\mathbf{F}_{p^n})}e\Bigl(\frac{\mathrm{Tr}_nf(x,y)}{p}\Bigr)=\frac{1}{p^{n/2}}\sum_{y\in \mathbf{F}_{p^n}} K(y)L(y)
L(y)=\sum_{x\in U(\mathbf{F}_{p^n})}e\Bigl(\frac{\mathrm{Tr}_nf(x,y)}{p}\Bigr).
But, by the first step, applied to f^*(X,Y)=f(Y,X) instead of f(X,Y), the function L is a trace function with conductor bounded in terms of the degrees of f^*, or equivalently of f. Thus p^{-n/2}S_n is the inner-product, over \mathbf{F}_{p^n}, of the trace functions of two sheaves with bounded conductor, and we can expect both to have weight 0. We can then expect quasi-orthogonality from the Riemann Hypothesis, and a resulting bound for the limsup that depends only on the conductors of these two sheaves, i.e., on the conductor of \mathcal{F} (for K) and on the degrees of the numerator of denominator of f (for L). This is the desired conclusion.

This sketch explains why we can prove the results in our paper. In many cases, it is certainly a valid reasoning, but it is not easy to make it rigorous in great generality. The basic problems are that it depends on the sums S_n having square-root cancellation (which for the transform of the trivial sheaf is a non-trivial assumption), and also on S_n detecting all of the cohomology space, and not just the part of weight 1: by Deligne’s Riemann Hypothesis, the eigenvalues of Frobenius on H^1_c(\bar{U},\mathcal{G}) are of weight \leq 1 if \mathcal{G} has weight 0, but the limsup only gives the number of eigenvalues of weight 1, and having too many smaller eigenvalues would create problem.

We work around these possible difficulties by dropping the diophantine motivation, and going straight at the dimension of H^1_c(\bar{U},\mathcal{G}). To do this, we need algebraic analogues of the two fundamental analytic steps we used:

(1) expressing the sum S_n for the trivial sheaf as a two-variable character sum;

(2) exchanging the order of the two sums when inserting a general input sheaf \mathcal{F}.

Both of these are replaced by (very elementary) arguments with Leray spectral sequences. This is a relatively well-known idea (it is part of the “dictionary” in Deligne’s survey on Sommes trigonométriques in SGA 4 ½ and quite a few concrete examples are found in papers and books of Katz), but it is the first time we use it ourselves. I will survey and explain this in a later post, since it seems that a good concrete example of the use of spectral sequences in analytic number theory might be a useful thing to have somewhere…

The reader who opens the PDF file of our preprint might be surprised to see that the paper in more than thirty pages long, in comparison with the rather simple-looking discussion above. The length is justified partly by the two motivating discussions we have included (the diophantine argument with S_n, and a self-contained algebraic treatment of the important case of the Fourier transform). But it also turns out that taking care of the “easy” parts of the conductor requires somewhat lengthy elementary arguments with rational functions. Most importantly maybe, we must take into account the fact that, in contrast with our previous works, we now have to handle general constructible \ell-adic sheaves, and not only middle-extension sheaves: there is no reason for our transformed sheaves to be so-well behaved in general. This requires adding a further component to the conductor (roughly, the support and dimension of the fibers of the “punctual part” of the sheaf, e.g., the conductor of a sheaf supported at 0 with fiber of dimension n\geq 1 must increase with n), and we also need to control it before applying the previous ideas. We also prove, both as a useful too and as a by-product, the analogue of the Bombieri bounds for a general input sheaf \mathcal{F}: the Betti numbers
\dim H^i_c(\mathbf{A}^2\times\bar{\mathbf{F}}_p,p_2^*\mathcal{F}\otimes \mathcal{L})
are bounded in terms of the conductor of \mathcal{F} and the degree of the numerator and denominator of f.

Published by


I am a professor of mathematics at ETH Zürich since 2008.