1 Introduction
Throughout this paper, $f$ will denote a multiplicative function. We let
and assume that all three Dirichlet series are absolutely convergent in Re $(s)>1$ . Here the middle relation above should be viewed as defining the numbers $\unicode[STIX]{x1D6EC}_{f}(n)$ . We will restrict attention to the class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ of multiplicative functions $f$ for which
where $\unicode[STIX]{x1D705}$ is some fixed positive constant. If $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ then it follows that $|f(n)|\leqslant d_{\unicode[STIX]{x1D705}}(n)$ for all $n$ , where $d_{\unicode[STIX]{x1D705}}$ denotes the $\unicode[STIX]{x1D705}$ -divisor function (the coefficient of $n^{-s}$ in $\unicode[STIX]{x1D701}(s)^{\unicode[STIX]{x1D705}}$ ). The class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ includes many of the most useful multiplicative functions. For example, when $\unicode[STIX]{x1D705}\leqslant 1$ , the class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ includes all completely multiplicative functions with $|f(p)|\leqslant \unicode[STIX]{x1D705}$ for all primes $p$ . If $f_{1}\in {\mathcal{C}}(\unicode[STIX]{x1D705}_{1})$ and $f_{2}\in {\mathcal{C}}(\unicode[STIX]{x1D705}_{2})$ then the convolution $f_{1}\ast f_{2}$ is in ${\mathcal{C}}(\unicode[STIX]{x1D705}_{1}+\unicode[STIX]{x1D705}_{2})$ . Therefore if $\unicode[STIX]{x1D705}\in \mathbb{N}$ then ${\mathcal{C}}(\unicode[STIX]{x1D705})$ includes any Dirichlet convolution of $\unicode[STIX]{x1D705}$ functions each belonging to ${\mathcal{C}}(1)$ . Thus our framework includes the Möbius and Liouville functions, $k$ -divisor functions, Hecke eigenvalues of automorphic forms (provided they satisfy the Ramanujan conjecture), and finitely many convolutions of such functions. One could weaken the assumptions further if needed (for example, using a weak Ramanujan hypothesis instead, as in [Reference SoundararajanSou10]), but we restrict ourselves to ${\mathcal{C}}(\unicode[STIX]{x1D705})$ for simplicity.
1.1 A generalization of Halász’s theorem
We begin by generalizing the classical version of Halász’s theorem to the class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ .
Theorem 1.1 (Halász’s theorem).
Let $\unicode[STIX]{x1D705}$ be a fixed positive number, and let $x$ be large. Then, uniformly for all $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ ,
The following corollary may be easier to work with in practice.
Corollary 1.2. For given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , define $M=M(x)$ by
Then
Notice $M\geqslant -o(1)$ , as $|F(1+1/\text{log}\,x+it)|\leqslant \unicode[STIX]{x1D701}(1+1/\text{log}\,x)^{\unicode[STIX]{x1D705}}\leqslant (1+\log x)^{\unicode[STIX]{x1D705}}$ . Therefore at worst Corollary 1.2 matches the trivial bound $|\sum _{n\leqslant x}f(n)|\leqslant \sum _{n\leqslant x}d_{\unicode[STIX]{x1D705}}(n)\ll _{\unicode[STIX]{x1D705}}x(\log x)^{\unicode[STIX]{x1D705}-1}$ . This lossless quality of the analytic bound is one of the key features of Halász’s theorem. On the other hand, from Lemma 2.7 below it follows that $e^{-M}(\log x)^{\unicode[STIX]{x1D705}}$ is always $\gg 1$ , so that at best Corollary 1.2 produces a bound of $\ll x(\log \log x)^{\max (\unicode[STIX]{x1D705},1)}/\text{log}\,x$ .
Existing proofs of Halász’s theorem [Reference Granville and SoundararajanGS03, Reference HalászHal68, Reference HalászHal71, Reference MontgomeryMon, Reference Montgomery and VaughanMV01, Reference TenenbaumTen95] establish an inequality like Theorem 1.1, and then a consequence like Corollary 1.2 (usually with $\unicode[STIX]{x1D705}=1$ , sometimes less sharply). These proofs are all built on the original fundamental arguments of Halász, and employ ‘average of averages’ type arguments, whereas our proof uses a more direct and hopefully more intuitive ‘triple convolution’ application of Perron’s formula, which generalises naturally to diverse other situations (such as short intervals and arithmetic progressions). The reader may also consult [Reference Granville, Harper and SoundararajanGHS15] where the argument is presented in the simpler context of function fields, and [Reference Granville, Harper and SoundararajanGHS18] where an even simpler approach is presented in the case $\unicode[STIX]{x1D705}=1$ .
This new proof can help us to identify the ‘main term(s)’ for the mean value of $f$ up to $x$ . Indeed, the following result will be proved in a forthcoming paper by the authors on asymptotics in Halász’s theorem. Fix $\unicode[STIX]{x1D716}>0$ . For $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , there are intervals $I_{j},1\leqslant j\leqslant \ell$ , with $\ell \ll _{\unicode[STIX]{x1D716}}1$ , each of length $1/(\log x)^{C(\unicode[STIX]{x1D716})}$ such that
for $c_{0}=1+1/\text{log}\,x$ . These integrals can be determined under appropriate assumptions, to provide asymptotics for various interesting examples (see § 9, which cites asymptotics determined in our forthcoming asymptotics paper).
1.2 Multiplicative functions in short intervals
We are interested in the mean value of a multiplicative function in a short interval $[x,x+z)$ with $z$ as small as possible.
Theorem 1.3 (Halász’s theorem for intervals).
Given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , real $x$ and $0<\unicode[STIX]{x1D6FF}<\frac{1}{4}$ , we have
A simplified version of this theorem is given in the following corollary.
Corollary 1.4. Given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , real $x$ and $0<\unicode[STIX]{x1D6FF}<\frac{1}{4}$ , we have
where
Theorem 1.3 and Corollary 1.4 are established in § 3. We remark that several aspects of Corollary 1.4 are in general best possible. For instance, taking $f(n)=n^{it}$ for any given $|t|\leqslant x^{\unicode[STIX]{x1D6FF}}$ then $f(n)$ will be almost constant on the interval $(x,x+x^{1-\unicode[STIX]{x1D6FF}}]$ , so there will be no cancellation in the mean value. This shows that the increased range of $t$ in the definition of $M_{\unicode[STIX]{x1D6FF}}$ , as well as the lack of a denominator $1+1/\text{log}\,x+it$ (as compared with the definition of $M$ in Corollary 1.2), are both necessary here.
The term $(\unicode[STIX]{x1D6FF}\log x)^{\unicode[STIX]{x1D705}}$ in Corollary 1.4 is also necessary, in view of the following example: suppose that we are given values of $f(p^{k})$ for all $p\leqslant x^{1-\unicode[STIX]{x1D6FF}}$ , consistent with $f\in C(\unicode[STIX]{x1D705})$ . We are interested in extending the definition of $f$ to large primes in such a way that certain sums over the corresponding $f(n)$ are maximized. Since any prime $p>x^{1-\unicode[STIX]{x1D6FF}}$ divides at most one integer in $(x,x+x^{1-\unicode[STIX]{x1D6FF}}]$ , one can choose $f(p)$ in such a way that for all such multiples the corresponding values $f(mp)$ all point in the same direction. Therefore, maximizing over all ways to extend $f\in C(\unicode[STIX]{x1D705})$ (given the values at primes below $x^{1-\unicode[STIX]{x1D6FF}}$ ), we have
Hence, using the prime number theorem,
If, for example, each $|f(p)|=\unicode[STIX]{x1D705}$ for $p\leqslant X^{\unicode[STIX]{x1D6FF}}$ then this is $\gg _{\unicode[STIX]{x1D705}}(X^{1-\unicode[STIX]{x1D6FF}}/\text{log}\,X)(\unicode[STIX]{x1D6FF}\log X)^{\unicode[STIX]{x1D705}}$ .
1.3 Slow variation of mean-values of multiplicative functions
We are also interested in bounding how much the mean value of a multiplicative function can vary in short multiplicative intervals; a general version of this principle forms the main input in [Reference SoundararajanSou10]. The example $f(n)=n^{i\unicode[STIX]{x1D6FC}}$ shows that the mean value $(1/x)\sum _{n\leqslant x}f(n)$ (which is ${\sim}x^{i\unicode[STIX]{x1D6FC}}/(1+i\unicode[STIX]{x1D6FC})$ ) might rotate with $x$ ; but here we may consider the ‘renormalized’ mean-values $(1/x^{1+i\unicode[STIX]{x1D6FC}})\sum _{n\leqslant x}f(n)$ , which now varies slowly with $x$ . The following result (which will be established in § 4) establishes a general result of this flavor. It extends the earlier work of Granville and Soundararajan [Reference Granville and SoundararajanGS03] (improving on Elliott [Reference ElliottEll89]) obtained in the special case $\unicode[STIX]{x1D705}=1$ .
Theorem 1.5 (Lipschitz theorem).
Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ be given, and let $x$ be large. Let $t_{1}=t_{1}(x)$ be a point in $[-(\log x)^{\unicode[STIX]{x1D705}},(\log x)^{\unicode[STIX]{x1D705}}]$ where the maximum of $|F(1+1/\text{log}\,x+it)|$ (for $t$ in this interval) is attained. For all $1\leqslant w\leqslant x^{1/3}$ we have
If $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ is always non-negative, then the estimate above holds with the improved exponent $\min (\unicode[STIX]{x1D705}(1-1/\unicode[STIX]{x1D70B}),1)$ .
In Examples 2 and 3 of § 9, we show that when $\unicode[STIX]{x1D705}=1$ there are examples of functions in ${\mathcal{C}}(1)$ where the exponent $1-2/\unicode[STIX]{x1D70B}$ that appears above is attained, and there are examples where the exponent $1-1/\unicode[STIX]{x1D70B}$ for non-negative functions is attained. In [Reference Granville and SoundararajanGS03] it was suggested that the right Lipschitz exponent for $\unicode[STIX]{x1D705}=1$ might be as large as $1$ ; we now see that this is not the case.
1.4 Multiplicative functions in arithmetic progressions
We now establish analogues of Halász’s Theorem 1.1 and Corollary 1.2 for multiplicative functions in arithmetic progressions. There is a strong analogy with the situation in short intervals, with Dirichlet characters $\unicode[STIX]{x1D712}$ modulo $q$ being introduced here and playing a similar role as the ‘continuous characters’ $(n^{it})_{|t|\leqslant x^{\unicode[STIX]{x1D6FF}}}$ in the short interval theorems.
Theorem 1.6 (Halász’s theorem for arithmetic progressions).
Given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , if $x\geqslant q^{4}$ and $(a,q)=1$ then
where $F_{\unicode[STIX]{x1D712}}(s):=\sum _{n=1}^{\infty }f(n)\unicode[STIX]{x1D712}(n)/n^{s}$ denotes the twisted Dirichlet series.
For each character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ we define $M_{\unicode[STIX]{x1D712}}=M_{\unicode[STIX]{x1D712}}(x)$ as follows:
The following corollary gives a simplified version of Theorem 1.6.
Corollary 1.7. Given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ let $M_{q}:=\min _{\unicode[STIX]{x1D712}}M_{\unicode[STIX]{x1D712}}(x)$ . Then
1.5 Isolating characters that correlate with a multiplicative function, and the pretentious large sieve
Given a multiplicative function $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ we shall show in § 5 that there cannot be many characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ that correlate with $f$ . For instance, there cannot be many characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ for which the quantity $M_{\unicode[STIX]{x1D712}}(x)$ is small. By extracting the few characters $\unicode[STIX]{x1D712}$ that correlate with $f$ , one can obtain a more refined understanding of the distribution of $f$ in arithmetic progressions. For a given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ and a character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ define
Theorem 1.8. Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , let $X$ be large, and $q$ be a natural number with $q\leqslant X^{1/4}$ . Put $Q=q\log X$ . For any natural number $J\geqslant 1$ , there are at most $J$ characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ for which
If ${\mathcal{X}}_{J}$ denotes the set of at most $J$ characters for which (1.2) holds, then for all $x$ in the range $\sqrt{X}\leqslant x\leqslant X^{2}$ and all reduced residue classes $a~(\text{mod}~q)$ we have
In the statement of this theorem, we mean that there exists an implicit constant (depending only on $\unicode[STIX]{x1D705},J$ ) such that (1.2) can only hold for at most $J$ characters. The same remark applies to Theorem 1.11 below.
Theorem 1.8 gives non-trivial bounds once $X\geqslant q^{A}$ for a sufficiently large constant $A$ . Precursors to this result may be found in the work of Elliott [Reference ElliottEll02a], where the effect of the principal character was removed and the difference estimated. Theorem 1.8 together with related variants will be established in § 6.
We next describe one of these variants, which we call ‘the pretentious large sieve’. For any sequence of complex numbers $a_{n}$ with $n\leqslant x$ , the orthogonality relation for characters gives
and, using the Cauchy–Schwarz inequality, this is
This simple large sieve inequality is, in general, the best one can do. For a multiplicative function $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ this becomes, when $x\geqslant q^{2}$ ,
using Lemma 2.3 below. Our work shows that if a handful of characters are removed, then the above large sieve estimate can be improved in the range $x\geqslant q^{A}$ for a suitably large $A$ .
Theorem 1.9 (The pretentious large sieve).
Suppose that $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ . Let $X$ , $q$ and $Q$ be as in Theorem 1.8. Let $J\geqslant 1$ be a natural number, and let ${\mathcal{X}}_{J}$ consist of the at most $J$ characters $\unicode[STIX]{x1D712}$ for which (1.2) holds. Then for all $\sqrt{X}\leqslant x\leqslant X^{2}$ we have
1.6 Primes in short intervals and arithmetic progressions
We now give some applications of our work to counting primes in short intervals and arithmetic progressions. We shall obtain qualitative versions of Hoheisel’s theorem that there are many primes in intervals $[x,x+x^{1-\unicode[STIX]{x1D6FF}}]$ for some $\unicode[STIX]{x1D6FF}>0$ , and Linnik’s theorem that the least prime $p\equiv a~(\text{mod}~q)$ is below $q^{L}$ for an absolute constant $L$ . In both situations, more precise results are known, but our work is perhaps simpler in comparison with other approaches, avoiding the use of log-free zero-density estimates and refined zero-free regions. In the case of Linnik’s theorem, our work bears several features in common with Elliott’s proof in [Reference ElliottEll02b].
Corollary 1.10. Let $\unicode[STIX]{x1D6FF}>0$ be small, and $x$ be a large real number. Then, with implied constants being absolute,
Now we turn to primes in arithmetic progressions, where the story is more complicated because of the possibility of exceptional characters that might pretend to be the Möbius function.
Theorem 1.11. Let $q$ be a natural number, and $x\geqslant q^{10}$ be large. Put $Q=q\log x$ and let $J\geqslant 1$ be a natural number. Let ${\mathcal{X}}_{J}$ denote the set of non-principal characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ such that
Then ${\mathcal{X}}_{J}$ has at most $J$ elements, and for any non-principal character $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}$ we have
Further for any reduced residue class $a~(\text{mod}~q)$ we have
Corollary 1.12. Let $q$ be a natural number and $x\geqslant q^{10}$ be large. Set $Q=q\log x$ . Then, for any reduced residue class $a~(\text{mod}~q)$ ,
unless there is a non-principal quadratic character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ such that
If this exceptional character exists, then
Theorem 1.11 also leads to an improvement of the large sieve estimate for primes, once one removes the effect of a few characters.
Corollary 1.13 (The pretentious large sieve for the primes).
Let $q$ , $x$ and $Q$ be as in Theorem 1.11, and let $\widetilde{{\mathcal{X}}}_{J}$ denote the set ${\mathcal{X}}_{J}$ of Theorem 1.11 extended to include the principal character. Then
Theorem 1.11 and Corollaries 1.12 and 1.13 will be established in § 7. Finally in § 8 we shall deduce the following qualitative form of Linnik’s theorem.
Corollary 1.14 (Linnik’s theorem).
There exists a constant $L$ such that if $(a,q)=1$ then there is a prime $p\equiv a~(\text{mod}~q)$ with $p\ll q^{L}$ .
2 Halász’s theorem for the class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ : proofs of Theorem 1.1 and Corollary 1.2
In this section, we establish Theorem 1.1. The strategy of the proof also generalizes readily to give variants of Halász’s theorem in short intervals and arithmetic progressions, and we shall give these versions in the next section.
Let $p(n)$ and $P(n)$ denote (respectively) the smallest and largest prime factors of $n$ . Given a parameter $y\geqslant 10$ , define the multiplicative functions $s(\cdot )$ (supported on small primes) and $\ell (\cdot )$ (supported on large primes) by
so that $f$ is the convolution of $s$ and $\ell$ . Therefore setting (for $s$ with Re $(s)>1$ )
We define $\unicode[STIX]{x1D6EC}_{s}$ and $\unicode[STIX]{x1D6EC}_{\ell }$ analogously. Note that $s$ , $\ell$ , ${\mathcal{S}}$ and ${\mathcal{L}}$ all depend on the parameter $y$ . We also remark that since ${\mathcal{S}}$ is defined by a finite Euler product, ${\mathcal{S}}(s)$ converges and is analytic for Re $(s)>0$ . The following proposition, and variants of it, will play a key role in our proof of Halász’s theorem.
Proposition 2.1. Let $10\leqslant y\leqslant \sqrt{x}$ , let $\unicode[STIX]{x1D702}=1/\text{log}\,y$ so that $0<\unicode[STIX]{x1D702}<1/2$ . Set $c_{0}=1+1/\text{log}\,x$ . Then, for any $1\leqslant T\leqslant x^{9/10}$ ,
equals
The crucial point here is the presence of multiple Dirichlet polynomials inside the integral (visibly four, but really three since the factors ${\mathcal{S}}$ and ${\mathcal{L}}$ go together). To understand why this is useful one may think of the circle method for ternary problems (for example, sums of three primes). In such situations, three generating functions are involved, and a satisfactory bound is obtained by using an $L^{\infty }$ estimate for one of the generating functions and then using $L^{2}$ estimates (Parseval’s identity) to bound the remaining two. Proposition 2.1 allows us to employ a similar strategy here, bounding ${\mathcal{S}}(s-\unicode[STIX]{x1D6FC}-\unicode[STIX]{x1D6FD}){\mathcal{L}}(s+\unicode[STIX]{x1D6FD})$ in magnitude, and then using a Plancherel bound (see Lemma 2.6) for the remaining two sums.
We also remark on the introduction of the extra integrals over $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ , which (as the reader will see below) is a technical device for (essentially) introducing logarithmic factors into some of our Dirichlet polynomials. It is possible to run our arguments without these extra integrals, just including the relevant logarithms explicitly, and indeed this is what we do in the short note [Reference Granville, Harper and SoundararajanGHS18] where we present our simplest proof of Halász’s theorem, restricted to the case $\unicode[STIX]{x1D705}=1$ . But when we go on to generalise our arguments to short intervals and arithmetic progressions, the calculations are nicer with the extra integrals than with the explicit ‘contaminating’ logarithms.
2.1 Introducing many convolutions
The kernel of Proposition 2.1 is contained in the following slightly simpler identity, which we will tailor (see Lemma 2.2) to obtain a more flexible variant. (Thus Lemma 2.1 itself is not actually needed, strictly speaking, for our subsequent work.)
Lemma 2.1. For any $x>2$ with $x$ not an integer, and any $c>1$ , we have
Proof. We give two proofs. In the first proof, we interchange the integrals over $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ , and $s$ . First perform the integral over $\unicode[STIX]{x1D6FD}$ . Since $\int _{\unicode[STIX]{x1D6FD}=0}^{\infty }F^{\prime }(s+\unicode[STIX]{x1D6FC}+\unicode[STIX]{x1D6FD})\,d\unicode[STIX]{x1D6FD}=1-F(s+\unicode[STIX]{x1D6FC})$ we are left with
and now performing the integral over $\unicode[STIX]{x1D6FC}$ this is
Perron’s formula now shows that the above matches the left-hand side of the stated identity.
For the second (of course closely related) proof, Perron’s formula gives that
Integrating now over $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ gives
where we use that $\sum _{\ell m=n}\unicode[STIX]{x1D6EC}_{f}(\ell )f(m)=f(n)\log n$ (which follows upon comparing coefficients in $(-F^{\prime }/F)(s)\cdot F(s)=-F^{\prime }(s)$ ).◻
Lemma 2.2. Let $F$ , ${\mathcal{S}}$ , ${\mathcal{L}}$ , $x$ and $y$ be as above. Let $\unicode[STIX]{x1D702}>0$ and $c>1$ be real numbers. Then
Proof. First we perform the integral over $\unicode[STIX]{x1D6FD}$ in (2.3), obtaining
The term arising from ${\mathcal{L}}(s+\unicode[STIX]{x1D6FC}+2\unicode[STIX]{x1D702})$ above gives the third term in (2.4), using Perron’s formula. The term arising from ${\mathcal{L}}(s+\unicode[STIX]{x1D6FC})$ gives
upon evaluating the integral over $\unicode[STIX]{x1D6FC}$ . Perron’s formula now matches the two terms above with the first two terms in (2.4). This establishes our identity.◻
To bound the contributions of the second and third terms in (2.4), as well as other related quantities, we use a well-known result of Shiu establishing a Brun–Titchmarsh inequality for non-negative multiplicative functions.
Lemma 2.3. Let $\unicode[STIX]{x1D705}$ and $\unicode[STIX]{x1D716}$ be fixed positive real numbers. Let $x$ be large, and suppose $x^{\unicode[STIX]{x1D716}}\leqslant z\leqslant x$ and $q\leqslant z^{1-\unicode[STIX]{x1D716}}$ . Then uniformly for all $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ we have
where $a~(\text{mod}~q)$ is any reduced residue class.
Proof. This follows from [Reference ShiuShi80, Theorem 1]. ◻
Lemma 2.4. Keep notations as above. For $\unicode[STIX]{x1D702}=1/\text{log}\,y$ , we have
Proof. Applying Lemma 2.3 (with $z=x$ and $q=1$ there) we find that
As $\sum _{k\leqslant K}|\unicode[STIX]{x1D6EC}(k)|/k^{\unicode[STIX]{x1D6FC}}\ll K^{1-\unicode[STIX]{x1D6FC}}$ by the Chebyshev bound $\unicode[STIX]{x1D713}(x):=\sum _{k\leqslant x}\unicode[STIX]{x1D6EC}(k)\ll x$ , we have
which is
as desired. ◻
2.2 Tailoring the integral
Now we turn to the task of bounding the integrals in (2.3), with a view to proving Proposition 2.1 and ultimately Halász’s Theorem 1.1. Fix $\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD}\in [0,2\unicode[STIX]{x1D702}]$ . If we write the terms in the Dirichlet series in (2.3) as $a,b,c,d$ respectively, then by Perron’s formula the integral over $s$ equals a sum of the shape $\sum _{abcd\leqslant x}(bcd)^{-\unicode[STIX]{x1D6FC}}(bd)^{-\unicode[STIX]{x1D6FD}}$ . Both $c$ and $d$ are ${>}y$ , by the definition of ${\mathcal{L}}$ , and so $a$ , $b$ , $c$ , and $d$ must all be ${<}x/y$ . Hence in (2.3) we may replace
and similarly for $({\mathcal{L}}^{\prime }/{\mathcal{L}})(s+\unicode[STIX]{x1D6FC}+\unicode[STIX]{x1D6FD})$ . Moreover, with these Dirichlet series truncated to finite sums we may move the line of integration a little to the left.
Choose $c_{\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD}}=c_{0}-\unicode[STIX]{x1D6FC}-\unicode[STIX]{x1D6FD}/2$ , so that (after a change of variables) the inner integral over $s$ in (2.3) is
We now show that the integral in (2.5) may be truncated at $T$ with an error term that is at most the second error term in Proposition 2.1. To do this, we shall find useful the following consequence of the Chebyshev bound $\unicode[STIX]{x1D713}(x)\ll x$ : for all $0<|\unicode[STIX]{x1D706}|\leqslant 1$
Lemma 2.5. For $x^{9/10}\geqslant T\geqslant 1$ , the part of the integral in (2.5) with $|t|\geqslant T$ , integrated over $0\leqslant \unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD}\leqslant 2/\text{log}\,y$ , is bounded by $O(x(\log x)^{\unicode[STIX]{x1D705}}/T)$ .
Proof. The error introduced in truncating the integral in (2.5) at $T$ is, by the quantitative Perron formula (see [Reference DavenportDav00, ch. 17]),
We bound this error term by splitting the values of $k_{1}k_{2}mn=N$ into various ranges. The terms with $N\leqslant x/2$ or $N>3x/2$ are bounded by
by (2.6). Integrating over $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ leads to a bound $\ll x(\log y)^{\unicode[STIX]{x1D705}}/T\ll x(\log x)^{\unicode[STIX]{x1D705}}/T$ .
Now we turn to the terms with $k_{1}k_{2}mn=:N$ where $x/2\leqslant N\leqslant 3x/2$ . Note that $m$ is the largest divisor of $N$ free of prime factors ${>}y$ , so that $(m,N/m)=1$ . Integrating over $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ , we find that
where $r$ only has prime factors ${>}y$ . Using $\sum _{ab=c}\unicode[STIX]{x1D6EC}(a)d_{\unicode[STIX]{x1D705}}(b)=(1/\unicode[STIX]{x1D705})d_{\unicode[STIX]{x1D705}}(c)\log c$ twice, we deduce that the above is
Hence these terms contribute
splitting the sum over $N$ into intervals $(x(1+j/T),x(1+(j+1)/T)]$ for $-T/2\leqslant j<T$ , proving the lemma. The final inequality here requires an upper bound for the sum of $d_{\unicode[STIX]{x1D705}}(N)$ in intervals of length $x/T$ around $x$ , which follows from Lemma 2.3.◻
Proof of Proposition 2.1.
We begin with the identity in Lemma 2.2, replacing the inner integral in (2.3) by (2.5). We truncate the inner integral obtaining the error in Lemma 2.5. The terms in (2.4) are bounded by Lemma 2.4. Finally we replace $\unicode[STIX]{x1D6FD}$ by $2\unicode[STIX]{x1D6FD}$ . ◻
2.3 Proof of Theorem 1.1
With Proposition 2.1 in hand, we may proceed to the proof of Halász’s Theorem 1.1. We will bound the integral in Proposition 2.1 by a suitable application of Cauchy–Schwarz and the following simple mean value theorem for Dirichlet polynomials.
Lemma 2.6. For any complex numbers $\{a(n)\}_{n\geqslant 1}$ and any $T\geqslant 1$ we have
Proof. Let $\unicode[STIX]{x1D6F7}$ be an even non-negative function with $\unicode[STIX]{x1D6F7}(t)\geqslant 1$ for $-1\leqslant t\leqslant 1$ , for which $\widehat{\unicode[STIX]{x1D6F7}}$ , the Fourier transform of $\unicode[STIX]{x1D6F7}$ , is compactly supported. For example, take $\unicode[STIX]{x1D6F7}(x)=(1/(\sin 1)^{2})(\sin x/x)^{2}$ and note that $\hat{\unicode[STIX]{x1D6F7}}(x)$ is supported in $[-1,1]$ . We may then bound our integral by
Since $2|a(m)a(n)|\leqslant |a(m)|^{2}+|a(n)|^{2}$ and by symmetry, the above is
Since $\widehat{\unicode[STIX]{x1D6F7}}$ is compactly supported we know that $|\widehat{\unicode[STIX]{x1D6F7}}(t)|\ll 1$ for all $t$ , and that for a given $m$ , our sum over $n$ is only supported on those values of $n$ for which $|n-m|\ll m/T$ . Therefore, using the Brun–Titchmarsh theorem, the sum over $n$ is seen to be $\ll m$ . The lemma follows.◻
Proof of Theorem 1.1.
Let $x$ be large, and take $T=(\log x)^{\unicode[STIX]{x1D705}+1}$ and $y=T^{2}$ in our work above. The error terms in Proposition 2.1 are $\ll x(\log \log x)^{\unicode[STIX]{x1D705}}/\text{log}\,x$ , and it remains to handle the integral in (2.1). For given $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ in $[0,\unicode[STIX]{x1D702}]$ , the integral there is (since $\unicode[STIX]{x1D702}=1/\text{log}\,y$ is small)
Since $\unicode[STIX]{x1D6FC}$ , $\unicode[STIX]{x1D6FD}$ , $c_{0}-1$ are all at most $1/\text{log}\,y$ ,
and so
Using Cauchy–Schwarz, Lemma 2.6 (noting that $y=T^{2}$ ), and then (2.6), the integral in (2.7) is
Combining the last two estimates, we find that the quantity in (2.7) is
Writing $\unicode[STIX]{x1D70E}=\unicode[STIX]{x1D6FD}+1/\text{log}\,x$ so that $\min (\log x,1/\unicode[STIX]{x1D6FD})\asymp 1/\unicode[STIX]{x1D70E}$ , the integral in (2.1) is
Since $|F(1+\unicode[STIX]{x1D70E}+it)|\leqslant \unicode[STIX]{x1D701}(1+\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}}\ll (1/\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}}$ ,
and so the quantity in (2.9) may be bounded by
completing the proof of Theorem 1.1.◻
The following simple lemma establishes limits on the quality of bounds that can be deduced from Halász’s theorem, and will be useful in deducing Corollary 1.2.
Lemma 2.7. Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ . Suppose $0<\unicode[STIX]{x1D70E}\leqslant 1$ , and $T$ is such that $T\geqslant 4\unicode[STIX]{x1D701}(1+\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}/2}$ . Then
Proof. For any $c>0$ , a simple contour shift argument gives
Therefore
The portion of the integral with $|t|>T$ contributes an amount that is bounded in magnitude by
and the lemma follows. ◻
Proof of Corollary 1.2.
We use the maximum modulus principle in the region
to bound $\max _{|t|\leqslant (\log x)^{\unicode[STIX]{x1D705}}}|F(1+\unicode[STIX]{x1D70E}+it)/(1+\unicode[STIX]{x1D70E}+it)|$ . By the definition of $M=M(x)$ , the maximum of this quantity along the vertical line segment $u=1+1/\text{log}\,x$ is $e^{-M}(\log x)^{\unicode[STIX]{x1D705}}$ , and by Lemma 2.7 this is $\gg 1$ . On the horizontal segments $|v|=(\log x)^{\unicode[STIX]{x1D705}}$ , $|F(u+iv)/(u+iv)|$ may be bounded by ${\leqslant}\unicode[STIX]{x1D701}(1+1/\text{log}\,x)^{\unicode[STIX]{x1D705}}/(\log x)^{\unicode[STIX]{x1D705}}=O(1)$ , and the same bound holds on the vertical line segment $u=2$ . Thus
Moreover, bounding $|F(1+\unicode[STIX]{x1D70E}+it)|$ by $\unicode[STIX]{x1D701}(1+\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}}$ we also have
Inserting these bounds into Halász’s Theorem 1.1 the integral there is
and the corollary follows. ◻
In some applications it may happen that even though the function $f$ belongs a priori to some class ${\mathcal{C}}(\unicode[STIX]{x1D705})$ the average size of $|f(p)|$ might be smaller. For example, if $f(n)$ is the indicator function of sums of two squares then $f$ lies in ${\mathcal{C}}(1)$ , whereas the average size of $f(p)$ is $\frac{1}{2}$ . Another example comes from the normalized Fourier coefficients of holomorphic eigenforms. These lie naturally in ${\mathcal{C}}(2)$ , but the average size of these coefficients at primes is smaller than $1$ in absolute value. In such situations, one would like refined versions of our results taking the average behavior of $|f(p)|$ into account. We give a sample result of this sort in the context of Corollary 1.2, and it would be desirable to flesh out similar variants for our other results.
Corollary 2.8. Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ and put $G(s)=\sum _{n=1}^{\infty }|f(n)|n^{-s}$ . Suppose that $0<\unicode[STIX]{x1D706}\leqslant \unicode[STIX]{x1D705}$ is such that $G(1+\unicode[STIX]{x1D70E})\ll 1/\unicode[STIX]{x1D70E}^{\unicode[STIX]{x1D706}}$ for all $\unicode[STIX]{x1D70E}>0$ . Then
where
Proof. We proceed as in the proof of Corollary 1.2 but now note that $|F(1+\unicode[STIX]{x1D70E}+it)|\leqslant |G(1+\unicode[STIX]{x1D70E})|\ll (1/\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D706}}$ , and so
Inserting this into Halász’s Theorem 1.1 we obtain this corollary. ◻
3 Halász’s theorem in short intervals and progressions: proofs of Theorems 1.3 and 1.6
3.1 Short intervals
Proof of Theorem 1.3.
Since $|f(n)|\leqslant d_{\unicode[STIX]{x1D705}}(n)$ and
we may assume that $\unicode[STIX]{x1D6FF}$ is small in the theorem (else the term $(\unicode[STIX]{x1D6FF}\log x)^{\unicode[STIX]{x1D705}}$ in the stated bound dominates). We may also assume that $\unicode[STIX]{x1D6FF}>1/\text{log}\,x$ else the result follows from Theorem 1.1. We follow the strategy of the proof of Theorem 1.1 with suitable modifications. We choose $T=x^{\unicode[STIX]{x1D6FF}}(\log x)^{\unicode[STIX]{x1D705}+2}$ , and $y=T^{2}$ , and define $s$ , $\ell$ , ${\mathcal{S}}$ and ${\mathcal{L}}$ as in § 2. We apply Lemma 2.2 twice, at $x+x^{1-\unicode[STIX]{x1D6FF}}$ and at $x$ , and subtract. Deferring the treatment of the secondary terms in (2.4) for a moment, we first explain how to bound the difference of contour integrals.
The difference of contour integrals mentioned above is (in place of (2.3))
As described in § 2.2, we change variables and truncate this integral at height $T$ , the error term in Lemma 2.5 being acceptable since $T=x^{\unicode[STIX]{x1D6FF}}(\log x)^{\unicode[STIX]{x1D705}+2}$ . Then we proceed as in the proof of Halász’s Theorem 1.1 in § 2.3, with the only difference being that our integral here (in the step analogous to (2.7)) is multiplied through by a factor
Thus, in place of (2.9), we have the bound
The contribution of those $\unicode[STIX]{x1D70E}$ for which the maximum of $|F(1+\unicode[STIX]{x1D70E}+it)|$ occurs with $T\geqslant |t|>\frac{1}{2}x^{\unicode[STIX]{x1D6FF}}(\log x)^{\unicode[STIX]{x1D705}}$ , is (since $|F(1+\unicode[STIX]{x1D70E}+it)|\ll 1/\unicode[STIX]{x1D70E}^{\unicode[STIX]{x1D705}}$ )
This is acceptable for the estimate stated in Theorem 1.3.
It remains to deal with the terms corresponding to (2.4), which here are bounded by
Using Lemma 2.3, the first term is
To bound the second term, we distinguish the cases when $mn\leqslant 2\sqrt{x}$ and when $mn>2\sqrt{x}$ so that $k\leqslant 2\sqrt{x}$ . In the first case, we use the Brun–Titchmarsh inequality to estimate the sum over $k$ , and thus such terms contribute
which is $\ll _{\unicode[STIX]{x1D705}}x^{1-\unicode[STIX]{x1D6FF}}(\log y)^{\unicode[STIX]{x1D705}}/\text{log}\,x$ . In the second case, we sum over $mn$ first using Lemma 2.3 and then sum over $k$ ; thus, we obtain
This completes our proof of Theorem 1.3.◻
To deduce Corollary 1.4 (and also in the proof of Theorem 1.5), we will find useful the following variant of the maximum modulus principle, which may be found as Lemma 2.2 of [Reference Granville and SoundararajanGS03].
Lemma 3.1. Let $a_{n}$ be a sequence of complex numbers such that $A(s)=\sum _{n=1}^{\infty }a_{n}n^{-s}$ is absolutely convergent for Re $(s)\geqslant 1$ . For all real numbers $T\geqslant 1$ and all $0\leqslant \unicode[STIX]{x1D6FC}\leqslant 1$ we have
and for any $w\geqslant 1$
Proof of Corollary 1.4.
Take $a_{n}=f(n)/n^{1/\text{log}\,x}$ , and $T=\frac{1}{2}x^{\unicode[STIX]{x1D6FF}}(\log x)^{\unicode[STIX]{x1D705}}$ in Lemma 3.1, so that for $1/\text{log}\,x\leqslant \unicode[STIX]{x1D70E}\leqslant 1$
where $c_{0}=1+1/\text{log}\,x$ as before. Combining this with the upper bound $|F(1+\unicode[STIX]{x1D70E}+it)|\leqslant \unicode[STIX]{x1D701}(1+\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}}\ll 1/\unicode[STIX]{x1D70E}^{\unicode[STIX]{x1D705}}$ in appropriate ranges of $\unicode[STIX]{x1D70E}$ , we deduce Corollary 1.4 from Theorem 1.3 (in much the same way as we deduced Corollary 1.2 from Theorem 1.1).◻
3.2 Arithmetic progressions
In this section we begin our study of the mean value of $f$ in an arithmetic progression $a~(\text{mod}~q)$ with $(a,q)=1$ . Our starting point is the orthogonality relation of characters
and now we may invoke our earlier work in understanding the mean-values of the multiplicative functions $f(n)\overline{\unicode[STIX]{x1D712}}(n)$ .
Proof of Theorem 1.6.
Starting with (3.3), we take $T=q^{2}(\log x)^{\unicode[STIX]{x1D705}+2}$ and $y=T^{2}$ , and apply Lemma 2.2 to each sum $\sum _{n\leqslant x}f(n)\overline{\unicode[STIX]{x1D712}}(n)$ . After summing over $\unicode[STIX]{x1D712}~\text{mod}~q$ , the second and third terms in (2.4) contribute
We bound these terms in a similar manner to the argument in the previous section. Thus, by Lemma 2.3, the first term in (3.4) is
To bound the second term in (3.4), we distinguish the cases when $mn\leqslant \sqrt{x}$ and when $mn>\sqrt{x}$ so that $k\leqslant \sqrt{x}$ . In the first case, we use the Brun–Titchmarsh inequality to estimate the sum over $k$ (noting that $k$ must lie in a fixed progression modulo $q$ ), and obtain
which is $\ll _{\unicode[STIX]{x1D705}}x(\log y)^{\unicode[STIX]{x1D705}}/(\unicode[STIX]{x1D719}(q)\log x)$ . Finally in the second case when $mn>\sqrt{x}$ , we first bound the sum over $mn$ using Lemma 2.3 and then perform the sum over $k$ (which is now at most $\sqrt{x}$ ). Thus such terms contribute (multiplying through by $(mn/\sqrt{x})^{\unicode[STIX]{x1D6FC}}$ since $mn>\sqrt{x}$ )
These bounds are all acceptable for Theorem 1.6.
Now we turn to the corresponding contour integrals in (2.3). Arguing as in § 2.2, we may truncate the corresponding integrals at height $T$ , incurring an error term in the analogue of Lemma 2.5 of
which again is acceptable for Theorem 1.6. After summing over $\unicode[STIX]{x1D712}$ mod $q$ , we are left with a main term (analogously to Proposition 2.1) of
where ${\mathcal{S}}_{\overline{\unicode[STIX]{x1D712}}}$ and ${\mathcal{L}}_{\overline{\unicode[STIX]{x1D712}}}$ are appropriate twists of ${\mathcal{S}}$ and ${\mathcal{L}}$ respectively. To bound this we need the following extension of Lemma 2.6.
Lemma 3.2. Let $T\geqslant q^{1+\unicode[STIX]{x1D716}}$ be given. Then for any complex numbers $a(n)$ we have
Proof. Arguing as in Lemma 2.6, the left-hand side of (3.5) may be bounded by
and the lemma follows upon using the Brun–Titchmarsh theorem to bound the number of primes in an interval that are also in a given arithmetic progression. ◻
We can now proceed much as in the proof of Halász’s Theorem 1.1 (replacing $\overline{\unicode[STIX]{x1D712}}$ by $\unicode[STIX]{x1D712}$ for convenience). The contribution from any $\unicode[STIX]{x1D6FC}$ , $\unicode[STIX]{x1D6FD}$ fixed is (arguing as in (2.7))
Using Cauchy–Schwarz and (3.5) (which is permissible since $y=T^{2}$ and $T=q^{2}(\log x)^{\unicode[STIX]{x1D705}+2}$ ), the second factor above is bounded by
by (2.6). Integrating over $\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD}\in [0,\unicode[STIX]{x1D702}]$ , and taking $\unicode[STIX]{x1D70E}=\unicode[STIX]{x1D6FD}+1/\text{log}\,x$ with $c_{0}=1+1/\text{log}\,x$ so that $c_{0}+\unicode[STIX]{x1D6FD}=1+\unicode[STIX]{x1D70E}$ , we obtain a bound of
for our main term. Now $|F_{\unicode[STIX]{x1D712}}(1+\unicode[STIX]{x1D70E}+it)|\ll 1/\unicode[STIX]{x1D70E}^{\unicode[STIX]{x1D705}}$ and so the contribution to the integral from any $|t|\geqslant (\log x)^{\unicode[STIX]{x1D705}}$ is $\ll 1$ , which is an acceptable error, so we may restrict the maximum to the range $|t|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ . The result follows.◻
4 A Lipschitz estimate for mean-values: proof of Theorem 1.5
Let $x$ be large, and $c_{0}=1+1/\text{log}\,x$ . Recall that $t_{1}=t_{1}(x)$ is chosen such that $|F(c_{0}+it_{1})|$ is a maximum in the range $|t_{1}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ . Given $1\leqslant w\leqslant x^{1/3}$ , our goal in Theorem 1.5 is to bound
We apply Lemma 2.2 twice, with $x$ and $x/w$ , and divide through by $x^{1+it_{1}}$ and $(x/w)^{1+it_{1}}$ respectively. The terms arising from the second and third sums in (2.4) may be bounded (as in Lemma 2.4) by $O((\log y)^{\unicode[STIX]{x1D705}}/\text{log}\,x)$ . The integral arising from (2.3) is
We now tailor this integral as in § 2.2, taking there $T=(\log x)^{\unicode[STIX]{x1D705}+1}$ , as before, and setting $y=\max (ew,T^{2})$ . Then, up to an error term $O((\log y)^{\unicode[STIX]{x1D705}}/(\log x))$ which is acceptable for Theorem 1.5, we may bound the magnitude of (4.1) by
From here on we follow the argument in § 2.3, making suitable modifications.
Thus, arguing as in (2.7) and (2.8), we may bound (for given $\unicode[STIX]{x1D6FC}$ , $\unicode[STIX]{x1D6FD}$ ) the inner integral in (4.2) by
Using the triangle inequality and since $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ are below $\unicode[STIX]{x1D702}\leqslant 1/\text{log}(ew)$ , we may bound the maximum in (4.3) by
Insert this bound in (4.3), and then into (4.2); we conclude that the quantity in (4.2) is bounded by
A small calculation allows us to bound the contribution of the second term above by
(The $\log (\log x/\text{log}\,ew)$ term can be replaced by $1$ except when $\unicode[STIX]{x1D705}=1$ .)
It remains to handle the first term in (4.4), for which we require the following lemma.
Lemma 4.1. Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ . Select a real number $t_{1}$ with $|t_{1}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ which maximizes $|F(1+1/\text{log}\,x+it_{1})|$ . Then for $|t|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ we have
If $f(n)\geqslant 0$ for all $n$ then we obtain the analogous result with $1/\unicode[STIX]{x1D70B}$ in place of $2/\unicode[STIX]{x1D70B}$ .
Proof. If $|t-t_{1}|\leqslant 1/\text{log}\,x$ then the stated estimate is immediate, and we suppose below that $(\log x)^{\unicode[STIX]{x1D705}}\gg |t-t_{1}|\geqslant 1/\text{log}\,x$ . By the maximality of $t_{1}$ , we have $|F(1+1/\text{log}\,x+it)|\leqslant |F(1+1/\text{log}\,x+it_{1})|$ , and so, setting $\unicode[STIX]{x1D70F}=|t_{1}-t|/2$ , we obtain
We may now estimate the sum above using the prime number theorem (in the strong form $\unicode[STIX]{x1D713}(x)=x+O(x\exp (-c\sqrt{\log x}))$ ) and partial summation. For a suitably large constant $C=C_{\unicode[STIX]{x1D705}}$ , put $Y=\max (\exp (C(\log \log x)^{2}),\exp (1/\unicode[STIX]{x1D70F}))$ . For $n\leqslant Y$ we use $|\text{cos}(\unicode[STIX]{x1D70F}\log n)|\leqslant 1$ , while for larger $n$ we can exploit the fact that $\int _{0}^{1}|\text{cos}(2\unicode[STIX]{x1D70B}y)|\,dy=2/\unicode[STIX]{x1D70B}$ . In this way, we obtain (and the reader may consult the proof of Lemma 2.3 of [Reference Granville and SoundararajanGS03] for further details if needed)
from which the first assertion of the lemma follows.
If $f(n)\geqslant 0$ for all $n$ , then using $F(c_{0}+it)=\overline{F(c_{0}-it)}$ we find
Now we argue as above, exploiting here that $\int _{0}^{1}\max \{0,\cos (2\unicode[STIX]{x1D70B}y)\}\,dy=1/\unicode[STIX]{x1D70B}$ .◻
Before applying the lemma, we comment that the estimates given there are in general the best possible. Suppose $\unicode[STIX]{x1D705}=1$ . Given $t_{1}$ with $|t_{1}|\leqslant \log x$ consider $f\in {\mathcal{C}}(1)$ defined by $f(p)=\text{sign}(\cos (t_{1}\log p))$ . Here the bound of the lemma is attained for $t=-t_{1}$ . Similarly for the non-negative case, consider the function defined by $f(p)=(1+\text{sign}(\cos (t_{1}\log p)))/2$ .
For more discussion on these examples, see § 9 below.
Now we return to the first term in (4.4), seeking a bound for
Note that if $|t|\geqslant (\log x)^{\unicode[STIX]{x1D705}}/10$ then the quantity in (4.6) may be bounded trivially by $\ll 1$ . Now restrict to the range $|t|\leqslant (\log x)^{\unicode[STIX]{x1D705}}/10$ , and distinguish the cases $|t_{1}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}/4$ and $|t_{1}|>(\log x)^{\unicode[STIX]{x1D705}}/4$ . In the second case $|t_{1}|>(\log x)^{\unicode[STIX]{x1D705}}/4$ , we have
upon using Lemma 3.1 with $a_{n}=f(n)/n^{1/\text{log}\,x}$ , and Lemma 4.1. In the first case when $|t_{1}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}/4$ , put $A(1+it)=F(c_{0}+i(t+t_{1}))$ , so that using the second part of Lemma 3.1
Appealing now to Lemma 4.1 where $A(1+it)=F(c_{0}+i(t+t_{1}))$ , the above is bounded by
after a little calculation. Thus in all cases we may bound the quantity in (4.6) by
Using this estimate in (4.4), we conclude that the first term there contributes
The bound in Theorem 1.5 for general $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ follows upon combining this estimate with (4.5). The bound for non-negative functions $f$ follows similarly, using now the stronger input of Lemma 4.1 for that situation.
5 Repulsion results
We first show that $|F(c_{0}+it)|$ can only be large for $t$ lying in a few rare intervals; in other words, that large values ‘repel’ one another. Such results are useful in many contexts (see for example Lemma 5.1 of [Reference SoundararajanSou10] which formulates a similar result in a slightly more general setting); while we will not use Proposition 5.1 directly in this paper, it will be useful in our forthcoming asymptotics paper in extracting asymptotic formulae (such as the examples given in § 9).
Proposition 5.1. Let $0\leqslant \unicode[STIX]{x1D70C}\leqslant 1$ . If $t_{1},\ldots ,t_{\ell }\in \mathbb{R}$ satisfy $|t_{j}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ , and $|t_{j}-t_{k}|\geqslant 1/(\log x)^{\unicode[STIX]{x1D70C}}$ for all $j\neq k$ , and $\unicode[STIX]{x1D6FF}_{1},\ldots ,\unicode[STIX]{x1D6FF}_{\ell }\geqslant 0$ then
Proof of Proposition 5.1.
The proof uses a simple but powerful ‘duality’ idea that is fundamental in large values theory for Dirichlet polynomials. Since each $|\unicode[STIX]{x1D6EC}_{f}(n)|\leqslant \unicode[STIX]{x1D705}\unicode[STIX]{x1D6EC}(n)$ , we have
By Cauchy–Schwarz the square of this sum is
The first factor above is $\log \log x+O(1)$ . Expanding out the sum over $j$ , the second factor on the right-hand side is
where the first term comes from the $j=k$ terms, and the second term from the bound
whenever $j\neq k$ , since $2(\log x)^{\unicode[STIX]{x1D705}}\geqslant |t_{j}-t_{k}|\geqslant 1/(\log x)^{\unicode[STIX]{x1D70C}}$ . (This bound follows because we have $|\unicode[STIX]{x1D701}(s)|\asymp 1/|s-1|$ when $s$ is in a neighbourhood of the pole at $s=1$ , while if $s$ is uniformly bounded away from $1$ we have $|\unicode[STIX]{x1D701}(s)|\ll \log (2+|s|)$ when Re $(s)\geqslant 1$ .) The proposition follows.◻
Of greater use in the present paper is Proposition 5.2 below, which generalizes Proposition 5.1 to Dirichlet characters. To prove that we require the following lemma.
Lemma 5.1. Let $\unicode[STIX]{x1D712}~(\text{mod}~q)$ be a non-principal character. Let $x\geqslant 10$ , and suppose that $|t|\leqslant (\log x)^{A}$ for some constant $A$ , and $y\geqslant q\log x$ . Then for all $\unicode[STIX]{x1D70E}\geqslant 1+1/\text{log}\,x$ we have
where (for Re $(s)>1$ )
Proof. This follows from [Reference KoukoulopoulosKou13, Lemma 4.1], where an elementary proof based on a sieve argument is given.
The following alternative argument, which uses some basic analytic properties of $L$ -functions, may also be illuminating. Suppose, for simplicity, that $\unicode[STIX]{x1D712}$ is primitive, and put $\mathfrak{a}=0$ or $1$ depending on whether $\unicode[STIX]{x1D712}$ is even or odd. The completed $L$ -function $\unicode[STIX]{x1D6EC}(s,\unicode[STIX]{x1D712})=(q/\unicode[STIX]{x1D70B})^{s/2}\unicode[STIX]{x1D6E4}((s+\mathfrak{a})/2)L(s,\unicode[STIX]{x1D712})$ extends to an entire function of the complex plane of order $1$ (see [Reference DavenportDav00, ch. 9]). We claim that for fixed $t\in \mathbb{R}$ , the size of the completed $L$ -function $|\unicode[STIX]{x1D6EC}(\unicode[STIX]{x1D70E}+it,\unicode[STIX]{x1D712})|$ is monotone increasing in $\unicode[STIX]{x1D70E}>1$ . To see this, suppose $\unicode[STIX]{x1D70E}_{1}\geqslant \unicode[STIX]{x1D70E}_{2}>1$ , and note that by Hadamard’s factorization formula (see (16) and (18) of [Reference DavenportDav00, ch. 12])
where the product is over all non-trivial zeros $\unicode[STIX]{x1D70C}$ of $L(s,\unicode[STIX]{x1D712})$ , which satisfy $0\leqslant \text{Re}(\unicode[STIX]{x1D70C})\leqslant 1$ . Since $0\leqslant \text{Re}(\unicode[STIX]{x1D70C})\leqslant 1$ and $\unicode[STIX]{x1D70E}_{1}\geqslant \unicode[STIX]{x1D70E}_{2}>1$ , each individual term in our product above is ${\geqslant}1$ , and our claim follows.
Now if $\unicode[STIX]{x1D70E}\geqslant 1+1/\text{log}\,y$ then
so that the lemma follows in this case. If $1+1/\text{log}\,x\leqslant \unicode[STIX]{x1D70E}\leqslant 1+1/\text{log}\,y$ , then using Stirling’s formula and the monotonicity observation above we find that
This completes the lemma for primitive $\unicode[STIX]{x1D712}$ , and it is a simple matter to extend to the general case.◻
Before stating our proposition, let us recall some notation used previously in § 3.2. If $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ is given, and $y$ is a real parameter, and $\unicode[STIX]{x1D712}~(\text{mod}~q)$ a Dirichlet character, then (with the sum being over all $n$ with prime factors all larger than $y$ )
Proposition 5.2. Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ . Suppose that $\unicode[STIX]{x1D712}_{1},\ldots ,\unicode[STIX]{x1D712}_{J}$ are distinct Dirichlet characters $~(\text{mod}~q)$ , and that $t_{1},\ldots ,t_{J}$ are real numbers with $|t_{j}|\leqslant (\log x)^{\unicode[STIX]{x1D705}}$ . Let $\unicode[STIX]{x1D6FF}_{1},\ldots ,\unicode[STIX]{x1D6FF}_{J}$ be non-negative real numbers. For $x\geqslant 10$ and $y$ in the range $q\log x\leqslant y\leqslant \sqrt{x}$ we have
Proof. The proof is similar to that of Proposition 5.1. We start with
By Cauchy–Schwarz the square of the sum here is
The first factor is $\log (\log x/\text{log}\,y)+O(1)$ . Expanding out the sum over $j$ , the second factor is
The terms $j=k$ contribute ${\leqslant}J\log (\log x/\text{log}\,y)+O(J)$ , while by Lemma 5.1 the terms $j\neq k$ contribute $O_{\unicode[STIX]{x1D705}}(J^{2})$ . We conclude that
and the proposition follows. ◻
6 Multiplicative functions in arithmetic progressions and the pretentious large sieve
Given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ and a Dirichlet character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ , recall that we defined
The proof of Halász’s theorem in arithmetic progressions presented in § 3.2 may be refined to take into account the contribution of a given set of characters. Recall that for a character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ we defined $M_{\unicode[STIX]{x1D712}}=M_{\unicode[STIX]{x1D712}}(x)$ by the relation
Proposition 6.1. Let ${\mathcal{X}}$ be a set of characters $~(\text{mod}~q)$ and put
Then, with
we have
Proof. We start with the orthogonality relation (3.3), omitting the characters in ${\mathcal{X}}$ , so that
and then apply Lemma 2.2 to each of the inner sums taking $T=q^{2}(\log x)^{\unicode[STIX]{x1D705}+2}$ and $y=T^{2}$ . Consider the contribution of the second term arising from (2.4), which is
A similar decomposition holds for the third term arising from (2.4). The first term above together with its counterpart arising from the third term in (2.4) contribute the expression in (3.4), and may be bounded as before by $\ll x(\log y)^{\unicode[STIX]{x1D705}}/(\unicode[STIX]{x1D719}(q)\log x)$ . Similarly, the second term above together with its counterpart may be handled by Lemma 2.4, and is bounded by $|{\mathcal{X}}|x(\log y)^{\unicode[STIX]{x1D705}}/(\unicode[STIX]{x1D719}(q)\log x)$ . From here on, we may follow the argument of § 3.2, and making small modifications to that argument yields the proposition.◻
The following result contains Theorem 1.8.
Theorem 6.1. Let $q$ be a natural number, and let $J\geqslant 1$ be a given integer. Let $X$ be large with $X\geqslant q^{10}$ , and set $Q=q\log X$ . Consider the following three definitions of a set of $J$ exceptional characters.
(i) The set ${\mathcal{X}}_{J}^{1}$ consists of the characters $\unicode[STIX]{x1D712}$ corresponding to the $J$ largest values of $\max _{\sqrt{X}\leqslant x\leqslant X}|S_{f}(x,\unicode[STIX]{x1D712})|/x$ .
(ii) The set ${\mathcal{X}}_{J}^{2}$ consists of the characters $\unicode[STIX]{x1D712}$ corresponding to the $J$ smallest values of $M_{\overline{\unicode[STIX]{x1D712}}}(X)$ .
(iii) The set ${\mathcal{X}}_{J}^{3}$ consists of the characters $\unicode[STIX]{x1D712}$ corresponding to the $J$ largest values of $\max _{|t|\leqslant (\log X)^{\unicode[STIX]{x1D705}}}|{\mathcal{L}}_{\overline{\unicode[STIX]{x1D712}}}(1+1/\text{log}\,X+it)|$ ; here take $y=q^{4}(\log X)^{2\unicode[STIX]{x1D705}+4}$ in the definition of ${\mathcal{L}}_{\overline{\unicode[STIX]{x1D712}}}$ .
In all of the cases $\ell =1$ , $2$ , or $3$ one has, uniformly for all $x$ in the range $\sqrt{X}\leqslant x\leqslant X^{2}$ ,
Further, for any character $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}^{1}\cap {\mathcal{X}}_{J}^{2}\cap {\mathcal{X}}_{J}^{3}$ one has, uniformly for all $x$ in the range $\sqrt{X}\leqslant x\leqslant X^{2}$ ,
Proof. By Proposition 5.2, there are at most $J$ characters $\unicode[STIX]{x1D712}$ with
For $\sqrt{X}\leqslant x\leqslant X^{2}$ , it is easy to check that the difference between $\log |{\mathcal{L}}_{\overline{\unicode[STIX]{x1D712}}}(1+1/\text{log}\,x+it)|$ and $\log |{\mathcal{L}}_{\overline{\unicode[STIX]{x1D712}}}(1+1/\text{log}\,X+it)|$ is $O(1)$ . Therefore, we have that there are at most $J$ characters $\unicode[STIX]{x1D712}$ with
For a character $\unicode[STIX]{x1D713}$ different from these $J$ characters, one has
In other words, for such a character $\unicode[STIX]{x1D713}$ one has
In particular, we have this lower bound for all $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}^{2}\cap {\mathcal{X}}_{J}^{3}$ . The estimate (6.1) for $\ell =2$ and $3$ (that is, cases (ii) and (iii)) follows now from Proposition 6.1, noting that the contribution from the $(\log Q)^{\unicode[STIX]{x1D705}}$ error term in Proposition 6.1 is smaller than the upper bound in (6.1).
If $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}^{2}\cap {\mathcal{X}}_{J}^{3}$ , then applying Corollary 1.2 (the first corollary of Halász’s Theorem 1.1) to $f\overline{\unicode[STIX]{x1D713}}$ yields (6.2). By the maximality in the definition of ${\mathcal{X}}_{J}^{1}$ , we deduce that the estimate (6.2) must also hold for any character $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}^{1}$ .
Now we handle (6.1) in case (i). We have already shown the result for the set ${\mathcal{X}}_{J}^{3}$ . Taking the difference between the sums defining $E_{f,{\mathcal{X}}_{J}^{1}}(x;q,a)$ and $E_{f,{\mathcal{X}}_{J}^{3}}(x;q,a)$ yields that
Using (6.2), this yields a bound on the difference that is acceptable, and so the theorem follows in case (i) also. ◻
Proof of Theorem 1.9.
Let ${\mathcal{X}}$ be a set of characters $~(\text{mod}~q)$ , (for Theorem 1.9 we will be interested in the set ${\mathcal{X}}_{J}$ of at most $J$ characters $\unicode[STIX]{x1D712}$ for which (1.2) holds), and consider
Expanding out the inner sum, we obtain
Thus
Taking ${\mathcal{X}}$ to be the set ${\mathcal{X}}_{J}^{1}$ of Theorem 6.1, and inserting (6.1) into (6.4), we obtain the bound claimed in Theorem 1.9. The set ${\mathcal{X}}_{J}^{1}$ may contain more characters than the set ${\mathcal{X}}_{J}$ , but if it does then (trivially by definition of ${\mathcal{X}}_{J}$ ) the contribution from any extra characters in ${\mathcal{X}}_{J}^{1}\backslash {\mathcal{X}}_{J}$ is acceptably small.◻
7 Counting primes in short intervals and arithmetic progressions
Here and in the next section, we wish to understand (given $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ )
where ${\mathcal{N}}$ is either a short interval near $x$ , or an arithmetic progression up to $x$ (with uniformity in the modulus $q$ ). One can relate the prime sums in (7.1) to multiplicative functions as follows. Set $G(s)=1/F(s)$ , and let $g(n)$ denote the corresponding multiplicative function (so that $g(p)=-f(p)$ for all primes $p$ ). Note that $G^{\prime }/G=-F^{\prime }/F$ and so $\unicode[STIX]{x1D6EC}_{g}(n)=-\unicode[STIX]{x1D6EC}_{f}(n)$ , whence $g$ also belongs to ${\mathcal{C}}(\unicode[STIX]{x1D705})$ . Comparing Dirichlet series coefficients in the identity $-F^{\prime }/F=(1/F)\cdot (-F^{\prime })=G\cdot (-F^{\prime })$ , we obtain
We can try to understand averages of the RHS over $n\in {\mathcal{N}}$ using the hyperbola method, noting that the sums over $k$ and $\ell$ now involve multiplicative functions. However, this approach is doomed to failure because the Halász type results allow us to save at most one logarithm (at least when $\unicode[STIX]{x1D705}=1$ ), whereas the trivial bound obtained by taking absolute values in our expression above for $\unicode[STIX]{x1D6EC}_{f}(n)$ with $f\in {\mathcal{C}}(1)$ is, (in the particular case that $f=1$ and $g=\unicode[STIX]{x1D707}$ , for simplicity),
To overcome this hurdle, we perform a ‘pre-sieving’ to eliminate integers with prime factors below $y$ for a suitable $y$ , and reprove several of our earlier results, though now restricted to multiplicative functions supported only on large prime factors.
7.1 Multiplicative functions supported only on the primes ${>}y$
Suppose $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ with $f(p^{k})=0$ for all $p\leqslant y$ and $k\geqslant 1$ . Thus ${\mathcal{S}}(s)=1$ and ${\mathcal{L}}(s)=F(s)$ , and making small modifications to the proof of Proposition 2.1 (specifically in the analogues of Lemmas 2.4 and 2.5), we arrive at the following result.
Proposition 7.1. Given $\sqrt{x}>y\geqslant 10$ , let $\frac{1}{2}>\unicode[STIX]{x1D702}=1/\text{log}\,y>0$ and $c_{0}=1+1/\text{log}\,x$ . Let $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ with $f(p^{k})=0$ for all primes $p\leqslant y$ and $k\geqslant 1$ . Then, for any $1\leqslant T\leqslant x^{9/10}$ ,
In fact, the proofs here simplify a little because in bounding some error terms there is no sum over smooth numbers. For instance, in the proof of Lemma 2.4 we only have the terms with $m=1$ , so that there is no Euler product over primes $p\leqslant y$ , and hence no $(\log y)^{\unicode[STIX]{x1D705}}$ term.
We can obtain analogues of our previous results for multiplicative functions supported on large primes, by following the earlier proofs, using ${\mathcal{S}}(s)=1$ and replacing applications of Proposition 2.1 with Proposition 7.1 above. Thus if $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ , with $f(n)=0$ if $n$ has a prime factor ${\leqslant}y$ , where $(\log x)^{2(\unicode[STIX]{x1D705}+2)}\leqslant y\leqslant \sqrt{x}$ , then the argument of Halász’s Theorem 1.1 with $T=\log x\cdot (\log x/\text{log}\,y)^{\unicode[STIX]{x1D705}}$ yields
We may now follow the proof of Corollary 1.2 using the bound (for $0<\unicode[STIX]{x1D70E}\leqslant 2/\text{log}\,y$ )
in place of the bound $|F(1+\unicode[STIX]{x1D70E}+it)/(1+\unicode[STIX]{x1D70E}+it)|\leqslant \unicode[STIX]{x1D701}(1+\unicode[STIX]{x1D70E})^{\unicode[STIX]{x1D705}}\ll 1/\unicode[STIX]{x1D70E}^{\unicode[STIX]{x1D705}}$ . Thus we obtain that
where
Next we turn to the analogue of Theorem 1.3 in this setting. We obtain, assuming that $x^{2\unicode[STIX]{x1D6FF}}(\log x)^{2(\unicode[STIX]{x1D705}+2)}\leqslant y\leqslant \sqrt{x}$ and $0<\unicode[STIX]{x1D6FF}<\frac{1}{4}$ , and taking $T=x^{\unicode[STIX]{x1D6FF}}\log ^{2}x\cdot (\log x/\text{log}\,y)^{\unicode[STIX]{x1D705}}$ ,
where
Similarly, we may formulate an analogue for arithmetic progressions. Thus, if $(a,q)=1$ and $q^{4}(\log x)^{2(\unicode[STIX]{x1D705}+2)}\leqslant y\leqslant \sqrt{x}$ , then (for $f\in {\mathcal{C}}(\unicode[STIX]{x1D705})$ with $f(p^{k})=0$ for $p\leqslant y$ )
where
For our work on primes in arithmetic progressions what will be useful are analogues of Theorem 1.8. Using Proposition 5.2, we find that if $q\log x\leqslant y\leqslant \sqrt{x}$ then for any natural number $J$ there are at most $J$ characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ with
If ${\mathcal{X}}_{J}$ denotes the set of at most $J$ characters for which the above holds, then for all $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}$ by (7.3) we have
Moreover, arguing as in § 6, we find that
7.2 Primes in short intervals: proof of Corollary 1.10
Let $\unicode[STIX]{x1D6FF}>0$ be small, and set
Let $1_{y}$ be the indicator function of those numbers with all prime factors ${>}y$ , and $\unicode[STIX]{x1D707}_{y}$ be the Möbius function restricted to numbers with all prime factors ${>}y$ . Our interest is in counting primes in the interval $[x,x+x^{1-\unicode[STIX]{x1D6FF}}]$ and for $n$ in this interval we may write $\unicode[STIX]{x1D6EC}_{1_{y}}(n)$ , which is usually $\unicode[STIX]{x1D6EC}(n)$ , as
where $K$ and $L$ are parameters to be chosen later with $KL=x+x^{1-\unicode[STIX]{x1D6FF}}$ , $y^{2}\leqslant K,L\leqslant x/y^{2}$ and $L\leqslant \sqrt{x}$ . Thus
We begin by analyzing the first term in (7.8). Temporarily setting $u=(\log L)/(2\log y)$ , a weak form of the fundamental lemma from sieve theory (see for example [Reference Friedlander and IwaniecFI10, Corollary 6.10]) gives
and so
Thus the first term in (7.8) is
Using the sieve again, the above equals
Setting this aside for the moment, we turn to the second term in (7.8). Applying (7.4) to $f=\unicode[STIX]{x1D707}_{y}$ (and with $\unicode[STIX]{x1D705}=1$ ) we obtain
where
To estimate the above quantity, we invoke the following proposition, whose proof is postponed for a few paragraphs.
Proposition 7.2. Let $x\geqslant y\geqslant 2$ , and suppose $t\in \mathbb{R}$ . Then
In our application of Proposition 7.2 we have $|t|\leqslant x^{\unicode[STIX]{x1D6FF}}(\log (x/\ell )/\text{log}\,y)\leqslant x^{\unicode[STIX]{x1D6FF}}\log x\leqslant y^{1/2}$ so that $\log (y+|t|)\asymp \log y$ . Therefore Proposition 7.2 implies that $e^{-M_{\unicode[STIX]{x1D6FF},\ell }}\ll ((\log y)/\text{log}(x/\ell ))^{1/4}$ , so that
Performing now the sum over $\ell$ , and using the sieve once again, the second term in (7.8) is
We now choose $u=3\log (\log x/\text{log}\,y)$ so that $e^{-u}=(\log y/\text{log}\,x)^{3}$ , and $L=y^{2u}\leqslant \sqrt{x}$ . Then, combining (7.9) and (7.10), we conclude that
It remains to simplify the main term in (7.11). We may finesse this issue as follows. Summing up (7.11) over all intervals of length $x^{1-\unicode[STIX]{x1D6FF}}$ between $x$ and $2x$ we find
The prime number theorem (which, incidentally, may be deduced from Halász’s theorem) now allows us to evaluate the right-hand side above. We conclude that
which establishes Corollary 1.10. It remains lastly to prove Proposition 7.2.
Proof of Proposition 7.2.
Set $c_{0}=1+1/\text{log}\,x$ . If $|t|\leqslant 1/\text{log}\,y$ then the product in the proposition may be estimated trivially as
We may therefore assume that $|t|\geqslant 1/\text{log}\,y$ . The elementary inequality $3+4\cos \unicode[STIX]{x1D703}+\cos 2\unicode[STIX]{x1D703}\geqslant 0$ yields (as in usual proofs of the zero-free region, see e.g. [Reference DavenportDav00, ch. 13])
Now [Reference KoukoulopoulosKou13, Lemma 2.4] implies thatFootnote 2 for all $z\geqslant y\geqslant 2$
Partial summation gives
and we now input the asymptotic formula above. This leads to the main term
as $|t|\geqslant 1/\text{log}\,y$ . Moreover the error term contributes (after integrating by parts)
which completes the proof.
Alternatively, we may upper bound $\prod _{p>y}|1-1/p^{c_{0}+2it}|^{-1}$ by using the monotonicity principle described in Lemma 5.1. If $|t|\leqslant 1/\text{log}\,y$ , or if $y+|t|\geqslant x$ then Proposition 7.2 is immediate, so we may assume that $|t|\geqslant 1/\text{log}\,y$ and $y+|t|\leqslant x$ . Next observe that
Next in the range $|t|\geqslant 1/\text{log}\,y$ , using Stirling’s formula we obtain
where $\unicode[STIX]{x1D709}(s)=s(s-1)\unicode[STIX]{x1D70B}^{-s/2}\unicode[STIX]{x1D6E4}(s/2)\unicode[STIX]{x1D701}(s)$ denotes Riemann’s $\unicode[STIX]{x1D709}$ -function, and the final inequality holds because $y+|t|\leqslant x$ and $|\unicode[STIX]{x1D709}(\unicode[STIX]{x1D70E}+2it)|$ is monotone increasing for $\unicode[STIX]{x1D70E}>1$ .◻
7.3 Primes in progressions: Theorem 1.11 and Corollaries 1.12 and 1.13
Set $y=q^{4}(\log x)^{6}$ , and let $1_{y}$ and $\unicode[STIX]{x1D707}_{y}$ be as in the previous section. Recall that $Q=q\log x$ , so that $\log y\asymp \log Q$ . Let ${\mathcal{X}}_{J}$ be the set of non-principal characters $\unicode[STIX]{x1D712}~(\text{mod}~q)$ for which
By Proposition 5.2 applied to the function $\unicode[STIX]{x1D707}_{y}$ , we know that ${\mathcal{X}}_{J}$ contains no more than $J$ elements. Let $\widetilde{{\mathcal{X}}}_{J}$ denote the set ${\mathcal{X}}_{J}$ extended to include the principal character.
As before, let $K$ and $L$ be parameters with $y^{2}\leqslant K,L\leqslant x/y^{2}$ , and with $KL=x$ and $L\leqslant \sqrt{x}$ . Using the decomposition of $\unicode[STIX]{x1D6EC}_{1_{y}}(n)$ , as in (7.8), we obtain that
equals
We now follow the argument in the previous section to handle the two sums above, starting with the first sum. Set $u=(\log L)/(2\log y)$ , as in the previous section. Using partial summation and the fundamental lemma from sieve theory [Reference Friedlander and IwaniecFI10, Corollary 6.10] we obtain (for any reduced residue class $b~(\text{mod}~q)$ )
If $\unicode[STIX]{x1D712}\in {\mathcal{X}}_{J}$ is a non-principal character, then using (7.14)
For the principal character, we obtain similarly that
Putting these results together for the inner sums over $\ell$ in the first term of (7.13), we may bound that term by
Now we turn to the second term in (7.13). By (7.7) (with $x$ there replaced by $x/\ell$ and then $K$ here), we obtain
Therefore the second term in (7.13) may be bounded by
upon using the sieve once again. Recall that $L=y^{2u}$ and choose once again $u=3\log (\log x/\text{log}\,y)$ , (so that in particular $L\leqslant \sqrt{x}$ , provided $\log x/\text{log}\,y$ is large). Then combining the above with (7.13) and (7.15) we conclude that
This establishes (1.5) of Theorem 1.11, using the prime number theorem to account for the contribution of the principal character.
Now if $\unicode[STIX]{x1D713}$ is a non-principal character mod $q$ , with $\unicode[STIX]{x1D713}\not \in {\mathcal{X}}_{J}$ then
by the last displayed equation. This implies (1.4), completing the proof of Theorem 1.11. Alternatively, arguing as in our proof of (1.5) (with the same parameters, but replacing the use of (7.7) by (7.3) applied to $f=\unicode[STIX]{x1D707}_{y}\unicode[STIX]{x1D712}$ ), we obtain that for any non-principal character $\unicode[STIX]{x1D712}~(\text{mod}~q)$
This again proves (1.4), and moreover (7.16) will be used in the proof of Corollary 1.12. ◻
Proof of Corollary 1.13.
Corollary 1.13 follows from Theorem 1.11 in much the same way that Theorem 1.9 was deduced from Theorem 1.8, in § 6.◻
Proof of Corollary 1.12.
Apply our work above with $J=1$ . If ${\mathcal{X}}_{1}$ is empty, then there is no exceptional character and a stronger form of (1.6) follows from Theorem 1.11. If ${\mathcal{X}}_{1}$ is non-empty, then the unique character that it contains must be real (else its conjugate would also belong to the set). Now (1.8) follows at once from Theorem 1.11. Using (7.16) and (1.8) we see that if (1.6) fails, then we must have
If the maximum above occurs at a point $t_{0}$ , then taking logarithms we have
It remains to deduce (1.7) from (7.17). From (7.17) it follows that
Using the prime number theorem and partial summation (as in our proof of Lemma 4.1, and recalling that $\int _{0}^{1}|\text{cos}(2\unicode[STIX]{x1D70B}u)|\,du=2/\unicode[STIX]{x1D70B}$ ) we find that for $|t_{0}|\leqslant (\log x)/\text{log}\,y$ ,
Comparing the above two estimates, provided $\log x/\text{log}\,y$ is large enough we obtain (since $\unicode[STIX]{x1D70B}/(\unicode[STIX]{x1D70B}-2)<3$ )
Hence
Bounding the first sum using (7.17) and the second sum by
we deduce (1.7).◻
8 Linnik’s theorem: proof of Corollary 1.14
Let $L$ be a sufficiently large fixed constant; what ‘sufficiently large’ means will evolve from satisfying certain inequalities in the proof. Suppose that $(a,q)=1$ and every prime $p\equiv a~(\text{mod}~q)$ is ${>}q^{L}$ . We appeal to Corollary 1.12 with $x=q^{L}$ , where $L$ is chosen large enough so that the error terms in Corollary 1.12 are all much smaller than the main terms.
Clearly (1.6) does not hold, and therefore we must have an exceptional real character $\unicode[STIX]{x1D712}~(\text{mod}~q)$ for which (1.7) and (1.8) hold. In particular, (1.7) gives
Now put $L_{0}=\exp (\sqrt{\log L})$ , and apply Theorem 1.11 with $J=1$ for $x$ in the range $q^{L_{0}}\leqslant x\leqslant q^{L}$ . The set ${\mathcal{X}}_{1}$ consists precisely of one character, namely the exceptional character $\unicode[STIX]{x1D712}$ : indeed
Theorem 1.11, equation (1.5), now tells us that (since there are no primes $p\equiv a~(\text{mod}~q)$ below $q^{L}$ ) for all $q^{L_{0}}\leqslant x\leqslant q^{L}$ we have
Partial summation, using this relation, yields
so that
We proved above that the left-hand side is ${\leqslant}30\log \log L+O(1)$ , and so $\unicode[STIX]{x1D712}(a)$ must be $1$ , implying that
At this stage, we invoke a sieve estimate from Friedlander and Iwaniec [Reference Friedlander and IwaniecFI10] (see Proposition 24.1, and the preceding discussion). Put
This is a natural object in number theory: if $\unicode[STIX]{x1D712}$ is the quadratic character that corresponds to a fundamental discriminant $D$ , then $a(n)$ counts (after accounting for automorphisms) the number of representations of $n$ by binary quadratic forms of discriminant $D$ . Friedlander and Iwaniec consider the sequence $a(n)$ over $n\leqslant q^{L}$ and $n\equiv a~(\text{mod}~q)$ (with $\unicode[STIX]{x1D712}(a)=1$ ), and sieve this sequence to extract the primes. (Note that if $\unicode[STIX]{x1D712}(a)=1$ then for a prime $p\equiv a~(\text{mod}~q)$ one has $a(p)=1+\unicode[STIX]{x1D712}(p)=2$ .) They establish that if $x\geqslant q^{O(1)}$ , and $z\leqslant x^{1/8}$ then
The idea behind this result is that one can usually sieve (as in the fundamental lemma) up to $z$ being a small power of $x$ , and if there are few large primes $p$ for which $\unicode[STIX]{x1D712}(p)=1$ then one has a sifting problem of low dimension so that it becomes possible to estimate accurately the number of primes left after sieving. The estimate (8.2) is useful only if $\sum _{\substack{ z<p\leqslant x \\ \unicode[STIX]{x1D712}(p)=1}}1/p$ is very small, but thanks to (8.1) this is precisely the situation we are in!
We therefore apply (8.2) taking $x=q^{L}$ and $z=q^{L_{0}}$ , so that
If $L$ is suitably large, it then follows that
say. And we have $L(1,\unicode[STIX]{x1D712})\gg 1/\sqrt{q}$ (see e.g. [Reference DavenportDav00, ch. 6]), so we have shown that the least prime $p\equiv a~(\text{mod}~q)$ is below $q^{L}$ . Note that we do not need Siegel’s theorem here; the effective lower bound $L(1,\unicode[STIX]{x1D712})\gg 1/\sqrt{q}$ suffices.
9 Examples of asymptotics
We will prove (1.1) in a forthcoming paper by the authors on asymptotics in Halász’s theorem, and that will allow us to find asymptotics for the mean values of certain interesting $f$ . Here we discuss three important examples given in there.
Example 1. If $F(s)=\unicode[STIX]{x1D701}(s)^{\unicode[STIX]{x1D6FD}}$ so that $f(n)=d_{\unicode[STIX]{x1D6FD}}(n)$ , then
This is a well-known result of Selberg and Delange.
Example 2. Consider the multiplicative function given by $f(p)=g(\log p)$ , where $g(t)=$ sign $(\cos (2\unicode[STIX]{x1D70B}t))$ . Here $g(t)=1$ if $-\frac{1}{4}\leqslant t\leqslant \frac{1}{4}$ mod 1, and $g(t)=-1$ otherwise. The Fourier coefficients of $g$ are given by
and one can prove that $F(c_{0}+2i\unicode[STIX]{x1D70B}m)\sim c_{m}(\log x)^{\widehat{g}(m)}$ for certain non-zero constants $c_{m}$ , where $c_{-m}=\overline{c_{m}}$ . Thus $t_{1}$ can be taken to be either $2\unicode[STIX]{x1D70B}$ or $-2\unicode[STIX]{x1D70B}$ , and one can show that
for some constant $c\neq 0$ . Taking $y=e^{1/4}$ with, say $t_{1}=-2\unicode[STIX]{x1D70B}$ , we deduce that
Hence the Lipschitz exponent for $f$ is at most $1-2/\unicode[STIX]{x1D70B}$ .
For any given $y$ in the range $1+\unicode[STIX]{x1D716}\leqslant y\leqslant \sqrt{x}$ we can let $f(p)=g(T\log p)$ for a carefully selected real number $T=T(y)$ to obtain a lower bound on the difference that is of the same size, up to a constant, as the difference in (9.1):
This gives examples, when $\unicode[STIX]{x1D705}=1$ , showing that the exponent $1-2/\unicode[STIX]{x1D70B}$ in Theorem 1.5 cannot be made larger.
Example 3. Consider the multiplicative function given by $f(p)=g_{+}(\log p)$ , where $g_{+}(t)=(1+g(t))/2$ . Here $g_{+}(t)=1$ if $-\frac{1}{4}\leqslant t\leqslant \frac{1}{4}$ , and $g_{+}(t)=0$ otherwise, so that $f(n)\geqslant 0$ for all $n\geqslant 1$ . One can show that
and one can develop this example to show that, when $\unicode[STIX]{x1D705}=1$ , the exponent $1-1/\unicode[STIX]{x1D70B}$ for real, non-negative $f$ in Theorem 1.5 cannot be made larger.