1. Introduction
The study of quadratic forms over number fields is a rich and highly developed area of mathematics. Let K be a number field of degree $d\geqslant 2$ over $\mathbb {Q}$ , and let
be a nonsingular quadratic form, with symmetric coefficients $c_{i,j}\in \mathfrak {o}_K$ . For given $N\in \mathfrak {o}_K$ , it is very natural to ask about the solubility of
with $x_1,\dots ,x_n\in \mathfrak {o}_K$ . If $n\geqslant 4$ , a number field version of the Hardy–Littlewood circle method is capable of establishing the Hasse principle for these equations. When $n\geqslant 5$ , this follows from work of Skinner [Reference Skinner13], and for $n=4$ it is carried out by Helfrich in a 2015 PhD thesis [Reference Helfrich8].
In this paper, we shall introduce the notion of a generalised quadratic form over K and ask about the Hasse principle in this new setting. We shall always assume that $K/\mathbb {Q}$ is a Galois extension of degree d that is totally real. (Our methods can handle arbitrary number fields, but doing so causes extra notational complexity and gives no new insight into the arithmetic of generalised quadratic forms.) We may now make the following definition.
Definition 1.1. Let $n\geqslant 2$ . A generalised quadratic form is given by
for symmetric coefficients $c_{i,j,\tau ,\tau '}=c_{j,i,\tau ',\tau }\in \mathfrak {o}_K$ .
We will be interested in the set of $(x_1,\dots ,x_n)\in \mathfrak {o}_K^n$ for which
for given $N\in \mathfrak {o}_K$ , in which case $x_i^{\tau }$ should be interpreted as the conjugate of $x_i$ under $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . Definition 1.1 encompasses standard integral quadratic forms over $\mathfrak {o}_K$ and forms defined using norms and traces. For example, let $\operatorname {\mathrm {Tr}}_{K/\mathbb {Q},H}:K\to K$ be the partial trace, defined via $\operatorname {\mathrm {Tr}}_{K/\mathbb {Q},H}(u)=\sum _{\tau \in H} u^{\tau }$ for any subset $H\subset \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . Then, a natural generalisation of the question about representing elements of $\mathfrak {o}_K$ as a sum of squares is to ask about the existence of $\mathbf {x}\in \mathfrak {o}_K^n$ such that
for given $N\in \mathfrak {o}_K$ and a given subset $H\subset \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ .
The coefficients of a generalised quadratic form $F(X_1,\dots ,X_n)$ form a $dn\times dn$ matrix $\mathbf {M}=(c_{i,j,\tau ,\tau '})_{(i,\tau )\times (j,\tau ')}$ . In the generic setting, we might expect this matrix to have full rank, but there are many cases of interest where the rank is much smaller. For example, standard quadratic forms produce a coefficient matrix $\mathbf {M}$ , which after reordering rows and columns, contains a $n\times n$ block matrix in the upper left corner and has zeros everywhere else. Our methods break down in the completely generic situation, and so our interest in this paper lies at the opposite end of the spectrum, in which the rank of $\mathbf {M}$ is not much bigger than n.
Let $W:(K\otimes _{\mathbb {Q}} \mathbb {R})^n\to \mathbb {R}_{\geqslant 0}$ be a smooth weight function, whose precise construction is deferred until §4. Our main results will comprise of asymptotic formulae for sums of the shape
as $P\rightarrow \infty $ for given $N\in \mathfrak {o}_K$ and suitable generalised quadratic forms F. When $N=0$ , we shall simply write $N_{W}(F;P)=N_{W}(F,0;P)$ .
1.1. Homogeneous setting
Of particular interest is the case $N=0$ , which we now assume. For standard quadratic forms $Q\in \mathfrak {o}_K[X_1,\dots ,X_n]$ , studying nontrivial zeros of Q over $\mathfrak {o}_K$ is equivalent to studying K-rational points on the smooth quadric $X\subset \mathbb {P}_K^{n-1}$ cut out by $Q=0$ . This, in turn, can be accessed via the Weil restriction (or restriction of scalars). The Weil restriction $R_{K/\mathbb {Q}} X$ is an algebraic variety whose set of $\mathbb {Q}$ -points is canonically in bijection with the K-rational points of X. In the setting where $Q\in \mathfrak {o}_K[X_1,\dots ,X_n]$ is a nonsingular quadratic form, the Weil restriction $R_{K/\mathbb {Q}}X$ is a smooth complete intersection of d quadrics in $\mathbb {P}_{\mathbb {Q}}^{dn-1}$ , all of which are defined over $\mathbb {Q}$ . However, the set of complete intersections that arise in this way is a very limited subset of the family of all smooth codimension d complete intersections of quadrics over $\mathbb {Q}$ in $\mathbb {P}_{\mathbb {Q}}^{dn-1}$ . Our first result shows that, after Weil restriction, the space of generalised quadratic forms is rich enough to capture the arithmetic over $\mathbb {Q}$ of arbitrary codimension d complete intersections of quadrics in $\mathbb {P}_{\mathbb {Q}}^{M-1}$ , provided that $d\mid M$ .
Let $F(X_1,\dots ,X_n)$ be a generalised quadratic form, and let $\omega _1,\dots ,\omega _d $ be a $\mathbb {Z}$ -basis for $\mathfrak {o}_K$ . Any element $\mathbf {x}\in \mathfrak {o}_K^n$ can be written $\mathbf {x}=\omega _1\mathbf {u}_1+\cdots +\omega _d\mathbf {u}_d$ for $(\mathbf {u}_1,\dots ,\mathbf {u}_d)\in \mathbb {Z}^{dn}$ . Taking the Weil restriction corresponds to writing down a set of quadratic forms $Q_1,\dots ,Q_d\in \mathbb {Z}[\mathbf {U}_1,\dots ,\mathbf {U}_d]$ , in $dn$ variables such that
We henceforth call $\{Q_1,\dots ,Q_d\}$ the descended system. We shall prove the following result in §2.
Theorem 1.2. Let $K/\mathbb {Q}$ be a Galois extension of degree d. Then there is a bijection between the space of generalised quadratic forms in n variables over K and systems of d rational quadratic forms in $dn$ variables.
It is interesting to note that this theorem is valid for any fixed degree d Galois extension $K/\mathbb {Q}$ . It follows from the bijection in Theorem 1.2 that the question of $\mathfrak {o}_K$ -solubility for a generalised quadratic form is equivalent to the question of $\mathbb {Z}$ -solubility for the descended system. It presents an intriguing challenge to gain insight into smooth codimension d complete intersections of quadrics in $\mathbb {P}_{\mathbb {Q}}^{M-1}$ over $\mathbb {Q}$ by working with generalised quadratic forms.
It follows from work of Birch [Reference Birch1] that the usual Hardy–Littlewood asymptotic formula holds for systems of quadrics over $\mathbb {Q}$ , provided that $M>B+2d(d+1)$ , where B is the affine dimension of the ‘Birch singular locus’ of the descended system. (Note that one can take $B\leqslant d-1$ when the descended system is a smooth complete intersection.) Breakthrough work of Rydin Myerson [Reference Rydin Myerson11] handles smooth codimension d complete intersections of quadrics in $\mathbb {P}_{\mathbb {Q}}^{M-1}$ when $M\geqslant 9d$ . The latter result is particularly significant since it allows one to handle arbitrary generalised quadratic forms over K in $n\geqslant 9$ variables, provided that the descended system defines a smooth complete intersection of codimension d.
Our main results will concern a special class of generalised quadratic forms, in which only one nontrivial automorphism appears and in which the conjugated variables separate completely from the unconjugated variables. These examples are chosen to represent a first step on the way to a fuller understanding of generalised quadratic forms, and yet exhibit enough features that make them untreatable by other methods. In the light of Theorem 1.2, a complete understanding of generalised quadratic forms must lie rather deep.
Let $Q\in \mathfrak {o}_K[X_1,\dots ,X_n]$ and $R\in \mathfrak {o}_K[X_1,\dots ,X_m]$ be quadratic forms in n and m variables, respectively, for $1\leqslant m\leqslant n$ . The generalised quadratic forms we shall treat take the shape
for a fixed nontrivial automorphism $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . Let $\rho _1,\dots ,\rho _d$ be the d distinct embeddings of K into $\mathbb {R}$ , where we recall that K is totally real. For each $1\leqslant l\leqslant d$ , we define $l_\tau $ through the relation
Suppose that $\mathbf {A}$ is the $n\times n$ symmetric matrix defining Q and that $\mathbf {B}$ is the $n\times n$ symmetric matrix given by the condition that its upper left $m\times m$ submatrix defines R, with all other entries equal to $0$ . For any $1\leqslant l\leqslant d$ , we shall write $\mathbf {A}^{(l)}$ and $\mathbf {B}^{(l)}$ for the l-th embeddings of $\mathbf {A}$ and $\mathbf {B}$ , respectively. We make the following key hypotheses about $\mathbf {A}$ and $\mathbf {B}$ .
Assumption 1. Assume that the descended system
has codimension d in $\mathbb {P}^{dn-1}$ . Furthermore, assume that $\det \mathbf {A}\neq 0$ and that the upper left $m\times m$ submatrix of $\mathbf {B}$ is nonsingular.
Our first result deals with the special case $m=1$ .
Theorem 1.3. Let $K/\mathbb {Q}$ be a totally real Galois extension of degree $d\geqslant 2$ . Suppose that $m=1$ and that Assumption 1 holds. Assume that $\det (\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )})$ is a constant polynomial in t, for each $1\leqslant l\leqslant d$ , where $l_\tau $ is defined via Equation (1.4). Let $n\geqslant 6$ and assume that the descended system has nonsingular points everywhere locally. Then there exist constants $c>0$ and $\Delta>0$ such that
The implied constants in our work are always allowed to depend on K and F. The generalised quadratic form $ 2X_1X_2 + a (X_1^{\tau })^2 + \tilde {Q}(X_3,\ldots , X_n) $ meets the hypotheses of the theorem, for example, where $\tilde Q\in \mathfrak {o}_K[X_3,\ldots , X_n]$ is a nonsingular quadratic form and $a\in \mathfrak {o}_K$ is nonzero.
We are also able to prove an asymptotic formula for $N_{W}(F;P)$ for arbitrary $m\geqslant 1$ , provided we make additional assumptions about the matrices $\mathbf {A}$ and $\mathbf {B}$ .
Assumption 2. For all $1\leqslant l\leqslant d$ , assume that $ \operatorname {\mathrm {rank}} (\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )})\geqslant n-1$ , for all $t\in \mathbb {R}$ , where $l_\tau $ is defined via Equation (1.4).
Assumption 3. For all $1\leqslant l\leqslant d$ , assume that $\det (\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )})$ has degree at least $m-1$ , viewed as a polynomial in t.
When $m=1$ and $\det (\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )})$ has degree exactly $0$ in Assumption 3, we see that Assumption 2 is implied by Assumption 1 since then $\operatorname {\mathrm {rank}} (\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )})= \operatorname {\mathrm {rank}} (\mathbf {A}^{(l)})=n$ . For general $m\geqslant 1$ , Assumption 2 is similar to one that is commonly made in the study of pairs of quadratic forms. Indeed, suppose one is given two matrices $A,B\in M_{n\times n}(L)$ over an algebraically closed field L of characteristic not equal to $2$ , with associated quadratic forms $Q_A$ and $Q_B$ . It follows from Reid’s thesis [Reference Reid10, Prop. 2.1] that the rank of any element in the pencil $\lambda A+\mu B$ , with $(\lambda ,\mu )\neq (0,0)$ , is never smaller than $n-1$ , provided the intersection $Q_A=Q_B=0$ is nonsingular as a projective variety and of the expected dimension. In our situation, by contrast, we only look at the pencil $\mathbf {A}^{(l)}+t\mathbf {B}^{(l_\tau )}$ since the matrix $\mathbf {B}^{(l_\tau )}$ has rank m by construction. (We shall relate this situation to the properties of an appropriate singular locus in Lemma 5.1 below.)
We are now ready to reveal our main result in the homogeneous setting.
Theorem 1.4. Let $K/\mathbb {Q}$ be a totally real Galois extension of degree $d\geqslant 2$ . Suppose that Assumptions 1–3 hold and that $ n> 3m+4-4m/d. $ Assume that the descended system has nonsingular points everywhere locally. Then there exist constants $c>0$ and $\Delta>0$ such that
On taking $m=1$ , we note that this result subsumes Theorem 1.3 when $n\geqslant 7$ . If one makes further assumptions on Q, one can do even better. Suppose, for example, that the last $n-m$ variables split off from Q so that
for quadratic forms $Q_1$ and $Q_2$ over $\mathfrak {o}_K$ . Then it seems likely that a classical version of the circle method can be employed. On summing trivially over the first m-variables of the associated exponential sums, one would be left with handling an exponential sum in $n-m$ variables involving $Q_2.$ If $Q_2$ has rank at least $5$ , then Skinner’s treatment over number fields [Reference Skinner13] would yield the necessary saving. This ought to allow $n\geqslant m+5$ in the statement of Theorem 1.4 if $Q(0,\dots ,0,\ X_{m+1},\dots ,X_n)$ has rank at least $5$ .
1.2. Inhomogeneous setting
We now assume that $N\in \mathfrak {o}_K$ is nonzero. Then we may write $N=\omega _1 N_1+\cdots +\omega _dN_d$ , where $N_1,\dots ,N_d\in \mathbb {Z}$ are not all zero. We shall henceforth call $\{Q_1-N_1,\dots ,Q_d-N_d\}$ the shifted descended system, where $Q_1,\dots ,Q_d$ are obtained from F via Equation (1.2), continuing to call $\{Q_1,\dots ,Q_d\}$ the associated descended system.
Our next result demonstrates that sharper results are available if $N\neq 0$ and $Q, R$ are both diagonal. Suppose that
for $1\leqslant m \leqslant n$ and nonzero $a_1,\dots ,a_n,b_1,\dots ,b_m\in \mathfrak {o}_K$ , and where $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ is a fixed nontrivial automorphism. Taking $m=n$ and $a_i=b_i=1$ for $1\leqslant i\leqslant n$ , we are led to an instance of the partial trace problem in Equation (1.1) with $H=\{\mathrm {id}, \tau \}$ . We will prove the following result.
Theorem 1.5. Let $K/\mathbb {Q}$ be a totally real Galois extension of degree $d\geqslant 2$ . Assume that $N\in \mathfrak {o}_K$ is nonzero and that $ n\geqslant m+4. $ Suppose that the descended system has codimension d and a nonsingular real point and that the shifted descended system has nonsingular points over $\mathbb {Z}_p$ for every prime p. Then there exist constants $c>0$ and $\Delta>0$ such that
The implied constant in this result is allowed to depend on N, in addition to K and F. In order to illustrate our result, take the quadratic number field $K=\mathbb {Q}(\sqrt {2})$ in Equation (1.5) and assume that $a_1,\dots ,a_n,b_1,\dots ,b_m\in \mathbb {Z}$ are all nonzero. Then it follows from Theorem 1.5 that our work treats the shifted descended system
when $n\geqslant m+4$ and $N_1,N_2\in \mathbb {Z}$ are not both zero.
1.3. Some words on the proof
Let $F(X_1,\dots ,X_n)$ be a generalised quadratic form defined over $\mathfrak {o}_K$ , and let $N\in \mathfrak {o}_K$ . Our analysis of $N_{W}(F,N;P)$ relies on a Fourier-analytic interpretation of the indicator function
Browning and Vishe [Reference Browning and Vishe2, Thm 1.2] have extended to arbitrary number fields the smooth $\delta $ -function technology of Duke–Friedlander–Iwaniec [Reference Duke, Friedlander and Iwaniec4], as later refined by Heath-Brown [Reference Heath-Brown5]. This will underpin the work in this paper, affording us the opportunity to extract nontrivial savings, in the spirit of Kloosterman’s method, in the proof of Theorem 1.5. We will be led to an expression for $N_W(F,N;P)$ in Equation (4.6), involving an infinite sum over nonzero integral ideals $\mathfrak {b}$ . The next stage is to apply Poisson summation, but an obstacle arises from the fact that it is no longer possible to break into residue classes modulo $\mathfrak {b}$ for generalised quadratic forms F. Instead, we shall break into residue classes modulo a larger ideal ${{}^{{G}}\mathfrak {b}}$ , which is the least common multiple of the ideals $\mathfrak {b}^{\tau ^{-1}}$ , as $\tau $ ranges over the automorphisms that actually occur in F. Poisson summation then leads to the analysis of certain exponential sums $S_{\mathfrak {b}}(N;\mathbf {m})$ and oscillatory integrals $I_{\mathfrak {b}}(N;\mathbf {m})$ , which are indexed by $\mathfrak {b}\subset \mathfrak {o}_K$ and suitable vectors $\mathbf {m}\in K^n$ . While the treatment of $S_{\mathfrak {b}}(N;\mathbf {m})$ is relatively standard, the main challenge is to understand $I_{\mathfrak {b}}(N;\mathbf {m})$ . When F is a standard quadratic form, these integrals factorise into a product of d oscillatory integrals, one for each of the d real embeddings of K. This reduces the problem to looking at oscillatory integrals over $\mathbb {R}^n$ . For generic generalised quadratic forms, it seems very difficult to obtain the kind of cancellation one needs for the method to go through for the relevant oscillatory integrals over $\mathbb {R}^{dn}$ .
We now summarise the contents of the paper. In §2, we shall prove Theorem 1.2 by spelling out the connection between generalised quadratic forms over K and descended systems over $\mathbb {Q}$ . In §3, we collect together some useful facts from algebraic number theory. The rest of the paper will be concerned with estimating the size of the counting function $N_{W}(F,N;P)$ , as $P\to \infty $ . In order to facilitate future investigation, we shall present most of the arguments for arbitrary generalised quadratic forms in §4. Next, in §5 we shall specialise to the case (1.3) and $N=0$ , in order to deduce Theorems 1.3 and 1.4. Finally, §6 will deal with Theorem 1.5, which pertains to the diagonal generalised quadratic form (1.5) and $N\neq 0$ .
2. Generalised quadratic forms and the descended system
In this section, we shall prove Theorem 1.2, by making explicit the correspondence between generalised quadratic forms F and the descended system of d quadratic forms over ${\mathbb Q}$ in $dn$ variables. Let $K/\mathbb {Q}$ be a degree d Galois number field, which (in this section only) need not be totally real. Assume that we are given a set of coefficients $(c_{i,j,\tau ,\tau '})$ of a generalised quadratic form, with $c_{i,j,\tau ,\tau '}=c_{j,i,\tau ',\tau }$ for all $1\leqslant i,j\leqslant n$ and $\tau ,\tau '\in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . We can write each coefficient $c_{i,j,\tau ,\tau '}\in K$ with respect to the basis $\{{\omega }_1,\ldots , {\omega }_d\}$ as $ c_{i,j,\tau ,\tau '}=\sum _{k=1}^d c_{i,j,\tau ,\tau '}^{(k)}{\omega }_k. $ We proceed to compute the descended system explicitly by writing $ X_i=\sum _{k=1}^d U_{k,i}{\omega }_k, $ for $1\leqslant i\leqslant n$ . Then
Let $\{\rho _1,\ldots , \rho _d\}$ be a dual basis of $\{{\omega }_1,\ldots , {\omega }_d\}$ with respect to the trace so that $( \operatorname {\mathrm {Tr}}_{K/\mathbb {Q}}(\rho _i\omega _j))_{i,j}$ is the identity matrix and any $\alpha \in K$ can be written in the form $\alpha =\sum _{p=1}^d \operatorname {\mathrm {Tr}}_{K/\mathbb {Q}}(\alpha \rho _p) \omega _p$ . Thus, $F(X_1,\ldots , X_n)$ is equal to
and we arrive at our descended system (1.2), with
for rational coefficients
By construction, the coefficients $\beta _{p,l,i,m,j}$ satisfy $\beta _{p,l,i,m,j}=\beta _{p,m,j,l,i}$ , for all $1\leqslant p,l,m\leqslant d$ and $1\leqslant i,j\leqslant n$ . Moreover, they depend linearly on the given set of coefficients $(c_{i,j,\tau ,\tau '}^{(k)})$ . Now, the space of all tuples $(c_{i,j,\tau ,\tau '}^{(k)})$ of rational numbers satisfying the symmetry relation $c_{i,j,\tau ,\tau '}^{(k)}=c_{j,i,\tau ',\tau }^{(k)}$ can be parametrised by ${\mathbb Q}^{\frac {1}{2}dn(dn+1)d}$ . Similarly, the space of all symmetric rational tuples $(\beta _{p,l,i,m,j})$ is naturally parametrised by ${\mathbb Q}^{\frac {1}{2}dn(dn+1)d}$ . We define the map
We claim that this map is an injective linear map. This implies that there is a bijection between generalised quadratic forms in n variables and systems of d rational quadratic forms in $nd$ variables, as claimed in Theorem 1.2.
To check the claim, we assume that $\beta _{p,l,i,m,j}=0$ for all $1\leqslant p,l,m\leqslant d$ and $1\leqslant i,j\leqslant n$ . By the nondegeneracy of the trace as a bilinear form, we deduce that
Note that the matrix $({\omega }_l^{\tau })_{\substack {1\leqslant l\leqslant d\\ \tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})}}$ is of maximal rank, and hence we obtain
Applying the same argument again, we finally obtain
and hence $c_{i,j,\tau ,\tau '}^{(k)}=0$ for all $1\leqslant k\leqslant d$ .
3. Recap from algebraic number theory
In this section, we collect together some of the facts about algebraic number fields that are important in our work. As usual, $K/\mathbb {Q}$ is a totally real Galois extension of degree d. We shall henceforth write $\mathfrak {o}=\mathfrak {o}_K$ for its ring of integers. In §3.1 and §3.2, we recall some facts about ideals and discuss the construction of primitive characters modulo ideals, respectively. The need to deal with generalised quadratic forms naturally leads to two basic objects that can be associated to a given integral ideal $\mathfrak {b}$ in K, both of which depend on the particular generalised quadratic form we are working with and will be introduced in §3.3.
3.1. Properties of ideals
For any fractional ideal $\mathfrak {a}$ in K, one defines the dual ideal
In particular, $\hat {\mathfrak {a}}= \mathfrak {a}^{-1}\mathfrak {d}^{-1}$ , where $ \mathfrak {d} = \{\alpha \in K: \alpha \hat {\mathfrak {o}}\subseteq \mathfrak {o}\} $ denotes the different ideal of K and is itself an integral ideal. One notes that $\hat {\mathfrak {o}}=\mathfrak {d}^{-1}$ . Furthermore, we have $\hat {\mathfrak {a}}\subseteq \hat {\mathfrak {b}}$ if and only if $\mathfrak {b}\subseteq \mathfrak {a}$ . An additional integral ideal featuring in our work is the denominator ideal
associated to any $\gamma \in K$ . Recall that $\operatorname {\mathrm {N}} \mathfrak {a}=|\mathfrak {o}/\mathfrak {a}|$ is the ideal norm of any integral ideal $\mathfrak {a}$ . One important property of the ideal norm is that $\operatorname {\mathrm {N}}\mathfrak {a}^{\tau }=\operatorname {\mathrm {N}} \mathfrak {a}$ for any $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q}).$ (This follows from the isomorphism $\mathfrak {o}/\mathfrak {a}\to \mathfrak {o}/\mathfrak {a}^{\tau }$ given by $\alpha \mapsto \alpha ^{\tau }$ .) Furthermore, we have $\operatorname {\mathrm {N}}\mathfrak {a}\in \mathfrak {a}$ for any integral ideal $\mathfrak {a}$ .
We will write $(\mathfrak {a},\mathfrak {b})=\mathfrak {a}+\mathfrak {b}$ for the greatest common divisor of two integral ideals $\mathfrak {a},\mathfrak {b}\subset \mathfrak {o}$ . When these ideals are coprime, meaning that $\mathfrak {a}+\mathfrak {b}=\mathfrak {o}$ , we shall adopt the abuse of notation $(\mathfrak {a},\mathfrak {b})=1$ . We close this section by recording the following basic result.
Lemma 3.1. Let $\varepsilon>0$ , and let $\mathfrak {b},\mathfrak {c}$ be integral ideals. Then
-
(i) there exists $\alpha \in \mathfrak {b}$ such that $\mathrm {ord}_{\mathfrak {p}}(\alpha )=\mathrm {ord}_{\mathfrak {p}}(\mathfrak {b})$ for every prime ideal $\mathfrak {p}\mid \mathfrak {c}$ ;
-
(ii) there exists $\alpha \in \mathfrak {b}$ and an unramified prime ideal $\mathfrak {p}$ coprime to $\mathfrak {b}^{\tau }$ for all $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ , with $\operatorname {\mathrm {N}}\mathfrak {p} \ll (\operatorname {\mathrm {N}} \mathfrak {b})^{\varepsilon }$ , such that $(\alpha )=\mathfrak {b}\mathfrak {p}$ .
Proof. Part (i) is [Reference Browning and Vishe2, Lemma 2.2(i)] and part (ii) follows from an obvious modification to the proof of [Reference Browning and Vishe2, Lemma 2.2(ii)].
We shall also require a version of the Chinese remainder theorem, as in [Reference Skinner12, Lemma 3].
Lemma 3.2. Suppose that $\mathfrak {a},\mathfrak {a}_1,\mathfrak {a}_2$ are integral ideals such that $\mathfrak {a}=\mathfrak {a}_1\mathfrak {a}_2$ , with $\mathfrak {a}_1$ and $\mathfrak {a}_2$ coprime. Let $\alpha _1, \alpha _2\in \mathfrak {o}$ satisfy $\mathrm {ord}_{\mathfrak {p}}(\alpha _1)=\mathrm {ord}_{\mathfrak {p}}(\mathfrak {a}_1)$ and $\mathrm {ord}_{\mathfrak {p}}(\alpha _2)=\mathrm {ord}_{\mathfrak {p}}(\mathfrak {a}_2)$ , for all $\mathfrak {p}\mid \mathfrak {a}$ . Then
3.2. Construction of primitive characters
Let $\psi (\cdot )=\exp (2\pi i \operatorname {\mathrm {Tr}}_{K/\mathbb {Q}}(\cdot ))$ be a character on K. The following result gives a way to construct primitive characters $\mathfrak {o}/\mathfrak {b}\to \mathbb {C}$ .
Lemma 3.3. Let $\sigma _0(\cdot )=\psi (\gamma \cdot ) :K\to \mathbb {C}$ , for any $\gamma \in K$ , and let $\mathfrak {b}\subsetneq \mathfrak {o}$ be an integral ideal. Then $\sigma _0$ is a nontrivial primitive additive character modulo $\mathfrak {b}$ if and only if $\mathfrak {a}_\gamma =\mathfrak {b}\mathfrak {e}$ for some $\mathfrak {e}\mid \mathfrak {d}$ such that $(\mathfrak {d}/\mathfrak {e},\mathfrak {b})=1$ .
Proof. We begin by showing that $\sigma _0$ is an additive character modulo $\mathfrak {b}$ if and only if $\mathfrak {b}\mathfrak {d}\subset \mathfrak {a}_\gamma $ . But $\sigma _0$ is an additive character modulo $\mathfrak {b}$ if and only if $\sigma _0(x+z)=\sigma _0(x)$ for all $x\in \mathfrak {o}$ and $z\in \mathfrak {b}$ . But this happens if and only if $\gamma z \in \hat {\mathfrak {o}}$ for all $z\in \mathfrak {b}$ , which is if and only if $\mathfrak {b}\mathfrak {d}\subset \mathfrak {a}_\gamma $ . This establishes the claim.
Now, suppose that $\mathfrak {b}\mathfrak {d}\subset \mathfrak {a}_\gamma $ , which means that $\mathfrak {a}_\gamma \mid \mathfrak {b}\mathfrak {d}$ . Thus, there is an integral ideal $\mathfrak {h}$ such that $\mathfrak {b}\mathfrak {d}=\mathfrak {a}_\gamma \mathfrak {h} $ . We wish to show that $\sigma _0$ is primitive if and only if $\mathfrak {h}\mid \mathfrak {d}$ with $(\mathfrak {h},\mathfrak {b})=1$ . To do so, we note that $\sigma _0$ is primitive if and only if $\mathfrak {a}_\gamma \nmid \mathfrak {b}_1 \mathfrak {d}$ for all $\mathfrak {b}_1\mid \mathfrak {b}$ with $\mathfrak {b}_1\neq \mathfrak {b}$ . Indeed, if $\mathfrak {a}_\gamma \mid \mathfrak {b}_1 \mathfrak {d}$ for some proper divisor $\mathfrak {b}_1\mid \mathfrak {b}$ , then $\gamma z \in \hat {\mathfrak {o}}$ for every $z\in \mathfrak {b}_1$ , which would mean that $\sigma _0$ is a character modulo $\mathfrak {b}_1$ . Suppose that $\sigma _0$ is primitive, and suppose that there is a prime ideal $\mathfrak {p}\mid \mathfrak {h}$ such that $\mathfrak {p}\mid \mathfrak {b}$ . Writing $\mathfrak {h}'=\mathfrak {h}\mathfrak {p}^{-1}$ and $\mathfrak {b}'=\mathfrak {b}\mathfrak {p}^{-1}$ , it follows that $\mathfrak {b}'\mathfrak {d}=\mathfrak {a}_\gamma \mathfrak {h}' $ , whence $\mathfrak {a}_\gamma \mid \mathfrak {b}'\mathfrak {d}$ , which is a contradiction. Thus, $\mathfrak {h}$ is coprime to $\mathfrak {b}$ and we must have $\mathfrak {h}\mid \mathfrak {d}$ . Suppose now that $\mathfrak {b}\mathfrak {d}=\mathfrak {a}_\gamma \mathfrak {h}$ for some $\mathfrak {h}\mid \mathfrak {d}$ such that $(\mathfrak {h},\mathfrak {b})=1$ . We wish to deduce that $\sigma _0$ is primitive, for which we suppose for a contradiction that there exists a proper divisor $\mathfrak {b}_1\mid \mathfrak {b}$ such that $\mathfrak {a}_\gamma \mid \mathfrak {b}_1 \mathfrak {d}$ . Writing $\mathfrak {b}=\mathfrak {b}_1\mathfrak {b}_2$ and recalling that $\mathfrak {b}\mathfrak {d}=\mathfrak {a}_\gamma \mathfrak {h}$ , we deduce that $\mathfrak {h} = (\mathfrak {a}_\gamma ^{-1}\mathfrak {b}_1 \mathfrak {d})\mathfrak {b}_2$ , whence $\mathfrak {b}_2\mid \mathfrak {h}$ , which is impossible since $(\mathfrak {h},\mathfrak {b})=1$ .
Finally we note that $\sigma _0$ is a trivial character if and only if $\gamma \in \hat {\mathfrak {o}}$ , which is equivalent to $\mathfrak {a}_\gamma \supseteq \mathfrak {d}$ . This is clearly impossible for any primitive character $\sigma _0$ modulo a proper ideal $\mathfrak {b}\subsetneq \mathfrak {o}$ since $(\mathfrak {d}/\mathfrak {e},\mathfrak {b})=1$ if $\mathfrak {a}_\gamma =\mathfrak {b}\mathfrak {e}$ .
We proceed to define a particularly convenient additive character modulo $\mathfrak {b}$ . Associated to any nonzero integral ideal $\mathfrak {b}$ is the subset $\mathfrak {F}(\mathfrak {b})\subset K$ given by
Note that condition (i) implies that $\alpha \in \mathfrak {b}\mathfrak {d}$ and condition (ii) implies that $\mathfrak {p}_1\nmid \mathfrak {b}^{\tau }\mathfrak {d}$ for any $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . We may now record a variant of [Reference Browning and Vishe2, Lemma 2.3], which shows that $\mathfrak {F}(\mathfrak {b})\neq \emptyset $ for any choice of $\mathfrak {b}$ .
Lemma 3.4. Let $\mathfrak {b}\subsetneq \mathfrak {o}$ be a nonzero ideal. Then there exists $\gamma \in \mathfrak {F}(\mathfrak {b})$ such that $\psi (\gamma \cdot )$ defines a nontrivial primitive additive character modulo $\mathfrak {b}$ .
Proof. We consider the integral ideal $\mathfrak {c}=\mathfrak {b}\mathfrak {d}$ . Observe that $\mathfrak {d}^{\tau }=\mathfrak {d}$ for all $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ since $\mathfrak {d}=\hat {\mathfrak {o}}^{-1}$ and the trace is invariant under the action of the Galois group. Taking $\varepsilon =1$ in Lemma 3.1(ii), we can find $\alpha \in \mathfrak {c}$ and a prime ideal $\mathfrak {p}_1$ coprime to $\mathfrak {c}^{\tau }=\mathfrak {b}^{\tau }\mathfrak {d}$ , for every $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ , with $\operatorname {\mathrm {N}}\mathfrak {p}_1 \ll \operatorname {\mathrm {N}} \mathfrak {b}$ and such that $(\alpha )=\mathfrak {c}\mathfrak {p}_1$ . It follows from Lemma 3.1(i) that there exists $\nu \in \mathfrak {p}_1$ such that $((\nu ), \mathfrak {c}^{\tau })=1$ for any $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . But this implies that $g=N_{K/\mathbb {Q}}(\nu )$ is coprime to $\mathfrak {c}^{\tau }$ , for any $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ , with $g\in \mathfrak {p}_1$ . We will show that $\mathfrak {c}=\mathfrak {a}_\gamma $ with $\gamma =g/\alpha $ , after which an application of Lemma 3.3 with $\mathfrak {e}=\mathfrak {d}$ will complete the proof. To check the claim we note that
since $\mathfrak {p}_1\mid (g)$ and $\mathfrak {b}\mathfrak {d}$ is coprime with $(g)$ .
3.3. The G-invariant ideal and an important $\mathbb {Z}$ -module
Let F be a generalised quadratic form, as in Definition 1.1. Let $G=G_F\subset \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ be the subset of automorphisms $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ that actually appear in F. Note that $G=\{\mathrm {id}\}$ if and only if F is a standard quadratic form. For any integral ideal $\mathfrak {b}$ , we define the G-invariant ideal to be
This is the least common multiple of the ideals $\mathfrak {b}^{\tau ^{-1}}$ for $\tau \in G$ .
Next, associated to our generalised quadratic form F is a generalised bilinear form
This defines a map $K^n\times K^n\to K$ , with
for any vectors $\mathbf {x},\mathbf {y},\mathbf {u},\mathbf {v}\in K^n$ . (However, this fails to be a bilinear form on $K^n$ since $B(\lambda \mathbf {x};\mathbf {y})$ , $B(\mathbf {x};\lambda \mathbf {y})$ and $\lambda B( \mathbf {x};\mathbf {y})$ needn’t be equal for $\lambda \in K$ .)
For any ideal $\mathfrak {b}\subset \mathfrak {o}$ , let
This is an additive group, and it is clear that ${{}^{{G}}\mathfrak {b}}^n\subset {\mathcal H}_{\mathfrak {b}} \subset {\mathfrak o}^n$ , where ${{}^{{G}}\mathfrak {b}}$ is the G-invariant ideal defined in Equation (3.1). By testing the hypothesis with ${\mathbf a} \equiv \mathbf {0} \mbox { mod } {{}^{{G}}\mathfrak {b}}$ , we have $F({\mathbf h}) \equiv 0 \mbox { mod } {\mathfrak {b}}$ for any ${\mathbf h}\in {\mathcal H}_{\mathfrak {b}}$ . Hence,
We claim that ${\mathcal H}_{\mathfrak {b}}$ has the structure of a finitely generated ${\mathbb Z}$ -module. To see this, let ${\mathbf e}_i$ be the ith unit vector, for $1\leqslant i\leqslant n$ . Observe that $(\operatorname {\mathrm {N}} {\mathfrak b}) {\omega }_j{\mathbf e}_i\in {\mathcal H}_{\mathfrak b}$ for all $1\leqslant i\leqslant n$ and $1\leqslant j\leqslant d$ . Hence, the image of ${\mathcal H}_{\mathfrak b}$ under the isomorphism ${\mathfrak o}^n \cong {\mathbb Z}^{nd}$ is a lattice of full rank and, thus, finitely generated as a ${\mathbb Z}$ -module.
The set ${\mathcal H}_{\mathfrak {b}}$ will emerge naturally in our analysis of certain key exponential sums, and it will be important to have an estimate for its index in $\mathfrak {o}^n$ . In the special case (1.3), it will be easier to calculate ${\mathcal H}_{\mathfrak b}$ directly, but for now we content ourselves with proving a general bound. In the following lemma, we consider the coefficient matrix $(c_{i,j,\tau ,\tau '})_{(i,\tau )\times (j,\tau ')}$ of a generalised quadratic form as a $nd\times nd$ matrix.
Lemma 3.5. There is a constant $C_1>0$ , depending only on F, such that for all $\mathfrak {b}$ we have
Moreover, there is an integral ideal $\mathfrak {d}_1$ such that one can take $C_1=1$ for all ideals $\mathfrak {b}$ with $({\mathfrak b},\mathfrak {d}_1)=1$ .
Proof. Let ${\Delta }= \operatorname {\mathrm {rank}} (c_{i,j,\tau ,\tau '})_{(i,\tau )\times (j,\tau ')}$ . Let ${\mathcal S}\subset \{1,\ldots , n\}\times \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ be a subset of indices such that the vectors $(c_{i,j,\tau ,\tau '})_{(j,\tau ')}$ , $(i,\tau )\in {\mathcal S}$ , are linearly independent and $|{\mathcal S}|$ is maximal. Then, for any $(k,\sigma )\in \{1,\ldots , n\}\times \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ there are numbers $a_{i,\tau }^{(k,\sigma )}$ (for $(i,\tau )\in {\mathcal S}$ ) such that
for all $1\leqslant j\leqslant n$ and $\tau '\in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . Let ${\alpha }\in {\mathfrak o}$ such that ${\alpha } a_{i,\tau }^{(k,\sigma )}\in {\mathfrak o}$ for all $(i,\tau )\in {\mathcal S}$ and $(k,\sigma )\in \{1,\ldots , n\}\times \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ . Now, set
Observe that ${\mathcal H}_{\mathfrak b}^{\prime }\subset {\mathcal H}_{\mathfrak b}$ . Moreover, if ${\mathfrak b}$ and $({\alpha })$ are coprime, then the ideal $({\alpha })$ may be omitted in the definition of ${\mathcal H}_{\mathfrak b}^{\prime }$ .
Finally, we observe that there is an injection
Hence, $|{\mathfrak o}^n/{\mathcal H}_{\mathfrak b}^{\prime }|\leqslant (\operatorname {\mathrm {N}} ({\alpha }{\mathfrak b}))^{{\Delta }},$ which suffices since ${\mathcal H}_{\mathfrak b}^{\prime }\subset {\mathcal H}_{\mathfrak b}$ .
We now wish to provide an alternative upper bound involving ${\mathcal H}_{\mathfrak b}$ under a suitable assumption on the generalised quadratic form.
Definition 3.6. We say that $F(X_1,\dots ,X_n)$ is admissible if there exist vectors
such that $B({\mathbf v}_i;{\mathbf h})=0$ for all $1\leqslant i\leqslant n$ if and only if ${\mathbf h}=\mathbf {0}$ .
In this language, a standard quadratic form is admissible if and only if it is nonsingular. We may now prove the following result.
Lemma 3.7. Assume that F is admissible. Then there exists a constant $C_2>0$ , depending only on F, such that
Moreover, there exists an integral ideal $\mathfrak {d}_2$ such that one can take $C_2=1$ for all ideals ${\mathfrak b}$ with $({\mathfrak b},\mathfrak {d}_2)=1$ .
We can use this result to get information about the index of ${\mathcal H}_{\mathfrak b}$ in $\mathfrak {o}^n$ via the identity
Lemma 3.7 is of the expected magnitude, which we can see by considering the case of the standard diagonal quadratic form $F(\boldsymbol {X})=\sum _{i=1}^n c_iX_i^2$ , for example, with nonzero $c_1,\dots ,c_n\in \mathfrak {o}$ . In this case, $G=\{\mathrm {id}\}$ and ${{}^{{G}}\mathfrak {b}}=\mathfrak {b}$ . It therefore follows that $|{\mathcal H}_{\mathfrak {b}}/{{}^{{G}}\mathfrak {b}}^n|\ll 1$ since ${\mathcal H}_{\mathfrak {b}}=(2c_1)^{-1}\mathfrak {b}\times \cdots \times (2c_n)^{-1}\mathfrak {b}$ .
Proof of Lemma 3.7.
Let ${\mathbf v}_1,\dots , {\mathbf v}_n$ be a set of vectors as in Definition 3.6. By scaling these vectors with a rational integer, we may assume that ${\mathbf v}_i\in {\mathfrak o}^n$ for all $1\leqslant i\leqslant n$ . We define the auxiliary set
and observe that ${\mathcal H}_{\mathfrak b}\subset \widetilde {{\mathcal H}}_{\mathfrak b}$ . Next, consider the map
which is injective by the definition of admissibility in Definition 3.6. Let ${\Gamma }$ be the image of ${\mathfrak o}^n$ under the map $\varphi $ . Then $\varphi $ induces an isomorphism
Note that ${\Gamma }$ only depends on $B(\boldsymbol {X};\boldsymbol {Y})$ and vectors ${\mathbf v}_1,\dots ,{\mathbf v}_n$ and hence can be taken to be independent of the ideal ${\mathfrak b}$ . As in Equation (3.4), we therefore obtain
where $C_{\Gamma }$ is a constant only depending on ${\Gamma }$ . Moreover, there is an ideal $\mathfrak {d}_2$ such that $|{\Gamma }/({\mathfrak b}^n\cap {\Gamma })|=(\operatorname {\mathrm {N}}{\mathfrak b})^n$ whenever $(\mathfrak {d}_2,{\mathfrak b})=1$ . This completes the proof of the lemma.
4. Enter the circle method
Our primary tool in this paper is a number field version of the Hardy–Littlewood circle method to interpret the function $\delta _K$ in Equation (1.6). Let K be a totally real Galois extension of $\mathbb {Q}$ of degree d. Let $Q\geqslant 1$ , and let $\alpha \in \mathfrak {o}$ . Then we shall use the version worked out by Browning and Vishe [Reference Browning and Vishe2, Thm. 1.2]. This states that there exists a positive constant $c_Q=1+O_A(Q^{-A})$ , for any $A>0$ and an infinitely differentiable function $h: (0,\infty )\times \mathbb R\rightarrow \mathbb {R}$ such that
where $\operatorname {\mathrm {N}}\mathfrak {b}=|\mathfrak {o}/\mathfrak {b}|$ denotes the norm of the ideal $\mathfrak {b}$ and the notation $\sum ^{*}_{\sigma {\,(\operatorname {\mathrm {mod}}{{\mathfrak {b}}})}}$ means that the sum is taken over primitive additive characters modulo $\mathfrak {b}$ . Furthermore, we have $h(x,y)\ll x^{-1}$ and $h(x,y)\neq 0$ only if $x\leqslant \max \{1,2|y|\} $ .
We fix some notation before proceeding further. Let $D_K$ be the discriminant of K, and note that $D_K>0$ since K is totally real. Let $\rho _{1},\dots ,\rho _{d}:K \hookrightarrow \mathbb {R}$ be the distinct real embeddings of K, and let $V= K\otimes _{\mathbb {Q}} \mathbb {R} \cong \mathbb {R}^d $ . There is a canonical embedding $K\hookrightarrow V$ given by $\alpha \mapsto (\rho _{1}(\alpha ), \dots , \rho _{d}(\alpha ))$ . We identify K with its image in V. If $v=(v_1,\dots ,v_d)\in V$ , then we extend the norm and trace on K to get functions $\operatorname {\mathrm {Nm}}(v):V\to \mathbb {R}$ and $\operatorname {\mathrm {Tr}}(v):V\to \mathbb {R}$ , with
We extend the absolute value on $\mathbb {R}$ to give a norm on V via $| v | = \max _{1\leqslant l\leqslant d}|v_l|$ , which we extend to $V^n$ in the obvious way.
Let $N\in \mathfrak {o}$ and let $F(X_1,\dots ,X_n)$ be a generalised quadratic form defined over $\mathfrak {o}$ . Our central concern is with the asymptotic behaviour of the sum
as $P\rightarrow \infty $ , for $W\in \mathcal {W}_n^+(V)$ , where $\mathcal {W}_n^+(V)$ is the class of smooth weight functions described in [Reference Browning and Vishe2, §2.2]. Our goal in this section is to lay some groundwork that will be useful for Theorems 1.3–1.5 but which applies to arbitrary generalised quadratic forms.
First, in §4.1 we shall discuss the link between the descended system associated to F and the ‘embedded system’ that arises from looking at all of the different real embeddings of F. In §4.2, we shall construct the weight function W that features in our counting function $ N_{W}(F,N;P)$ . In §4.3, we shall combine Equation (4.1) with Poisson summation in order to arrive at a preliminary expression for $ N_{W}(F,N;P) $ in Lemma 4.1. In §4.4, we make some preliminary investigations into exponential sums and similarly for exponential integrals in §4.5. In §4.6, we shall discuss the main term that comes from the trivial character after Poisson summation is applied. Finally, in §4.7 we shall make some initial observations concerning the contribution from the nontrivial characters.
4.1. The embedded system
Let $F(X_1,\dots ,X_n)$ be a generalised quadratic form, and let $\{\omega _1,\dots ,\omega _d\}$ be a $\mathbb {Z}$ -basis for $\mathfrak {o}$ . We have seen in Equation (1.2) how there is a descended system $\{Q_1,\dots ,Q_d\}$ of quadratic forms, that is associated to F via
with variables $\mathbf {U}_l=(U_{l1},\dots ,U_{ln})$ for $1\leqslant l\leqslant d$ .
We will need to be able to relate the descended system to the embedded system, which amounts to how $F(\mathbf {x})$ embeds in V for given $\mathbf {x}\in K^n$ . We extend $F:K^n\to K$ to get a map $V^n\to V$ , through the identification of K with V. Associated to $\mathbf {x}$ is the vector $ (\mathbf {x}^{(1)}, \dots , \mathbf {x}^{(d)}), $ with $\mathbf {x}^{(l)}\in \mathbb {R}^n$ for $1\leqslant l\leqslant d$ . Let $l\in \{1,\dots ,d\}$ . To any $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ may be associated a unique integer $l_\tau \in \{1,\dots ,d\}$ such that Equation (1.4) holds. Then we define
where $c_{i,j,\tau ,\tau }^{\prime (l)}=\rho _l(c_{i,j,\tau ,\tau '})\in \mathbb {R}$ . With this notation, we have
Thus, $\rho _l(F(\mathbf {x}))$ is a real quadratic form in the $dn$ variables $\mathbf {x}^{(1)},\dots ,\mathbf {x}^{(d)}$ . We call $\{F^{(1)},\dots ,F^{(d)}\}$ the embedded system. In particular, it is clear that $N_{K/\mathbb {Q}}(F(\mathbf {x}))=\operatorname {\mathrm {Nm}}(F(\mathbf {x}))$ and
for any $v=(v_1,\dots ,v_d)\in V$ and $\mathbf {x}\in V^n$ , identities that we shall often make use of in our analysis of the exponential integrals in §4.5.
Note that if F is a standard quadratic form, then $\rho _l(F(\mathbf {x}))=F^{(l)}(\mathbf {x}^{(l)})$ for $1\leqslant l\leqslant d$ . One positive effect of this is that the relevant oscillatory integrals factorise into a product of d integrals, one for each embedding. The situation is much more complicated for generalised quadratic forms since there is usually no such factorisation.
Let $\mathbf {A}=(\omega _j^{(i)})_{1\leqslant i,j\leqslant d}$ , where $\omega _j^{(i)}=\rho _i(\omega _j)$ . Then $(\det \mathbf {A})^2=D_K$ . Moreover, on recalling that $\mathbf {x}=\omega _1\mathbf {u}_1+\cdots +\omega _d\mathbf {u}_d$ , we have
where $\mathbf {W}$ is the $dn\times dn$ block matrix
Switching appropriate rows and columns takes $\mathbf {W}$ to $\mathrm {Diag}(\mathbf {A},\dots ,\mathbf {A})$ , whence $\det \mathbf {W}=(\det \mathbf {A})^{n}=D_K^{n/2}$ . In particular, it follows that
for any $1\leqslant l\leqslant d$ , under the transformation (4.4).
4.2. Construction of the weight W
We assume that the descended system is of codimension d and has a nonsingular real point. This means that there exists $\underline {\boldsymbol {\xi }}=(\boldsymbol {\xi }_1,\dots ,\boldsymbol {\xi }_d)\in \mathbb {R}^{dn}$ such that $J_{Q_1,\dots , Q_d}(\underline {\boldsymbol {\xi }})$ has rank d, where
is the associated $d\times dn$ Jacobian matrix. Define the smooth weight function
and let $\delta>0$ be a small parameter. In this paper, we shall work with the weight function $W:V^n\to \mathbb {R}_{\geqslant 0}$ , which is given by
where $\mathbf {x}$ is identified with $(\mathbf {x}^{(1)},\dots ,\mathbf {x}^{(d)})$ , and where $\mathbf {W}$ is the matrix in Equation (4.5). It is clear that W is infinitely differentiable and that it is supported on the region $|\mathbf {W}^{-1}\mathbf {x}-\underline {\boldsymbol {\xi }}| \leqslant \delta $ . Ultimately, we will want to work with a value of $\delta $ that is sufficiently small but which still satisfies $1\ll \delta \leqslant 1$ for an absolute implied constant.
4.3. Poisson summation
It follows from Equation (4.1) that
for any $Q\geqslant 1$ . Here, the constant $c_Q$ satisfies $ c_Q=1+O_A(Q^{-A}), $ for any $A>0$ . Furthermore, we have $h(x,y)\ll x^{-1}$ for all y and $h(x,y)\neq 0$ only if $x\leqslant \max \{1,2|y|\} $ .
In our work, we will take $Q=P$ and we henceforth follow the convention that the implied constant in any estimate involving W is allowed to depend implicitly on the parameters that enter into its definition of $\mathcal {W}_n(V)$ in [Reference Browning and Vishe2, §2.2]. Likewise, the integer N and the number field K are considered fixed once and for all so that all implied constants are allowed to depend implicitly on N and K. In view of the fact that $h(x,y)\neq 0$ only if $x\leqslant \max (1,2|y|) $ , it is clear that the sum over $\mathfrak {b} $ is restricted to $\operatorname {\mathrm {N}} \mathfrak {b}\ll Q^{d}=P^d $ .
If F were a standard quadratic form over $\mathfrak {o}$ , we would proceed by breaking the sum over $\mathbf {x}$ into residue classes modulo $\mathfrak {b}$ before executing an application of Poisson summation. This would ultimately lead to an expression of the form [Reference Browning and Vishe2, Thm. 5.1]. For generalised quadratic forms F, this route is not directly accessible, since for given $\mathbf {a},\mathbf {h}\in \mathfrak {o}^n$ and any primitive character $\sigma $ modulo $\mathfrak {b}$ , one may have $\sigma (F(\mathbf {a}+\mathbf {h}))\neq \sigma (F(\mathbf {a}))$ even when $\mathbf {h}\in \mathfrak {b}^n$ . In this way, we see that a special role will be played by the set ${\mathcal H}_{\mathfrak {b}}$ that was introduced in §3.3.
Lemma 4.1. We have
where the sum over $\mathfrak {b}$ is over nonzero integral ideals and
Proof. Our approach is based on breaking the $\mathbf {x}$ -sum in Equation (4.6) into residue classes modulo ${{}^{{G}}\mathfrak {b}}$ . Since $Q=P$ and ${{}^{{G}}\mathfrak {b}}^n\subset {\mathcal H}_{\mathfrak {b}}$ , it follows that this sum equals
for any primitive character $\sigma $ modulo $\mathfrak {b}$ . We apply the multidimensional Poisson summation formula (cf. [Reference Browning and Vishe2, §5]). Recalling that K is totally real, we find that the inner $\mathbf {x}$ -sum is equal to
where we recall that $\widehat {{{}^{{G}}\mathfrak {b}} }={{}^{{G}}\mathfrak {b}}^{-1}\mathfrak {d}^{-1}$ is the dual of ${{}^{{G}}\mathfrak {b}}$ . Putting everything together in Equation (4.6), we have therefore established that
with $S_{\mathfrak {b}} (N;\mathbf {m})$ as in the statement of the lemma and
A simple change of variables yields $\tilde I_{\mathfrak {b}} (\mathbf {m})=P^{dn} I_{\mathfrak {b}} (N/P^2;P\mathbf {m})$ , as required.
4.4. The exponential sum
We proceed by discussing $S_{\mathfrak {b}} (N;\mathbf {m})$ in Lemma 4.1, for $\mathbf {m}\in \widehat {{}^{{G}}\mathfrak {b}}^n$ . Let $\gamma =g/\alpha \in \mathfrak {F}(\mathfrak {b})$ be as in Lemma 3.4. Then we have
for any $x\in \mathfrak {o}$ . It follows that
Our work hinges upon the following upper bound for this sum.
Lemma 4.2. We have
where ${\mathcal H}_{\mathfrak {b}}$ is given by Equation (3.3).
Proof. For fixed $a\in ({\mathfrak o}/{\mathfrak {b}})^{*}$ , we have
in the notation of Equation (3.2). We observe that the function ${\mathbf u} \mapsto \psi (2{\gamma } a B({\mathbf u};{\mathbf h}))$ is a character modulo ${{}^G\mathfrak {b} }^n$ , and it is the trivial character precisely when
We rewrite ${\gamma }$ in the form ${\gamma }=g/{\alpha }$ with $({\alpha })={\mathfrak {b}}{\mathfrak {d}}{\mathfrak p}_1$ for some prime ideal ${\mathfrak p}_1$ and $g\in {\mathfrak p}_1\cap {\mathbb Z}$ with the property that $((g),{\mathfrak {d}}{{}^G\mathfrak {b} })=1$ . Thus, the above condition is equivalent to the condition
Since $a \in ( {\mathfrak o}/{\mathfrak {b}})^{*}$ , $g\in {\mathfrak p}_1$ and $((g),{\mathfrak {d}}{{}^G\mathfrak {b} })=1$ , this is equivalent to saying that
Finally, since this condition on $\mathbf {u}$ is invariant modulo ${{}^G\mathfrak {b} }^n$ , this is equivalent to the condition $2 B({\mathbf u};{\mathbf h}) \in {\mathfrak {b}}$ , for all ${\mathbf u} \in {\mathfrak o}^n$ , which is equivalent to specifying that ${\mathbf h} \in {\mathcal H}_{\mathfrak {b}}$ , by Equation (3.3). The statement of the lemma now follows.
Corollary 4.3. Assume that F is admissible in the sense of Definition 3.6. Let ${\mathfrak {b}}$ be an integral ideal, and let ${\mathbf m}\in K^n$ . Then $ S_{\mathfrak {b}}(N;{\mathbf m})\ll (\operatorname {\mathrm {N}}\mathfrak {b})^{1-n/2} (\operatorname {\mathrm {N}} {{}^{{G}}\mathfrak {b}})^{n}. $
It is straightforward to show that $S_{\mathfrak {b}}(N;\mathbf {m})$ vanishes unless $\mathbf {m}$ satisfies additional constraints, as demonstrated in the following result.
Lemma 4.4. We have $S_{\mathfrak {b}} (N;\mathbf {m})=0$ unless $\mathbf {m}.\mathbf {h}\in \mathfrak {d}^{-1}$ for all $\mathbf {h}\in \mathcal {H}_{\mathfrak {b}}$ .
Proof. Returning to the definition of $S_{\mathfrak {b}}(N;\mathbf {m})$ in Lemma 4.1 and noting that ${{}^{{G}}\mathfrak {b}}^n\subset {\mathcal H}_{\mathfrak {b}}\subset \mathfrak {o}^n$ , we may write
However, orthogonality of characters gives
Since we automatically have $\mathbf {m}.\mathbf {h}\in \mathfrak {d}^{-1}$ for any $\mathbf {m} \in \widehat {{{}^{{G}}\mathfrak {b}} }^n$ and $\mathbf {h}\in {{}^{{G}}\mathfrak {b}}^n$ , the statement of the lemma follows.
We shall also need to establish a multiplicativity property for the exponential sums. This is achieved in the following result.
Lemma 4.5. Let ${\mathfrak {b}}$ be a nonzero integral ideal, and suppose that $\mathfrak {b}=\mathfrak {b}_1\mathfrak {b}_2$ for integral ideals $\mathfrak {b}_1,\mathfrak {b}_2$ such that $ \gcd (\operatorname {\mathrm {N}}\mathfrak {b}_1,\operatorname {\mathrm {N}}\mathfrak {b}_2)=1. $ Then, for any $N\in \mathfrak {o}$ and any $\mathbf {m}\in \widehat {{{}^{{G}}\mathfrak {b}}}^n$ , we have
Proof. According to Lemma 3.4, there exists $\gamma =g/\alpha \in \mathfrak {F}(\mathfrak {b})$ such that $\psi (\gamma \cdot )$ is a primitive character modulo $\mathfrak {b}$ . Then, Equation (4.7) implies that
Let us write $\operatorname {\mathrm {N}}\mathfrak {b}_i\hspace{-0.5pt}=\hspace{-0.5pt}b_i$ for $i=1,2$ . The assumption $\gcd (b_1,b_2)\hspace{-0.5pt}=\hspace{-0.5pt}1$ implies that $({{}^{{G}}\mathfrak {b}}_1,{{}^{{G}}\mathfrak {b}}_2)\hspace{-0.5pt}=\hspace{-0.5pt}1$ . Moreover, we have $b_1\in \mathfrak {b}_1$ , $b_2\in \mathfrak {b}_2$ and
According to Lemma 3.1(i), we find elements $\lambda , \mu \in \mathfrak {o}$ such that $\mathrm {ord}_{\mathfrak {p}}(\lambda )=\mathrm {ord}_{\mathfrak {p}}(\mathfrak {b}_1)$ and $\mathrm {ord}_{\mathfrak {p}}(\mu )=\mathrm {ord}_{\mathfrak {p}}(\mathfrak {b}_2)$ for all $\mathfrak {p} \mid {{}^{{G}}\mathfrak {b}}_1{{}^{{G}}\mathfrak {b}}_2$ . It follows from the Chinese remainder theorem, in the form Lemma 3.2, that we can write $a=\mu b+\lambda c$ for $b {\,(\operatorname {\mathrm {mod}}{{\mathfrak {b}_1}})}$ and $c {\,(\operatorname {\mathrm {mod}}{{\mathfrak {b}_2}})}$ . Likewise, we claim that we can write $\mathbf {x}=b_2\mathbf {b}+b_1\mathbf {c}$ , for $\mathbf {b} {\,(\operatorname {\mathrm {mod}}{{{{}^{{G}}\mathfrak {b}}_1}})}$ and $\mathbf {c} {\,(\operatorname {\mathrm {mod}}{{{{}^{{G}}\mathfrak {b}}_2}})}$ . To prove the claim it suffices to show that there is an isomorphism $\mathfrak {o}/{{}^{{G}}\mathfrak {b}}_1\times \mathfrak {o}/{{}^{{G}}\mathfrak {b}}_2\to \mathfrak {o}/{{}^{{G}}\mathfrak {b}}$ , given by $(u,v)\mapsto b_2u+b_1v$ . This map is clearly well defined since $b_1\in {{}^{{G}}\mathfrak {b}}_1$ and $b_2\in {{}^{{G}}\mathfrak {b}}_2$ . Moreover, injectivity follows from the coprimality conditions $((b_2),{{}^{{G}}\mathfrak {b}}_1)=((b_1),{{}^{{G}}\mathfrak {b}}_2)=1$ , which are a direct consequence of Equation (4.8). The claim follows, since the cardinalities are the same, by the Chinese remainder theorem.
In summary, on observing that $b_1\in {{}^{{G}}\mathfrak {b}}_1$ and $b_2\in {{}^{{G}}\mathfrak {b}}_2$ , it follows that
We claim that $\psi (\gamma \mu b_2^2\cdot )$ defines a primitive character modulo $\mathfrak {b}_1$ . For this, we note that
since $b_2\in \mathfrak {b}_2$ and $g\in \mathfrak {p}_1$ . Now, $(g)$ is coprime to $\mathfrak {b}_1\mathfrak {d}$ and $(\mu b_2)$ is coprime to $\mathfrak {b}_1$ . Thus, it follows that
where $\mathfrak {e}=\mathfrak {d}/(\mathfrak {d},\mu b_2^2\mathfrak {b}_2^{-1})$ . Clearly, $\mathfrak {e}\mid \mathfrak {d}$ . We claim that $(\mathfrak {d}/\mathfrak {e},\mathfrak {b}_1)=1$ . To see this, note that $\mathfrak {d}/\mathfrak {e}$ is equal to the common divisor $(\mathfrak {d}, \mu b_2^2 \mathfrak {b}_2^{-1})$ . Now, $\mu $ is coprime to $\mathfrak {b}_1$ and so is $\mathfrak {b}_2$ . Hence, the common divisor of these ideals most be coprime to the ideal $\mathfrak {b}_1$ , as claimed. Thus, Lemma 3.3 establishes the claim that $\psi (\gamma \mu b_2^2\cdot )$ is a primitive character modulo $\mathfrak {b}_1$ . It follows that
where $\overline {\operatorname {\mathrm {N}}\mathfrak {b}_2}$ is the multiplicative inverse of $\operatorname {\mathrm {N}}\mathfrak {b}_2$ modulo $\mathfrak {b}_1$ . Similarly,
from which the lemma follows.
Corollary 4.6. Let ${\mathfrak {b}}$ be a nonzero integral ideal, and suppose that $\mathfrak {b}=\mathfrak {b}_1\mathfrak {b}_2$ for integral ideals $\mathfrak {b}_1,\mathfrak {b}_2$ such that $ \gcd (\operatorname {\mathrm {N}}\mathfrak {b}_1,\operatorname {\mathrm {N}}\mathfrak {b}_2)=1. $ Then $S_{\mathfrak {b}}(N;\mathbf {0})= S_{\mathfrak {b}_1} (N;\mathbf {0}) S_{\mathfrak {b}_2} (N;\mathbf {0}). $
Proof. On making an obvious change of variables to the a-sum and the $\mathbf {x}$ -sum in Equation (4.7), we note that $S_{\mathfrak {b}}(c^2N;\mathbf {0})=S_{\mathfrak {b}}(N;\mathbf {0})$ for any $c\in \mathbb {Z}$ which is coprime to $\mathfrak {b}$ . The statement now follows from an application of Lemma 4.5.
4.5. The exponential integral
In this section, we discuss the exponential integral $I_{\mathfrak {b}}(t;\mathbf {k})$ that appears in Lemma 4.1 for given $t\in V$ and $\mathbf {k}\in V^n$ . It will be convenient to set
with which notation we have
We now bring into play the work in [Reference Browning and Vishe2, §6]. It follows from an application of Fourier inversion, as in [Reference Browning and Vishe2, Eq. (6.3)], that there exists a function $p_\rho (v):V\to \mathbb {C}$ such that
where
In our analysis, it will be useful to have the notion of a height function on V. Accordingly, we define $\mathfrak {H}:V\to \mathbb {R}_{\geqslant 1}$ via
for $v=(v_1,\dots ,v_d)\in V$ . In the closing stages of our argument, we will need to estimate integrals involving powers of $\mathfrak {H}(v)$ over various regions in V. First, it follows from [Reference Browning and Vishe2, Lemma 5.3] that
We can use this to deduce two further bounds that will play important roles.
For any $A\geqslant 1$ and $\varepsilon>0$ , we claim that
If $\alpha <-1$ , then we can clearly assume that $\varepsilon <-\alpha -1$ . But then the conditions of integration imply that $(\mathfrak {H}(v)/A)^{-\alpha -1-\varepsilon }\geqslant 1$ , whence
by Equation (4.11).
Next, for any $B\geqslant 1$ and $\varepsilon>0$ , we claim that
To see this, we note that $(B/\mathfrak {H}(v))^{\alpha +1+\varepsilon }\geqslant 1$ , under the conditions of the integral, if $\alpha \geqslant -1$ . But then
by Equation (4.11).
Returning to the function $p_\rho (v)$ in Equation (4.9), the following result summarises its key properties and is extracted from [Reference Browning and Vishe2, Lemmas 6.3 and 6.4].
Lemma 4.7. For any $\varepsilon>0$ , we have $ p_\rho (v)\ll P^{\varepsilon }, $ for any $v\in V$ . Moreover, for any $\varepsilon>0$ and $A\geqslant 1$ , we have
Recall here that $\rho>0$ . The next result is a straightforward consequence of the previous result, once combined with Equation (4.9) and the bound
which follows from the fact that W is compactly supported.
Corollary 4.8. Let $\varepsilon>0$ . Let $t\in V$ and $\mathbf {k}\in V^n$ . Then
for any $A\geqslant 1$ , where
It is interesting to pause and reflect on the corresponding situation for cubic forms G over a number field K that was considered in [Reference Browning and Vishe2], recalling that we are assuming K to be totally real in our setting. In [Reference Browning and Vishe2], crucial use was made of the fact that the integral over $\mathbf {x}$ factors as
since $\operatorname {\mathrm {Tr}} (vG(\mathbf {x}) )=\sum _{l=1}^{d}v^{(l)}G^{(l)} (\mathbf {x}^{(l)})$ , where $G^{(l)}=\rho _l(G)$ is a cubic form over $\mathbb {R}$ . We have chosen our main example Equation (1.3) in order that a similar property holds. Such a factorisation is not necessarily enjoyed for arbitrary generalised quadratic forms F, however, and it seems very difficult to analyse the integrals $K(v,\mathbf {k})$ in generic situations.
Define
for fixed $v\in V$ , where $F^{(l)}$ is the quadratic form (4.2). Thus, $\mathcal {Q}$ is a quadratic form over $\mathbb {R}$ in $dn$ variables. Let us write, temporarily, $\underline {\mathbf {x}}=(\mathbf {x}^{(1)},\dots ,\mathbf {x}^{(d)})$ and $ \underline {\mathbf {k}}=(\mathbf {k}^{(1)},\dots ,\mathbf {k}^{(d)}). $ Then, in the light of Equation (4.3), we may write
A general study of these exponential integrals has been carried out by Heath-Brown and Pierce [Reference Heath-Brown and Pierce6, Lemma 3.1]. Assuming that the support of W is contained in $[-1,1]^{dn}$ , we may appeal to their work, which we record here for the convenience of the reader.
Lemma 4.9. Let $\mathcal {Q}\in \mathbb {R}[X_1,\dots ,X_m]$ be a quadratic form with coefficients of maximum modulus $\|\mathcal {Q}\|$ and eigenvalues $\rho _1,\dots ,\rho _m$ . Let $\boldsymbol {\lambda }\in \mathbb {R}^m$ , and suppose that $w:\mathbb {R}^m\to \mathbb {R}$ is any smooth weight function supported on $[-1,1]^m$ . Then
Furthermore, if $|\boldsymbol {\lambda }|\geqslant 4\|\mathcal {Q}\|$ , then the integral is $O_{w,A} (|\boldsymbol {\lambda }|^{-A})$ for any $A\geqslant 1$ .
We will apply this result with $\boldsymbol {\lambda }=\underline {\mathbf {k}}$ and with the real quadratic form in Equation (4.14). Note that $\|\mathcal {Q}\|\ll | v|$ . Next, define
where $\mathbf {M}^{(l)}$ is the $dn\times dn$ matrix associated to $F^{(l)}$ . The function ${\mathcal {F}}(v)$ is a real form of degree $dn$ in the variables $v_1,\dots ,v_d$ . The following estimate is a direct consequence of Lemma 4.9.
Corollary 4.10. Assume $| \mathbf {k}| \gg | v|$ . Then $ K(v,\mathbf {k})\ll _A |\mathbf {k}|^{-A}, $ for any $A\geqslant 1$ . Moreover, $ K(v,\mathbf {k})\ll \min \{1,|{\mathcal {F}}(v)|^{-1/2}\} $ for any $\mathbf {k}\in V^n$ .
Unfortunately, it appears difficult to extract anything useful from the second bound, unless the generalised quadratic form is assumed to have extra structure.
4.6. Contribution from the trivial character
In this section, we study the overall contribution from the vector $\mathbf {m}=\mathbf {0}$ in the expression for $N_{W}(F,N;P)$ in Lemma 4.1. This contribution is
in the notation of that result.
It will ease notation if we put $t=N/P^2\in \mathbb {R}$ . Assuming that the descended system has codimension d, we begin by analysing the exponential integral $I_{\mathfrak {b}}(t;\mathbf {0})$ , writing
where $\rho =\operatorname {\mathrm {N}}\mathfrak {b}/P^d$ and
where, by an abuse of notation, $\mathrm {d} \mathbf {x}$ is the surface measure obtained by eliminating d variables from the equation $F(\mathbf {x})-t=v$ . We shall think of $f(y)$ as a function of $\mathbf {y}=(y_1,\dots ,y_d)$ on $\mathbb {R}^d$ , in which t is fixed and bounded absolutely. The following result summarises its main properties.
Lemma 4.11. Assume that the descended system has codimension d. There exist positive constants $C,C_0,C_1,\dots $ such that the function $f: \mathbb {R}^d\to \mathbb {R}$ is a smooth weight function that is supported on $[-C,C]^d$ and satisfies
for any $\mathbf {y}\in [-C,C]^d$ and any $i_1,\dots ,i_d\geqslant 0$ . The constants $C,C_0,C_1,\dots $ depend only on the coefficients of F and the parameter $\delta $ in the definition of W.
Proof. In the course of the proof, it will be convenient to write $\underline {\mathbf {s}}=(\mathbf {s}_1,\dots ,\mathbf {s}_d)$ , $\underline {\mathbf {u}}=(\mathbf {u}_1,\dots ,\mathbf {u}_d)$ , $\underline {\boldsymbol {\xi }}=(\boldsymbol {\xi }_1,\dots ,\boldsymbol {\xi }_d)$ and $\mathbf {t}=(t_1,\dots ,t_d)$ . Recall the definition of the weight function W in §4.2 for a suitable fixed $\underline {\boldsymbol {\xi }}\in \mathbb {R}^{dn}$ . Making the change of variables in Equation (4.4), we see that
where ${\boldsymbol \tau }=\mathbf {A}^{-1} \mathbf {t}$ and $\mathbf {w}=\mathbf {A}^{-1}\mathbf {y}$ . It will clearly suffice to prove the properties recorded in the lemma for the integral on the right-hand side, $\widetilde {f}({\mathbf {w}})$ say, regarded as a function of $\mathbf {w}$ . Making the change of variables $\underline {\mathbf {s}}=\underline {\mathbf {u}}-\underline {\boldsymbol {\xi }}$ , we have
It is now clear that $\widetilde {f}(\mathbf {w})=0$ unless $\mathbf {w}\in [-C,C]^d$ for suitable $C>0$ .
Next, we recall that $J_{Q_1,\dots ,Q_d}(\underline {\boldsymbol {\xi }})$ has rank d. We may assume without loss of generality that
Let $\varphi :\mathbb {R}^{dn}\to \mathbb {R}^{dn}$ be given by
The implicit function theorem implies that there exist open subsets $W',W\subset \mathbb {R}^{dn}$ with $\underline {\mathbf {0}}\in W'$ and $\varphi (\underline {\mathbf {0}})\in W$ such that $\varphi :W'\to W$ is a bijection and has differentiable inverse $\varphi ^{-1}$ on W. It is now clear that we wish to choose $\delta>0$ small enough to ensure that $\underline {\mathbf {s}}\in W'$ whenever $|\underline {\mathbf {s}}|\leqslant \delta $ .
We may now conclude that
where $\underline {\mathbf {s}}'= (s_{1,2},\dots ,s_{1,n},\dots ,s_{d,2},\dots ,s_{d,n}) $ , and $s_{1,1},s_{2,1},\dots , s_{d,1}$ are implicitly given by $\underline {\mathbf {s}}'$ and $\mathbf {w}$ , and
is the associated Jacobian. Since $\varphi ^{-1}$ is smooth, this implies that $\widetilde {f}(\mathbf {w})$ is infinitely differentiable and that its partial derivatives satisfy the bound claimed in the lemma.
Now, it follows from Corollary 4.8 that for $t=N/P^2\in \mathbb {R}$ , and $\varepsilon $ fixed as in the corollary, we have
Furthermore, in view of Lemma 4.11, it follows from Equation (4.16) and [Reference Browning and Vishe2, Lemma 4.1] that
for any $A\geqslant 0$ , where
if $\underline {\mathbf {x}}=(\mathbf {x}_1,\dots ,\mathbf {x}_d)$ . According to Equation (4.17), we have $f(0)= D_K^{-n/2} \sigma _\infty (t),$ where
is the usual singular integral for the descended system. In particular, arguing as in Davenport and Lewis [Reference Davenport and Lewis3, §6], a standard argument ensures that $\sigma _\infty (t)>0$ since $\underline {\boldsymbol {\xi }}$ is a nonsingular real point on the descended system.
We summarise our preliminary treatment of the main term in the following result.
Lemma 4.12. Let $\varepsilon>0$ . Then, for any $A\geqslant 1$ , we have
where $\sigma _\infty (N/P^2)>0$ is given by Equation (4.18).
In order to proceed further, it is clear that one requires a good enough upper bound for $S_{\mathfrak {b}}(N;\mathbf {0})$ in order to show that the error term is satisfactory and the sum over $\mathfrak {b}$ can be extended to infinity. Such a bound is available for admissible F, thanks to Corollary 4.3. Although we omit details, one can use it to prove that
for any admissible F such that $n\geqslant 5$ . In the setting of Theorems 1.4 and 1.5, we shall produce better bounds for $S_{\mathfrak {b}} (N;\mathbf {0})$ which allow such a deduction under milder hypotheses.
We close this section with a formal analysis of the singular series
ignoring issues of convergence. This is summarised in the following result.
Lemma 4.13. Assume that $\mathfrak {S}(N)$ is absolutely convergent. Then
We have $\mathfrak {S}(N)>0$ if the shifted descended system has a nonsingular p-adic solution for every prime p.
Proof. We may write
say. It follows from Corollary 4.6 that $S(k_1k_2)=S(k_1)S(k_2)$ if $k_1,k_2$ are coprime integers. Hence,
Since K is Galois, we may assume that p admits a factorisation $(p)=(\mathfrak {p}_1\cdots \mathfrak {p}_r)^e$ , with $\operatorname {\mathrm {N}} \mathfrak {p}_1=\cdots =\operatorname {\mathrm {N}}\mathfrak {p}_r=p^f$ . Let $\ell \geqslant 0$ , and let $I_\ell $ denote the set of integral ideals $\mathfrak {b}=\mathfrak {p}_1^{k_1}\cdots \mathfrak {p}_r^{k_r}$ , with $0\leqslant k_i\leqslant \ell e$ for $1\leqslant i\leqslant r$ . Then the union of $I_\ell $ over $\ell \geqslant 0$ exactly matches the set of integral ideals whose norm is a power of p. Hence,
It follows from Equation (4.7) that
on extending the inner sum to a sum over elements of $(\mathfrak {o}/p^\ell \mathfrak {o})^n$ . Hence, on rearranging, we obtain
where
We claim that $\psi (\gamma a\cdot )$ runs over all characters modulo $p^\ell $ , as a runs over $(\mathfrak {o}/\mathfrak {b})^{*}$ and $\mathfrak {b}$ runs over $I_\ell $ . On one hand, since $a\in (\mathfrak {o}/\mathfrak {b})^{*}$ , each character $\psi (\gamma a\cdot )$ is a primitive character modulo $\mathfrak {b}$ . Since $\mathfrak {b}=\mathfrak {p}^{k_1}\cdots \mathfrak {p}_r^{k_r}$ , for $k_1,\dots ,k_r\leqslant \ell e$ and $p^\ell =(\mathfrak {p}_1\cdots \mathfrak {p}_r)^{\ell e}$ , we conclude that each such character induces a character modulo $p^\ell $ . In order to complete the proof of the claim, it remains to show that we get all $p^{\ell d}$ characters modulo $p^\ell $ this way. But the number of characters is precisely
as required.
We may now conclude from orthogonality of characters that
from which the first part of the lemma follows. The second part is standard. Using Equation (1.2), the solubility of $F(\mathbf {x})- N$ in $\mathfrak {o}/p^\ell \mathfrak {o}$ can be reduced to the solubility of a shifted descended system $Q_i(\mathbf {u}_1,\dots ,\mathbf {u}_d) -N_i$ modulo primes powers, for $1\leqslant i\leqslant d$ , where we have written $N=\omega _1N_1+\cdots +\omega _d N_d$ . Arguing as in work of Birch [Reference Birch1, Lemma 7.1], for example, the existence of nonsingular p-adic zeros of this system is enough to deduce that $\mathfrak {S}(N)>0$ . The details of this will not be repeated here.
4.7. Contribution from the nontrivial characters
In this section, we make some initial steps in the treatment of the contribution from the nonzero vectors $\mathbf {m}$ in the asymptotic formula for $N_W(F,N;P)$ in Lemma 4.1. This contribution is
where
The primary task is to establish conditions under which there is an absolute constant $\Delta>0$ such that $E(N;P)=O(P^{-\Delta })$ .
We now place ourselves in the context of the generalised quadratic forms (1.3) and make some initial steps that will be common to Theorems 1.3–1.5. It will be convenient to consider the overall contribution from $\mathfrak {b}$ such that $\operatorname {\mathrm {N}}\mathfrak {b}$ and $\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}}$ are constrained to lie in dyadic intervals. Note that $\operatorname {\mathrm {N}}\mathfrak {b}\ll P^d$ and $\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}}\leqslant (\operatorname {\mathrm {N}}\mathfrak {b})^2$ since $\#G=2$ . Accordingly, we let $X,Y$ be parameters such that
We then write $E(N;P;X,Y)$ for the overall contribution to $E(N;P)$ from nonzero ideals $\mathfrak {b}\subset \mathfrak {o}$ for which
We denote by $\mathcal {B}(X,Y)$ the set of all such ideals. On summing over dyadic intervals for $X,Y$ satisfying Equation (4.20), it will suffice to establish the existence of $\Delta>0$ such that
for any $X,Y$ satisfying Equation (4.20).
It follows from Corollary 4.8 that
for any $A\geqslant 1$ , where $K(u,P\mathbf {m})$ is given by Equation (4.10) and
Hence, Equation (4.19) yields
for any $A\geqslant 1$ , where $\mathcal {B}(X,Y)$ is the set of nonzero ideals $\mathfrak {b}\subset \mathfrak {o}$ for which $ X\leqslant \operatorname {\mathrm {N}}\mathfrak {b} <2X$ and $Y\leqslant \operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}} <2Y$ .
5. Homogeneous case: proof of Theorems 1.3 and 1.4
We begin by proving a general result about rank drop in pencils of quadratic forms in situations where one of the matrices has much smaller rank. It parallels the basic fact in Reid’s thesis [Reference Reid10, Prop. 2.1] about rank drop in pencils $\nu _1A+\nu _2 B$ , for suitable $n\times n$ matrices $A,B$ , and shows how Assumption 2 can be deduced from an appropriate hypothesis about the shape of the associated singular locus.
Lemma 5.1. Let L be an algebraically closed field of characteristic not equal to $2$ , and let $m<n$ . Consider two matrices $A,B\in M_{n\times n}(L)$ such that B has only nonzero entries in the upper left $m\times m$ submatrix, which we also assume to be nonsingular. Let $\det (A)\neq 0$ . Assume that all singular points of the intersection of the two quadratic forms associated to A and B have the shape $(0,\mathbf {x}")$ with $\mathbf {x}"=(x_{m+1},\ldots , x_n)$ , and that the intersection has codimension $2$ . Then we have
Proof. Assume that there is some $\lambda \in L$ with
Let $V_0\subset \mathbb {A}^n$ be the affine subspace given by the kernel of $A+\lambda B$ . Then $\dim V_0\geqslant 2$ . Let $\mathbb {P}(V_0)=V\subset \mathbb {P}^{n-1}$ , and let $Q_B\subset \mathbb {P}^{n-1}$ be the quadric given by the matrix B. Then $\dim V\geqslant 1$ and $\dim Q_B=n-2$ as projective varieties. We deduce that the intersection $V\cap Q_B$ is nonempty. Consider a point $\mathbf {x}=(\mathbf {x}',\mathbf {x}")\in L^n\setminus \{0\}$ in the affine cone of $V\cap Q_B$ , where $\mathbf {x}'\in L^m$ and ${\mathbf x}" \in L^{n-m}$ . Then we deduce that
We deduce that ${\mathbf x}$ lies on the quadric given by A and as it is in the kernel of $A+\lambda B$ , it is a singular point of the intersection $Q_A \cap Q_B$ . We claim that ${\mathbf x}'\neq 0$ , that is, ${\mathbf x}$ is not of the shape $(0,{\mathbf x}")$ . Assume for a moment that ${\mathbf x}=(0,{\mathbf x}")$ . Note that
This is a contradiction to A being nonsingular. Hence, we found a singular point of the intersection $Q_A\cap Q_B$ which is not of the form $(0,{\mathbf x}")$ .
The main aim of this section is to carry out the proof of Theorems 1.3 and 1.4, which corresponds to taking $N=0$ and
as in Equation (1.3). Suppose that $\mathbf {A}$ is the $n\times n$ symmetric matrix defining Q and that $\mathbf {B}$ is the $n\times n$ symmetric matrix given by the condition that its upper left $m\times m$ submatrix defines R, with all other entries are equal to $0$ . We may proceed under the assumption that Assumptions 1–3 hold.
We have two tasks remaining. The first is to show that the sum over $\mathfrak {b}$ in Lemma 4.12 can be extended to infinity, with acceptable error, and the second is to prove that Equation (4.21) holds. We’ll need some more preparations for estimating the relevant exponential sum in Lemma 4.12 and Equation (4.23). Recalling the definition (3.3) of ${\mathcal H}_{\mathfrak {b}}$ , we lower bound its index in $\mathfrak {o}^n$ .
Lemma 5.2. There exist nonzero constants $\kappa _1,\dots ,\kappa _n, \tilde {\kappa }_1,\dots ,\tilde {\kappa }_m\in K$ , depending only on F and K such that
Moreover, we have $\kappa _1^{-1},\dots ,\kappa _n^{-1}, \tilde {\kappa }_1^{-1},\dots ,\tilde {\kappa }_m^{-1}\in \mathfrak {o}$ .
Proof. Assume that $\mathbf {A}$ has symmetric entries $a_{i,j}\in \mathfrak {o}$ , for $1\leqslant i,j\leqslant n$ , and that $\mathbf {B}$ has symmetric entries $b_{i,j}\in \mathfrak {o}$ for $1\leqslant i,j\leqslant m$ . Then the associated bilinear form takes the shape
Now, $\mathbf {h}\in \mathcal {H}_{\mathfrak {b}}$ if and only if $2B(\mathbf {h},\mathbf {k})\in \mathfrak {b}$ for all $\mathbf {k}\in \mathfrak {o}^n$ . Let $\omega _1,\ldots , \omega _d$ be an integral basis of $\mathfrak {o}$ with ${\omega }_1=1$ . Let $l\in \{1,\ldots , d\}$ and $j\in \{1,\dots ,n\}$ and consider a vector $\mathbf {k}$ such that the j-th entry is equal to ${\omega }_l$ and all other entries are equal to zero. Then the condition $B(\mathbf {h},\mathbf {k})\in \mathfrak {b}$ implies that
As the matrix $({\omega }_l^{\tau })_{1\leqslant l\leqslant d, \tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})}$ is invertible, this implies that there exists $\beta \in K$ with $\beta ^{-1}\in \mathfrak {o}$ such that
Thus, we find that $\mathbf {A}{\mathbf h}\in (\beta \mathfrak {b})^n$ and $\mathbf {B}(\mathbf {h}')^{\tau } \in (\beta \mathfrak {b})^m$ , where $\mathbf {h}'=(h_1,\dots ,h_m).$ As both matrices $\mathbf {A}$ and $\mathbf {B}$ are nonsingular, this implies that
Putting these together, the statement of the lemma easily follows.
Corollary 5.3. Let $N\in \mathfrak {o}$ and let F be given by Equation (1.3). Suppose that Assumption 1 holds. Then
Moreover, $S_{\mathfrak {b}} (N;\mathbf {m})=0$ unless $ m_i\in \mathfrak {d}^{-1}\mathfrak {b}^{-1}$ for $m<i\leqslant n$ .
Proof. It follows from Lemma 5.2 that $|\mathfrak {o}^n/{\mathcal H}_{\mathfrak {b}}|\gg (\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}})^{m}(\operatorname {\mathrm {N}}\mathfrak {b})^{n-m}$ . Thus, Equation (3.4) implies that
Inserting this into Lemma 4.2 yields the desired upper bound. We have already observed in Lemma 4.4 that $S_{\mathfrak {b}} (N;\mathbf {m})=0$ unless $\mathbf {m}.\mathbf {h}\in \mathfrak {d}^{-1}$ for all $ \mathbf {h}\in \mathcal {H}_{\mathfrak {b}}$ . Noting that ${{}^{{G}}\mathfrak {b}}^m\times \mathfrak {b}^{n-m}\subset \mathcal {H}_{\mathfrak {b}}$ , the second part easily follows.
Returning to Lemma 4.12, it immediately follows from this that the overall contribution from the tail $\operatorname {\mathrm {N}}\mathfrak {b}\gg P^d$ is
Since $\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}}\geqslant \operatorname {\mathrm {N}}\mathfrak {b}$ , this is acceptable provided that $n>4$ , which is certainly implied by the hypotheses in Theorems 1.3 and 1.4. Thus, we can focus our remaining efforts on establishing Equation (4.21).
Our next goal is to analyse the integrals $K(u,P\mathbf {m})$ in Equation (4.23) for the case that F has the shape $F(\mathbf {x})=Q(\mathbf {x})+R(x_1^{\tau }, x_2^{\tau },\ldots , x_m^{\tau }),$ for $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ some fixed automorphism. Taking the lth embedding into the real numbers gives
where we write $\mathbf {x}^{(l)}=\rho _l(\mathbf {x})$ . For each $1\leqslant l\leqslant d$ , we define $l_\tau $ through the relation (1.4). With this notation, we obtain
Hence,
with
Note that $G^{(l)}(\mathbf {x}^{(l)})$ is a quadratic form in $\mathbf {x}^{(l)}$ and hence can be represented by a symmetric matrix, which can be diagonalised using an orthogonal base change. Thus, for every tuple $u=(u_1,\ldots , u_d)$ , there exists a diagonal matrix $\operatorname {\mathrm {Diag}}(\eth _{l,i}(u))_{1\leqslant i\leqslant n}$ and an orthogonal matrix $M_l(u)\in O(n)$ such that
Set
With the change of coordinates $M_l(u)\mathbf {x}^{(l)}=\mathbf {y}^{(l)}$ , we get
We are now ready to prove the following result.
Lemma 5.4. For any $\varepsilon>0$ , the integral $K^{(l)}(u,P\mathbf {m})$ is essentially supported on the set of u and $\mathbf {m}$ for which
and
Moreover, we have
Proof. Recall that $M_l(u)\in O(n)$ . In particular, all entries of $M_l(u)$ are bounded independently of u and we obtain
uniformly in u for all $k\in \mathbb {N}$ . The result now follows from Lemma 4.9.
Henceforth, we take $N=0$ and write $E(P;X,Y)=E(0;P;X,Y)$ in Equation (4.23). We shall adhere to common convention and allow the value of $\varepsilon>0$ to change at each appearance so that $P^{\varepsilon }\log P\ll P^{\varepsilon }$ , for example. Moreover, all implied constants are allowed to depend on $\varepsilon $ .
Applying Corollary 5.3, we deduce that
Let $\delta \in {{}^{{G}}\mathfrak {b}}\mathfrak {d}$ , and let $\mathfrak {p}_1$ be a prime ideal coprime to ${{}^{{G}}\mathfrak {b}}\mathfrak {d}$ , with $\operatorname {\mathrm {N}}\mathfrak {p}_1 \ll (\operatorname {\mathrm {N}} \mathfrak {b})^{\varepsilon /d}$ , such that $(\delta )={{}^{{G}}\mathfrak {b}}\mathfrak {d}\mathfrak {p}_1$ . On multiplying $\delta $ by an appropriate unit, there is no loss of generality in assuming that
for $1\leqslant l\leqslant d$ , since $Y\leqslant \operatorname {\mathrm {N}} {{}^{{G}}\mathfrak {b}} <2Y$ . We are led to make the change of variables
for $1\leqslant i\leqslant n$ , so that $\mathbf {c}=(c_1,\dots ,c_n)\in \mathfrak {o}^n$ . Then
We may now write
Define the function
Let $\mathcal {R}(\mathbf {m})$ be the set of $u\in \mathcal {U}$ such that
and
We now have
Let
where $\mathcal {C}(u,\mathfrak {b})$ is the set of nonzero vectors $\mathbf {c}\in \mathfrak {o}^n$ for which Equation (5.3) holds,
and
Then we have
Our next goal is to estimate $L(u)$ . For each $1\leqslant l\leqslant d$ , we sort the eigenvalues $\eth _{l,i}(u)$ in a way such that
Note that we can always achieve this by adjusting the orthogonal matrix $M_l(u)$ with suitable permutations. Moreover, for all $1\leqslant i\leqslant n$ and $1\leqslant l\leqslant d$ , we have
It will now be useful to make the observation
We proceed by proving the following result.
Lemma 5.5. Let $u\in V$ such that $\mathfrak {H}(u)\leqslant P^{d+\varepsilon }/X$ . If $L(u)\neq 0$ , then
Moreover, we have $ L(u)\ll P^{\varepsilon } X J(u), $ where
Proof. Let us write $\mathbf {c}=(\mathbf {c}',\mathbf {c}")$ , where $\mathbf {c}'=(c_1,\dots ,c_m)$ and $\mathbf {c}"=(c_{m+1},\dots ,c_n)$ . Keeping in mind Equation (5.3), we first fix a choice of $\mathbf {c}"\in \left (\mathfrak {b}^{-1} {{}^{{G}}\mathfrak {b}} \right )^{n-m}$ satisfying
for $m+1\leqslant i\leqslant n$ and $1\leqslant l\leqslant d$ . Choose $\lambda \in K$ such that $(\lambda )=\mathfrak {b}^{-1} {{}^{{G}}\mathfrak {b}} \mathfrak {p}_2^{-1}$ , for a suitable prime ideal $\mathfrak {p}_2$ of norm $O(P^{\varepsilon }).$ We may assume that $\lambda $ is well shaped in the sense of Equation (5.1), on multiplying by a suitable unit. Thus, $ X^{1/d}Y^{1/d}\ll |\lambda ^{(l)}| \ll X^{1/d}Y^{1/d+\varepsilon }, $ for $1\leqslant l\leqslant d$ . Making the change of variables $\mathbf {c}"=\lambda \mathbf {d}"$ and recalling that $\operatorname {\mathrm {N}}\mathfrak {b} \asymp X$ and $\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}} \asymp Y$ , we must have
for $m+1\leqslant i\leqslant n$ and $1\leqslant l\leqslant d$ .
We begin by showing that Equation (5.7) holds if $L(u)\neq 0$ . Thus, there exists $\mathbf {c}\neq \mathbf {0}$ counted by $L(u)$ . Suppose first that $\mathbf {c}"\neq \mathbf {0}$ . Then there exists $i\in \{m+1,\dots ,n\}$ such that
whence $ 1\ll P^{-d+\varepsilon } X \mathfrak {H}(u)\ll P^{-d+\varepsilon } Y \mathfrak {H}(u)^2 $ since $X\leqslant Y$ . This is satisfactory for Equation (5.7). Suppose next that $\mathbf {c}'\neq \mathbf {0}$ . In particular, we have
As $M_l(u)$ is an orthogonal matrix, this implies that
whence Equation (5.6) yields
This completes the proof of Equation (5.7) under the assumption that $L(u)\neq 0$ .
Turning now to the estimation of $L(u)$ , it readily follows from a result in Lang [Reference Lang9, Thm. 0 in §V.1] that the overall number of vectors $\mathbf {d}"$ is
It remains to count the number of vectors $\mathbf {c}'$ associated to a particular choice of $\mathbf {c}"$ . Let $L(u,\mathfrak {b},\mathbf {c}")$ be the number of $\mathbf {c}'\in \mathfrak {o}^m$ such that Equation (5.8) holds. Assume that the matrix $M_l(u)$ is given by $M_l(u)=(m_{l\alpha \beta })_{1\leqslant \alpha ,\beta \leqslant n}$ . Write
with $M_l^{\prime }(u)=(m_{l\alpha \beta })_{\substack {1\leqslant \alpha \leqslant n\\ 1\leqslant \beta \leqslant m}}$ and $M_l^{\prime \prime }(u)=(m_{l\alpha \beta })_{\substack {1\leqslant \alpha \leqslant n\\ m< \beta \leqslant n}}$ . Then we consider the system of inequalities
where $\mathbf {r}_l=(r_{li})_{1\leqslant i\leqslant n}=M_l^{\prime \prime }(u) \mathbf {c}^{\prime \prime (l)}$ .
Write
Then, for $1\leqslant i\leqslant n$ , we can write
Let H be the $dn\times dm$ matrix given by
with $1\leqslant l\leqslant d$ , $1\leqslant i\leqslant n$ , $1\leqslant k\leqslant d$ , $1\leqslant \beta \leqslant m$ , and consider the lattice
Then $L(u,\mathfrak {b},\mathbf {c}")$ counts lattice points in $\Lambda $ which lie in a box of side length
We claim that the successive minima of the lattice $\Lambda $ are bounded above and below by constants depending only on K and n. Taking this on faith, it will then follow that
which will settle the lemma, on summing over $O(X)$ choices for $\mathfrak {b}\in \mathcal {B}(X,Y)$ .
To check the claim, we order the index tuples $(l,i)$ and $(k,\beta )$ in the matrix H lexicographically. Write
with the $n\times m$ matrix $B_l=(m_{li\beta })_{1\leqslant i\leqslant n,1\leqslant \beta \leqslant m}$ . Note that $B_l$ has orthogonal and norm one columns for $1\leqslant l\leqslant d$ . We can then write H as a block matrix
Let $B=B(u)$ be the $nd\times md$ matrix which is a diagonal block matrix, with the matrices $B_1,\ldots , B_d$ on the diagonal. Let W be the $md\times md$ block matrix, with blocks ${\omega }_k^{(l)} E_{m}$ at each place $1\leqslant l,k\leqslant d$ , where $E_m$ is the m-dimensional identity matrix. Then
Consider the lattice $\Gamma = W\mathbb {Z}^{md}\subset \mathbb {R}^{md}$ , and note that this only depends on the basis ${\omega }_1,\ldots , {\omega }_d$ . Moreover, if $\mathbf {w}\in \Gamma $ , then
Hence, the successive minima of the lattice $\Lambda $ coincide with those of $\Gamma $ , which thereby establishes the claim.
It follows from the previous result that
where $f(u)$ is given by Equation (5.4) and $\mathcal {U}^{*}$ is the set of $u\in \mathcal {U}$ such that Equation (5.7) holds.
Recall that the $\eth _{l,i}(u)$ are the eigenvalues of the matrix associated to the quadratic form
The next result collects together a number of properties concerning the size of the eigenvalues $\eth _{l,i}(u)$ .
Lemma 5.6. Assume that Assumptions 1–3 hold, and suppose that $\tilde m\geqslant m-1$ is the degree of the polynomial appearing in Assumption 3. For each $1\leqslant l\leqslant d$ , we order the eigenvalues $\eth _{l,i}(u)$ such that
Then there exist constants $C_1,\dots ,C_d>0$ such that the following holds:
-
(1) If $|u_{l_\tau }|\leqslant C_l |u_{l}|$ , then
$$ \begin{align*}|\eth_{l,1}(u)|\ll |u_l|\quad \text{ and } \quad |\eth_{l,n-1}(u)|\gg |u_l|.\end{align*} $$Moreover, if $m=1$ and $\tilde m=0$ , then
$$ \begin{align*}|\eth_{l,1}(u)|\ll |u_l|\quad \text{ and } \quad |\eth_{l,n}(u)|\gg |u_l|.\end{align*} $$ -
(2) If $|u_{l_\tau }|> C_l |u_l|$ , then
$$ \begin{align*}|\eth_{l,m+1}(u)\cdots \eth_{l,n}(u)|\gg \frac{|u_l|^{n-\tilde m}}{|u_{l_\tau}|^{m-\tilde m}}.\end{align*} $$
Proof. To begin with, according to Assumption 3, for each $1\leqslant l\leqslant d$ there exists a constant $C_l$ such that
for $|t|\geqslant C_l$ .
We start by examining the case $|u_{l_\tau }|\leqslant C_l |u_l|$ . The first bound $|\eth _{l,1}(u)|\ll |u_l|$ follows directly from Equation (5.5). Assume now that $u_l\neq 0$ . Note that each of the eigenvalues $\eth _{l,i}(u)$ arises by multiplication with $u_l$ from the eigenvalues of the matrix corresponding to
Write $\tilde {\eth }_{l,i}(u)$ for those eigenvalues in the same ordering. Assume that the lower bound $|\tilde {\eth }_{l,n-1}(u)|\gg 1$ is not satisfied. Thus, there exists a sequence of $t_j$ in the range $|t_j|\leqslant C_l$ such that $\tilde {\eth }_{l,n-1}(t_j) \rightarrow 0 $ , for $j\rightarrow \infty $ , where we write $\tilde {\eth }_{l,n-1}(t)$ for the second smallest eigenvalues of $Q^{(l)}+tR^{(l_\tau )}$ . As the set of t is compact there is a convergent subsequence, convergent to $t'$ say, with $\operatorname {\mathrm {rank}} (Q^{(l)}+t' R^{(l_\tau )})<n-1$ . This contradicts Assumption 2.
Now, we consider the case $m=1$ and $\tilde m=0$ . By Assumptions 1 and 3, we deduce that $\det (Q^{(l)} + tR^{(l_\tau )})$ is a nonzero constant independent of t. In particular, the rank of this matrix is always n and the argument above shows that $|\eth _{l,n}(u)|\gg |u_l|$ .
Next, we consider the case $|u_{l_\tau }|> C_l |u_l|$ and $u_l\neq 0$ . Again, we write $\tilde {\eth }_{l,i}(u)$ for the eigenvalues of $Q^{(l)} + \frac {u_{l_\tau }}{u_l} R^{(l_\tau )}.$ Note that we have
Moreover, we observe that
We therefore find that
From this, we obtain the lower bound
which completes the proof of the lemma.
We now continue with our analysis of $E(P;X,Y)$ in Equation (5.9). Recall our assumptions on $X,Y$ in Equation (4.20). Let $E_1(P;X,Y)$ denote the overall contribution from the case $Y\geqslant P^d$ , and let $E_2(P;X,Y)$ denote the remaining contribution. The following pair of results treats these two quantities in turn.
Lemma 5.7. Assume $\tilde m\geqslant m-1$ . Then $E_1(P;X,Y)\ll P^{d(2-n/2+m/2)+\varepsilon }+P^{-dm+\varepsilon }$ .
Proof. On recalling the definition Equation (5.4) of $f(u)$ , we deduce from Equation (5.9) that
since Equation (5.5) implies that
Here, we recall that $\mathcal {U}^{*}$ is the set of $u\in \mathcal {U}$ such that Equation (5.7) holds. Consider for a moment a fixed value of l. If $C_l|u_l| \geqslant |u_{l_\tau }|$ , then Lemma 5.6 yields
If $C_l|u_l| < |u_{l_\tau }|$ and $\tilde m\geqslant m-1$ , then Lemma 5.6 yields
In either of these two cases, we therefore have
whence
It follows from Equation (5.6) that
The contribution gets maximal for $Y\asymp X^2$ , in which case we get the upper bound
The first term is satisfactory for the lemma. If $2-n/2+3m/2\leqslant 0$ , then the second term is $O( P^{-dm+\varepsilon })$ , which is satisfactory. If $n\leqslant 3+3m$ , on the other hand, then we take $X\ll P^d$ and get the satisfactory upper bound $O(P^{d(2-n/2+m/2)+\varepsilon })$ .
Lemma 5.8. Assume that $n\geqslant m+4$ and $\tilde m \geqslant m-1$ . Let
Then $E_2(P;X,Y)$ is
Proof. For $1\leqslant i\leqslant m$ , we clearly have
Hence, we find that $E_2(P;X,Y)$ is
If $C_l|u_l|\geqslant |u_{l_\tau }|$ , then Lemma 5.6 leads to the bound
For the case $C_l |u_l| < |u_{l_\tau }|$ , we still have
since $\tilde m\geqslant m-1$ . We now deduce that in either case we have
It now follows from Equation (5.6) that $E_2(P;X,Y)$ is
Let $I_1$ denote the contribution to the integral from those u for which there exists at least one $u_l$ with $|u_l|\gg (P/Y^{1/d})^{2} $ , and let $I_2$ denote the remaining contribution.
On recalling that $\mathcal {U}^{*}$ is the set of $u\in \mathcal {U}$ such that Equation (5.7) holds, it is clear that
Turning to $I_1$ , we see that
by Equation (5.6). But if there is one $u_l$ with $|u_l|\gg (P/Y^{1/d})^{2} $ , then clearly
Hence, since $P^{-1}Y^{1/d}\ll 1$ , it now follows that
In summary, we have shown that
with $I_1,I_2$ as above.
Since $n\geqslant m+4$ , the exponent of $\mathfrak {H}(u)$ in $I_2$ is less than or equal to $-1$ . If $\frac {n-m-2+\kappa }{2}>1$ , then it follows from Equation (4.12) that
However, if $\frac {n-m-2+\kappa }{2}=1$ , then we apply Equation (4.13) to deduce that the same bound holds.
On the other hand, Equations (4.12) and (4.13) also yield
We conclude that
We now consider these three terms separately, starting with the third and recalling that $n-m\geqslant 4$ . If $-3m/4+n/4-(4-\kappa )/4\leqslant 0,$ then we get an upper bound
In the opposite case, we get the upper bound $ \ll P^{-dm/2+\varepsilon }\ll P^{-m+\varepsilon }, $ on using $Y\leqslant P^d$ and $d\geqslant 2$ .
We now turn to the second term. If $2m/d+n/4-5m/4-(4-\kappa )/4\leqslant 0,$ then we get the upper bound
In the opposite case, on using $X\geqslant Y^{1/2}$ , we get the upper bound
If $d\geqslant 3$ or $\kappa =0$ , then we reduce to the case above. If $d=2$ and $\kappa =1$ , on the other hand, we obtain the upper bound
since $Y\leqslant P^d$ . Clearly, $(n+m-4)/2\geqslant m$ if $n\geqslant m+4$ , whence this case contributes $O(P^{-m+\varepsilon })$ , which is satisfactory.
It remains to deal with the first term. Again, we use the lower bound $X\geqslant Y^{1/2}$ , allowing us to bound the first term by
If $d\geqslant 3$ or $\kappa =0$ , then we get $O(P^{-2m + 3md/2 + (4-\kappa )d/2 - dn/2+\varepsilon })$ , which is satisfactory. Alternatively, if $d=2$ and $\kappa =1$ , then we get
This is $\ll P^{-1/2+\varepsilon }$ since $n\geqslant m+4$ , which thereby completes the proof of the lemma.
It remains to combine Lemmas 5.7 and 5.8. We make the assumption
Under this assumption, the bound in Lemma 5.7 is $O(P^{-d/2+\varepsilon })$ . Moreover, the bound in Lemma 5.8 is
Hence, since $d\geqslant 2$ and $m\geqslant 1$ , it finally follows that Equation (4.21) holds for a suitable $\Delta>0$ , provided that $n\geqslant m+5$ and
where $\kappa $ is defined in the statement of Lemma 5.8.
Suppose first that $m=1$ and place ourselves under the hypotheses of Theorem 1.3. Then Assumptions 1–3 hold with $\tilde m=0$ . Hence, $\kappa =1$ and the condition on n reduces to $n\geqslant 6$ , as required for Theorem 1.3. Assume now that $m\geqslant 1$ , but $\kappa =0$ . Since $d\geqslant 2$ , we have $3m+4-4m/d\geqslant m+4$ , from which the statement of Theorem 1.4 follows.
6. Inhomogeneous case: proof of Theorem 1.5
In this section, we complete the proof of Theorem 1.5. We note that the quadratic form in Equation (1.5) is a special case of Equation (1.3), with $\mathbf {A}=\operatorname {\mathrm {Diag}}(a_1,\dots ,a_n)$ and $\mathbf {B}=\operatorname {\mathrm {Diag}}(b_1,\dots ,b_m,0,\dots ,0)$ . Hence, Corollary 5.3 applies to the situation considered in Theorem 1.5. In particular, assuming that $n>4$ , the argument in the previous section shows that the sum over $\mathfrak {b}$ in Lemma 4.12 can be extended to infinity with acceptable error. Since the assumption $n>4$ is implied by the hypotheses in Theorem 1.5, this leaves us free to focus our efforts on proving Equation (4.21).
In the present setting, it will be vital to obtain additional cancellation from the sum over primitive characters in $S_{\mathfrak {b}}(N;\mathbf {m})$ . We plan to improve on Corollary 5.3 in generic situations, beginning with an examination of a particular exponential sum modulo degree 1 prime ideals. The saving we shall achieve is linked to the fact that $N\neq 0$ and will also involve the special generalised quadratic form
that is the analogue of the dual form in our setting. (Note that it has coefficients in $\mathfrak {o}$ .) For any unramified prime ideal $\mathfrak {p}$ and any vector $\mathbf {v}\in \widehat {{}^{{G}}\mathfrak {p}}^n$ , it will be convenient to observe that $\mathrm {ord}_{\mathfrak {p}}(G(\mathbf {v}))\geqslant -2$ , since $\mathrm {ord}_{\mathfrak {p}} (v_i)\geqslant -1$ and $\mathrm {ord}_{\mathfrak {p}} (v_i^{\tau })\geqslant -1$ for any $v_i\in \widehat {{}^{{G}}\mathfrak {p}}$ . With this in mind, we proceed by proving the following bound for $S_{\mathfrak {p}}(N;\mathbf {v})$ .
Lemma 6.1. Let $\mathfrak {p}$ be a prime ideal of residue degree $1$ , and let $\mathbf {v}\in \widehat {{}^{{G}}\mathfrak {p}}^n$ . Then
where
Proof. Let $\mathfrak {p}$ be a prime ideal of residue degree $1$ so that $\operatorname {\mathrm {N}}\mathfrak {p}=p$ for a rational prime p. We may assume that p is unramified in K and that
since the desired estimate is trivial otherwise. Since $K/\mathbb {Q}$ is Galois, this means that there is a factorisation $(p)=\mathfrak {p}_1\cdots \mathfrak {p}_d$ into prime ideals, where $\mathfrak {p}_1,\dots ,\mathfrak {p}_d$ are the d conjugates of $\mathfrak {p}$ under $\operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ , satisfying $\operatorname {\mathrm {N}}\mathfrak {p}_i=p$ for $1\leqslant i\leqslant d$ .
It will be convenient to write $S_{\mathfrak {p}}=S_{\mathfrak {p}} (N;\mathbf {v})$ and $\tilde {\mathfrak {p}}=\mathfrak {p}^{\tau ^{-1}}$ in the proof. Then $\mathfrak {p}$ and $\tilde {\mathfrak {p}}$ are distinct prime ideals, with ${}^{{G}}\mathfrak {p}=\mathfrak {p} \tilde {\mathfrak {p}}$ and $\operatorname {\mathrm {N}} \mathfrak {p}=\operatorname {\mathrm {N}}\tilde {\mathfrak {p}}=p$ . Choose $\gamma =g/\alpha \in \mathfrak {F}(\mathfrak {p})$ as in Lemma 3.4 so that $\psi (\gamma \cdot )$ is a primitive character modulo $\mathfrak {p}$ . Then we can write
as in Equation (4.7).
Lemma 3.2 yields
for suitable $\lambda ,\mu \in \mathfrak {o}$ such that
Clearly,
and
since the characters $\psi ( \gamma \lambda \cdot )$ and $\psi ( \gamma \mu ^{\tau } \cdot )$ are both trivial on $\mathfrak {o}$ . Putting everything together, it follows that
where
We estimate the first sum trivially via $\Sigma _0\ll (\operatorname {\mathrm {N}}\tilde {\mathfrak {p}})^{n-m}=(\operatorname {\mathrm {N}}\mathfrak {p})^{n-m}$ .
The second sum factorises as
Recall from the definition of $\mathfrak {F}(\mathfrak {p})$ that $\alpha \in \mathfrak {p} \mathfrak {d}$ . Hence,
since $\mathrm {ord}_{\tilde {\mathfrak {p}}}(\mu )=1$ and $\mathbf {v}\in \widehat {{}^{{G}}\mathfrak {p}}^n$ , by assumption. Making the change of variables
where $\overline {2a\mu ^2a_i}$ denotes the multiplicative inverse of $2a\mu ^2 a_i$ modulo $\mathfrak {p}$ , we are led to the expression
since $\overline {4}-\overline {2}\equiv -\overline {4}\ {(\operatorname {\mathrm {mod}}{{\mathfrak {p}}})}$ . The inner sum is a classical Gauss sum, as found in work of Hecke [Reference Hecke7, Satz 155], for example. We obtain
where
This completes the proof of the identity
It turns out that the remaining sum $\Sigma _2(a)$ can also be interpreted as a product of Gauss sums. First, we observe that we have the factorisation
on making the change of variables $u=w^{\tau }$ . The trace is left invariant under conjugation. On recalling that $g\in \mathbb {Z}$ so that $g^{\tau }=g$ , it therefore follows that
since $(\gamma \alpha )^{\tau }=g=\gamma \alpha $ . Hence,
where
for $1\leqslant i\leqslant m$ . The inner sum is a Gauss sum that we can evaluate, as previously. This yields
We now piece everything together in Equation (6.2). To begin with, it follows from squaring and differencing that
Since $\mathfrak {p}\nmid 2$ , we see that the inner sum is $\operatorname {\mathrm {N}}\mathfrak {p}$ if $u\in \mathfrak {p}$ and and $0$ otherwise. Hence, it follows that $ |\tau _{\mathfrak {p}}|=\sqrt {\operatorname {\mathrm {N}}\mathfrak {p}}, $ from which we deduce that
where
Since $\mathfrak {p}\nmid 2a_1\cdots a_n b_1\cdots b_m \mu \lambda ^{\tau }$ , we may replace $\overline a$ by $4a a_1\cdots a_n b_1\cdots b_m \mu ^2(\lambda ^{\tau })^2$ by a in order to obtain
with
with G is given by Equation (6.1). One notes that $\mu ^2 (\lambda ^{\tau })^2 \alpha ^2G( \mathbf {v})\in \mathfrak {o}$ when $\mathbf {v}\in (\widehat{{}^{{G}}\mathfrak{p}})^n$ . In particular,
Thus, $K_{\mathfrak {p}}$ is a Kloosterman sum, if $2\mid m+n$ , and a Salié sum if $2\nmid m+n$ . It follows that
The statement of the lemma is now clear.
We are now ready to reveal our final estimate for the exponential sum $S_{\mathfrak {b}} (N;\mathbf {m})$ .
Lemma 6.2. Let $\varepsilon>0$ . Let $\mathfrak {b}\subset \mathfrak {o}$ be a nonzero ideal, and let $\mathbf {m}\in \widehat {{}^{{G}}\mathfrak {b}}^n$ . Then
where G is given by Equation (6.1).
Proof. There is a factorisation $\mathfrak {b}=\mathfrak {b}_1\mathfrak {b}_2$ , where $\operatorname {\mathrm {N}}\mathfrak {b}_1$ is square-free and $\operatorname {\mathrm {N}}\mathfrak {b}_2$ is square-full, with $\gcd (\operatorname {\mathrm {N}}\mathfrak {b}_1,\operatorname {\mathrm {N}}\mathfrak {b}_2)=1$ . It follows from Lemma 4.5 and Corollary 5.3 that
We now turn to $S_{\mathfrak {b}_1} (\overline {\operatorname {\mathrm {N}}\mathfrak {b}_2}^2N;(\operatorname {\mathrm {N}}\mathfrak {b}_2)\mathbf {m})$ , in which we note that
for $1\leqslant i\leqslant n$ . Since $\operatorname {\mathrm {N}} \mathfrak {b}_1$ is square-free, we have a factorisation
for distinct prime ideals $\mathfrak {q}_1,\cdots ,\mathfrak {q}_r$ of residue degree $1$ such that $\operatorname {\mathrm {N}}\mathfrak {q}_1,\cdots ,\operatorname {\mathrm {N}}\mathfrak {q}_r$ are distinct rational primes. Let
for $1\leqslant i\leqslant r$ . It now follows from a further application of Lemma 4.5 that
In particular, we plainly have $(\operatorname {\mathrm {N}}\mathfrak {b}_2)c_i\mathbf {m}\in (\widehat {{}^{{G}}\mathfrak {q}_i})^n$ for $1\leqslant i\leqslant r$ .
We are now aligned for an application of Lemma 6.1. For each $i\in \{1,\dots ,r\}$ , this yields
where
Note that
since $\operatorname {\mathrm {N}}{}^{{G}}\mathfrak {q}_i=(\operatorname {\mathrm {N}}\mathfrak {q}_i)^2$ . Thus,
Combining these estimates in Equation (6.3), we conclude that
since $(\operatorname {\mathrm {N}} \mathfrak {b}_1)(\operatorname {\mathrm {N}} \mathfrak {b}_2)=\operatorname {\mathrm {N}}\mathfrak {b}$ and $(\operatorname {\mathrm {N}} {{}^{{G}}\mathfrak {b}}_1)(\operatorname {\mathrm {N}} {{}^{{G}}\mathfrak {b}}_2)=\operatorname {\mathrm {N}} {{}^{{G}}\mathfrak {b}}$ . The statement of the lemma is now clear.
Our next task is to analyse the oscillatory integral $K(u,P\mathbf {m})$ when F is given by Equation (1.5), based on Equation (4.15). To the fixed automorphism $\tau \in \operatorname {\mathrm {Gal}}(K/\mathbb {Q})$ in Equation (1.5), we can associated a unique integer $l_\tau \in \{1,\dots ,d\}$ , as in Equation (1.4). We therefore have
for $1\leqslant l\leqslant d$ . Let $\mathbf {A}_l=\operatorname {\mathrm {Diag}}(a_1^{(l)},\dots , a_n^{(l)})$ and $\mathbf {B}_l=\operatorname {\mathrm {Diag}}(b_1^{(l)},\dots ,b_m^{(l)}, 0,\dots ,0)$ . Then it follows that the quadratic form Equation (4.14) has an underlying matrix which is the block diagonal matrix
If $\mathbf {m}$ is given coordinates $ \underline {\mathbf {m}}=(\mathbf {m}^{(1)},\dots ,\mathbf {m}^{(d)}) $ on $V^n$ , then we have
where $G^{(l)}$ has underlying matrix $u_l \mathbf {A}_l +u_{l_\tau }\mathbf {B}_l$ . Since this matrix is diagonal, on assuming that the weight W is chosen suitably, we may further factorise to obtain
where we write
for $i\leqslant m$ , and
for $i>m$ .
Lemma 6.3. Let
for $1\leqslant i\leqslant m$ . Then, for any $\varepsilon>0$ , $K(u,P\mathbf {m})$ is essentially supported on the set of u and $\mathbf {m}$ for which
for $1\leqslant l\leqslant d$ . Moreover, we have
Proof. Clearly, we get exponential decay in $K(u,P\mathbf {m})$ unless $P|\mathbf {m}|\ll |u| P^{\varepsilon }$ , as we now assume. However, on examining each of the factors in $K(u,P\mathbf {m})$ separately, the essential support of $K(u,P\mathbf {m})$ is rendered clear. Next, for each $i\leqslant m$ and $1\leqslant l\leqslant d$ , we have $ \rho _l(L_i(u))=a_i^{(l)}u_l+b_i^{(l_\tau )}u_{l_\tau }. $ The second derivative bound for exponential integrals yields
for $i\leqslant m$ , and
for $i> m$ . The statement is now clear.
We now piece everything together in our expression (4.23) for $E(N;P;X,Y)$ . We shall continue to adhere to the convention that the value of $\varepsilon>0$ is allowed to change at each appearance and that all implied constants are allowed to depend on $\varepsilon $ .
Recall the definition of $\mathcal {U}$ from Equation (4.22). Combining Equation (4.23) and Lemma 6.3, we obtain
where now
and $\mathcal {R}(\mathbf {m})$ denotes the set of $u\in \mathcal {U}$ such that Equation (6.4) holds.
We now make the exact same change of variables $\mathbf {c}=\delta \mathbf {m}$ that we made previously in Equation (5.2). Then, in particular, we can assume that Equation (5.3) holds. Moreover, on dropping the information about $G(\mathbf {m})$ , Lemma 6.2 yields
where
In this notation, we conclude that
Let
where $\mathcal {C}(u,\mathfrak {b})$ is the set of nonzero vectors $\mathbf {c}\in \mathfrak {o}^n$ for which Equation (5.3) holds and
for $1\leqslant l\leqslant d$ . Then we may write
rendering our next task to estimate $L(u)$ . The following result is an analogue of Lemma 5.5.
Lemma 6.4. Let $u\in V$ be such that $\mathfrak {H}(u)\leqslant P^{d+\varepsilon }/X$ . If $L(u)\neq 0$ , then
Moreover, we have $ L(u)\ll P^{\varepsilon } X J(u), $ where
Proof. Let us write $\mathfrak {b}=\mathfrak {b}_1\mathfrak {b}_2$ , where $\operatorname {\mathrm {N}}\mathfrak {b}_1$ is square-free and $\operatorname {\mathrm {N}}\mathfrak {b}_2$ is square-full, with $\gcd (\operatorname {\mathrm {N}}\mathfrak {b}_1,\operatorname {\mathrm {N}}\mathfrak {b}_2)=1$ . Then $ g(\mathfrak {b})= \operatorname {\mathrm {N}}\mathfrak {b}_2 \operatorname {\mathrm {N}}\mathfrak {h}, $ where $\mathfrak {h}$ is the greatest common ideal divisor of $\mathfrak {b}_1$ and N. In summary, we may now write
In order to proceed, we assume without loss of generality that $u\in V$ satisfies
Let us write $\mathbf {c}=(\mathbf {c}',\mathbf {c}")$ , where $\mathbf {c}'=(c_1,\dots ,c_m)$ and $\mathbf {c}"=(c_{m+1},\dots ,c_n)$ . Keeping in mind Equation (5.3), we first fix a choice of $\mathbf {c}"\in \left ((\mathfrak {b}_1\mathfrak {b}_2)^{-1} {{}^{{G}}\mathfrak {b}}_1 {{}^{{G}}\mathfrak {b}}_2\right )^{n-m}$ satisfying
for $m+1\leqslant i\leqslant n$ and $1\leqslant l\leqslant d$ . We claim that there exists $\lambda \in K$ such that
for a suitable prime ideal $\mathfrak {p}_2$ of norm $O(P^{\varepsilon })$ . To begin with, it follows from part (ii) of Lemma 3.1 that there exists $\lambda _3\in \mathfrak {o}$ such that $(\lambda _3)=(\mathfrak {b}_1\mathfrak {b}_2)^{-1} {{}^{{G}}\mathfrak {b}}_1 {{}^{{G}}\mathfrak {b}}_2 \mathfrak {p}_3$ for a suitable prime ideal $\mathfrak {p}_3$ of norm $O(P^{\varepsilon })$ . A second application of this result reveals that there exists $\lambda _2\in \mathfrak {o}$ and a prime ideal $\mathfrak {p}_2$ of norm $O(P^{\varepsilon })$ such that $(\lambda _2)=\mathfrak {p}_3\mathfrak {p}_2$ . The claim now follows with $\lambda =\lambda _3/\lambda _2$ .
On multiplying by units, we can further assume that
for $1\leqslant l\leqslant d$ , on recalling that $\operatorname {\mathrm {N}}\mathfrak {b}_1\operatorname {\mathrm {N}}\mathfrak {b}_2 \asymp X$ and $\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}}_1\operatorname {\mathrm {N}}{{}^{{G}}\mathfrak {b}}_2 \asymp Y$ . Making the change of variables $\mathbf {c}"=\lambda \mathbf {d}"$ , we deduce that for $m+1\leqslant i\leqslant n$ , we have $d_i\in \mathfrak {o}$ and
for $1\leqslant l\leqslant d$ . In particular, if $\mathbf {c}"\neq \mathbf {0}$ , then there exists $i\in \{m+1,\dots ,n\}$ such that
Recalling that $\operatorname {\mathrm {Nm}}(u)\leqslant \mathfrak {H}(u)\ll P^{d+\varepsilon }/X$ , we deduce that
Moreover, arguing as in Lemma 5.5, it readily follows from a result in Lang [Reference Lang9, Thm. 0 in §V.1] that the overall number of vectors $\mathbf {d}"$ is $O(P^{\varepsilon })$ . We must next address the number of $\mathbf {c}'\in \mathfrak {o}^m$ , with $(\mathbf {c}',\mathbf {c}")\neq \mathbf {0}$ , which satisfy
for $1\leqslant i\leqslant m$ and $1\leqslant l\leqslant d$ . It is clear that
Together, Equations (6.9) and (6.10) yield the first part of the lemma.
Appealing once more to Lang [Reference Lang9, Thm. 0 in §V.1], we deduce that the number of $\mathbf {c}'$ is
in the notation of the lemma. Returning to Equation (6.8), we deduce that
since there are $O(1)$ ideal divisors $\mathfrak {h}\mid N$ when $N\in \mathfrak {o}$ is nonzero. Finally, the lemma follows on noting that there are $O(\sqrt {X})$ integral ideals such that $\operatorname {\mathrm {N}}\mathfrak {b}_2$ is a square-full integer of modulus at most X.
We may now apply Lemma 6.4 in Equation (6.6). Let R denote the set of $u\in \mathcal {U}$ such that Equation (6.7) holds. On recalling the definition (6.5) of $f(u)$ , we deduce that
where
The following result deals with this integral.
Lemma 6.5. We have
where
Proof. In the proof of this result, we shall make frequent use of the observation that
for any $i\in \{1,\dots ,n\}$ , which follows from Equation (5.6).
We may assume without loss of generality that the range of integration is restricted to satisfy
We further break the range of integration into $m+1$ regions. For $0\leqslant t\leqslant m$ , let $R_t$ denote the set of $u\in V$ with $\mathfrak {H}(u)\leqslant P^{d+\varepsilon }/X$ , such that Equations (6.7) and (6.12) hold, with
(Note that the left inequality is vacuous when $t=0$ and similarly for the right-hand inequality when $t=m$ .) In particular, it is clear that $ R_m=\emptyset $ when the second inequality in Equation (6.7) holds. Moreover, when $t\in \{1,\dots ,m-1\}$ , we observe that
since $\mathfrak {H}(L_t(u))\geqslant 1$ . We have
Thus,
where
on taking $\mathfrak {H}(L_1(u))\cdots \mathfrak {H}(L_t(u))\geqslant 1$ .
We first deal with $ I^{(0)}(X,Y)$ . Recalling that $\mathfrak {H}(L_i(u))\ll \mathfrak {H}(u)^2$ for $1\leqslant i\leqslant m$ , it follows that
If $ 3m/2-n/2\geqslant -1$ , then Equation (4.13) yields
Alternatively, if $3m/2-n/2< -1$ , then the left-hand side is $O(1)$ by Equation (4.11). Thus,
which is satisfactory.
Terms with $1\leqslant t\leqslant m-1$ only contribute when $Y\leqslant P^d$ . Arguing as above, it follows from Equations (4.11) and (4.13) that
since $X\leqslant Y$ . This is satisfactory for the lemma.
It remains to estimate $I^{(m)}(X,Y)$ . In this case, we may assume that u satisfies the first inequality in Equation (6.7) since $R_m=\emptyset $ otherwise. Hence, Equation (4.12) yields
which is satisfactory and so completes the proof of the lemma.
It is now time to return to our goal of proving that Equation (4.21) holds for a suitable $\Delta>0$ for any $X,Y\geqslant 1$ satisfying Equation (4.20). We wish to do so under the assumption that $n-m\geqslant 4$ . Applying Lemma 6.5 in Equation (6.11), the overall contribution to $E(N;P;X,Y)$ from the final term is seen to be
on taking $X\leqslant Y$ and $m\geqslant 1$ . This is $O(P^{-d+\varepsilon })$ , if $n-m\geqslant 4$ , which is satisfactory for Equation (4.21). Next, the second term in Lemma 6.5 makes the overall contribution
If $m\geqslant 2$ , we take $Y\leqslant X^2$ in the first term, and $X,Y\geqslant 1$ in the second term. Assuming that $n-m\geqslant 4$ , this yields
which is satisfactory for Equation (4.21). If $m=1$ , on the other hand, then we get the contribution
by Equation (4.20) and the assumption $n\geqslant m+4=5$ , together with the fact that $Y\leqslant P^d$ when $c_Y\neq 0$ .
Turning to the contribution to Equation (6.11) from the first term in Lemma 6.5, we see that this is
Taking $X\ll P^d$ , the first term is $O(P^{-d(n-m-3)/2+\varepsilon })$ , which is $O(P^{-d/2+\varepsilon })$ , if $n-m\geqslant 4$ . The second term is plainly $P^{-dm+\varepsilon }$ if $\frac {3}{2}-n/2+3m/2\leqslant 0$ , and it is
otherwise, on taking $X\ll P^d$ . This is $O(P^{-d/2+\varepsilon })$ if $n-m\geqslant 4$ . All of our estimates are satisfactory for Equation (4.21), which therefore concludes the proof of Theorem 1.5.
Acknowledgements
The authors are grateful to Jayce Getz for asking questions that set this project in motion and to the anonymous referee for useful comments. T.B. was supported by a FWF grant (DOI 10.55776/P32428) and by a grant from the Institute for Advanced Study School of Mathematics. L.B.P. was partially supported by NSF DMS-2200470 and DMS-1652173, and thanks the Hausdorff Centre for Mathematics for hosting research visits.
Competing interest
None.