Hostname: page-component-78c5997874-s2hrs Total loading time: 0 Render date: 2024-11-10T06:38:15.729Z Has data issue: false hasContentIssue false

THE PAUCITY PROBLEM FOR CERTAIN SYMMETRIC DIOPHANTINE EQUATIONS

Published online by Cambridge University Press:  23 September 2022

TREVOR D. WOOLEY*
Affiliation:
Department of Mathematics, Purdue University, 150 N. University Street, West Lafayette, IN 47907-2067, USA
Rights & Permissions [Opens in a new window]

Abstract

Let $\varphi _1,\ldots ,\varphi _r\in {\mathbb Z}[z_1,\ldots z_k]$ be integral linear combinations of elementary symmetric polynomials with $\text {deg}(\varphi _j)=k_j\ (1\le j\le r)$, where $1\le k_1<k_2<\cdots <k_r=k$. Subject to the condition $k_1+\cdots +k_r\ge \tfrac {1}{2}k(k-~1)+2$, we show that there is a paucity of nondiagonal solutions to the Diophantine system $\varphi _j({\mathbf x})=\varphi _j({\mathbf y})\ (1\le j\le r)$.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of Australian Mathematical Publishing Association Inc.

1 Introduction

Our focus in this paper is on systems of simultaneous Diophantine equations defined by symmetric polynomials. Let $\varphi _1,\ldots ,\varphi _r\in {\mathbb Z}[z_1,\ldots ,z_k]$ be symmetric polynomials with $\text {deg}(\varphi _j)=k_j\ (1\le j\le r)$ , so that, for any permutation $\pi $ of $\{1,2,\ldots ,k\}$ , one has

$$ \begin{align*} \varphi_j(x_{\pi 1},\ldots ,x_{\pi k})=\varphi_j(x_1,\ldots ,x_k)\quad (1\le j\le r). \end{align*} $$

Given any such permutation $\pi $ , the system of Diophantine equations

(1.1) $$ \begin{align} \varphi_j(x_1,\ldots ,x_k)=\varphi_j(y_1,\ldots ,y_k)\quad (1\le j\le r) \end{align} $$

plainly has the trivial solutions obtained by putting $y_i=x_{\pi i}\ (1\le i\le k)$ . Denoting by $T_k(X)$ the number of these trivial solutions with $1\le x_i,y_i\le X\ (1\le i\le k)$ , one is led to the question of whether there is a paucity of nondiagonal solutions. Thus, fixing k and ${\boldsymbol \varphi }$ and writing $N_k(X;{\boldsymbol \varphi })$ for the number of solutions of the system (1.1) in this range, one may ask whether as $X\rightarrow \infty $ , one has

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})=T_k(X)+o(T_k(X)), \end{align*} $$

or equivalently, whether $N_k(X;{\boldsymbol \varphi })=k!X^k+o(X^k)$ . This question has been examined extensively in the diagonal case where the polynomials under consideration take the shape $\varphi _j({\mathbf x})=x_1^{k_j}+\cdots +x_k^{k_j}$ , as can be surmised from the references cited in this paper. Relatively little consideration has been afforded to more general symmetric polynomials. It transpires that by adapting a strategy applied previously in a special case of Vinogradov’s mean value theorem (see [Reference Vaughan and Wooley17]), we are able to settle this paucity problem for numerous systems of the type (1.1) in a particularly strong form.

Further notation is required to describe our conclusions. Define the elementary symmetric polynomials ${\sigma }_j({\mathbf z})\in {\mathbb Z}[z_1,\ldots ,z_k]$ for $j\ge 0$ by means of the generating function identity

(1.2) $$ \begin{align} \sum_{j=0}^k{\sigma}_j({\mathbf z})t^{k-j}=\prod_{i=1}^k(t+z_i). \end{align} $$

We restrict attention primarily to symmetric polynomials of the shape

(1.3) $$ \begin{align} \varphi_j({\mathbf z})=\sum_{l=1}^ka_{jl}\sigma_l({\mathbf z})\quad (1\le j\le r), \end{align} $$

with fixed coefficients $a_{jl}\in {\mathbb Z}$ for $1\le j\le r$ and $1\le l\le k$ . By taking appropriate integral linear combinations of the polynomials $\varphi _1,\ldots ,\varphi _r$ , it is evident that in our investigations concerning $N_k(X;{\boldsymbol \varphi })$ , we may suppose that $\text {deg}(\varphi _j)=k_j\ (1\le j\le r)$ , where the exponents $k_j$ satisfy the condition

(1.4) $$ \begin{align} 1\le k_1<k_2<\cdots <k_r=k. \end{align} $$

This can be seen by applying elementary row operations on the reversed $r\times k$ matrix of coefficients $A=(a_{j,k-l+1})$ , reducing to an equivalent system having the same number of solutions, in which the new coefficient matrix $A'$ has upper triangular form and $r'\le r$ nonvanishing rows.

A simple paucity result is provided by our first theorem. It is useful here and elsewhere to introduce the auxiliary quantity

(1.5) $$ \begin{align} w({\boldsymbol \varphi})=\tfrac{1}{2}k(k+1)-k_1-k_2-\cdots -k_r. \end{align} $$

Here and throughout this paper, implicit constants in the notations of Landau and Vinogradov may depend on $\varepsilon $ , k and the coefficients of ${\boldsymbol \varphi }$ .

Theorem 1.1. Let $\varphi _1,\ldots ,\varphi _r$ be symmetric polynomials of the shape (1.3), having respective degrees $k_1,\ldots ,k_r$ satisfying (1.4). Then, for each $\varepsilon>0$ ,

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})=T_k(X)+O(X^{w({\boldsymbol \varphi})+1+\varepsilon}). \end{align*} $$

In particular, when $k_1+\cdots +k_r\ge \tfrac {1}{2}k(k-1)+2$ , one has

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})=k!X^k+O(X^{k-1+\varepsilon}). \end{align*} $$

A specialisation of the system (1.1) illustrates the kind of results made available by Theorem 1.1. Fix a choice of coefficients $a_{jl}\in {\mathbb Z}$ for $1\le l\le k-r$ and $k-r+1\le j\le k$ , and denote by $M_{k,r}(X;{\mathbf a})$ the number of integral solutions of the simultaneous equations

(1.6) $$ \begin{align} {\sigma}_j({\mathbf x})+\sum_{l=1}^{k-r}a_{jl}{\sigma}_l({\mathbf x}) ={\sigma}_j({\mathbf y})+\sum_{l=1}^{k-r}a_{jl}{\sigma}_l({\mathbf y})\quad (k-r+1\le j\le k), \end{align} $$

in variables ${\mathbf x}=(x_1,\ldots ,x_k)$ and ${\mathbf y}=(y_1,\ldots ,y_k)$ with $1\le x_i,y_i\le X$ .

Corollary 1.2. Suppose that $k,r\in {\mathbb N}$ and $(k-r)(k-r+1)<2k-2$ . Then there is a paucity of nondiagonal solutions in the system of equations (1.6). In particular, for each $\varepsilon>0$ , one has

$$ \begin{align*} M_{k,r}(X;{\mathbf a})=T_k(X)+O(X^{\frac{1}{2}(k-r)(k-r+1)+1+\varepsilon}). \end{align*} $$

Results analogous to those of Theorem 1.1 and Corollary 1.2 for diagonal Diophantine systems involving r equations are almost always limited to systems possessing only $r+1$ pairs of variables, and these we now describe. In the case $r=1$ , Hooley applied sieve methods to investigate the equation

$$ \begin{align*} x_1^k+x_2^k=y_1^k+y_2^k, \end{align*} $$

with $k\ge 3$ , and ultimately established the paucity of nondiagonal solutions with a power saving (see [Reference Hooley9, Reference Hooley10]). Strong conclusions have been derived when $k=3$ by Heath-Brown [Reference Heath-Brown8] using ideas based on quadratic forms, enhancing earlier work of the author that extends from cubes to general cubic polynomials [Reference Wooley20]. When $r=2$ , $k_1<k_2$ and $k_2\ge 3$ , the paucity of nondiagonal solutions has been established for the pair of simultaneous equations

$$ \begin{align*} \left. \begin{aligned} x_1^{k_1}+x_2^{k_1}+x_3^{k_1}&=y_1^{k_1}+y_2^{k_1}+y_3^{k_1}\\ x_1^{k_2}+x_2^{k_2}+x_3^{k_2}&=y_1^{k_2}+y_2^{k_2}+y_3^{k_2} \end{aligned}\right\}. \end{align*} $$

Sharp results are available in the case $(k_1,k_2)=(1,3)$ (see [Reference Vaughan and Wooley16]), with nontrivial conclusions available for the exponent pair $(1,k)$ when $k\ge 3$ (see [Reference Greaves5, Reference Skinner and Wooley14]). The cases $(2,3)$ and $(2,4)$ were tackled successfully via affine slicing methods (see [Reference Tsui and Wooley15, Reference Wooley, Berndt, Diamond and Hildebrand19]), with the remaining cases of this type covered by Salberger [Reference Salberger, Duke and Tschinkel12] using variants of the determinant method. Most other examples in which it is known that there is a paucity of nondiagonal solutions are closely related to the Vinogradov system of equations

$$ \begin{align*} x_1^{\kern2pt j}+\cdots +x_{r+1}^{\kern2pt j}=y_1^{\kern2pt j}+\cdots +y_{r+1}^{\kern2pt j}\quad (1\le j\le r). \end{align*} $$

The paucity problem has been solved here by Vaughan and the author [Reference Vaughan and Wooley17], with similar conclusions when the equation of degree r is replaced by one of degree $r+1$ . Recent work [Reference Wooley21] shows that the missing equation of degree r in this last result can be replaced by one of degree $r-d$ , provided that d is not too large. Meanwhile, when the exponents $k_j$ satisfy (1.4), results falling just short of paucity have been obtained in general for systems of the shape

$$ \begin{align*} x_1^{k_j}+\cdots +x_{r+1}^{k_j}=y_1^{k_j}+\cdots +y_{r+1}^{k_j}\quad (1\le j\le r), \end{align*} $$

(see [Reference Parsell and Wooley11, Reference Wooley, Pollington and Moran18]). In the special case $\mathbf k=(1,3,\ldots ,2r-1)$ , Brüdern and Robert [Reference Brüdern and Robert2] have even established the desired paucity result. We should note also that the existence of nondiagonal solutions, often exhibited by remarkable parametric formulae, has long been the subject of investigation, as recorded in Gloden’s book [Reference Gloden4], with notable recent contributions by Choudhry [Reference Choudhry3].

Taking k large and $r=k-\lfloor \!\sqrt {2k}\rfloor +1$ , we see that (1.6) constitutes a Diophantine system of r equations in $k=r+\!\!\sqrt {2r}+O(1)$ pairs of variables having a paucity of nondiagonal solutions. Corollary 1.2 therefore exhibits a large class of Diophantine systems in which the barrier described in the previous paragraph is emphatically surmounted. There are two exceptions to the rule noted in that paragraph. First, Salberger and the author [Reference Salberger and Wooley13, Corollary 1.4 and Theorem 5.2] have established the paucity of nondiagonal solutions in situations where the underlying equations have very large degree in terms of the number of variables. Second, work of Bourgain et al. [Reference Bourgain, Garaev, Konyagin and Shparlinski1, Theorem 26] and Heap et al. [Reference Heap, Sahay and Wooley7, Theorem 1.2] examines systems of equations determined by relations of divisor type having the shape

(1.7) $$ \begin{align} (x_1+{\theta})(x_2+{\theta})\cdots (x_k+{\theta})=(y_1+{\theta})(y_2+{\theta})\cdots (y_k+{\theta}), \end{align} $$

in which ${\theta }\in {\mathbb C}$ is algebraic of degree d over ${\mathbb Q}$ . When $d\le k$ , this relation generates d independent symmetric Diophantine equations, and it is shown that the number of integral solutions of (1.7) with $1\le x_i,y_i\le X$ is asymptotically $T_k(X)+O(X^{k-d+1+\varepsilon })$ . In particular, when $d=2$ , we obtain a pair of simultaneous Diophantine equations in k variables having a paucity of nondiagonal solutions. For example, when $k=4$ and ${\theta }=\!\sqrt {-1}$ , we obtain the system

$$ \begin{align*} \left.{\begin{aligned} x_1x_2x_3x_4-&x_1x_2-x_2x_3-x_3x_4-x_4x_1-x_2x_4-x_1x_3\\ &=y_1y_2y_3y_4-y_1y_2-y_2y_3-y_3y_4-y_4y_1-y_2y_4-y_1y_3\\ x_1x_2x_3+x_2&x_3x_4+x_3x_4x_1+x_4x_1x_2-x_1-x_2-x_3-x_4\\ &=y_1y_2y_3+y_2y_3y_4+y_3y_4y_1+y_4y_1y_2-y_1-y_2-y_3-y_4 \end{aligned}}\right\} \end{align*} $$

having $4!X^4+O(X^{3+\varepsilon })$ integral solutions with $1\le x_i,y_i\le X$ .

We have avoided discussion of systems (1.1) containing the equation

(1.8) $$ \begin{align} x_1x_2\cdots x_k=y_1y_2\cdots y_k. \end{align} $$

Here, when $r\ge 2$ , one may parametrise the solutions of (1.8) in the shape

$$ \begin{align*} x_1={\alpha}_1{\alpha}_2\cdots {\alpha}_k,\quad x_2&={\beta}_1{\beta}_2\cdots {\beta}_k,\quad \ldots ,\quad x_k={\omega}_1{\omega}_2\cdots {\omega}_k,\\ y_1={\alpha}_1{\beta}_1\cdots {\omega}_1,\quad y_2&={\alpha}_2{\beta}_2\cdots {\omega}_2,\quad \ldots , \quad y_k={\alpha}_k{\beta}_k\cdots {\omega}_k, \end{align*} $$

to provide a paucity result by simple elimination. The reader will find all of the ideas necessary to complete this elementary exercise in [Reference Vaughan and Wooley16].

This paper is organised as follows. In Section 2 we derive a multiplicative relation among the variables ${\mathbf x},{\mathbf y}$ of the system (1.1). This may be applied to obtain Theorem 1.1 and Corollary 1.2. The polynomials $\varphi _j({\mathbf z})$ that are the subject of Theorem 1.1 are integral linear combinations of the elementary symmetric polynomials ${\sigma }_1({\mathbf z}),\ldots ,{\sigma }_k({\mathbf z})$ . In Section 3 we examine the extent to which our methods are applicable when the polynomials $\varphi _j({\mathbf z})$ are permitted to depend nonlinearly on ${\sigma }_1({\mathbf z}),\ldots ,{\sigma }_k({\mathbf z})$ .

Our basic parameter is X, a sufficiently large positive number. Whenever $\varepsilon $ appears in a statement, either implicitly or explicitly, we assert that the statement holds for each $\varepsilon>0$ . We make frequent use of vector notation in the form ${\mathbf x}=(x_1,\ldots ,x_k)$ . Here, the dimension k will be evident to the reader from the ambient context.

2 Multiplicative relations from symmetric polynomials

Our initial objective in this section is to obtain a multiplicative relation between the variables underlying the system (1.1). Let $\varphi _1,\ldots ,\varphi _r$ be polynomials of the shape (1.3), having respective degrees $k_1,\ldots ,k_r$ satisfying (1.4). We define the complementary set of exponents ${\mathcal R}={\mathcal R}({\boldsymbol \varphi })$ by putting

(2.1) $$ \begin{align} {\mathcal R}({\boldsymbol \varphi})=\{1,\ldots ,k\}\setminus \{k_1,\ldots ,k_r\}. \end{align} $$

The counting function $N_k(X;{\boldsymbol \varphi })$ remains unchanged when we replace the polynomials $\varphi _j\ (1\le j\le r)$ by any collection of linearly independent integral linear combinations, and thus we may suppose that these polynomials take the shape

(2.2) $$ \begin{align} \varphi_j({\mathbf z})=a_j{\sigma}_{k_j}({\mathbf z})-\sum_{\substack{1\le l<k_j\\ l\in {\mathcal R}}} b_{jl}{\sigma}_l({\mathbf z})\quad (1\le j\le r), \end{align} $$

where $a_j\in {\mathbb Z}\setminus \{0\}$ and $b_{jl}\in {\mathbb Z}$ .

Given a solution ${\mathbf x},{\mathbf y}$ of the system (1.1) counted by $N_k(X;{\boldsymbol \varphi })$ , we define the integers $h_l=h_l({\mathbf x},{\mathbf y})$ for $l\in {\mathcal R}$ by putting

(2.3) $$ \begin{align} h_l({\mathbf x},{\mathbf y})={\sigma}_l({\mathbf x})-{\sigma}_l({\mathbf y}). \end{align} $$

Thus, since $1\le x_i,y_i\le X\ (1\le i\le k)$ , one has $|h_l({\mathbf x},{\mathbf y})|\le 2^kX^l\ (l\in {\mathcal R})$ . By making use of relations (2.2) and (2.3), the system (1.1) becomes

(2.4) $$ \begin{align} a_j({\sigma}_{k_j}({\mathbf x})-{\sigma}_{k_j}({\mathbf y})) =\sum_{\substack{1\le l<k_j\\ l\in {\mathcal R}}} b_{jl}h_l\quad (1\le j\le r). \end{align} $$

Then, by wielding (1.2) in combination with (2.3) and (2.4), we obtain

(2.5) $$ \begin{align} \prod_{i=1}^k(t+x_i)-\prod_{i=1}^k(t+y_i)&=\sum_{m=0}^kt^{k-m} ({\sigma}_m({\mathbf x})-{\sigma}_m({\mathbf y})) \notag \\ &=\sum_{m\in {\mathcal R}}h_mt^{k-m}+\sum_{j=1}^ra_j^{-1}t^{k-k_j} \sum_{\substack{1\le l<k_j\\ l\in {\mathcal R}}}b_{jl}h_l. \end{align} $$

Put $A=a_1a_2\cdots a_r$ and $c_j=A/a_j\ (1\le j\le r)$ . Also, define

$$ \begin{align*} \psi_l(t)=At^{k-l}+\sum_{\substack{1\le j\le r\\ k_j>l}}c_jb_{jl}t^{k-k_j}\quad (l\in {\mathcal R}), \end{align*} $$

and then set

(2.6) $$ \begin{align} \Psi(t;{\mathbf h})=\sum_{l\in {\mathcal R}}h_l\psi_l(t). \end{align} $$

Notice here that the polynomial $\Psi (t;{\mathbf h})$ has integral coefficients and degree at most $k-1$ with respect to t. Moreover, it follows from (2.5) that

(2.7) $$ \begin{align} A\bigg( \prod_{i=1}^k(t+x_i)-\prod_{i=1}^k(t+y_i)\bigg) =\Psi(t;{\mathbf h}). \end{align} $$

We may now record the multiplicative relation employed in our proof of Theorem 1.1. Suppose that ${\mathbf x},{\mathbf y}$ is an integral solution of the system (1.1), where $\varphi _1,\ldots ,\varphi _r$ are symmetric polynomials as described above. Then, with

$$\begin{align*}h_l={\sigma }_l({\mathbf x})-{\sigma }_l({\mathbf y})\quad (l\in {\mathcal R}),\end{align*}$$

we deduce by substituting $t=-y_j$ into (2.7) that

(2.8) $$ \begin{align} A\prod_{i=1}^k(x_i-y_j)=\Psi(-y_j;{\mathbf h})\quad (1\le j\le k). \end{align} $$

Proof of Theorem 1.1

Divide the solutions of the system (1.1) with $1\le x_i,y_i\le X$ into two types. A solution ${\mathbf x},{\mathbf y}$ will be called potentially diagonal when $\{x_1,\ldots ,x_k\}=\{y_1,\ldots ,y_k\}$ , and nondiagonal when, for some index j with $1\le j\le k$ , one has either $y_j\not \in \{x_1,\ldots ,x_k\}$ or $x_j\not \in \{y_1,\ldots ,y_k\}$ .

We first consider potentially diagonal solutions ${\mathbf x},{\mathbf y}$ . Suppose, if possible, that ${\mathbf x},{\mathbf y}$ satisfies the condition that the polynomial $\Psi (t;{\mathbf h})$ defined in (2.6) is identically zero as a polynomial in t. Since $A\ne 0$ , it follows from (2.7) that

$$ \begin{align*} \prod_{i=1}^k(t+x_i)=\prod_{i=1}^k(t+y_i). \end{align*} $$

The roots of the polynomials on the left- and right-hand sides here must be identical, and so too must be their respective multiplicities. Thus $(x_1,\ldots ,x_k)$ is a permutation of $(y_1,\ldots ,y_k)$ , and it follows that the number of solutions ${\mathbf x},{\mathbf y}$ of this type counted by $N_k(X;{\boldsymbol \varphi })$ is precisely $T_k(X)$ .

Suppose next that ${\mathbf x},{\mathbf y}$ is a potentially diagonal solution with $\Psi (t;{\mathbf h})$ not identically zero, where ${\mathbf h}$ is defined via (2.3). In this situation, there are distinct integers $w_1,\ldots ,w_s$ , for some integer s with $1\le s\le k$ , such that

(2.9) $$ \begin{align} \{x_1,\ldots ,x_k\}=\{y_1,\ldots ,y_k\}=\{w_1,\ldots ,w_s\}. \end{align} $$

Consider any tuple of integers ${\mathbf h}$ with $|h_l|\le 2^kX^l\ (l\in {\mathcal R})$ for which the polynomial $\Psi (t;{\mathbf h})$ is not identically zero. This polynomial has degree at most $k-1$ , and hence there is an integer $\xi $ with $1\le \xi \le k$ for which the integer ${\Theta }=\Psi (\xi ;{\mathbf h})$ is nonzero. We see from (2.7) that $(\xi +w_1)\cdots (\xi +w_s)$ divides ${\Theta }$ . Thus, an elementary divisor function estimate (see, for example, [Reference Hardy and Wright6, Theorem 317]) shows the number of possible choices for $\xi +w_1,\ldots ,\xi +w_s$ to be at most

$$ \begin{align*} \sum_{\substack{d_1,\ldots ,d_s\in \mathbb N\\ d_1\cdots d_s|{\Theta}}}1\le \bigg( \sum_{d|{\Theta}}1\bigg)^s=d(|{\Theta}|)^s\ll |{\Theta}|^\varepsilon. \end{align*} $$

Since one has ${\Theta }=O(X^k)$ , we see that there are at most $O(X^\varepsilon )$ possible choices for $\xi +w_1,\ldots ,\xi +w_s$ , and hence also for $w_1,\ldots ,w_s$ . From here, relation (2.9) implies that, for these fixed choices of ${\mathbf h}$ , the number of possible choices for ${\mathbf x}$ and ${\mathbf y}$ is also $O(X^\varepsilon )$ . On recalling (1.5), we discern that the total number of choices for the tuple ${\mathbf h}$ with $|h_l|\le 2^kX^l\ (l\in {\mathcal R})$ is $O(X^{w({\boldsymbol \varphi })})$ . Let $T_k^*(X;{\boldsymbol \varphi })$ denote the number of potentially diagonal solutions ${\mathbf x},{\mathbf y}$ not counted by $T_k(X)$ . For each of the $O(X^{w({\boldsymbol \varphi })})$ choices for ${\mathbf h}$ associated with these solutions, we have shown that there are $O(X^\varepsilon )$ solutions ${\mathbf x},{\mathbf y}$ , whence

(2.10) $$ \begin{align} T_k^*(X;{\boldsymbol \varphi})\ll X^{w({\boldsymbol \varphi})+\varepsilon}. \end{align} $$

Finally, suppose that ${\mathbf x},{\mathbf y}$ is a nondiagonal solution of the system (1.1). By invoking symmetry (twice), we may suppose that $y_k\not \in \{x_1,\ldots ,x_k\}$ , whence (2.8) shows the integer ${\Theta } =\Psi (-y_k;{\mathbf h})$ to be nonzero. There are $O(X^{w({\boldsymbol \varphi })})$ possible choices for ${\mathbf h}$ and $O(X)$ possible choices for $y_k$ corresponding to this situation. Fixing any one such, and noting that ${\Theta }=O(X^k)$ , an elementary divisor function estimate again shows that there are $O(X^\varepsilon )$ possible choices for $x_1-y_k,\ldots ,x_k-y_k$ satisfying (2.8). Fix any one such choice for these divisors. Since $y_k$ is already fixed, it follows that $x_1,\ldots ,x_k$ are likewise fixed.

It remains to determine $y_1,\ldots ,y_{k-1}$ . Since the tuple ${\mathbf h}$ has already been fixed and $\Psi (t;{\mathbf h})$ has degree at most $k-1$ with respect to t, the polynomial

$$ \begin{align*} A\prod_{i=1}^k(x_i-t)-\Psi(-t;{\mathbf h}) \end{align*} $$

has degree k with respect to t, and has all of its coefficients already fixed. It therefore follows from (2.8) that there are at most k choices for each of the variables $y_1,\ldots ,y_{k-1}$ . Let $T_k^\dagger (X;{\boldsymbol \varphi })$ denote the number of nondiagonal solutions ${\mathbf x},{\mathbf y}$ counted by $N_k(X;{\boldsymbol \varphi })$ . Then we may conclude that

(2.11) $$ \begin{align} T_k^\dagger(X;{\boldsymbol \varphi})\ll X^{w({\boldsymbol \varphi})+1+\varepsilon}. \end{align} $$

On recalling our opening discussion together with the estimates (2.10) and (2.11), we arrive at the upper bound

(2.12) $$ \begin{align} N_k(X;{\boldsymbol \varphi})-T_k(X)=T_k^*(X;{\boldsymbol \varphi})+T_k^\dagger(X;{\boldsymbol \varphi})\ll X^{w({\boldsymbol \varphi})+1+\varepsilon}. \end{align} $$

This delivers the first conclusion of Theorem 1.1. On recalling the definition (1.5) of $w({\boldsymbol \varphi })$ , it follows that when $k_1+\cdots +k_r\ge \tfrac {1}{2}k(k-1)+2$ , one has $w({\boldsymbol \varphi })\le k-2$ , and thus the second conclusion of Theorem 1.1 is immediate from (2.12) and the asymptotic formula $T_k(X)=k!X^k+O(X^{k-1})$ .

Proof of Corollary 1.2

Write

$$ \begin{align*} \varphi_j({\mathbf z})={\sigma}_j(z_1,\ldots ,z_k)+\sum_{l=1}^{k-r}a_{jl}{\sigma}_l(z_1,\ldots ,z_k)\quad (k-r+1\le j\le k). \end{align*} $$

Then we see that the system (1.6) is comprised of symmetric polynomials having degrees $k-r+1,\ldots ,k$ . For this system, it follows from (1.5) that

$$ \begin{align*} w({\boldsymbol \varphi})=\tfrac{1}{2}k(k+1)-\sum_{j=k-r+1}^k j=\tfrac{1}{2}(k-r)(k-r+1). \end{align*} $$

We therefore deduce from Theorem 1.1 that

$$ \begin{align*} M_{k,r}(X;{\mathbf a})=N_k(X;{\boldsymbol \varphi})=T_k(X)+O(X^{\frac{1}{2}(k-r)(k-r+1)+1+\varepsilon}), \end{align*} $$

and so there is a paucity of nondiagonal solutions in the system (1.6) provided that $(k-r)(k-r+1)<2k-2$ . This completes the proof of the corollary.

3 Nonlinear variants

The system of Diophantine equations (1.1) underlying Theorem 1.1 and Corollary 1.2 possesses features inherently linear in nature. Indeed, as is evident from the discussion initiating Section 2, we are able to restrict attention to integral linear combinations of elementary symmetric polynomials of the shape (2.2). This is a convenient but not essential simplification, as we now explain.

We first describe the symmetric polynomials presently in our field of view. Recall the exponents $k_1,\ldots ,k_r$ satisfying (1.4), the complementary set of exponents ${\mathcal R}={\mathcal R}({\boldsymbol \varphi })$ defined in (2.1), and the definition (1.5) of $w({\boldsymbol \varphi })$ . Putting $R=\text {card}({\mathcal R})$ , we label the elements of ${\mathcal R}$ so that ${\mathcal R}=\{l_1,\ldots ,l_R\}$ . We consider symmetric polynomials in the variables ${\mathbf z}=(z_1,\ldots ,z_k)$ of the shape

(3.1) $$ \begin{align} \varphi_j({\mathbf z})=a_j{\sigma}_{k_j}({\mathbf z})-{\Upsilon}_j({\sigma}_{l_1}({\mathbf z}),\ldots ,{\sigma}_{l_R}({\mathbf z}))\quad (1\le j\le r), \end{align} $$

where $a_j\in {\mathbb Z}\setminus \{0\}$ and ${\Upsilon }_j\in {\mathbb Z}[s_1,\ldots ,s_R]$ .

Theorem 3.1. Let $\varphi _1,\ldots ,\varphi _r$ be symmetric polynomials of the shape (3.1) with respective degrees $k_1,\ldots ,k_r$ satisfying (1.4). Then

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})=T_k(X)+O(X^{2w({\boldsymbol \varphi})+1+\varepsilon}). \end{align*} $$

In particular, when $k_1+\cdots +k_r\ge \tfrac {1}{2}k^2+1$ , one has

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})=k!X^k+O(X^{k-1+\varepsilon}). \end{align*} $$

A specialisation again makes for accessible conclusions. Fix the polynomials ${\Upsilon }_j\in {\mathbb Z}[s_1,\ldots ,s_R]$ for $k-r+1\le j\le k$ , and denote by $L_{k,r}(X;{\boldsymbol {\Upsilon }})$ the number of solutions of the simultaneous equations

(3.2) $$ \begin{align} {\sigma}_j({\mathbf x})+{\Upsilon}_j({\sigma}_1({\mathbf x}),\ldots ,{\sigma}_{k-r}({\mathbf x}))={\sigma}_j({\mathbf y})+{\Upsilon}_j({\sigma}_1&({\mathbf y}), \ldots ,{\sigma}_{k-r}({\mathbf y})) \quad (k-r+1\le j\le k), \end{align} $$

in variables ${\mathbf x}=(x_1,\ldots ,x_k)$ and ${\mathbf y}=(y_1,\ldots ,y_k)$ with $1\le x_i,y_i\le X$ .

Corollary 3.2. Suppose that $k,r\in {\mathbb N}$ satisfy $(k-r)(k-r+1)<k-1$ . Then there is a paucity of nondiagonal solutions in the system (3.2). In particular,

$$ \begin{align*} L_{k,r}(X;{\boldsymbol {\Upsilon}})=T_k(X)+O(X^{(k-r)(k-r+1)+1+\varepsilon}). \end{align*} $$

Proof of Theorem 3.1

Our present strategy is very similar to that wrought against Theorem 1.1. Given a solution ${\mathbf x},{\mathbf y}$ of the system (1.1) counted by $N_k(X;{\boldsymbol \varphi })$ , define $h_m({\mathbf z})={\sigma }_{l_m}({\mathbf z})$ for ${\mathbf z}\in \{{\mathbf x},{\mathbf y}\}$ and $1\le m\le R$ . In view of (3.1), the system (1.1) now becomes

(3.3) $$ \begin{align} a_j({\sigma}_{k_j}({\mathbf x})-{\sigma}_{k_j}({\mathbf y}))={\Upsilon}_j({\mathbf h}({\mathbf x}))-{\Upsilon}_j({\mathbf h}({\mathbf y}))\quad (1\le j\le r). \end{align} $$

In the present nonlinear scenario there may be some index j for which the integer on the right-hand side of (3.3) is not equal to ${\Upsilon }_j({\mathbf h}({\mathbf x})-{\mathbf h}({\mathbf y}))$ . The argument here therefore contains extra complications, with weaker quantitative conclusions than Theorem 1.1.

Put $A=a_1a_2\cdots a_r$ and $c_j=A/a_j\ (1\le j\le r)$ . Then, by applying the identity (1.2) together with (3.3), we obtain for $1\le j\le r$ the relation

(3.4) $$ \begin{align} A\bigg( \prod_{i=1}^k(t+x_i)-\prod_{i=1}^k(t+y_i)\bigg)=\Psi(t;{\mathbf h})-\Psi(t;{\mathbf g}), \end{align} $$

where ${\mathbf h}=(h_1({\mathbf x}),\ldots ,h_R({\mathbf x}))$ , ${\mathbf g}=(h_1({\mathbf y}),\ldots ,h_R({\mathbf y}))$ , and we write

$$ \begin{align*} \Psi(t;{\mathbf e})=A\sum_{m\in {\mathcal R}}e_mt^{k-m}+\sum_{j=1}^rc_jt^{k-k_j}{\Upsilon}_j({\mathbf e}). \end{align*} $$

If ${\mathbf x},{\mathbf y}$ is a solution of (1.1) counted by $N_k(X;{\boldsymbol \varphi })$ , then, with $h_m={\sigma }_{l_m}({\mathbf x})$ and $g_m={\sigma }_{l_m}({\mathbf y})$ for $1\le m\le R$ , we deduce by setting $t=-y_j$ in (3.4) that

(3.5) $$ \begin{align} A\prod_{i=1}^k(x_i-y_j)=\Psi(-y_j;{\mathbf h})-\Psi(-y_j;{\mathbf g})\quad (1\le j\le k). \end{align} $$

We now follow the path already trodden in the proof of Theorem 1.1 presented in Section 2. Suppose first that ${\mathbf x},{\mathbf y}$ is a potentially diagonal solution. When the polynomial $\Psi (-t;{\mathbf h})-\Psi (-t;{\mathbf g})$ is identically zero, we find that ${\mathbf x},{\mathbf y}$ is counted by $T_k(X)$ . Meanwhile, when instead this polynomial is not identically zero, a divisor function argument shows that for each fixed choice of ${\mathbf h}$ and ${\mathbf g}$ , there are $O(X^\varepsilon )$ possible choices for ${\mathbf x}$ and ${\mathbf y}$ . We have $|h_m({\mathbf x})|\le 2^kX^{l_m}$ and $|h_m({\mathbf y})|\le 2^kX^{l_m}$ , so the total number of choices for $(h_1,\ldots ,h_R)$ and $(g_1,\ldots ,g_R)$ is $O(X^{2w({\boldsymbol \varphi })})$ , where $w({\boldsymbol \varphi })$ is defined by (1.5). Thus, the total number of potentially diagonal solutions is equal to $T_k(X)+O(X^{2w({\boldsymbol \varphi })+\varepsilon })$ .

Suppose next that ${\mathbf x},{\mathbf y}$ is a nondiagonal solution of the system (1.1). By symmetry, we may again suppose that $y_k\not \in \{x_1,\ldots ,x_k\}$ , and (3.5) shows that the integer ${\Theta } =\Psi (-y_j;{\mathbf h})-\Psi (-y_j;{\mathbf g})$ is nonzero. There are $O(X^{2w({\boldsymbol \varphi })})$ possible choices for ${\mathbf h}$ and ${\mathbf g}$ , and $O(X)$ possible choices for $y_k$ in this scenario. Fix any one such choice and note from (3.5) that ${\Theta }=O(X^k)$ . An elementary divisor function estimate shows there to be $O(X^\varepsilon )$ choices for $x_1-y_k,\ldots ,x_k-y_k$ satisfying (3.5). Fixing any one such choice fixes the integers $x_1,\ldots ,x_k$ . An argument essentially identical to that applied in the proof of Theorem 1.1 shows from here that there are $O(1)$ possible choices for $y_1,\ldots ,y_{k-1}$ . Hence, the number of nondiagonal solutions of the system (1.1) is $O(X^{2w({\boldsymbol \varphi })+1+\varepsilon })$ .

Combining the two contributions to $N_k(X;{\boldsymbol \varphi })$ that we have obtained yields

$$ \begin{align*} N_k(X;{\boldsymbol \varphi})-T_k(X)\ll X^{2w({\boldsymbol \varphi})+1+\varepsilon}, \end{align*} $$

confirming the first conclusion of Theorem 3.1. The definition (1.5) of $w({\boldsymbol \varphi })$ , moreover, implies that when $k_1+\cdots +k_r\ge \tfrac {1}{2}k^2+1$ , one has $2w({\boldsymbol \varphi })\le k-2$ . The second conclusion of Theorem 3.1 therefore follows directly from the first.

Proof of Corollary 3.2

When $k-r+1\le j\le k$ , write

$$ \begin{align*} \varphi_j({\mathbf z})={\sigma}_j(z_1,\ldots ,z_k)+{\Upsilon}_j({\sigma}_1(z_1,\ldots ,z_k),\ldots , {\sigma}_{k-r}(z_1,\ldots ,z_k)). \end{align*} $$

Then we see from (1.5) that the system (3.2) is comprised of symmetric polynomials with $w({\boldsymbol \varphi })=\tfrac {1}{2}(k-r)(k-r+1)$ . Theorem 3.1 therefore shows that

$$ \begin{align*} L_{k,r}(X;{\boldsymbol {\Upsilon}})=N_k(X;{\boldsymbol \varphi})=T_k(X)+O(X^{(k-r)(k-r+1)+1+\varepsilon}), \end{align*} $$

and provided that $(k-r)(k-r+1)<k-1$ , there is a paucity of nondiagonal solutions in the system (3.2). This completes the proof of the corollary.

Acknowledgement

We thank the referee for useful comments.

Footnotes

The author’s work is supported by NSF grants DMS-2001549 and DMS-1854398.

References

Bourgain, J., Garaev, M. Z., Konyagin, S. V. and Shparlinski, I. E., ‘Multiplicative congruences with variables from short intervals’, J. Anal. Math. 124 (2014), 117147.CrossRefGoogle Scholar
Brüdern, J. and Robert, O., ‘Rational points on linear slices of diagonal hypersurfaces’, Nagoya Math. J. 218 (2015), 51100.CrossRefGoogle Scholar
Choudhry, A., ‘Symmetric Diophantine systems’, Acta Arith. 59(3) (1991), 291307.CrossRefGoogle Scholar
Gloden, A., Mehrgradige Gleichungen (P. Noordhoff, Groningen, 1944).Google Scholar
Greaves, G., ‘Some Diophantine equations with almost all solutions trivial’, Mathematika 44(1) (1997), 1436.CrossRefGoogle Scholar
Hardy, G. H. and Wright, E. M., An Introduction to the Theory of Numbers, 6th edn (Oxford University Press, Oxford, 2008).Google Scholar
Heap, W., Sahay, A. and Wooley, T. D., ‘A paucity problem associated with a shifted integer analogue of the divisor function’, J. Number Theory, to appear.Google Scholar
Heath-Brown, D. R., ‘The density of rational points on cubic surfaces’, Acta Arith. 79(1) (1997), 1730.CrossRefGoogle Scholar
Hooley, C., ‘On another sieve method and the numbers that are a sum of two $h$ th powers’, Proc. Lond. Math. Soc. (3) 43(1) (1981), 73109.CrossRefGoogle Scholar
Hooley, C., ‘On another sieve method and the numbers that are a sum of two $h$ th powers: II’, J. reine angew. Math. 475 (1996), 5575.Google Scholar
Parsell, S. T. and Wooley, T. D., ‘A quasi-paucity problem’, Michigan Math. J. 50(3) (2002), 461469.CrossRefGoogle Scholar
Salberger, P., ‘Rational points of bounded height on threefolds’, in: Analytic Number Theory, Clay Mathematics Proceedings, 7 (eds. Duke, W. and Tschinkel, Y.) (American Mathematical Society, Providence, RI, 2007), 207216.Google Scholar
Salberger, P. and Wooley, T. D., ‘Rational points on complete intersections of higher degree, and mean values of Weyl sums’, J. Lond. Math. Soc. (2) 82(2) (2010), 317342.CrossRefGoogle Scholar
Skinner, C. M. and Wooley, T. D., ‘On the paucity of non-diagonal solutions in certain diagonal Diophantine systems’, Q. J. Math. Oxford Series (2) 48(2) (1997), 255277.CrossRefGoogle Scholar
Tsui, W. Y. and Wooley, T. D., ‘The paucity problem for simultaneous quadratic and biquadratic equations’, Math. Proc. Cambridge Philos. Soc. 126(2) (1999), 209221.CrossRefGoogle Scholar
Vaughan, R. C. and Wooley, T. D., ‘On a certain nonary cubic form and related equations’, Duke Math. J. 80(3) (1995), 669735.CrossRefGoogle Scholar
Vaughan, R. C. and Wooley, T. D., ‘A special case of Vinogradov’s mean value theorem’, Acta Arith. 79(3) (1997), 193204.CrossRefGoogle Scholar
Wooley, T. D., ‘A note on symmetric diagonal equations’, in: Number Theory with an Emphasis on the Markoff Spectrum (Provo, UT, 1991) (eds. Pollington, A. D. and Moran, W.) (Marcel Dekker, New York, 1993), 317321.Google Scholar
Wooley, T. D., ‘An affine slicing approach to certain paucity problems’, in: Analytic Number Theory: Proceedings of a Conference in Honor of Heini Halberstam, Vol. 2, Progress in Mathematics, 139 (eds. Berndt, B. C., Diamond, H. G. and Hildebrand, A. J.) (Birkhäuser, Boston, MA, 1996), 803815.Google Scholar
Wooley, T. D., ‘Sums and differences of two cubic polynomials’, Monatsh. Math. 129(2) (2000), 159169.CrossRefGoogle Scholar
Wooley, T. D., ‘Paucity problems and some relatives of Vinogradov’s mean value theorem’, Preprint, 2021, arXiv:2107.12238.Google Scholar