Hostname: page-component-78c5997874-g7gxr Total loading time: 0 Render date: 2024-11-10T13:44:08.331Z Has data issue: false hasContentIssue false

Discrete restriction estimates for forms in many variables

Published online by Cambridge University Press:  18 September 2023

Brian Cook
Affiliation:
Department of Mathematics, Virginia Tech, Blacksburg, VA, USA (briancookmath@gmail.com; palsson@vt.edu)
Kevin Hughes
Affiliation:
School of Mathematics, The University of Bristol, Bristol, UK The Heilbronn Insitute for Mathematical Research, Bristol, UK (khughes.math@gmail.com)
Eyvindur Palsson
Affiliation:
Department of Mathematics, Virginia Tech, Blacksburg, VA, USA (briancookmath@gmail.com; palsson@vt.edu)
Rights & Permissions [Opens in a new window]

Abstract

We prove discrete restriction estimates for a broad class of hypersurfaces arising in seminal work of Birch. To do so, we use a variant of Bourgain’s arithmetic version of the Tomas–Stein method and Magyar’s decomposition of the Fourier transform of the indicator function of the integer points on a hypersurface.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on Behalf of The Edinburgh Mathematical Society.

1. Introduction

In this paper, we consider discrete restriction estimates associated to integral, positive definite forms. Recall that a form is a homogeneous polynomial, integral means that the coefficients of this polynomial are integers and positive definite means that $\mathcal{Q}({\bf{x}}) \gt 0$ for ${\bf{x}} \neq 0$. The positive definite criterion guarantees that the form is nondegenerate. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be such a form, where ${\bf{x}} = (x_1,x_2,\ldots,x_d)$ with $d \geq 2$, and k denotes the degree of the form $\mathcal{Q}$. We always assume that $k \geq 2$. For each $\lambda \in \mathbb{R} $, the polynomial $\mathcal{Q}$ cuts out a real variety $V_{\mathcal{Q}=\lambda}(\mathbb{R} ) := \{{\bf{x}} \in \mathbb{R}^d : \mathcal{Q}({\bf{x}}) = \lambda\}$ containing a discrete set of integral points $V_{\mathcal{Q}=\lambda}(\mathbb{Z} ) := \{{\bf{x}} \in \mathbb{Z}^d : \mathcal{Q}({\bf{x}}) = \lambda\};$ either or both of these sets are possibly empty depending on the value of λ. For instance, $V_{\mathcal{Q}=\lambda}(\mathbb{R} )$ is empty for negative λ since $\mathcal{Q}$ is positive definite, and $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ is empty for non-integral values of λ.

In our discussion, we always consider a fixed form $\mathcal{Q}$. So, we suppress it from the notation below. For $\lambda \in \mathbb{N} $ and functions $a : \mathbb{Z}^d \to \mathbb{C} $, define the arithmetic extension operator

\begin{equation*} E_\lambda a({\bf{\xi}}) := \sum_{{\bf{x}} \in V_{\mathcal{Q}=\lambda}(\mathbb{Z} )} a({\bf{x}}) e\hspace{-0.5mm}\left( {\bf{x}} \cdot {\bf{\xi}} \right). \end{equation*}

Letting $\omega_\lambda := \textbf{1}_{V_{\mathcal{Q}=\lambda}(\mathbb{Z} )}$, we have $ E_\lambda a({\bf{\xi}}) = \mathcal{F}_{{\mathbb{Z}^d}}({a\cdot \omega_\lambda})({\bf{\xi}}), $ where $\mathcal{F}_{{\mathbb{Z}^d}}$ is the Fourier transform defined on complex-valued functions with domain $\mathbb{Z}^d$. In other words, Eλ is the adjoint to the restriction operator $R_{\lambda}f$ defined as $ R_{\lambda}f := \mathcal{F}_{{\mathbb{T}^d}}({f}) \cdot \omega_\lambda $ for functions $f : \mathbb{T}^d \to \mathbb{C} $. The extension operator is trivial when the variety has no integer points; that is, when $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ is the empty set. Consequently, we are interested in situations where the variety has many integer points. The prototypical examples here are spheres (centered at the origin) in five or more variables. Here the form is given by the sum of squares $x_1^2+\cdots+x_d^2$, and the cardinality of $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ has order of magnitude $\lambda^{\frac{d}{2}-1}$ for $\lambda \in \mathbb{N} $. According to a theorem of Birch, there is a natural setting for these operators, which we review here.

Define the Birch singular locus of the form $\mathcal{Q}$ as the complex variety

\begin{equation*} V_{\mathcal{Q}}^\dagger(\mathbb{C} ) := \{{\bf{x}} \in \mathbb{C}^d : \nabla \mathcal{Q}({\bf{x}}) = {\bf{0}}\}. \end{equation*}

Let $\dim_{\mathbb{C}} (V)$ denote the algebraic dimension of a complex variety V. We will say that an integral form is regular if it satisfies Birch’s criterion:

(1)\begin{equation} d-\dim_{\mathbb{C}} (V_{\mathcal{Q}}^\dagger(\mathbb{C} )) \gt (k-1)2^k. \end{equation}

We define the Birch rank, $B(\mathcal{Q})$ of a form $\mathcal{Q}$, to be the co-dimension $d-\dim_{\mathbb{C}} (V_{\mathcal{Q}}^\dagger(\mathbb{C} ))$. The Birch rank is always non-negative since $V_{\mathcal{Q}}^\dagger(\mathbb{C} )$ being a variety in $\mathbb{C}^d$ implies that $\dim_{\mathbb{C}} (V_{\mathcal{Q}}^\dagger(\mathbb{C} )) \leq d$. To justify the term ‘rank’, one should note that this generalizes the notion of rank for quadratic forms. Indeed, for a quadratic form $\mathcal{Q}({\bf{x}}):={\bf{x}}M{\bf{x}}^{\rm T}$ defined by some d × d-matrix M, a simple calculation gives $B(\mathcal{Q})=\operatorname{rank}(M)$. Here, and in related examples, the point ${\bf{x}} \in \mathbb{Z}^d$ is regarded as a row vector of length d and ${\bf{x}}^{\rm T}$ is its transpose.

When Equation (1) is satisfied, Birch [Reference Birch2] tells us that there exists an infinite arithmetic progression $\Gamma_{\mathcal{Q}}$ in $\mathbb{N} $ depending on the form $\mathcal{Q}$ such that for each $\lambda \in \Gamma_{\mathcal{Q}}$, there exists a positive constant $C_{\mathcal{Q}}(\lambda)$ with the property that

(2)\begin{equation} N_{\mathcal{Q}}(\lambda) := \#\{{\bf{n}} \in \mathbb{Z}^d : \mathcal{Q}({\bf{n}}) = \lambda \} = C_{\mathcal{Q}}(\lambda) \lambda^{\frac{d}{k}-1} + O_{\mathcal{Q}}(\lambda^{\frac{d}{k}-1-\delta}) \gt 0 \end{equation}

for some positive δ depending on the form $\mathcal{Q}$. Moreover, there exists constants $c_2 \gt c_1 \gt 0$ such that $c_1 \leq C_{\mathcal{Q}}(\lambda) \leq c_2$ for all $\lambda \in \Gamma_{\mathcal{Q}}$. Based on Birch’s asymptotic Equation (2) and on the usual heuristics of the circle method, one expects the following estimates.

Conjecture 1.

Let $\mathcal{Q}$ be an integral, positive definite form of degree $k \geq 2$ in $d \gt 2k$ variables. For each $1 \leq p \leq \infty$ and ϵ > 0, there exists a positive constant $C_{\mathcal{Q},p,\epsilon}$ such that

(3)\begin{equation} \|E_\lambda a\|_{L^p(\mathbb{T}^d)} \leq C_{\mathcal{Q},p,\epsilon} \lambda^\epsilon (1+\lambda^{\frac{d-k}{2k}-\frac{d}{kp}}) \|a\|_{\ell^2(\mathbb{Z}^d)}. \end{equation}

For $k \geq 3$, we further conjecture that one may remove the ϵ-loss; that is, for each $1 \leq p \leq \infty$, there exists a constant $C_{\mathcal{Q},p}$ such that

(4)\begin{equation} \|E_\lambda a\|_{L^p(\mathbb{T}^d)} \leq C_{\mathcal{Q},p} \left(1+\lambda^{\frac{d-k}{2k}-\frac{d}{kp}}\right) \|a\|_{\ell^2(\mathbb{Z}^d)}. \end{equation}

There are two trivial estimates known for Conjecture 1. The first trivial estimate is the $\ell^2 \to L^2$ estimate, which is furnished by Plancherel’s theorem. The second trivial estimate is the $\ell^2 \to L^\infty$ estimate, which is furnished by the Cauchy–Schwarz inequality and Equation (2) when the latter is known to hold. Conjecture 1 has been intensively studied in the quadratic case, especially for the spherical case $\mathcal{Q}({\bf{x}}) := x_1^2+\cdots+x_d^2$. Even for the sphere, this problem remains open despite major recent advances in the area. See [Reference Bourgain4Reference Bourgain and Demeter9] for more information regarding the spherical case and [Reference Bourgain and Demeter10] for other quadratic hypersurfaces. In contrast, for forms of higher degree, there are no hitherto known non-trivial estimates towards this problem.

Our result is an affirmative answer to Conjecture 1 when the form is also assumed to be regular, and it yields Equation (4) when p and d are both sufficiently large. In particular, p will be much larger than the critical exponent $p_c = p_c(\mathcal{Q}) := \frac{2d}{d-k}$. (The critical exponent is defined as the exponent p where the two summands in Equation (3) or (4) balance. Supercritical p means that $p \gt p_c$, while subcritical p means that $p \lt p_c$.) To state our result, we introduce a relevant parameter. For a regular, integral form $\mathcal{Q}$ of degree k in d variables, define the parameter

(5)\begin{equation} \gamma_{\mathcal{Q}} := \frac{1}{6k} \left( \frac{d-\dim(V_{\mathcal{Q}}^\dagger(\mathbb{C} ))}{(k-1)2^k} - 1 \right). \end{equation}

Throughout we assume that d is sufficiently large with respect to k to satisfy the regularity criterion (1). This implies that $\gamma_{\mathcal{Q}} \gt 0$ and $d \gt 2k$. Our main result is the following.

Theorem 1. Let $\mathcal{Q}$ be a regular, positive definite integral form in d variables of degree $k \geq 2$. If $p \gt 2+\frac{2k}{\gamma_{\mathcal{Q}}}$, then Equation (4) holds for $\lambda \in \mathbb{N} $.

We take a moment to orient ourselves with a few examples to record what Theorem 1 gives for these examples and to compare it with known bounds when applicable.

1.1. Spheres

For the form $\mathcal{Q}({\bf{x}}) := |{\bf{x}}|^2$, its singular locus is $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{\nabla \mathcal{Q}({\bf{x}}) = 2{\bf{x}} = 0 \}$ is $\{{\bf{0}}\}$. Therefore, the dimension of the singular locus is 0, and we require that $d-0 \gt (2-1)2^2$. More simply, we require that $d \geq 5$ for spheres. Under this assumption on the dimension, $\gamma_{\mathcal{Q}} = (d-4)/48$ and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+192/(d-4)$. This range of p is far away from the conjectured critical exponent of $2+4/(d-2)$. Fortunately, in this case, one may replace $\gamma_{\mathcal{Q}}$ (in Equation (13) below) by $(d-1)/4+\epsilon$ for any ϵ > 0 and $d \geq 4$ from [Reference Magyar25]. In turn, this replacement improves the range of p in Theorem 1 to all $p \gt 2+8/(d-3)$ for $d \geq 4$. This recovers the bounds obtained in [Reference Bourgain4] but falls short of their subsequent improvements obtained in [Reference Bourgain6Reference Bourgain and Demeter9].

1.2. Ellipsoids

Suppose that M is an invertible d × d matrix with integral coefficients such that the associated quadratic form $\mathcal{Q}({\bf{x}}) := {\bf{x}}M{\bf{x}}^{\rm T}$ is positive definite. Spheres corresponding to M being the identity matrix. Since M is invertible, $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{{\bf{0}}\}$, and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+192/(d-4)$. While we are unaware of any results in this level of generality for quadratic forms, presumably, a more delicate approach following [Reference Bourgain4, Reference Henriot and Hughes18] would yield a bound closer to the critical exponent.

1.3. k-Spheres

For the form $\mathcal{Q}({\bf{x}}) := x_1^k+\cdots+x_d^k$, where k is an integer greater than 1, its singular locus is $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{\nabla \mathcal{Q}({\bf{x}}) = k(x_1^{k-1},\dots,x_d^{k-1}) = {\bf{0}} \} = \{{\bf{0}}\}$. Therefore, the dimension of the singular locus is 0, and we require that $d \gt (k-1)2^k$. Under this assumption on the dimension, $\gamma_{\mathcal{Q}} = (d-(k-1)2^k)/(6k(k-1)2^{k})$, and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+(12k^2(k-1)2^{k})/(d-(k-1)2^k)$. When k = 3, this becomes d > 16 and $p \gt 2+1728/(d-16)$.

A peculiar feature of Birch’s method - and hence our results - is that the Birch rank is defined in terms of the complex points of the singular locus rather than its real points. Recall Euler’s theorem: for any form $\mathcal{Q}$, we have the identity

\begin{equation*} \deg(\mathcal{Q}) \mathcal{Q}(x_1,\ldots,x_d) = (x_1,\ldots,x_d) \cdot \nabla\mathcal{Q}(x_1,\ldots,x_d), \end{equation*}

where the · on the right hand side denotes the inner product of two vectors. By Euler’s theorem, a real singular point for a positive definite form is necessarily 0. In other words, $V_{\mathcal{Q}}^\dagger(\mathbb{R} ) = \{{\bf{0}}\}$ for every positive definite form $\mathcal{Q}$. In contrast, the Birch singular locus can be huge as seen in the following ‘non-example’ of a positive definite form whose singular locus is too large for our theorem and methods to be applicable.

1.4. A non-example

Consider the form $\mathcal{Q}({\bf{x}}) := (x_1^2+\cdots+x_d^2)^2$; its Birch singular locus is

\begin{equation*} V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{{\bf{x}} \in \mathbb{C}^d : 4(x_1^2+\cdots+x_d^2){\bf{x}} = {\bf{0}} \} = \{{\bf{x}} \in \mathbb{C}^d : x_1^2+\cdots+x_d^2 = {\bf{0}} \} .\end{equation*}

This is a co-dimension 1 complex algebraic set. Consequently, this form fails to satisfy Birch’s regularity criterion (1) regardless of how large d (the number of variables) is. Meanwhile, its real singular locus $V_{\mathcal{Q}}^\dagger(\mathbb{R} )$ is the set $\{{\bf{0}}\}$. When λ is a square, the corresponding restriction operator is closely related to that for the form $x_1^2+\cdots+x_d^2$. When λ is not a square, the behaviour of the corresponding restriction operator is subtle.

Remark 1.1. The expert can formulate conjectures analogous to Conjecture 1 for integral forms and their 0-level sets without difficulty. Practically, this presents only a technical difference from our hypothesis in Theorem 1 that forms are positive definite. Our methods also apply in this setting, but we do not pursue the analogous results in this paper.

Having considered a few examples, let us now discuss our motivations. One motivation is to extend discrete restriction theory for hypersurfaces beyond the setting of spheres and paraboloids. This is the first attempt to do so. This work fits into a broader program, initiated by Magyar in [Reference Magyar24], which seeks to understand discrete (more appropriately termed ‘arithmetic’) harmonic analysis for hypersurfaces. Initial forays into this program have centered around Birch’s theorem and have had applications to maximal functions and ergodic theorems [Reference Magyar24] and [Reference Cook and Hughes12], discrepancy estimates in [Reference Magyar25], Szemeredi-type theorems in [Reference Magyar26] and $\ell^p$-improving estimates in [Reference Hughes21].

Our approach to Theorem 1 is motivated by a previously open question. This question, posed by the second author in May 2014 at the Workshop III: Kakeya problem, Restriction Problem and Sum-product Theory Workshop as part of IPAM’s long program Algebraic Techniques for Combinatorial and Computational Geometry, asks: Can one use Magyar–Stein–Wainger’s decomposition of the surface measure for the integer points on a sphere from [Reference Magyar, Stein and Wainger27] to improve the discrete restriction estimates for the sphere?

This question was natural since Magyar–Stein–Wainger’s decomposition had been successfully used in the aforementioned works of Magyar, but at that time, it was unknown if Magyar–Stein–Wainger’s decomposition could be used to prove non-trivial discrete restriction estimate for the sphere. Our proof of Theorem 1 reveals that Magyar–Stein–Wainger’s decomposition can be used to prove non-trivial discrete restriction estimates for the sphere. Examining [Reference Bourgain4], the second author’s question means: What is the best way to control the error term in the decomposition?

This latter question closely relates to another question, posed by Ákos Magyar at the Georgia Discrete Analysis Conference in May 2018, which asks: how does one incorporate minor arc estimates for higher degree Diophantine equations in order to obtain discrete restriction estimates? At that time, no discrete restriction estimates were known for a single degree 3 or higher multivariate form. Magyar’s question was natural given the fact that for quadratic forms one does not need to use minor arcs but one must grapple with the minor arcs for hypersurfaces of degrees 3 or more. This relates to the first question because the minor arcs contribute the greatest error term in the decomposition formulas for hypersurfaces of degrees 3 and more. In the quadratic cases, there is no need for minor arcs, and they have not made an appearance in previous analyses.

It transpired that Magyar’s question was partially answered in [Reference Henriot and Hughes17] where minor arc estimates were incorporated to prove discrete restriction estimates for ‘k-paraboloids’. While [Reference Henriot and Hughes17] were predominately interested in ϵ-removal lemmas, the methods therein also used minor arc estimates to prove discrete restriction estimates. When one observes that the worst error term (in Magyar’s generalizations of Magyar–Stein–Wainger’s decomposition) arises from the minor arcs, the natural strategy becomes to adapt those methods to handle the other error terms. Our work answers these questions by successfully using this strategy.

We organized our argument to closely follow [Reference Henriot and Hughes17] so that Theorem 1 reduces to proving appropriate estimates for the main term and the error term, but we streamline the approach to fit our purposes. In particular, since we are interested in the sharp discrete restriction estimates, our approach ‘bakes in’ the ϵ-removal. The bounds for the error terms are taken from [Reference Magyar24]. Meanwhile, the bulk of our work lies in handling the main term. This is done in Theorem 2 where we prove a dyadically refined decomposition of the main term, which is better suited to our purposes. Outlined in Section 4, this refinement is this paper’s main technical contribution, which allows us to adapt the Tomas–Stein method in [Reference Bourgain4] to the main term.

Instead of striving to fully optimize every aspect of our argument, we have aimed to give a simplified version of the general method, which hopefully illuminates the main ideas. The main bottleneck in our argument is the poor state of knowledge for minor arcs bounds in Equation (13) that leads us to define $\gamma_{\mathcal{Q}}$ in Equation (5). Any better decay rate of Equation (13) (e.g., replacing $\gamma_{\mathcal{Q}}$ therein by a larger constant) immediately enlarges the range of p in Theorem 1. For example, one can improve the ranges of d and p in Theorem 1 for k-spheres by using superior minor arc estimates available in this case. Such estimates are possible by exploiting the diagonal structure of the underlying Diophantine equation; see [Reference Magyar25] when k = 2 and [Reference Anderson, Cook, Hughes and Kumchev1] when $k \geq 3$ for the best bounds presently known.

There has been much recent progress on decoupling estimates for affine-invariant systems of equations in many variables following [Reference Bourgain and Demeter9, Reference Bourgain, Demeter and Guth11, Reference Wooley29]. (Affine invariance is also known as translation-dilation invariance or as parabolic rescaling.) For instance, see [Reference Guo and Zhang14Reference Guo and Zorin-Kranich16]. It is important to note the setting of Theorem 1 is far from affine invariant. By combining [Reference Parsell, Prendiville and Wooley28] and [Reference Lai and Ding23], there is a different way to use such decoupling results to prove Equation (3) for sufficiently large $p \gt p_c$. However, this procedure almost surely yields a smaller range of p than Theorem 1 provides, and it becomes increasingly worse as the degree or number of variables increases. Moreover, another method must be used to obtain the sharper estimate (4). The only known way to sharpen an estimate from Equation (3) to (4) is a circle method approach like the one used in the proof of Theorem 1.

1.5. Organization of the paper

The paper is organized as follows. In § 2, we set notation used throughout the paper. In § 3, we give an abstract formulation of Tomas’s method for discrete L 2 restriction theorems dating to [Reference Bourgain3]; Lemma 1 therein reduces our problem to proving estimates related to the Fourier transform of the surface measure. In § 4, we recall a decomposition of the surface measure due to Magyar and related estimates from [Reference Magyar24]; this is ‘Magyar’s Decomposition Theorem’. Combining Lemma 1 and Magyar’s Decomposition Theorem, we reduce Theorem 1 to Theorem 2, which is an estimate for the major arcs. In § 5, we prove a bound for the major arc pieces by a further application of Tomas’s methods.

2. Notation

We introduce here some notation that will streamline our exposition.

  • For a positive integer, we let $\mathbb{Z}/q$ denote the group of integers modulo q and $U_q := \{1 \leq a \lt q : (a,q)=1 \}$ denote its unit group.

  • We write $f(\lambda) \lesssim g(\lambda)$ if there exists a constant C > 0 independent of all λ under consideration (e.g., λ in $\mathbb{N} $ or in $\Gamma_{\mathcal{Q}}$) such that $ |f(\lambda)| \leq C |g(\lambda)|. $ Furthermore, we will write $f(\lambda) \gt lesssim g(\lambda)$ if $g(\lambda) \lesssim f(\lambda)$, while we will write $f(\lambda) \eqsim g(\lambda)$ if $f(\lambda) \lesssim g(\lambda)$ and $f(\lambda) \gt lesssim g(\lambda)$. Subscripts in the above notation will denote parameters, such as the dimension d or degree k of a form $\mathcal{Q}$, on which the implicit constants may depend.

  • $\mathbb{T}^d$ denotes the d-dimensional torus $(\mathbb{R} /\mathbb{Z} )^d$ identified with the unit cube $[-1/2,1/2]^d$.

  • $*$ denotes convolution on a group such as $\mathbb{Z}^d$, $\mathbb{T}^d$ or $\mathbb{R}^d$. It will be clear from context as to which group the convolution takes place.

  • $e\hspace{-0.5mm}\left( t \right)$ will denote the character ${\rm e}^{-2\pi it}$ for $t \in \mathbb{R} $ or $\mathbb{T} $.

  • For a function $f: \mathbb{Z}^d \to \mathbb{C} $, its $\mathbb{Z}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{Z}^d}}{f}({\bf{\xi}})$ for ${\bf{\xi}} \in \mathbb{T}^d$. For a function $f: \mathbb{T}^d \to \mathbb{C} $, its $\mathbb{T}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{T}^d}}{f}({\bf{x}})$ for ${\bf{x}} \in \mathbb{Z}^d$. $\mathcal{F}_{{\mathbb{Z}^d}}$ and $\mathcal{F}_{{\mathbb{T}^d}}$ are defined so that they are inverses of one another. For a function $f: \mathbb{R}^d \to \mathbb{C} $, its $\mathbb{R}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{R}^d}}{f}({\bf{x}})$ for ${\bf{x}} \in \mathbb{R}^d$.

  • For a function $f: \mathbb{R}^d \to \mathbb{C} $, we define dilation operator $\operatorname{D}_{t}$ by $\operatorname{D}_{t}f({\bf{x}}) = f({\bf{x}}/t)$.

  • For a ring R, we will use the inner product notation ${\bf{b}} \cdot {\bf{m}}$ for vectors ${\bf{b}},{\bf{m}} \in R^d$ to mean the sum $\sum_{i=1}^d b_i m_i$. This is used for the rings $\mathbb{R} ,\mathbb{Z} ,\mathbb{T} $ and $\mathbb{Z} /q$, where $q \in \mathbb{N} $.

  • We also let $\textbf{1}_X$ denote the indicator function of the set X.

3. The arithmetic Tomas–Stein method

Let ωλ be the counting measure on $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ for a single integral, positive definite, homogenous form $\mathcal{Q}$ satisfying (1) and some $\lambda \in \mathbb{Z} $. Let $F = \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$ be the exponential sum corresponding to ωλ. A common approach to problems involving ωλ is to use the circle method so as to decompose the exponential sum F into a main piece $F_\mathfrak{M}$ and an error term $F_\mathfrak{m}$ corresponding, respectively, to major and minor arcs. (These are analogous to low and high frequency pieces, respectively.) To prove discrete restriction estimates, Bourgain in [Reference Bourgain4] combined this approach with Tomas’s L 2 restriction argument in order to reduce matters to the following two estimates:

  • Bounds for the operator given by convolution with the major arc operator $F_\mathfrak{M}$, and

  • A uniform power saving bound on the minor arc piece $F_\mathfrak{m}$.

See [Reference Hu and Li19, Reference Hu and Li20] for a variant. Bourgain’s approach has been abstracted in [Reference Keil22] and [Reference Henriot and Hughes17]. We combine Lemmas 3.3 and 3.6 from [Reference Henriot and Hughes17] to form the following lemma.

Lemma 1. For $\lambda \in \mathbb{N} $, let $F = \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$ be the $\mathbb{Z}^d$-Fourier transform of the arithmetic surface measure ωλ defined on $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$. Suppose that there exists a decomposition $F = F_{\mathfrak{M}} + F_{\mathfrak{m}}$ such that for each $f \in L^\infty(\mathbb{T}^d)$, we have the estimates

(TS1)\begin{align} \| F \ast f \|_{L^{p_0}(\mathbb{T}^d)} \lesssim \lambda^{\epsilon} \|f\|_{L^{p_0'}(\mathbb{T}^d)} \quad \text{for some} \ p_0 \leq p_c, \end{align}
(TS2)\begin{align} \| F_{\mathfrak{M}} \ast f \|_{L^{p_1}(\mathbb{T}^d)} \lesssim \lambda^{{\frac{d}{k}-1}-\frac{2d}{kp_1}} \|f\|_{p_1^{\prime}} \quad \text{for some} \ p_1 \gt p_c, \text{and} \end{align}
(TS3)\begin{align} \| F_{\mathfrak{m}} \|_{L^\infty(\mathbb{T}^d)} \lesssim {\lambda}^{\frac{d}{k}-1-\frac{\zeta}{k}} \quad \text{for some} \ \zeta \in (0,d-k). \end{align}

Then $\| F \ast f \|_{L^p(\mathbb{T}^d)} \lesssim \lambda^{\frac{d}{k}-1 - \frac{2d}{kp}} \| f \|_{L^{p^{\prime}}(\mathbb{T}^d)}$ holds for $p \gt \max\left[p_1, \frac{2d - (d-k)p_0}{\zeta} + p_0 \right]$.

In our work, we only use Plancherel’s theorem to exploit the subcritical estimate at $p_0 = 2$; this gives the exponent $p \gt \max\left[ p_1, \frac{2d - (d-k)2}{\zeta} + 2 \right] = \max\left[ p_1, \frac{2k}{\zeta} + 2 \right]$. We give the proof of Lemma 1 for completeness.

Proof of Lemma 1

Set $N = \lceil \lambda^{1/k} \rceil$. Fix $p \gt \max\left[p_1, \frac{2d - (d-k)p_0}{\zeta} + p_0 \right]$ and let a be an element of $\ell^{2}$. For notational convenience, we let E denote the extension operator defined on sequences $a : \mathbb{Z}^d \to \mathbb{C} $ by $Ea := \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda \cdot \mathcal{F}_{{\mathbb{T}^d}}a) = a*\mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$. We may assume that a is not identically zero and by homogeneity normalize a so that $\| a \|_{2} = 1$. We introduce a parameter α > 0 in order to define the level sets and functions

\begin{align*} S_\alpha = \{{\bf{\xi}} \in \mathbb{T}^d : |Ea({\bf{\xi}})| \geq \alpha \} \quad \text{and} \quad f = 1_{S_\alpha} \frac{Ea}{|Ea|}. \end{align*}

By the Cauchy–Schwarz inequality and Birch’s theorem in [Reference Birch2], we have

(6)\begin{equation} \|Ea\|_{L^\infty} \lesssim N^{\frac{d-k}{2}}. \end{equation}

Therefore, we may restrict α to lie in the interval $[0,CN^{\frac{d-k}{2}}]$ for some positive constant C. By Parseval’s identity, we have

\begin{align*} \alpha |S_\alpha| \leq \langle f , Ea \rangle = \langle \mathcal{F}_{{\mathbb{T}^d}}f , \omega_\lambda \cdot a \rangle = \langle \omega_\lambda \cdot \mathcal{F}_{{\mathbb{T}^d}}f , a \rangle. \end{align*}

By Cauchy–Schwarz and the assumption $\| a \|_2 = 1$, it follows that

\begin{align*} \alpha^2 |S_\alpha|^2 \leq \| (\mathcal{F}_{{\mathbb{T}^d}}f) \omega_\lambda \|_{\ell^2}^2 = \langle (\mathcal{F}_{{\mathbb{T}^d}}f) \cdot \omega_\lambda , \mathcal{F}_{{\mathbb{T}^d}}f \rangle. \end{align*}

Another application of Parseval’s identity implies that

(7)\begin{align} \alpha^2 |S_\alpha|^2 \leq \langle f \ast F, f \rangle. \end{align}

By Equation (7), Hölder’s inequality and hypotheses (TS2) and (TS3) of the lemma, we have

\begin{align*} \alpha^2 |S_\alpha|^2 & \leq \| f \ast F_{\mathfrak{M}} \|_{p_1} \| f \|_{p_1^{\prime}} + \| f \ast F_{\mathfrak{m}} \|_\infty \| f \|_1 \\ & \lesssim N^{d-k - \frac{2d}{p_1}} \| f \|_{p_1'}^2 + \| F_{\mathfrak{m}} \|_\infty \| f \|_1^2 \\ &\lesssim N^{d-k - \frac{2d}{p_1}} | {S_\alpha} |^{\frac{2}{p_1^{\prime}}} + N^{d-k-\zeta} | {S_\alpha} |^2. \end{align*}

Therefore, when $ \alpha \gt lesssim N^{\frac{d-k}{2}-\frac{\zeta}{2}} $, we have

\begin{equation*} \alpha^2 |S_\alpha|^2 \lesssim N^{d-k - \frac{2d}{p_1}} | {S_\alpha} |^{2 - \frac{2}{p_1}}. \end{equation*}

Rearranging implies that $|S_\alpha| \lesssim \alpha^{-p_1} N^{\frac{(d-k)p_1}{2} - d}$. Since $p \gt p_1$, we have

\begin{align*} \int_{|Ea| \gt lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}}} |Ea|^p \, \mathrm{d}m & = p \int_{CN^{\frac{d-k}{2} - \frac{\zeta}{2}} }^{CN^{\frac{d-k}{2}} } \alpha^{p-1} |S_\alpha| \, \mathrm{d}\alpha \\ &\lesssim N^{\frac{(d-k)p_1}{2} - d} \int_1^{CN^{\frac{d-k}{2}}} \alpha^{p-p_1-1} \, \mathrm{d}\alpha \\ &\lesssim N^{\frac{(d-k)p}{2} - d}. \end{align*}

Altogether, we have

(8)\begin{equation} \int_{|Ea| \gt lesssim N^{d/2 - \zeta/2}} |Ea|^p \, {\rm d}{m} \lesssim N^{\frac{(d-k)p}{2} - d}. \end{equation}

We are left to consider that the regime where $|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}}$. We now make use of estimate (TS1) at the exponent p 0 to handle the regime where $|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}}$. This is possible by the trivial bound (6) as follows:

\begin{align*} \int_{|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}} } |Ea|^p \, \mathrm{d}m &\lesssim \left(N^{\frac{d-k}{2} - \frac{\zeta}{2}}\right)^{p - p_0} \int_{\mathbb{T}^r} |Ea|^{p_0} \, \mathrm{d}m \lesssim_\epsilon N^{\frac{(d-k-\zeta)(p-p_0)}{2}+ \epsilon}. \end{align*}

Combining this estimate with Equation (8), we have that

\begin{align*} \int |Ea|^p \, \mathrm{d}m &= \int_{|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}} } |Ea|^p \, \mathrm{d}m + \int_{|Ea| \gt lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}} } |Ea|^p \, \mathrm{d}m \\ &\lesssim_\epsilon N^{\frac{(d-k)p}{2} - d} + N^{\frac{(d-k-\zeta)(p-p_0)}{2}+ \epsilon}. \end{align*}

The latter summand is dominated by the former summand when $\frac{(d-k-\zeta)(p-p_0)}{2} \lt \frac{(d-k)p}{2} - d$. This is equivalent to

\begin{equation*} \frac{(d-k-\zeta)(p-p_0)}{2} = \frac{(d-k)p}{2} - \frac{\zeta p}{2} -\frac{(d-k-\zeta)p_0}{2} \lt \frac{(d-k)p}{2} - d, \end{equation*}

which is equivalent to

\begin{equation*} \frac{\zeta p}{2}+\frac{(d-k-\zeta)p_0}{2} \gt d. \end{equation*}

Rearranging this last expression, we find that we need

\begin{equation*} p \gt \frac{2}{\zeta} \left(d - \frac{(d-k-\zeta)p_0}{2}\right) = \zeta^{-1} (2d - (d-k-\zeta)p_0) = \frac{2d - (d-k)p_0}{\zeta} + p_0. \end{equation*}

This is precisely the range of $p \gt \frac{2d - (d-k)p_0}{\zeta} + p_0$.

4. Magyar’s decomposition of the surface measure

Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be an integral, positive definite form where ${\bf{x}} = (x_1,\dots,x_d)$. The heavy lifting in our theorem lies in a decomposition of Magyar for the surface measure $\omega_\lambda := \textbf{1}_{\{{\bf{x}} \in \mathbb{Z}^d : \mathcal{Q}({\bf{x}})=\lambda\}}$, where $\lambda \in \mathbb{Z} $; this is the counting measure on the integer points x in $\mathbb{Z}^d$ such that $\mathcal{Q}({\bf{x}})=\lambda$. To state this theorem, we need to introduce a few objects.

For $q \in \mathbb{N} $, $a \in U_q$ and ${\bf{m}} \in \mathbb{Z}^d$, define the normalized Birch–Weyl sums

\begin{equation*} G_{\mathcal{Q}}(a,q;{\bf{m}}) := q^{-d}\sum_{{\bf{b}} \in (\mathbb{Z} /q)^d} e\,\left(\frac{a\mathcal{Q}({\bf{b}})+{\bf{b}}\cdot{\bf{m}}}{q} \right). \end{equation*}

We have the bound

(9)\begin{equation} |G_{\mathcal{Q}}(a,q;{\bf{m}})| \lesssim_\epsilon q^{\epsilon-\kappa_{\mathcal{Q}}} \quad \text{for all} \ \epsilon \gt 0 \end{equation}

uniformly in $a \in U_q$ and ${\bf{m}} \in \mathbb{Z}^d$ with

\begin{equation*} \kappa_{\mathcal{Q}} := \frac{d-\dim V_{\mathcal{Q}}(\mathbb{C} )}{2^{k-1}(k-1)}. \end{equation*}

See [Reference Magyar24] for a proof of this fact. The dimension d is sufficiently large so that $\kappa_{\mathcal{Q}} \gt 2$.

Let $d\sigma_{\mathcal{Q}}$ denote the singular measure on $\mathbb{R}^d$ defined as the Gelfand–Leray form whose $\mathbb{R}^d$-Fourier transform is defined distributionally by the oscillatory integral

\begin{equation*} \int_{\mathbb{R} } e\hspace{-0.5mm}\left( t(\mathcal{Q}({\bf{x}})-1) \right) \, {\rm d}t. \end{equation*}

It is known that

(10)\begin{equation} {\rm d}\sigma_{\mathcal{Q}}({\bf{x}}) = {\rm d}S_{\mathcal{Q}}({\bf{x}})/|\nabla \mathcal{Q}({\bf{x}})|, \end{equation}

where ${\rm d}S_{\mathcal{Q}}$ is the Euclidean surface area measure on the hypersurface $\{{\bf{x}} \in \mathbb{R}^d : \mathcal{Q}({\bf{x}})=1 \}$. These measures are compactly supported since $\mathcal{Q}$ is positive definite. We cite the following bound – see Lemma 6 on page 931 of [Reference Magyar24] – for the $\mathbb{R}^d$-Fourier transform of the surface measure:

(11)\begin{equation} |\widetilde{d\sigma_{\mathcal{Q}}}({\bf{\xi}})| \lesssim_\epsilon (1+|{\bf{\xi}}|)^{1-\kappa_{\mathcal{Q}}+\epsilon} \quad \text{for each} \quad {\bf{\xi}} \in \mathbb{R}^d \quad \text{and for all} \ \epsilon \gt 0. \end{equation}

Let Ψ be a $C^\infty(\mathbb{R}^d)$ bump function supported in the cube $[-1/8,1/8]^d$ and 1 on the cube $[-1/16,1/16]^d$, where these cubes are regarded as subsets of the torus $\mathbb{T}^d$. For each $q \in \mathbb{N} $, let s be the integer such that $2^s \leq q \lt 2^{s+1}$. For such q and for $a \in U_q$, define the Fourier multipliers

\begin{equation*} \mu^{a/q}_\lambda({\bf{\xi}}) := \sum_{{\bf{m}} \in \mathbb{Z}^d} G_{\mathcal{Q}}(a,q;{\bf{m}}) \Psi\left(2^s\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right)\, \widetilde{{\rm d}\sigma_{\mathcal{Q}}}\left(\lambda^{\frac{1}{k}}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right) \end{equation*}

for $\xi \in \mathbb{T}^d$. Generalizing work of [Reference Magyar24], Magyar [Reference Magyar, Stein and Wainger27] obtained a flexible decomposition of the surface measure; we choose the following form.

Magyar’s Decomposition Theorem ([Reference Magyar24, Reference Magyar, Stein and Wainger27])

Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a regular, positive definite integral form. For each $\lambda \in \mathbb{N},$ the Fourier transform of the surface measure ωλ decomposes as

(12)\begin{equation} \lambda^{1-\frac{d}{k}} \cdot \mathcal{F}_{{\mathbb{Z}^d}}{\omega_\lambda}({\bf{\xi}}) = \left( \sum_{s=0}^{\lceil \log_2 \lambda^{1/k} \rceil} \sum_{q=2^s}^{2^{s+1}-1} e\left( -\frac{a\lambda}{q} \right) \sum_{a \in U_q} \mu^{a/q}_\lambda({\bf{\xi}}) \right) + \varepsilon_\lambda({\bf{\xi}}), \end{equation}

where

(13)\begin{equation} \| \varepsilon_\lambda \|_{L^\infty(\mathbb{T}^d)} \lesssim_{\mathcal{Q},\epsilon} \lambda^{\epsilon-\gamma_{\mathcal{Q}}} \quad \mathrm{for\ all} \ \epsilon \gt 0. \end{equation}

Remark 4.1. Our form of the error term ɛλ and its estimate (13) do not explicitly appear in [Reference Magyar24]. We outline the differences and how to prove this form of Magyar’s Decomposition Theorem. Recall that Magyar’s main term takes the shape as the Fourier multiplier

(14)\begin{equation} \sum_{q \in \mathbb{N} } \sum_{a \in U_q} e\left( \frac{-a\lambda}{q} \right) \sum_{{\bf{m}} \in \mathbb{Z}^d} G_{\mathcal{Q}}(a,q;{\bf{m}}) \Psi(q{\bf{\xi}}-{\bf{m}})\, \widetilde{{\rm d}\sigma_{\mathcal{Q}}}\left(\lambda^{1/k}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right). \end{equation}

The first notable difference is that we have dyadically refined the decomposition so that Equation (14) becomes

(15)\begin{equation} \sum_{s=0}^{\infty} \sum_{q=2^s}^{2^{s+1}-1} \sum_{a \in U_q} e\left( -\frac{a\lambda}{q} \right) \sum_{{\bf{m}} \in \mathbb{Z}^d} G_{\mathcal{Q}}(a,q;{\bf{m}}) \Psi\left(2^s\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right) \,\widetilde{{\rm d}\sigma_{\mathcal{Q}}}\left(\lambda^{1/k}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right). \end{equation}

This modifies the analysis of Equations (2.15) and (2.16) of Proposition 4 in [Reference Magyar24] in inconsequential ways since $2^s \leq q \lt 2^{s+1}$. In particular, this preserves the estimate (13). The second notable difference is that we truncated the sum over $q \in \mathbb{N} $. Following the analysis of Equation (2.17) of Proposition 4 in [Reference Magyar24], we may truncate Equation (15) to

(16)\begin{equation} \sum_{s=0}^{\lfloor \log_2 \lambda^{1/k} \rfloor} \sum_{q=2^s}^{2^{s+1}-1} \sum_{a \in U_q} e\left( -\frac{a\lambda}{q} \right) \sum_{{\bf{m}} \in \mathbb{Z}^d} G_{\mathcal{Q}}(a,q;{\bf{m}}) \Psi\left(2^s\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right) \, \widetilde{{\rm d}\sigma_{\mathcal{Q}}}\left(\lambda^{1/k}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right) \end{equation}

and place the difference into the error term ɛλ while maintaining the estimate (13). The expert may immediately verify this by using the Magyar–Stein–Wainger transference principle (see Section 2 of [Reference Magyar, Stein and Wainger27]) and Birch’s Weyl bound (9).

The next theorem establishes (TS2) of Lemma 1; that is, we treat the major arc terms.

Theorem 2. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, regular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $p \gt 2+\frac{4}{\kappa_{\mathcal{Q}}-2}$, we have

(17)\begin{equation} \| F_{\mathfrak{M}} \ast f \|_{L^{p}(\mathbb{T}^d)} \lesssim_p \lambda^{\frac{d-k}{k}-\frac{2d}{kp}} \|f\|_{L^{p'}(\mathbb{T}^d)} \end{equation}

for each $\lambda \in \mathbb{N} $.

We may deduce Theorem 1 once Theorem 2 is proved as follows.

Proof of Theorem 1 assuming Theorem 2

Since $k \geq 2$, we have $2k/\gamma_{\mathcal{Q}} \gt 4/(\kappa_{\mathcal{Q}}-2)$, and Lemma 1 reduces Theorem 1 to applying the major arc bound in Theorem 2 and the (minor arc) bound for the error term (13).

5. Proof of Theorem 2

Fix $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ a positive definite form of degree k satisfying Equation (1) and $\lambda \in \mathbb{N} $. Set $N = \lceil \lambda^{1/k} \rceil$. Define the functions

\begin{align*} \Psi_j({\bf{\xi}}) &:= \Psi(2^j{\bf{\xi}}) - \Psi(2^{j+1}{\bf{\xi}}) \quad \text{for} \quad 0 \leq j \lt \lfloor \log_2 N \rfloor \text{and}\\ \Psi_j({\bf{\xi}}) &:= \Psi(2^j{\bf{\xi}}) \quad \text{for} \quad j = \lfloor \log_2 N \rfloor. \end{align*}

Furthermore, for $q \in \mathbb{N} , a \in U_q$ and $0 \leq j \lt \lfloor \log_2 N \rfloor$, define the multipliers

\begin{equation*} \mu^{a/q,j}_\lambda({\bf{\xi}}) := \lambda^{\frac{d}{k}-1}\sum_{{\bf{m}} \in \mathbb{Z}^d} G_{\mathcal{Q}}(a,q;{\bf{m}}) \Psi_j\left(2^{s+1}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right) \cdot \mathcal{F}_{{\mathbb{R}^d}}\,{{\rm d}\sigma_{\mathcal{Q}}}\left(\lambda^{1/k}\left[{\bf{\xi}}-\frac{{\bf{m}}}{q}\right]\right). \end{equation*}

We will collect these multipliers according to the scale of their moduli; to do so, define, for each $s \geq 0$, the set of fractions

\begin{equation*} \mathcal{R}_s := \{a/q \in \mathbb{Q} : 2^s \leq q \lt 2^{s+1} \ \text{and} \ a \in U_q \}. \end{equation*}

Let $K^{a/q,j}_\lambda := \mathcal{F}_{{\mathbb{T}^d}}{(\mu^{a/q,j}_\lambda)}$ denote the inverse Fourier transform of $\mu^{a/q,j}_\lambda$. We start our proof by establishing an identity for these kernels.

Proposition 1. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\Gamma_{\mathcal{Q}}$ be a set of regular values for the form $\mathcal{Q}$. If $s \geq 0$, then for each $a/q \in \mathcal{R}_s$, we have

(18)\begin{equation} K^{a/q,j}_\lambda({\bf{x}}) = e\left( a\mathcal{Q}({\bf{x}})/q \right) \lambda^{-1} [\mathcal{F}_{{\mathbb{R}^d}}(\operatorname{D}_{2^s\lambda^{-1/k}}\Psi_j)*{\rm d}\sigma_{\mathcal{Q}}](\lambda^{-1/k} {\bf{x}}) \end{equation}

for all ${\bf{x}} \in \mathbb{Z}^d.$

The proof of this proposition follows the proof of Proposition 1 in [Reference Hughes21]; in that proof, one replaces Ψ by $\Psi_j$ and q by 2s.

Now that we know the structure of our kernel, we will use a circle method decomposition and a further Littlewood–Paley decomposition to arbitrage $L^1(\mathbb{T}^d) \to L^\infty(\mathbb{T}^d)$ and $L^2(\mathbb{T}^d) \to L^2(\mathbb{T}^d)$ estimates and deduce Theorem 2. These bounds are the content of the two following lemmas.

Lemma 2. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$, then each major arc piece $\mu^{a/q,j}_\lambda$ satisfies

(19)\begin{equation} \| \mu^{a/q,j}_\lambda \|_{L^\infty(\mathbb{T}^d)} \lesssim_\epsilon 2^{j-s} 2^{j(\epsilon-\kappa)} \lambda^{\frac{d}{k}-\kappa} \quad \text{for} \quad 0 \leq j \leq \lfloor \log_2 N \rfloor -s \end{equation}

and

(20)\begin{equation} \| \mu^{a/q,j}_\lambda \|_{L^\infty(\mathbb{T}^d)} \lesssim_\epsilon 2^{s(\epsilon-\kappa)} \lambda^{\frac{d}{k}-1} \quad \text{for} \quad j = \lfloor \log_2 N \rfloor - s \end{equation}

for all ϵ > 0.

Lemma 3. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$, then each major arc piece $\mu^{a/q,j}_\lambda$ satisfies

(21)\begin{equation} \| \mathcal{F}_{{\mathbb{T}^d}}{\mu^{a/q,j}_\lambda} \|_{\ell^\infty(\mathbb{Z}^d)} \lesssim 2^{j+s} \lambda^{-1-\frac{1}{k}} \end{equation}

for $0 \leq j \leq \lfloor \log_2 N \rfloor - s$.

Remark 5.1. Note that $j+s = \lfloor \log_2 N \rfloor$ is the natural cutoff because we do not capture any oscillation in $\mathcal{F}_{{\mathbb{R}^d}}d\sigma(\lambda^{1/k}{\bf{\xi}})$ when $\vert\bf\xi\vert\lesssim\lambda^{-1/k}$.

Proof of Lemma 2

Fix $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$. For $0 \leq j \lt \lfloor \log_2 N \rfloor -s$, Equation (11) implies that

\begin{equation*} \| \mu^{a/q,j} \|_{L^\infty(\mathbb{T}^d)} \lesssim_\epsilon \lambda^{\frac{d}{k}-1} (2^s)^{\epsilon-\kappa} (\lambda^{1/k}/2^{s+j})^{1-\kappa+\epsilon} \lesssim_\epsilon 2^{j-s} 2^{j(\epsilon-\kappa)} \lambda^{\frac{d}{k}-\kappa} \end{equation*}

for all ϵ > 0 since κ > 2. For $j = \lfloor \log_2 N \rfloor - s$, Equation (11) implies that

\begin{equation*} \| \mu^{a/q,j} \|_{L^\infty(\mathbb{T}^d)} \lesssim_\epsilon 2^{s(\epsilon-\kappa)} \lambda^{\frac{d}{k}-1}. \end{equation*}

Before proving Lemma 3, we need a geometric property of our measures $d\sigma_{\mathcal{Q}}$. The estimate below is best known for $\mathcal{Q}({\bf{x}}) = |{\bf{x}}|^2$; see [Reference Grafakos13] for this estimate. However, we are unaware of a reference for more general hypersurfaces aside from estimate (23) in [Reference Hughes21]. For completeness, we include the statement and its proof below.

Proposition 2. Let ϕ be a Schwartz function on $\mathbb{R}^d$. If t > 0, then

(22)\begin{equation} \| t^{-d}(\operatorname{D}_t\phi) * {\rm d}\sigma_Q \|_{L^\infty(\mathbb{R}^d)} \lesssim t^{-1}. \end{equation}

Proof. Since $\mathcal{Q}$ is positive definite, the variety $V_{\mathcal{Q}=1}(\mathbb{R} )$ is compact. Moreover, Equation (10) implies that for every ball B of radius r > 0, we have

(23)\begin{equation} \sigma(B) \lesssim r^{d-1}. \end{equation}

For each point ${\bf{x}} \in \mathbb{R}^d$, define the sets $S_0({\bf{x}}) := \{{\bf{y}} \in \mathbb{R}^d : |{\bf{x}}-{\bf{y}}| \lt t \}$ and $S_j({\bf{x}}) := \{{\bf{y}} \in \mathbb{R}^d : 2^jt \leq |{\bf{x}}-{\bf{y}}| \lt 2^{j+1}t \}$ for $j \in \mathbb{N} $. By Equation (23), we have that

(24)\begin{equation} \sigma(S_j({\bf{x}})) \lesssim (2^jt)^{d-1} \end{equation}

for each ${\bf{x}} \in \mathbb{R}^d$.

Since ϕ is Schwartz, we have

\begin{equation*} \operatorname{D}_t\phi({\bf{x}}) \lesssim_\phi (1+|{\bf{x}}/t|)^{-M} \end{equation*}

for all $M \in \mathbb{N} $. Therefore,

\begin{equation*} \operatorname{D}_t\phi * {\rm d}\sigma_{\mathcal{Q}}({\bf{x}}) \lesssim (1+|\cdot/t|)^{-M} * {\rm d}\sigma_{\mathcal{Q}}({\bf{x}}) \end{equation*}

for all ${\bf{x}} \in \mathbb{R}^d$. Decomposing $\mathbb{R}^d$ into the sets $S_j({\bf{x}})$, we have

\begin{align*} \operatorname{D}_t\phi * {\rm d}\sigma_{\mathcal{Q}}({\bf{x}}) & \lesssim_{\phi,M} \int_{\mathbb{R}^d} (1+|{\bf{x}}-{\bf{y}}|/t)^{-M} \, {\rm d}\sigma({\bf{y}}) \\ & \lesssim \sum_{j=0}^\infty \int_{S_j({\bf{x}})} (1+|{\bf{y}}|/t)^{-M} \; {\rm d}\sigma({\bf{y}}) \\ & \lesssim \sum_{j=0}^\infty \int_{S_j({\bf{x}})} 2^{-jM} \, {\rm d}\sigma({\bf{y}}) \end{align*}

Using estimate (24), we obtain that

\begin{align*} \operatorname{D}_t\phi * {\rm d}\sigma_{\mathcal{Q}}({\bf{x}}) & \lesssim_{\phi,M} \lesssim \sum_{j=0}^\infty \sigma_{\mathcal{Q}}(S_j({\bf{x}})) 2^{-jM} \lesssim \sum_{j=0}^\infty (2^jt)^{d-1} 2^{-jM} \lesssim t^{d-1}. \end{align*}

Normalizing by t d, we obtain the desired estimate.

Proof of Lemma 3

Fix $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$. For each $0 \leq j \leq \lfloor \log_2 N \rfloor - s$, identity (18) and estimate (22) imply that for each ${\bf{x}} \in \mathbb{Z}^d$, we have

\begin{align*} \mu^{a/q,j}_\lambda({\bf{x}}) & \lesssim_d 2^j (\lambda^{1/k}/2^s)^{-1} \lambda^{-1} \lesssim_d 2^{j+s} \lambda^{-1-\frac{1}{k}} \end{align*}

by taking $\textstyle\phi={\mathcal F}_{\mathbb{R}^d}(\mathrm D_{2^s}\;\lambda^{-1/k}\psi_j)$ and $t = \lambda^{1/k}2^{-s}$ in Proposition 2.

Proof of Theorem 2

Let $1 \leq p \leq 2$ and $f \in L^{p^{\prime}}(\mathbb{T}^d)$ be normalized so that $\|f\|_{L^{p^{\prime}}(\mathbb{T}^d)}=1$. Interpolating the bounds (19) and (21) for $\mu^{a/q,j}_\lambda$ when $0 \leq j+s \lt \lfloor \log_2 N \rfloor$, we obtain

\begin{align*} \| \mu^{a/q,j}_\lambda \ast f \|_{p} & \lesssim_\epsilon \left( 2^{j+s} \lambda^{-1-\frac{1}{k}} \right)^{\frac{2}{p}} \cdot \left( 2^{j-s} 2^{j(\epsilon-\kappa)} \lambda^{\frac{d}{k}-\kappa} \right)^{1-\frac{2}{p}} \\ & = 2^{j(\frac{2}{p}+(1+\epsilon-\kappa)(1-\frac{2}{p}))} \cdot 2^{s(\frac{2}{p}-1+\frac{2}{p})} \cdot \lambda^{(\frac{d}{k}-\kappa)(1-\frac{2}{p})-\frac{2}{p}(1+\frac{1}{k})} \\ & = 2^{j(1+(\epsilon-\kappa)(1-\frac{2}{p}))} \cdot 2^{s(\frac{4}{p}-1)} \cdot \lambda^{(\frac{d}{k}-\kappa)(1-\frac{2}{p})-\frac{2}{p}(1+\frac{1}{k})} . \end{align*}

Summing over fractions $a/q \in R_s$ for $j \leq s \lt \lfloor \log_2 N \rfloor$, we find that

\begin{equation*} \left\| \left( \sum_{a/q \in R_s} \mu^{a/q,j}_\lambda({\bf{x}}) \right) \ast f \right\|_{L^p(\mathbb{T}^d)} \lesssim_{\mathcal{Q},\epsilon} 2^{j(1+(\epsilon-\kappa)(1-\frac{2}{p}))} \cdot 2^{s(\frac{4}{p}+1)} \cdot \lambda^{(\frac{d}{k}-\kappa)(1-\frac{2}{p})-\frac{2}{p}(1+\frac{1}{k})}. \end{equation*}

Provided $1-\kappa(1-\frac{2}{p}) \lt 0$, which is equivalent to the range $p \gt 2+\frac{2}{\kappa-1}$, we have

\begin{equation*} \left\| \left( \sum_{j=0}^{\lfloor \log_2N \rfloor - s-1} \sum_{a/q \in R_s} \mu^{a/q,j}_\lambda({\bf{x}}) \right) \ast f \right\|_{L^p(\mathbb{T}^d)} \lesssim_{\mathcal{Q},\epsilon} 2^{s(\frac{4}{p}+1)} \cdot \lambda^{(\frac{d}{k}-\kappa)(1-\frac{2}{p})-\frac{2}{p}(1+\frac{1}{k})}. \end{equation*}

Consequently, when $p \gt 2+\frac{2}{\kappa-1}$, we have

\begin{equation*} \left\| \left( \sum_{s=0}^{\lfloor \log_2N \rfloor} \sum_{j=0}^{\lfloor \log_2N \rfloor - s-1} \sum_{a/q \in R_s} \mu^{a/q,j}_\lambda({\bf{x}}) \right) \ast f \right\|_{L^p(\mathbb{T}^d)} \lesssim_{\mathcal{Q},p} \lambda^{(\frac{4}{p}+1)/k} \cdot \lambda^{(\frac{d}{k}-\kappa)(1-\frac{2}{p})-\frac{2}{p}(1+\frac{1}{k})}. \end{equation*}

Comparing the exponent of λ with the desired one of $\frac{d}{k}-1-\frac{2d}{kp}$, we find that we have Equation (2) for $p \gt 2+\frac{4}{k\kappa-\kappa-1}$. This is better than the range of $p \gt 2+\frac{4}{\kappa-2}$ claimed in the theorem.

When $0 \leq j+s = \lfloor \log_2 N \rfloor$, we have

\begin{equation*} \| \mu^{a/q,j}_\lambda \ast f \|_{p} \lesssim_\epsilon \left( \lambda^{-1} \right)^{\frac{2}{p}} \cdot \left( 2^{s(\epsilon-\kappa)} \lambda^{\frac{d}{k}-1} \right)^{1-\frac{2}{p}} = 2^{s(\epsilon-\kappa)(1-\frac{2}{p})} \cdot \lambda^{\frac{d}{k}-1-\frac{2d}{kp}} . \end{equation*}

Summing over $0 \leq s \leq \lfloor \log_2 N \rfloor$, we find that

\begin{equation*} \left\| \sum_{s=0}^{\lfloor\log_2N\rfloor} \mu^{a/q,j}_\lambda \ast f \right\|_{p} \lesssim \lambda^{\frac{d}{k}-1-\frac{2d}{kp}} \end{equation*}

provided that $(\epsilon-\kappa)(1-\frac{2}{p}) \lt 0$ for arbitrarily small, positive ϵ. For each $0 \lt \epsilon \lt \kappa-2$, this is equivalent to the range of $p \gt \frac{2(\kappa-\epsilon)}{\kappa-2-\epsilon}$. Thereby, taking ϵ to 0, we arrive at the range of $p \gt \frac{2\kappa}{\kappa-2} = 2+\frac{4}{\kappa-2}$, as claimed.

Acknowledgements

The authors would like to thank the Heilbronn Institute for Mathematical Research for enabling this collaboration through their Focused Research Workshop ‘Efficient Congruencing and Decoupling’ in June 2019. KH thanks Virginia Tech for their hospitality whilst part of this paper was written.

KH thanks Dr. Efthalia Tzitzili for a discussion on positive definite forms. We thank the anonymous referees for their feedback on this paper.

Funding Statement

EP was supported in part by Simons Foundation grant no. 360560.

Competing Interests

The authors declare no competing interests pertaining to the undertaken research.

References

Anderson, T., Cook, B., Hughes, K. and Kumchev, A., Improved l p boundedness for integral k-spherical maximal functions, Discrete Anal. (2018), 118. doi: 10.19086/da.3675.Google Scholar
Birch, B. J., Forms in many variables, Proc. Roy. Soc. Ser. A 265 (1961), 245263.Google Scholar
Bourgain, J., On $\Lambda(p)$-subsets of squares, Israel J. Math. 67(3) (1989), 291311.CrossRefGoogle Scholar
Bourgain, J., Eigenfunction bounds for the Laplacian on the n-torus, Int. Math. Res. Not. 1993(3) (1993), 6166. doi: 10.1155/S1073792893000066.CrossRefGoogle Scholar
Bourgain, J., Analysis results and problems related to lattice points on surfaces, in Harmonic Analysis and Nonlinear Differential Equations (Riverside, CA, 1995), Contemporary Mathematics, Volume 208, (American Mathematical Society, Providence, RI, 1997).Google Scholar
Bourgain, J., Moment inequalities for trigonometric polynomials with spectrum in curved hypersurfaces, Israel J. Math. 193(1) (2013), 441458.CrossRefGoogle Scholar
Bourgain, J. and Demeter, C., Improved estimates for the discrete Fourier restriction to the higher dimensional sphere, Illinois J. Math. 57(1) (2013), 213227.CrossRefGoogle Scholar
Bourgain, J. and Demeter, C., New bounds for the discrete Fourier restriction to the sphere in 4D and 5D, Int. Math. Res. Not. 2015(11) (2015), 31503184. doi: 10.1093/imrn/rnu036.Google Scholar
Bourgain, J. and Demeter, C., The proof of the $\ell^{2}$ decoupling conjecture, Ann. of Math. (2) 182.1 (2015), 351389.CrossRefGoogle Scholar
Bourgain, J. and Demeter, C., Decouplings for curves and hypersurfaces with nonzero Gaussian curvature, J. Anal. Math. 133 (2017), 279311.CrossRefGoogle Scholar
Bourgain, J., Demeter, C. and Guth, L., Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three, Ann. of Math. (2) 184(2) (2016), 633682.CrossRefGoogle Scholar
Cook, B. and Hughes, K., Bounds for lacunary maximal functions given by Birch–Magyar averages, Trans. Amer. Math. Soc. 374 (2021), 38593879. doi: 10.1090/tran/8152.CrossRefGoogle Scholar
Grafakos, L., Classical Fourier Analysis, 2nd ed. (Springer, 2008).CrossRefGoogle Scholar
Guo, S. and Zhang, R., On integer solutions of Parsell-Vinogradov systems, Invent. Math. 218(1) (2019), 181.CrossRefGoogle Scholar
Guo, S. and Zorin-Kranich, P., Decoupling for certain quadratic surfaces of low codimensions, Preprint arXiv:1902.03450 (2019).Google Scholar
Guo, S. and Zorin-Kranich, P., Decoupling for moment manifolds associated to Arkhipov–Chubarikov–Karatsuba systems, Adv. Math. 360 (2020), 156.CrossRefGoogle Scholar
Henriot, K. and Hughes, K., Discrete restriction estimates of epsilon-removal type for kth-powers and k-paraboloids, Math. Ann. 372(3) (2018), 963998.CrossRefGoogle Scholar
Henriot, K. and Hughes, K., On restriction estimates for discrete quadratic surfaces, Int. Math. Res. Not. 2019(23) (2019), 71397159.CrossRefGoogle Scholar
Hu, Y. and Li, X., Discrete Fourier restriction associated with KdV equations, Anal. PDE 6(4) (2013), 859892.CrossRefGoogle Scholar
Hu, Y. and Li, X., Discrete Fourier restriction associated with Schrödinger equations, Rev. Mat. Iberoam. 30(4) (2014), 12811300.CrossRefGoogle Scholar
Hughes, K., $\ell^p$-improving for discrete spherical averages, Ann. H. Lebesgue 3 (2020), 959980.CrossRefGoogle Scholar
Keil, E., On a diagonal quadric in dense variables, Glasg. Math. J. 56(3) (2014), .CrossRefGoogle Scholar
Lai, X. and Ding, Y., A note on the discrete Fourier restriction problem, Proc. Amer. Math. Soc. 146(9) (2018), 38393846.CrossRefGoogle Scholar
Magyar, A., Diophantine equations and ergodic theorems, Amer. J. Math. 124(5) (2002), 921953.CrossRefGoogle Scholar
Magyar, A., On the distribution of lattice points on spheres and level surfaces of polynomials, J. Number Theory 122(1) (2007), 6983.CrossRefGoogle Scholar
Magyar, A., On distance sets of large sets of integer points, Israel J. Math. 164 (2008), 251263.CrossRefGoogle Scholar
Magyar, A., Stein, E. M. and Wainger, S., Discrete analogues in harmonic analysis: spherical averages, Ann. of Math. (2) 155(1) (2002), 189208.CrossRefGoogle Scholar
Parsell, S. T., Prendiville, S. M. and Wooley, T. D., Near-optimal mean value estimates for multidimensional Weyl sums, Geom. Funct. Anal. 23(6) (2013), 19622024.CrossRefGoogle Scholar
Wooley, T., Nested efficient congruencing and relatives of Vinogradov’s mean value theorem, Proc. Lond. Math. Soc. 118(4) (2019), 9421016.CrossRefGoogle Scholar