1. Introduction
In this paper, we consider discrete restriction estimates associated to integral, positive definite forms. Recall that a form is a homogeneous polynomial, integral means that the coefficients of this polynomial are integers and positive definite means that $\mathcal{Q}({\bf{x}}) \gt 0$ for ${\bf{x}} \neq 0$. The positive definite criterion guarantees that the form is nondegenerate. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be such a form, where ${\bf{x}} = (x_1,x_2,\ldots,x_d)$ with $d \geq 2$, and k denotes the degree of the form $\mathcal{Q}$. We always assume that $k \geq 2$. For each $\lambda \in \mathbb{R} $, the polynomial $\mathcal{Q}$ cuts out a real variety $V_{\mathcal{Q}=\lambda}(\mathbb{R} ) := \{{\bf{x}} \in \mathbb{R}^d : \mathcal{Q}({\bf{x}}) = \lambda\}$ containing a discrete set of integral points $V_{\mathcal{Q}=\lambda}(\mathbb{Z} ) := \{{\bf{x}} \in \mathbb{Z}^d : \mathcal{Q}({\bf{x}}) = \lambda\};$ either or both of these sets are possibly empty depending on the value of λ. For instance, $V_{\mathcal{Q}=\lambda}(\mathbb{R} )$ is empty for negative λ since $\mathcal{Q}$ is positive definite, and $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ is empty for non-integral values of λ.
In our discussion, we always consider a fixed form $\mathcal{Q}$. So, we suppress it from the notation below. For $\lambda \in \mathbb{N} $ and functions $a : \mathbb{Z}^d \to \mathbb{C} $, define the arithmetic extension operator
Letting $\omega_\lambda := \textbf{1}_{V_{\mathcal{Q}=\lambda}(\mathbb{Z} )}$, we have $ E_\lambda a({\bf{\xi}}) = \mathcal{F}_{{\mathbb{Z}^d}}({a\cdot \omega_\lambda})({\bf{\xi}}), $ where $\mathcal{F}_{{\mathbb{Z}^d}}$ is the Fourier transform defined on complex-valued functions with domain $\mathbb{Z}^d$. In other words, Eλ is the adjoint to the restriction operator $R_{\lambda}f$ defined as $ R_{\lambda}f := \mathcal{F}_{{\mathbb{T}^d}}({f}) \cdot \omega_\lambda $ for functions $f : \mathbb{T}^d \to \mathbb{C} $. The extension operator is trivial when the variety has no integer points; that is, when $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ is the empty set. Consequently, we are interested in situations where the variety has many integer points. The prototypical examples here are spheres (centered at the origin) in five or more variables. Here the form is given by the sum of squares $x_1^2+\cdots+x_d^2$, and the cardinality of $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ has order of magnitude $\lambda^{\frac{d}{2}-1}$ for $\lambda \in \mathbb{N} $. According to a theorem of Birch, there is a natural setting for these operators, which we review here.
Define the Birch singular locus of the form $\mathcal{Q}$ as the complex variety
Let $\dim_{\mathbb{C}} (V)$ denote the algebraic dimension of a complex variety V. We will say that an integral form is regular if it satisfies Birch’s criterion:
We define the Birch rank, $B(\mathcal{Q})$ of a form $\mathcal{Q}$, to be the co-dimension $d-\dim_{\mathbb{C}} (V_{\mathcal{Q}}^\dagger(\mathbb{C} ))$. The Birch rank is always non-negative since $V_{\mathcal{Q}}^\dagger(\mathbb{C} )$ being a variety in $\mathbb{C}^d$ implies that $\dim_{\mathbb{C}} (V_{\mathcal{Q}}^\dagger(\mathbb{C} )) \leq d$. To justify the term ‘rank’, one should note that this generalizes the notion of rank for quadratic forms. Indeed, for a quadratic form $\mathcal{Q}({\bf{x}}):={\bf{x}}M{\bf{x}}^{\rm T}$ defined by some d × d-matrix M, a simple calculation gives $B(\mathcal{Q})=\operatorname{rank}(M)$. Here, and in related examples, the point ${\bf{x}} \in \mathbb{Z}^d$ is regarded as a row vector of length d and ${\bf{x}}^{\rm T}$ is its transpose.
When Equation (1) is satisfied, Birch [Reference Birch2] tells us that there exists an infinite arithmetic progression $\Gamma_{\mathcal{Q}}$ in $\mathbb{N} $ depending on the form $\mathcal{Q}$ such that for each $\lambda \in \Gamma_{\mathcal{Q}}$, there exists a positive constant $C_{\mathcal{Q}}(\lambda)$ with the property that
for some positive δ depending on the form $\mathcal{Q}$. Moreover, there exists constants $c_2 \gt c_1 \gt 0$ such that $c_1 \leq C_{\mathcal{Q}}(\lambda) \leq c_2$ for all $\lambda \in \Gamma_{\mathcal{Q}}$. Based on Birch’s asymptotic Equation (2) and on the usual heuristics of the circle method, one expects the following estimates.
Conjecture 1.
Let $\mathcal{Q}$ be an integral, positive definite form of degree $k \geq 2$ in $d \gt 2k$ variables. For each $1 \leq p \leq \infty$ and ϵ > 0, there exists a positive constant $C_{\mathcal{Q},p,\epsilon}$ such that
For $k \geq 3$, we further conjecture that one may remove the ϵ-loss; that is, for each $1 \leq p \leq \infty$, there exists a constant $C_{\mathcal{Q},p}$ such that
There are two trivial estimates known for Conjecture 1. The first trivial estimate is the $\ell^2 \to L^2$ estimate, which is furnished by Plancherel’s theorem. The second trivial estimate is the $\ell^2 \to L^\infty$ estimate, which is furnished by the Cauchy–Schwarz inequality and Equation (2) when the latter is known to hold. Conjecture 1 has been intensively studied in the quadratic case, especially for the spherical case $\mathcal{Q}({\bf{x}}) := x_1^2+\cdots+x_d^2$. Even for the sphere, this problem remains open despite major recent advances in the area. See [Reference Bourgain4–Reference Bourgain and Demeter9] for more information regarding the spherical case and [Reference Bourgain and Demeter10] for other quadratic hypersurfaces. In contrast, for forms of higher degree, there are no hitherto known non-trivial estimates towards this problem.
Our result is an affirmative answer to Conjecture 1 when the form is also assumed to be regular, and it yields Equation (4) when p and d are both sufficiently large. In particular, p will be much larger than the critical exponent $p_c = p_c(\mathcal{Q}) := \frac{2d}{d-k}$. (The critical exponent is defined as the exponent p where the two summands in Equation (3) or (4) balance. Supercritical p means that $p \gt p_c$, while subcritical p means that $p \lt p_c$.) To state our result, we introduce a relevant parameter. For a regular, integral form $\mathcal{Q}$ of degree k in d variables, define the parameter
Throughout we assume that d is sufficiently large with respect to k to satisfy the regularity criterion (1). This implies that $\gamma_{\mathcal{Q}} \gt 0$ and $d \gt 2k$. Our main result is the following.
Theorem 1. Let $\mathcal{Q}$ be a regular, positive definite integral form in d variables of degree $k \geq 2$. If $p \gt 2+\frac{2k}{\gamma_{\mathcal{Q}}}$, then Equation (4) holds for $\lambda \in \mathbb{N} $.
We take a moment to orient ourselves with a few examples to record what Theorem 1 gives for these examples and to compare it with known bounds when applicable.
1.1. Spheres
For the form $\mathcal{Q}({\bf{x}}) := |{\bf{x}}|^2$, its singular locus is $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{\nabla \mathcal{Q}({\bf{x}}) = 2{\bf{x}} = 0 \}$ is $\{{\bf{0}}\}$. Therefore, the dimension of the singular locus is 0, and we require that $d-0 \gt (2-1)2^2$. More simply, we require that $d \geq 5$ for spheres. Under this assumption on the dimension, $\gamma_{\mathcal{Q}} = (d-4)/48$ and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+192/(d-4)$. This range of p is far away from the conjectured critical exponent of $2+4/(d-2)$. Fortunately, in this case, one may replace $\gamma_{\mathcal{Q}}$ (in Equation (13) below) by $(d-1)/4+\epsilon$ for any ϵ > 0 and $d \geq 4$ from [Reference Magyar25]. In turn, this replacement improves the range of p in Theorem 1 to all $p \gt 2+8/(d-3)$ for $d \geq 4$. This recovers the bounds obtained in [Reference Bourgain4] but falls short of their subsequent improvements obtained in [Reference Bourgain6–Reference Bourgain and Demeter9].
1.2. Ellipsoids
Suppose that M is an invertible d × d matrix with integral coefficients such that the associated quadratic form $\mathcal{Q}({\bf{x}}) := {\bf{x}}M{\bf{x}}^{\rm T}$ is positive definite. Spheres corresponding to M being the identity matrix. Since M is invertible, $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{{\bf{0}}\}$, and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+192/(d-4)$. While we are unaware of any results in this level of generality for quadratic forms, presumably, a more delicate approach following [Reference Bourgain4, Reference Henriot and Hughes18] would yield a bound closer to the critical exponent.
1.3. k-Spheres
For the form $\mathcal{Q}({\bf{x}}) := x_1^k+\cdots+x_d^k$, where k is an integer greater than 1, its singular locus is $V_{\mathcal{Q}}^\dagger(\mathbb{C} ) = \{\nabla \mathcal{Q}({\bf{x}}) = k(x_1^{k-1},\dots,x_d^{k-1}) = {\bf{0}} \} = \{{\bf{0}}\}$. Therefore, the dimension of the singular locus is 0, and we require that $d \gt (k-1)2^k$. Under this assumption on the dimension, $\gamma_{\mathcal{Q}} = (d-(k-1)2^k)/(6k(k-1)2^{k})$, and Theorem 1 implies that the supercritical extension estimate Equation (4) holds for $p \gt 2+(12k^2(k-1)2^{k})/(d-(k-1)2^k)$. When k = 3, this becomes d > 16 and $p \gt 2+1728/(d-16)$.
A peculiar feature of Birch’s method - and hence our results - is that the Birch rank is defined in terms of the complex points of the singular locus rather than its real points. Recall Euler’s theorem: for any form $\mathcal{Q}$, we have the identity
where the · on the right hand side denotes the inner product of two vectors. By Euler’s theorem, a real singular point for a positive definite form is necessarily 0. In other words, $V_{\mathcal{Q}}^\dagger(\mathbb{R} ) = \{{\bf{0}}\}$ for every positive definite form $\mathcal{Q}$. In contrast, the Birch singular locus can be huge as seen in the following ‘non-example’ of a positive definite form whose singular locus is too large for our theorem and methods to be applicable.
1.4. A non-example
Consider the form $\mathcal{Q}({\bf{x}}) := (x_1^2+\cdots+x_d^2)^2$; its Birch singular locus is
This is a co-dimension 1 complex algebraic set. Consequently, this form fails to satisfy Birch’s regularity criterion (1) regardless of how large d (the number of variables) is. Meanwhile, its real singular locus $V_{\mathcal{Q}}^\dagger(\mathbb{R} )$ is the set $\{{\bf{0}}\}$. When λ is a square, the corresponding restriction operator is closely related to that for the form $x_1^2+\cdots+x_d^2$. When λ is not a square, the behaviour of the corresponding restriction operator is subtle.
Remark 1.1. The expert can formulate conjectures analogous to Conjecture 1 for integral forms and their 0-level sets without difficulty. Practically, this presents only a technical difference from our hypothesis in Theorem 1 that forms are positive definite. Our methods also apply in this setting, but we do not pursue the analogous results in this paper.
Having considered a few examples, let us now discuss our motivations. One motivation is to extend discrete restriction theory for hypersurfaces beyond the setting of spheres and paraboloids. This is the first attempt to do so. This work fits into a broader program, initiated by Magyar in [Reference Magyar24], which seeks to understand discrete (more appropriately termed ‘arithmetic’) harmonic analysis for hypersurfaces. Initial forays into this program have centered around Birch’s theorem and have had applications to maximal functions and ergodic theorems [Reference Magyar24] and [Reference Cook and Hughes12], discrepancy estimates in [Reference Magyar25], Szemeredi-type theorems in [Reference Magyar26] and $\ell^p$-improving estimates in [Reference Hughes21].
Our approach to Theorem 1 is motivated by a previously open question. This question, posed by the second author in May 2014 at the Workshop III: Kakeya problem, Restriction Problem and Sum-product Theory Workshop as part of IPAM’s long program Algebraic Techniques for Combinatorial and Computational Geometry, asks: Can one use Magyar–Stein–Wainger’s decomposition of the surface measure for the integer points on a sphere from [Reference Magyar, Stein and Wainger27] to improve the discrete restriction estimates for the sphere?
This question was natural since Magyar–Stein–Wainger’s decomposition had been successfully used in the aforementioned works of Magyar, but at that time, it was unknown if Magyar–Stein–Wainger’s decomposition could be used to prove non-trivial discrete restriction estimate for the sphere. Our proof of Theorem 1 reveals that Magyar–Stein–Wainger’s decomposition can be used to prove non-trivial discrete restriction estimates for the sphere. Examining [Reference Bourgain4], the second author’s question means: What is the best way to control the error term in the decomposition?
This latter question closely relates to another question, posed by Ákos Magyar at the Georgia Discrete Analysis Conference in May 2018, which asks: how does one incorporate minor arc estimates for higher degree Diophantine equations in order to obtain discrete restriction estimates? At that time, no discrete restriction estimates were known for a single degree 3 or higher multivariate form. Magyar’s question was natural given the fact that for quadratic forms one does not need to use minor arcs but one must grapple with the minor arcs for hypersurfaces of degrees 3 or more. This relates to the first question because the minor arcs contribute the greatest error term in the decomposition formulas for hypersurfaces of degrees 3 and more. In the quadratic cases, there is no need for minor arcs, and they have not made an appearance in previous analyses.
It transpired that Magyar’s question was partially answered in [Reference Henriot and Hughes17] where minor arc estimates were incorporated to prove discrete restriction estimates for ‘k-paraboloids’. While [Reference Henriot and Hughes17] were predominately interested in ϵ-removal lemmas, the methods therein also used minor arc estimates to prove discrete restriction estimates. When one observes that the worst error term (in Magyar’s generalizations of Magyar–Stein–Wainger’s decomposition) arises from the minor arcs, the natural strategy becomes to adapt those methods to handle the other error terms. Our work answers these questions by successfully using this strategy.
We organized our argument to closely follow [Reference Henriot and Hughes17] so that Theorem 1 reduces to proving appropriate estimates for the main term and the error term, but we streamline the approach to fit our purposes. In particular, since we are interested in the sharp discrete restriction estimates, our approach ‘bakes in’ the ϵ-removal. The bounds for the error terms are taken from [Reference Magyar24]. Meanwhile, the bulk of our work lies in handling the main term. This is done in Theorem 2 where we prove a dyadically refined decomposition of the main term, which is better suited to our purposes. Outlined in Section 4, this refinement is this paper’s main technical contribution, which allows us to adapt the Tomas–Stein method in [Reference Bourgain4] to the main term.
Instead of striving to fully optimize every aspect of our argument, we have aimed to give a simplified version of the general method, which hopefully illuminates the main ideas. The main bottleneck in our argument is the poor state of knowledge for minor arcs bounds in Equation (13) that leads us to define $\gamma_{\mathcal{Q}}$ in Equation (5). Any better decay rate of Equation (13) (e.g., replacing $\gamma_{\mathcal{Q}}$ therein by a larger constant) immediately enlarges the range of p in Theorem 1. For example, one can improve the ranges of d and p in Theorem 1 for k-spheres by using superior minor arc estimates available in this case. Such estimates are possible by exploiting the diagonal structure of the underlying Diophantine equation; see [Reference Magyar25] when k = 2 and [Reference Anderson, Cook, Hughes and Kumchev1] when $k \geq 3$ for the best bounds presently known.
There has been much recent progress on decoupling estimates for affine-invariant systems of equations in many variables following [Reference Bourgain and Demeter9, Reference Bourgain, Demeter and Guth11, Reference Wooley29]. (Affine invariance is also known as translation-dilation invariance or as parabolic rescaling.) For instance, see [Reference Guo and Zhang14–Reference Guo and Zorin-Kranich16]. It is important to note the setting of Theorem 1 is far from affine invariant. By combining [Reference Parsell, Prendiville and Wooley28] and [Reference Lai and Ding23], there is a different way to use such decoupling results to prove Equation (3) for sufficiently large $p \gt p_c$. However, this procedure almost surely yields a smaller range of p than Theorem 1 provides, and it becomes increasingly worse as the degree or number of variables increases. Moreover, another method must be used to obtain the sharper estimate (4). The only known way to sharpen an estimate from Equation (3) to (4) is a circle method approach like the one used in the proof of Theorem 1.
1.5. Organization of the paper
The paper is organized as follows. In § 2, we set notation used throughout the paper. In § 3, we give an abstract formulation of Tomas’s method for discrete L 2 restriction theorems dating to [Reference Bourgain3]; Lemma 1 therein reduces our problem to proving estimates related to the Fourier transform of the surface measure. In § 4, we recall a decomposition of the surface measure due to Magyar and related estimates from [Reference Magyar24]; this is ‘Magyar’s Decomposition Theorem’. Combining Lemma 1 and Magyar’s Decomposition Theorem, we reduce Theorem 1 to Theorem 2, which is an estimate for the major arcs. In § 5, we prove a bound for the major arc pieces by a further application of Tomas’s methods.
2. Notation
We introduce here some notation that will streamline our exposition.
• For a positive integer, we let $\mathbb{Z}/q$ denote the group of integers modulo q and $U_q := \{1 \leq a \lt q : (a,q)=1 \}$ denote its unit group.
• We write $f(\lambda) \lesssim g(\lambda)$ if there exists a constant C > 0 independent of all λ under consideration (e.g., λ in $\mathbb{N} $ or in $\Gamma_{\mathcal{Q}}$) such that $ |f(\lambda)| \leq C |g(\lambda)|. $ Furthermore, we will write $f(\lambda) \gt lesssim g(\lambda)$ if $g(\lambda) \lesssim f(\lambda)$, while we will write $f(\lambda) \eqsim g(\lambda)$ if $f(\lambda) \lesssim g(\lambda)$ and $f(\lambda) \gt lesssim g(\lambda)$. Subscripts in the above notation will denote parameters, such as the dimension d or degree k of a form $\mathcal{Q}$, on which the implicit constants may depend.
• $\mathbb{T}^d$ denotes the d-dimensional torus $(\mathbb{R} /\mathbb{Z} )^d$ identified with the unit cube $[-1/2,1/2]^d$.
• $*$ denotes convolution on a group such as $\mathbb{Z}^d$, $\mathbb{T}^d$ or $\mathbb{R}^d$. It will be clear from context as to which group the convolution takes place.
• $e\hspace{-0.5mm}\left( t \right)$ will denote the character ${\rm e}^{-2\pi it}$ for $t \in \mathbb{R} $ or $\mathbb{T} $.
• For a function $f: \mathbb{Z}^d \to \mathbb{C} $, its $\mathbb{Z}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{Z}^d}}{f}({\bf{\xi}})$ for ${\bf{\xi}} \in \mathbb{T}^d$. For a function $f: \mathbb{T}^d \to \mathbb{C} $, its $\mathbb{T}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{T}^d}}{f}({\bf{x}})$ for ${\bf{x}} \in \mathbb{Z}^d$. $\mathcal{F}_{{\mathbb{Z}^d}}$ and $\mathcal{F}_{{\mathbb{T}^d}}$ are defined so that they are inverses of one another. For a function $f: \mathbb{R}^d \to \mathbb{C} $, its $\mathbb{R}^d$-Fourier transform will be denoted $\mathcal{F}_{{\mathbb{R}^d}}{f}({\bf{x}})$ for ${\bf{x}} \in \mathbb{R}^d$.
• For a function $f: \mathbb{R}^d \to \mathbb{C} $, we define dilation operator $\operatorname{D}_{t}$ by $\operatorname{D}_{t}f({\bf{x}}) = f({\bf{x}}/t)$.
• For a ring R, we will use the inner product notation ${\bf{b}} \cdot {\bf{m}}$ for vectors ${\bf{b}},{\bf{m}} \in R^d$ to mean the sum $\sum_{i=1}^d b_i m_i$. This is used for the rings $\mathbb{R} ,\mathbb{Z} ,\mathbb{T} $ and $\mathbb{Z} /q$, where $q \in \mathbb{N} $.
• We also let $\textbf{1}_X$ denote the indicator function of the set X.
3. The arithmetic Tomas–Stein method
Let ωλ be the counting measure on $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$ for a single integral, positive definite, homogenous form $\mathcal{Q}$ satisfying (1) and some $\lambda \in \mathbb{Z} $. Let $F = \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$ be the exponential sum corresponding to ωλ. A common approach to problems involving ωλ is to use the circle method so as to decompose the exponential sum F into a main piece $F_\mathfrak{M}$ and an error term $F_\mathfrak{m}$ corresponding, respectively, to major and minor arcs. (These are analogous to low and high frequency pieces, respectively.) To prove discrete restriction estimates, Bourgain in [Reference Bourgain4] combined this approach with Tomas’s L 2 restriction argument in order to reduce matters to the following two estimates:
• Bounds for the operator given by convolution with the major arc operator $F_\mathfrak{M}$, and
• A uniform power saving bound on the minor arc piece $F_\mathfrak{m}$.
See [Reference Hu and Li19, Reference Hu and Li20] for a variant. Bourgain’s approach has been abstracted in [Reference Keil22] and [Reference Henriot and Hughes17]. We combine Lemmas 3.3 and 3.6 from [Reference Henriot and Hughes17] to form the following lemma.
Lemma 1. For $\lambda \in \mathbb{N} $, let $F = \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$ be the $\mathbb{Z}^d$-Fourier transform of the arithmetic surface measure ωλ defined on $V_{\mathcal{Q}=\lambda}(\mathbb{Z} )$. Suppose that there exists a decomposition $F = F_{\mathfrak{M}} + F_{\mathfrak{m}}$ such that for each $f \in L^\infty(\mathbb{T}^d)$, we have the estimates
Then $\| F \ast f \|_{L^p(\mathbb{T}^d)} \lesssim \lambda^{\frac{d}{k}-1 - \frac{2d}{kp}} \| f \|_{L^{p^{\prime}}(\mathbb{T}^d)}$ holds for $p \gt \max\left[p_1, \frac{2d - (d-k)p_0}{\zeta} + p_0 \right]$.
In our work, we only use Plancherel’s theorem to exploit the subcritical estimate at $p_0 = 2$; this gives the exponent $p \gt \max\left[ p_1, \frac{2d - (d-k)2}{\zeta} + 2 \right] = \max\left[ p_1, \frac{2k}{\zeta} + 2 \right]$. We give the proof of Lemma 1 for completeness.
Proof of Lemma 1
Set $N = \lceil \lambda^{1/k} \rceil$. Fix $p \gt \max\left[p_1, \frac{2d - (d-k)p_0}{\zeta} + p_0 \right]$ and let a be an element of $\ell^{2}$. For notational convenience, we let E denote the extension operator defined on sequences $a : \mathbb{Z}^d \to \mathbb{C} $ by $Ea := \mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda \cdot \mathcal{F}_{{\mathbb{T}^d}}a) = a*\mathcal{F}_{{\mathbb{Z}^d}}(\omega_\lambda)$. We may assume that a is not identically zero and by homogeneity normalize a so that $\| a \|_{2} = 1$. We introduce a parameter α > 0 in order to define the level sets and functions
By the Cauchy–Schwarz inequality and Birch’s theorem in [Reference Birch2], we have
Therefore, we may restrict α to lie in the interval $[0,CN^{\frac{d-k}{2}}]$ for some positive constant C. By Parseval’s identity, we have
By Cauchy–Schwarz and the assumption $\| a \|_2 = 1$, it follows that
Another application of Parseval’s identity implies that
By Equation (7), Hölder’s inequality and hypotheses (TS2) and (TS3) of the lemma, we have
Therefore, when $ \alpha \gt lesssim N^{\frac{d-k}{2}-\frac{\zeta}{2}} $, we have
Rearranging implies that $|S_\alpha| \lesssim \alpha^{-p_1} N^{\frac{(d-k)p_1}{2} - d}$. Since $p \gt p_1$, we have
Altogether, we have
We are left to consider that the regime where $|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}}$. We now make use of estimate (TS1) at the exponent p 0 to handle the regime where $|Ea| \lesssim N^{\frac{d-k}{2} - \frac{\zeta}{2}}$. This is possible by the trivial bound (6) as follows:
Combining this estimate with Equation (8), we have that
The latter summand is dominated by the former summand when $\frac{(d-k-\zeta)(p-p_0)}{2} \lt \frac{(d-k)p}{2} - d$. This is equivalent to
which is equivalent to
Rearranging this last expression, we find that we need
This is precisely the range of $p \gt \frac{2d - (d-k)p_0}{\zeta} + p_0$.
4. Magyar’s decomposition of the surface measure
Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be an integral, positive definite form where ${\bf{x}} = (x_1,\dots,x_d)$. The heavy lifting in our theorem lies in a decomposition of Magyar for the surface measure $\omega_\lambda := \textbf{1}_{\{{\bf{x}} \in \mathbb{Z}^d : \mathcal{Q}({\bf{x}})=\lambda\}}$, where $\lambda \in \mathbb{Z} $; this is the counting measure on the integer points x in $\mathbb{Z}^d$ such that $\mathcal{Q}({\bf{x}})=\lambda$. To state this theorem, we need to introduce a few objects.
For $q \in \mathbb{N} $, $a \in U_q$ and ${\bf{m}} \in \mathbb{Z}^d$, define the normalized Birch–Weyl sums
We have the bound
uniformly in $a \in U_q$ and ${\bf{m}} \in \mathbb{Z}^d$ with
See [Reference Magyar24] for a proof of this fact. The dimension d is sufficiently large so that $\kappa_{\mathcal{Q}} \gt 2$.
Let $d\sigma_{\mathcal{Q}}$ denote the singular measure on $\mathbb{R}^d$ defined as the Gelfand–Leray form whose $\mathbb{R}^d$-Fourier transform is defined distributionally by the oscillatory integral
It is known that
where ${\rm d}S_{\mathcal{Q}}$ is the Euclidean surface area measure on the hypersurface $\{{\bf{x}} \in \mathbb{R}^d : \mathcal{Q}({\bf{x}})=1 \}$. These measures are compactly supported since $\mathcal{Q}$ is positive definite. We cite the following bound – see Lemma 6 on page 931 of [Reference Magyar24] – for the $\mathbb{R}^d$-Fourier transform of the surface measure:
Let Ψ be a $C^\infty(\mathbb{R}^d)$ bump function supported in the cube $[-1/8,1/8]^d$ and 1 on the cube $[-1/16,1/16]^d$, where these cubes are regarded as subsets of the torus $\mathbb{T}^d$. For each $q \in \mathbb{N} $, let s be the integer such that $2^s \leq q \lt 2^{s+1}$. For such q and for $a \in U_q$, define the Fourier multipliers
for $\xi \in \mathbb{T}^d$. Generalizing work of [Reference Magyar24], Magyar [Reference Magyar, Stein and Wainger27] obtained a flexible decomposition of the surface measure; we choose the following form.
Magyar’s Decomposition Theorem ([Reference Magyar24, Reference Magyar, Stein and Wainger27])
Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a regular, positive definite integral form. For each $\lambda \in \mathbb{N},$ the Fourier transform of the surface measure ωλ decomposes as
where
Remark 4.1. Our form of the error term ɛλ and its estimate (13) do not explicitly appear in [Reference Magyar24]. We outline the differences and how to prove this form of Magyar’s Decomposition Theorem. Recall that Magyar’s main term takes the shape as the Fourier multiplier
The first notable difference is that we have dyadically refined the decomposition so that Equation (14) becomes
This modifies the analysis of Equations (2.15) and (2.16) of Proposition 4 in [Reference Magyar24] in inconsequential ways since $2^s \leq q \lt 2^{s+1}$. In particular, this preserves the estimate (13). The second notable difference is that we truncated the sum over $q \in \mathbb{N} $. Following the analysis of Equation (2.17) of Proposition 4 in [Reference Magyar24], we may truncate Equation (15) to
and place the difference into the error term ɛλ while maintaining the estimate (13). The expert may immediately verify this by using the Magyar–Stein–Wainger transference principle (see Section 2 of [Reference Magyar, Stein and Wainger27]) and Birch’s Weyl bound (9).
The next theorem establishes (TS2) of Lemma 1; that is, we treat the major arc terms.
Theorem 2. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, regular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $p \gt 2+\frac{4}{\kappa_{\mathcal{Q}}-2}$, we have
for each $\lambda \in \mathbb{N} $.
We may deduce Theorem 1 once Theorem 2 is proved as follows.
5. Proof of Theorem 2
Fix $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ a positive definite form of degree k satisfying Equation (1) and $\lambda \in \mathbb{N} $. Set $N = \lceil \lambda^{1/k} \rceil$. Define the functions
Furthermore, for $q \in \mathbb{N} , a \in U_q$ and $0 \leq j \lt \lfloor \log_2 N \rfloor$, define the multipliers
We will collect these multipliers according to the scale of their moduli; to do so, define, for each $s \geq 0$, the set of fractions
Let $K^{a/q,j}_\lambda := \mathcal{F}_{{\mathbb{T}^d}}{(\mu^{a/q,j}_\lambda)}$ denote the inverse Fourier transform of $\mu^{a/q,j}_\lambda$. We start our proof by establishing an identity for these kernels.
Proposition 1. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\Gamma_{\mathcal{Q}}$ be a set of regular values for the form $\mathcal{Q}$. If $s \geq 0$, then for each $a/q \in \mathcal{R}_s$, we have
for all ${\bf{x}} \in \mathbb{Z}^d.$
The proof of this proposition follows the proof of Proposition 1 in [Reference Hughes21]; in that proof, one replaces Ψ by $\Psi_j$ and q by 2s.
Now that we know the structure of our kernel, we will use a circle method decomposition and a further Littlewood–Paley decomposition to arbitrage $L^1(\mathbb{T}^d) \to L^\infty(\mathbb{T}^d)$ and $L^2(\mathbb{T}^d) \to L^2(\mathbb{T}^d)$ estimates and deduce Theorem 2. These bounds are the content of the two following lemmas.
Lemma 2. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$, then each major arc piece $\mu^{a/q,j}_\lambda$ satisfies
and
for all ϵ > 0.
Lemma 3. Let $\mathcal{Q}({\bf{x}}) \in \mathbb{Z} [{\bf{x}}]$ be a positive definite, non-singular, integral form satisfying Equation (1) and $\lambda \in \mathbb{N} $. If $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$, then each major arc piece $\mu^{a/q,j}_\lambda$ satisfies
for $0 \leq j \leq \lfloor \log_2 N \rfloor - s$.
Remark 5.1. Note that $j+s = \lfloor \log_2 N \rfloor$ is the natural cutoff because we do not capture any oscillation in $\mathcal{F}_{{\mathbb{R}^d}}d\sigma(\lambda^{1/k}{\bf{\xi}})$ when $\vert\bf\xi\vert\lesssim\lambda^{-1/k}$.
Proof of Lemma 2
Fix $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$. For $0 \leq j \lt \lfloor \log_2 N \rfloor -s$, Equation (11) implies that
for all ϵ > 0 since κ > 2. For $j = \lfloor \log_2 N \rfloor - s$, Equation (11) implies that
Before proving Lemma 3, we need a geometric property of our measures $d\sigma_{\mathcal{Q}}$. The estimate below is best known for $\mathcal{Q}({\bf{x}}) = |{\bf{x}}|^2$; see [Reference Grafakos13] for this estimate. However, we are unaware of a reference for more general hypersurfaces aside from estimate (23) in [Reference Hughes21]. For completeness, we include the statement and its proof below.
Proposition 2. Let ϕ be a Schwartz function on $\mathbb{R}^d$. If t > 0, then
Proof. Since $\mathcal{Q}$ is positive definite, the variety $V_{\mathcal{Q}=1}(\mathbb{R} )$ is compact. Moreover, Equation (10) implies that for every ball B of radius r > 0, we have
For each point ${\bf{x}} \in \mathbb{R}^d$, define the sets $S_0({\bf{x}}) := \{{\bf{y}} \in \mathbb{R}^d : |{\bf{x}}-{\bf{y}}| \lt t \}$ and $S_j({\bf{x}}) := \{{\bf{y}} \in \mathbb{R}^d : 2^jt \leq |{\bf{x}}-{\bf{y}}| \lt 2^{j+1}t \}$ for $j \in \mathbb{N} $. By Equation (23), we have that
for each ${\bf{x}} \in \mathbb{R}^d$.
Since ϕ is Schwartz, we have
for all $M \in \mathbb{N} $. Therefore,
for all ${\bf{x}} \in \mathbb{R}^d$. Decomposing $\mathbb{R}^d$ into the sets $S_j({\bf{x}})$, we have
Using estimate (24), we obtain that
Normalizing by t −d, we obtain the desired estimate.
Proof of Lemma 3
Fix $0 \leq s \leq \lfloor \log_2 N \rfloor$ and $a/q \in \mathcal{R}_s$. For each $0 \leq j \leq \lfloor \log_2 N \rfloor - s$, identity (18) and estimate (22) imply that for each ${\bf{x}} \in \mathbb{Z}^d$, we have
by taking $\textstyle\phi={\mathcal F}_{\mathbb{R}^d}(\mathrm D_{2^s}\;\lambda^{-1/k}\psi_j)$ and $t = \lambda^{1/k}2^{-s}$ in Proposition 2.
Proof of Theorem 2
Let $1 \leq p \leq 2$ and $f \in L^{p^{\prime}}(\mathbb{T}^d)$ be normalized so that $\|f\|_{L^{p^{\prime}}(\mathbb{T}^d)}=1$. Interpolating the bounds (19) and (21) for $\mu^{a/q,j}_\lambda$ when $0 \leq j+s \lt \lfloor \log_2 N \rfloor$, we obtain
Summing over fractions $a/q \in R_s$ for $j \leq s \lt \lfloor \log_2 N \rfloor$, we find that
Provided $1-\kappa(1-\frac{2}{p}) \lt 0$, which is equivalent to the range $p \gt 2+\frac{2}{\kappa-1}$, we have
Consequently, when $p \gt 2+\frac{2}{\kappa-1}$, we have
Comparing the exponent of λ with the desired one of $\frac{d}{k}-1-\frac{2d}{kp}$, we find that we have Equation (2) for $p \gt 2+\frac{4}{k\kappa-\kappa-1}$. This is better than the range of $p \gt 2+\frac{4}{\kappa-2}$ claimed in the theorem.
When $0 \leq j+s = \lfloor \log_2 N \rfloor$, we have
Summing over $0 \leq s \leq \lfloor \log_2 N \rfloor$, we find that
provided that $(\epsilon-\kappa)(1-\frac{2}{p}) \lt 0$ for arbitrarily small, positive ϵ. For each $0 \lt \epsilon \lt \kappa-2$, this is equivalent to the range of $p \gt \frac{2(\kappa-\epsilon)}{\kappa-2-\epsilon}$. Thereby, taking ϵ to 0, we arrive at the range of $p \gt \frac{2\kappa}{\kappa-2} = 2+\frac{4}{\kappa-2}$, as claimed.
Acknowledgements
The authors would like to thank the Heilbronn Institute for Mathematical Research for enabling this collaboration through their Focused Research Workshop ‘Efficient Congruencing and Decoupling’ in June 2019. KH thanks Virginia Tech for their hospitality whilst part of this paper was written.
KH thanks Dr. Efthalia Tzitzili for a discussion on positive definite forms. We thank the anonymous referees for their feedback on this paper.
Funding Statement
EP was supported in part by Simons Foundation grant no. 360560.
Competing Interests
The authors declare no competing interests pertaining to the undertaken research.