Hostname: page-component-78c5997874-v9fdk Total loading time: 0 Render date: 2024-11-10T12:57:23.645Z Has data issue: false hasContentIssue false

Diagonal quartic surfaces with a Brauer–Manin obstruction

Published online by Cambridge University Press:  17 March 2023

Tim Santens*
Affiliation:
Departement wiskunde, KU Leuven, Celestijnenlaan 200B, 3001 Leuven, Belgium Tim.Santens@kuleuven.be
Rights & Permissions [Opens in a new window]

Abstract

In this paper we investigate the quantity of diagonal quartic surfaces $a_0 X_0^4 + a_1 X_1^4 + a_2 X_2^4 +a_3 X_3^4 = 0$ which have a Brauer–Manin obstruction to the Hasse principle. We are able to find an asymptotic formula for the quantity of such surfaces ordered by height. The proof uses a generalization of a method of Heath-Brown on sums over linked variables. We also show that there exists no uniform formula for a generic generator in this family.

Type
Research Article
Copyright
© 2023 The Author(s). The publishing rights in this article are licensed to Foundation Compositio Mathematica under an exclusive licence

1. Introduction

This paper is concerned with failures of the Hasse principle. A $\mathbb {Q}$-variety $X$ satisfies the Hasse principle if $X(\mathbb {A}_\mathbb {Q}) \neq \emptyset$ implies that $X(\mathbb {Q}) \neq \emptyset$. There are multiple known obstructions which explain why varieties fail the Hasse principle, the most important of these being the Brauer–Manin obstruction. This obstruction consists of constructing a subset $X(\mathbb {Q}) \subset X(\mathbb {A}_\mathbb {Q})^{\operatorname {Br}} \subset X(\mathbb {A}_\mathbb {Q})$ called the Brauer–Manin set. We say that $X$ has a Brauer–Manin obstruction to the Hasse principle if $X(\mathbb {A}_\mathbb {Q}) \neq \emptyset$ but $X(\mathbb {A}_\mathbb {Q})^{\operatorname {Br}} = \emptyset$.

In this paper we look at K3 surfaces. These lie at the boundary of our current understanding of the Hasse principle. It was conjectured by Skorobogatov [Reference SkorobogatovSko09] that the Brauer–Manin obstruction is the only obstruction for K3 surfaces. This conjecture is only known for certain Kummer varieties [Reference Harpaz and SkorobogatovHS16, Reference HarpazHar19] when assuming the truth of some big conjectures in number theory (finiteness of Shafarevich–Tate groups) and wide open in general.

Recently Gvirtz, Loughran and Nakahara [Reference Gvirtz, Loughran and NakaharaGLN22] investigated how often diagonal degree $2$ K3 surfaces have a Brauer–Manin obstruction and were able to obtain the correct order of magnitude. We instead look at another family of K3 surfaces, namely the diagonal quartic surfaces. These are defined by the equation

\[ X_{\mathbf{a}}: a_0 X_0^4 + a_1 X_1^4 + a_2 X_2^4 + a_3 X_3^4 = 0 \subset \mathbb{P}_{\mathbb{Q}}^3 \]

for $\mathbf {a} = (a_0, a_1, a_2, a_3)$. Unlike the case of degree $2$ K3 surfaces the Brauer group has already been computed. The algebraic part by Bright [Reference BrightBri02] and the transcendental part by Gvirtz, Ieronymou and Skorobogatov [Reference Ieronymou and SkorobogatovIS15, Reference Gvirtz and SkorobogatovGvSk22]. However, in our case there is no uniform description of the relevant algebras. When compared with the case of degree $2$ K3 surfaces this causes us significant difficulties. However, the method used to resolve these issues also allows us, without too much extra effort, to find an asymptotic formula. This is a stronger type of result than the correct order of magnitude in [Reference Gvirtz, Loughran and NakaharaGLN22]. We hope that this method can also be applied to find asymptotic formulas for other families.

First note that due to [Reference Bright, Browning and LoughranBBL16, Theorem 1.3] a positive proportion of these surfaces have local points everywhere. This positive proportion can be computed as a product of local densities. It has been shown by Bright [Reference BrightBri11] that $0\%$ of these surfaces have a Brauer–Manin obstruction to the Hasse principle. In this paper we are able to improve this result by proving the following asymptotic formula.

Theorem 1.1 There exists a constant $A > 0$ such that

\[ \# \bigg\{ \mathbf{a} \in \mathbb{Z}_{\neq 0}^4: \begin{array}{l} |a_i| \leq T, \textrm{for all } i \in \{0,1,2,3\}, \\ X_{\mathbf{a}} \textrm{ has a BM obstruction to the HP} \end{array}\bigg\} \sim A T^2 (\log T)^{{33}/{8}}. \]

We actually prove the following finer results, denote the set in the left-hand side by $N^{\operatorname {Br}}(T)$, we can then define the subsets

\begin{align*} N_{ = \square}^{\operatorname{Br}}(T) &:= \{ \mathbf{a} \in N^{\operatorname{Br}}(T): a_0 a_1 a_2 a_3 \in \mathbb{Q}^{\times 2} \},\\ N_{ = -\square}^{\operatorname{Br}}(T) &:= \{ \mathbf{a} \in N^{\operatorname{Br}}(T): a_0 a_1 a_2 a_3 \in -\mathbb{Q}^{\times 2} \},\\ N_{ \neq \pm \square}^{\operatorname{Br}}(T) &:= \{ \mathbf{a} \in N^{\operatorname{Br}}(T): a_0 a_1 a_2 a_3 \not \in \pm \mathbb{Q}^{\times 2} \}. \end{align*}

Because $N^{\operatorname {Br}}(T) = N_{ = \square }^{\operatorname {Br}}(T) \sqcup N_{ = -\square }^{\operatorname {Br}}(T) \sqcup N_{ \neq \pm \square }^{\operatorname {Br}}(T)$ it suffices to prove the following theorem.

Theorem 1.2 There exist constants $0 < A, B$ such that

\begin{align*} \#N_{ = \square}^{\operatorname{Br}}(T) &= O(T^2 (\log T)^2), \\ \#N_{ = -\square}^{\operatorname{Br}}(T) &\sim A T^2 (\log T)^{{33}/{8}}, \\ \#N_{ \neq \pm \square}^{\operatorname{Br}}(T) &\sim B T^2 (\log T)^{{45}/{16}}. \end{align*}

Remark 1.3 It is geometrically interesting to instead count points $[a_0: a_1: a_2: a_3] \in \mathbb {P}^3_{\mathbb {Q}}(\mathbb {Q})$ of bounded height such that $X_{\mathbf {a}}$ has a Brauer–Manin obstruction. This is equivalent to counting only those $\mathbf {a}$ such that $\gcd (a_0, a_1, a_2, a_3) = 1$ in Theorems 1.1 and 1.2. Our method is easily able to deal with this variant. We explain in Remark 5.1 which minor modification one has to make to prove this variant.

We recall that a subset of $X(\mathbb {Q})$ is thin if it is a finite union of sets of the form $f(Y(\mathbb {Q}))$ for $Y$ irreducible and $f: Y \to X$ finite and not birational. For example, the subset $\{ \mathbf {a} \in \mathbb {A}^4(\mathbb {Q}) : a_0 a_1 a_2 a_3 \in - \mathbb {Q}^{\times 2}\}$ is thin. Theorem 1.2 implies that $100\%$ of the elements of $N^{\operatorname {Br}}(T)$ lie in this thin set. It has been understood for some time that in Manin-type conjectures one is required to remove a thin set. Thus, the exponent of the logarithm which would be reasonable to geometrically interpret is $45/16$. We give such an interpretation in § 1.1.

Somewhat unusually we do not prove the lower bound by providing an explicit example of a family of varieties which have a Brauer–Manin obstruction to the Hasse principle. Our method is actually only able to provide such a family of order of magnitude $T^2$. It seems very difficult to provide an explicit example of a family of varieties with a Brauer–Manin obstruction of the correct order of magnitude, due to the non-existence of a uniform description of the generator of $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})$ in the generic case. We make precise in § 6 what we mean by a uniform generator and prove, using ideas of Uematsu [Reference UematsuUem14, Reference UematsuUem16], that it indeed does not exist.

An important concept in the proof is that of prolific algebras as introduced by Bright [Reference BrightBri15]. Indeed, the proof proceeds by first showing that $50\%$ of the surfaces where a certain algebra $\mathcal {A}$, which depends on certain choices but defines a canonical element of $\operatorname {Br}(X_{\mathbf {a}})/ \operatorname {Br}(\mathbb {Q})$, is locally nowhere prolific have a Brauer–Manin obstruction. Then later we find an asymptotic formula for the amount of surfaces where $\mathcal {A}$ is nowhere locally prolific.

The fact that the thin set where $a_0 a_1 a_2 a_3 \in - \mathbb {Q}^{\times 2}$ dominates can also be interpreted using this notion. Namely, $\mathcal {A}$ being nowhere locally prolific and $X_{\mathbf {a}}$ being locally soluble is more common on this thin set than outside of it.

To prove this $50\%$ result we proceed roughly as follows. Given any choice of $\mathcal {A} \in \operatorname {Br}(X_{\mathbf {a}})$ and a tuple $\mathbf {u} = (u_0, u_1, u_2, u_3) \in \mathbb {Q}^{\times }$ we construct a related algebra $\mathcal {A}_{\mathbf {u}} \in \operatorname {Br}(X_{\mathbf {u}^2 \mathbf {a}})$ where $\mathbf {u}^2 \mathbf {a}= (u_0^2 a_0, u_1^2 a_1, u_2^2 a_2, u_3^2 a_3)$. It is possible to compute the invariants of $\mathcal {A}_{\mathbf {u}}$ in terms of those of $\mathcal {A}$. We use this to show that as long as $\mathcal {A}$ is locally nowhere prolific half of the surfaces $X_{\mathbf {a} \mathbf {u}^2}$ have a Brauer–Manin obstruction as $\mathbf {u}$ varies. Note that we do not need to know what the invariants of $\mathcal {A}$ actually are.

1.1 Structure

We now give an overview of the structure of the paper. In the second section we recall the notion of cyclic algebras and the Brauer–Manin obstruction. We also describe a slight modification of a method by Bright [Reference BrightBri15] which is used to decide whether an algebra is prolific.

The third section starts by constructing two Brauer elements $\mathcal {A}, \mathcal {B}$ which we use in our arguments. We then apply the method in the previous section to compute when they are prolific. We also look at how the invariants of the Brauer element $\mathcal {A}$ change as we multiply the coefficients $\mathbf {a}$ by a square.

The goal of the fourth section is to generalize a method of Heath-Brown [Reference Heath-BrownHea93] on sums over linked pairs of variables. In particular, we need to deal with variables which are linked by quartic residue symbols and not just by Jacobi symbols. This generalization is proven in § 4.2. To prove this in the desired generality, and also for other uses, we prove an uniform asymptotic formula for sums over frobenian multiplicative functions as introduced by Loughran and Matthiesen [Reference Loughran and MatthiesenLM19] in § 4.1.

In this section we also prove a lemma which can be interpreted as a form of the hyperbola method for toric varieties [Reference Pieropan and SchindlerPS20] for functions $f(x_1, \ldots, x_n)$ which can be written as a product of single-variable functions $f_i(x_i)$. The advantage of this new form is that it can also be applied when $\sum _{x_i \leq T} f_i(x_i)$ is not of order of magnitude a power of $T$. These analytic results are of independent interest.

In the fifth section we count $N^{\operatorname {Br}}(T)$. We first show that most of the Brauer–Manin obstructions are caused by $\mathcal {A}$. We then prove that $50\%$ of the surfaces where $\mathcal {A}$ is nowhere locally prolific have a Brauer–Manin obstruction. We will then interpret the amount of surfaces where $\mathcal {A}$ is nowhere locally prolific as a sum over a large amount of linked variables and use the method developed in the third section to reduce the number of variables. We finish by computing an asymptotic formula for the resulting simpler sum using our form of the hyperbola method.

The last section is devoted to proving that there exists no uniform formula for $\mathcal {A}$. We do this using ideas of Uematsu [Reference UematsuUem14, Reference UematsuUem16]. This section is independent of the rest of the paper.

1.2 An interpretation of the exponents

We give a geometric interpretation for the exponents in Theorem 1.2. What follows is quite speculative and we give no proofs of the claims we make.

Let $\pi : \mathfrak {X} \to \mathbb {P}_{\mathbb {Q}}^3$ be the universal diagonal quartic, i.e. $\mathfrak {X} \subset \mathbb {P}_{\mathbb {Q}}^3 \times \mathbb {P}_{\mathbb {Q}}^3$ is defined by the equation

\[ a_0 X_0^4 + a_1 X_1^4 + a_2 X_2^4 + a_3 X_3^4 = 0. \]

Proposition 3.4 can be interpreted as saying that $N^{\operatorname {Br}}(T)$ is a subset of the set of weak Campana points of the Campana orbifold $(\mathbb {P}^3_{\mathbb {Q}}, \frac {1}{2}D)$ where $D = \sum _{i =0}^3 \frac {1}{2} \{a_i = 0 \}$ as defined in [Reference Pieropan, Smeets, Tanimoto and Várilly-AlvaradoPSTV21, Definition 3.3]. This is also what one should expect from arguments such as those in [Reference BrightBri18]. There is as of yet no Manin-type conjecture for the asymptotic behavior of weak Campana points. However, these are in some sense Campana points on the limit of all log blow-ups. To be precise let $B \to \mathbb {P}_{\mathbb {Q}}^3$ be a smooth log blow-up of the log pair $(\mathbb {P}_{\mathbb {Q}}^3, D)$ and let $\tilde {D} \subset B$ be the strict transform of $D$. The set of weak Campana points on $(\mathbb {P}^3_{\mathbb {Q}}, \frac {1}{2}D)$ is the intersection of the images of the Campana points on the orbifolds $(B, \frac {1}{2} \tilde {D})$ as $B \to \mathbb {P}_{\mathbb {Q}}^3$ ranges over all smooth log blow-ups. We are counting Campana points with respect to a height function determined by the hyperplane class $H \in \textrm {Pic}(\mathbb {P}_{\mathbb {Q}}^3)$.

Blow up $\mathbb {P}^3_{\mathbb {Q}}$ first at the disjoint subvarieties $\{a_0 = a_1 = 0\},\ \{a_2 = a_3 = 0\}$, then blow it up at the strict transforms of $\{ a_0 = a_2 = 0\},\ \{a_1 = a_3 = 0\}$ and then blow it up at the strict transforms of $\{a_0 = a_3 = 0\},\ \{a_1 = a_2 = 0\}$. Call the resulting (toric) variety $B$. Denote by $E_{k \ell } \subset B$ the (strict transform) of the exceptional divisor corresponding to $\{a_k = a_\ell = 0\}$.

One can check that the order of magnitude of the set of Campana points on the orbifold $(B,\frac {1}{2} \tilde {D})$ will not change if we do more log blow-ups $B' \to B$. The order of magnitude in this case is $T^2 (\log T)^6$ where $6 = \textrm {rk}(\textrm {Pic}(B)) - 1$. We get a further saving on the exponent $6$ from two parts: first, that the surfaces we count have to be locally soluble; and, second, due to properties of the Brauer group.

First note that the base change $\mathfrak {X} \times _{\mathbb {P}^3_{\mathbb {Q}}} B$ is not smooth. Let $\mathfrak {Y}$ be the blow-up of $\mathfrak {X} \times _{\mathbb {P}^3_{\mathbb {Q}}} B$ at the subvarieties $\mathfrak {X} \times _{\mathbb {P}^3_{\mathbb {Q}}} E_{k \ell } \cap \{X_m = X_n = 0\}$ for all $\{k, \ell, m, n\} = \{0, 1, 2, 3\}$. This variety is smooth. We interpret the exponent of the logarithm as coming from $f: \mathfrak {Y} \to B$.

The first issue is that the surfaces we count have to be locally soluble. As in [Reference Loughran and SmeetsLS16] we should get for each divisor $D \subset B$ a saving in the exponent of the logarithm equal to $1$ minus the $\delta$-invariant of the set of geometrically irreducible components of the fiber $\mathfrak {Y}_{\mathbb {Q}(D)}$ as a $\textrm {Gal}(\overline {\mathbb {Q}(D)}/\mathbb {Q}(D))$ set. This $\delta$-invariant is the probability that an element of $\textrm {Gal}(\overline {\mathbb {Q}(D)}/\mathbb {Q}(D))$ fixes any element of the set. The only divisors whose fiber is not geometrically irreducible are the $E_{k \ell }$. The étale algebra corresponding to this set of irreducible components is

\[ A = \mathbb{Q}(E_{k \ell})[\sqrt[4]{ -a_k/a_\ell}] \times \mathbb{Q}(E_{k \ell})[\sqrt[4]{ -a_m/a_n}]. \]

One computes that the $\delta$-invariant is $\frac {13}{32}$ so the saving in the logarithm is $\frac {19}{32}$.

The other savings should come from some sort of relative version of the computation in Lemma 3.8. An issue when trying to do this relative computation is that the relative version of $\mathcal {A}$ does not live in $\operatorname {Br}(\mathfrak {Y}_{\mathbb {Q}(B)})$, but only in $H^1(\mathbb {Q}(B), \textrm {Pic}(\mathfrak {Y}_{\overline {\mathbb {Q}(B)}}))$. A relative residue map has been defined in [Reference Bright, Browning and LoughranBBL16, (5.9)]:

\[ \operatorname{H}^1(\mathbb{Q}(B), \textrm{Pic}(\mathfrak{Y}_{\overline{\mathbb{Q}(B)}})) \to H^0(\mathbb{Q}(E_{k \ell}), \operatorname{H}^1(\mathfrak{Y}^{\textrm{sm}}_{\overline{\mathbb{Q}(E_{k \ell})}}, \mathbb{Q}/\mathbb{Z})). \]

However, in this case $\mathfrak {Y}^{\textrm {sm}}_{\overline {\mathbb {Q}(E_{k \ell })}}$ is a disjoint union of affine planes so the right-hand side is trivial. This relative residue map is thus not particularly useful.

We see from Lemma 3.8 that the savings should be equal to $1$ minus the chance that an element of $\textrm {Gal}(\overline {\mathbb {Q}(E_{k \ell })}/\mathbb {Q}(E_{k \ell }))$ fixes a geometrically irreducible component of the fiber $\mathfrak {Y}_{\mathbb {Q}(E_{k \ell })}$ and either fixes $\sqrt {a_k a_m/a_{\ell } a_n}$ or does not fix $\sqrt {-1}$. In other words it has to fix at least one of $\sqrt {\pm a_k a_m/ a_{\ell } a_n}$. The saving should thus be equal to $1$ minus the $\delta$-invariant of the $\textrm {Gal}(\overline {\mathbb {Q}(E_{k \ell })}/\mathbb {Q}(E_{k \ell }))$ set corresponding to the étale algebra

\[ A[\sqrt{a_k a_m/a_{\ell} a_n}] \times A[\sqrt{-a_k a_m/a_{\ell} a_n}]. \]

A computation shows that this $\delta$-invariant is $\frac {15}{32}$ so the saving is $\frac {17}{32}$ for each $E_{k \ell }$. The expected power of the logarithm is $6 - 6 \frac {17}{32} = \frac {45}{16}$ which is exactly what we wanted.

One can interpret the order of magnitude $T^2 (\log T)^{{33}/{8}}$ coming from the thin set $\{a_0 a_1 a_2 a_3 \in - \mathbb {Q}^{\times 2 }\}$ in a similar way, but instead of starting with $\mathbb {P}_{\mathbb {Q}}^3$ one has to consider (a desingularization) of the double cover $C = \{a_0 a_1 a_2 a_3 = -b^2\} \subset \mathbb {P}_{\mathbb {Q}}^3 \times \mathbb {A}_{\mathbb {Q}}^1$.

The leading constant in Theorem 1.2 (if one counts $\mathbf {a} = [a_0:a_1: a_2: a_3] \in \mathbb {P}^3_{\mathbb {Q}}(\mathbb {Q})$ of bounded height as in Remark 1.3) is exactly half the leading constant of the analogous counting problem where one counts the $\mathbf {a}$ such that $\mathcal {A}$ for $X_{\mathbf {a}}$ is nowhere locally prolific. This latter counting problem should have a leading constant of the same type as the counting problem in the conjecture [Reference Loughran and SmeetsLS16, Conjecture 1.6].

The known cases of this conjecture [Reference LoughranLou18, Reference Loughran, Takloo-Bighash and TanimotoLTT20, Reference Sofos and Visse-MartindaleSV21] suggest that the constant should take the Peyre-type form

\[ \frac{\alpha(B, \frac{1}{2}\tilde{D}) \# (\textrm{SubBr}/ \operatorname{Br} \mathbb{Q}) \tau( U^{\textrm{SubBr}})} {\gamma}, \]

where:

  1. (i) $\alpha ((B, \frac {1}{2}\tilde {D}), H)$ is the alpha constant of the orbifold $(B, \frac {1}{2}\tilde {D})$ as defined in [Reference Pieropan, Smeets, Tanimoto and Várilly-AlvaradoPSTV21, § 3.3];

  2. (ii) the group $\textrm {SubBr} \subset \operatorname {Br} \mathbb {Q}(\mathbb {P}^3)$ is the subordinate Brauer group which should be defined analogously to [Reference LoughranLou18, Definition 2.8];

  3. (iii) the set $U \subset \mathbb {P}^3_{\mathbb {Q}}(\mathbb {A}_{\mathbb {Q}})$ is the set of all $(\mathbf {a}_v)_v$ such that for each place $v$ the algebra $\mathcal {A}$ is not prolific on $X_{\mathbf {a}_v}$;

  4. (iv) the subset $U^{\textrm {SubBr}} \subset U$ is the set of those elements of $U$ which are orthogonal to $\textrm {SubBr}$ with respect to the Brauer–Manin pairing;

  5. (v) the Tamagawa number $\tau ( U^{\textrm {SubBr}})$ is the volume of $U^{\textrm {SubBr}}$ with respect to a certain Tamagawa measure $\tau$;

  6. (vi) the denominator $\gamma$ is a product of values of the Gamma function.

In most known cases of this conjecture [Reference LoughranLou18, Reference Loughran, Takloo-Bighash and TanimotoLTT20, Reference Sofos and Visse-MartindaleSV21] the number $\gamma$ is a special value of the Gamma function. But recently Loughran, Sofos and Rome [Reference Loughran, Rome and SofosLRS22, Theorem~1.1] proved a case of the conjecture in which $\gamma$ is $\pi^{{3}/{2}} = \Gamma(\frac{1}{2})^3$. They also conjecture a general formula [Reference Loughran, Rome and SofosLRS22, Conjecture 2.14] for $\gamma$. In our case this gives $\gamma = \Gamma(61/16)$ because $k[B \setminus \bigcup_{k, \ell} E_{k \ell}]^{\times} = k^{\times}$.

In our case the subordinate Brauer group must certainly be contained in $\operatorname {Br} B \setminus \bigcup _{k, \ell } E_{k \ell }$ because the $E_{k \ell }$ are the only divisors for which we have a non-trivial $\delta$-invariant. However, $B \setminus \bigcup _{k \neq \ell } E_{k \ell } \cong \mathbb {P}^3_{\mathbb {Q}} \setminus \bigcup _{k \neq \ell } \{a_k = a_{\ell } = 0\}$. In addition, $\{a_k = a_{\ell } = 0\}$ has codimension $2$ so $\operatorname {Br} B \setminus \bigcup _{k, \ell } E_{k \ell } \cong \operatorname {Br} \mathbb {P}^{3}_{\mathbb {Q}} = \operatorname {Br} \mathbb {Q}$. We thus have $\# (\textrm {SubBr}/ \operatorname {Br} \mathbb {Q}) = 1$ and $U^{\textrm {SubBr}} = U$.

It is unclear how to interpret the necessary convergence factors in the Tamagawa measure. In [Reference LoughranLou18, § 5.7.2] a virtual Artin character is used, but this has no obvious analogue in our case.

It is conjectured by Lehman and Tanimoto [Reference Lehmann and TanimotoLT17, § 1.2] that the non-Zariski dense part of the thin set in Manin's conjecture comes from finite maps which are étale in codimension $1$. The map $C \to \mathbb {P}^3_{\mathbb {Q}}$ defining the thin set is ramified at the divisors $\{a_i = 0 \}$. However, the map of orbifolds $(C \times _{\mathbb {P}^3_{\mathbb {Q}}} B, 0) \to (B, \frac {1}{2} \tilde {D})$ is étale in codimension $1$ in the sense that the map $C \times _{\mathbb {P}^3_{\mathbb {Q}}} B \to B$ is unramified at all divisors except those in $\tilde {D}$ and that the ramification degree at every divisor in $\tilde {D}$ is equal to $2$. Our result thus agrees with the orbifold version of this conjecture.

1.3 Notation and conventions

For every field $K$ fix a separable closure $\overline {K}$.

Given a number field $K$ we let $\Omega _K$ be the set of its places. Given a place $v \in \Omega _K$ we denote by $K_v$ the associated local field and $\mathcal {O}_v$ the valuation ring. If $v$ is finite, we let $\mathbb {F}_v$ be the residue field.

We identify $\mathbb {Z}/n \mathbb {Z}$ with the subgroup of $\mathbb {Q}/\mathbb {Z}$ generated by ${1}/{n}$.

For any group $G$ and number $n \in \mathbb {N}$ we use $G[n]$ to denote the $n$-torsion. If $G$ is discrete and $p$ is a prime number, then let $G(p')$ be the prime to $p$ torsion. If $G$ naturally has a profinite topology, the only such $G$ we consider are étale fundamental groups, then let $G(p')$ be the maximal prime to $p$ quotient.

All cohomology is étale/Galois cohomology and all fundamental groups are étale fundamental groups. We omit the base point if it is not relevant. For any abelian group $B$, we identify elements of $H^{1}(X, B)$ with the corresponding maps $\pi _1(X) \to B$.

For any number field $K$, we let $I_K$ be the monoid of integral ideals of $K$. For any extension $L/K$ of number fields and ideal $\mathfrak {a} \in I_L$ we let $\mathbf {N}_{L/K}(\mathfrak {a})$ be the relative norm. We write $\mathbf {N}(\cdot ) := \mathbf {N}_K(\cdot ) := \mathbf {N}_{K/\mathbb {Q}}(\cdot )$ for the absolute norm.

For any ideal $\mathfrak {n}$ of $K$ and $z \in \mathbb {C}$ we define $\tau _z(\mathfrak {n})$ as the coefficients of the Dirichlet series $\zeta _K(s)^z$. It is the unique multiplicative function such that for all $k \in \mathbb {N}$

\[ \tau_z(\mathfrak{p}^k) = \binom{z + k - 1}{k}. \]

In particular, for an integer $n$, $\tau (n) := \tau _2(n)$ is the number of divisors of $n$. We frequently use that for $z \geq 0$

\[ \sum_{\mathbf{N}(\mathfrak{n}) \leq N} \tau_z(\mathfrak{n}) \ll_{z,K} N (\log N)^{z -1}. \]

We also use the divisor bound, i.e. that for all $\varepsilon > 0$ we have $\tau _{z}(\mathfrak {n}) \ll _{z, K, \varepsilon } \mathbf {N}(\mathfrak {n})^{\varepsilon }$.

For any number $n$ we define $\omega (n) := \sum _{p \mid n} 1$ as the amount of prime ideals dividing $\mathfrak {n}$, not counted with multiplicity.

For any number $n \in \mathbb {N}$ define $\textrm {rad}(n) := \prod _{p \mid n} p$ as the largest square-free number dividing $n$.

A prime $\mathfrak {p}$ of $K$ is totally split if it completely splits for the extension $K/\mathbb {Q}$. This is equivalent to $\mathbf {N}_K(\mathfrak {p})$ being a prime number.

For any ideal $\mathfrak {m}$ a Hecke character $\chi$ of finite modulus $\mathfrak {m}$ is a character of the group $\{\mathfrak {a} \in I_K : (\mathfrak {a}, \mathfrak {m}) = 1\}/\{ (\alpha ) \in I_K : \alpha \equiv 1 \ (\mathrm {mod}\ \mathfrak {m})\}$. The conductor of $\chi$ is the minimal ideal $\mathfrak {q} \mid \mathfrak {m}$ such that there exists a Hecke character $\chi '$ of modulus $\mathfrak {q}$ such that $\chi (\mathfrak {a}) = \chi '(\mathfrak {a})$ for all $\mathfrak {a}$ coprime to $\mathfrak {m}$.

2. Algebraic preliminaries

2.1 Cyclic algebras

We recall the notion of cyclic algebras. Let $K$ be a field, $n \in \mathbb {N}$, $\chi \in H^{1}(K, \mathbb {Z}/n\mathbb {Z})$ and $b \in K^{\times }$. Then $b$ induces an element in $H^{1}(K, \mu _n ) \cong K^\times /K^{\times n}$ by Kummer theory. The associated cyclic algebra is the cup product

\[ (\chi, b) := \chi \cup b \in H^{2}(K, \mu_n) \subset \operatorname{Br}(K). \]

If $\mu _n \subset K$, then, after fixing a primitive $n$th root of unity $\zeta$, we can assign to each $a \in K^{\times }$ a character $\chi _a$ via the induced isomorphism $\mu _n \cong \mathbb {Z}/n \mathbb {Z}$. The induced cyclic algebra is

\[ (a,b)_\zeta := (\chi_a, b). \]

We denote the quaternion algebra $(a,b) := (a,b)_{-1}$. For more information and another construction of cyclic algebras we refer the reader to [Reference Gille and SzamuelyGiSz06, §§ 2.5 and 4.7].

In the special case when $K$ is a local field corresponding to a place $v$ of a number field $k$ we have the invariant map ${\rm inv}_v:\operatorname {Br}(K) \to \mathbb {Q}/\mathbb {Z}$ coming from local class field theory. This map is always an injection and it is an isomorphism if $v$ is finite. We define the quadratic Hilbert symbol as

\[ (a,b)_{v} := {\rm inv}_v((a,b)) \in \mathbb{Z}/2\mathbb{Z}. \]

This is the usual quadratic Hilbert symbol, but taking values in $\mathbb {Z}/2\mathbb {Z}$ instead of $\mu _2$.

2.2 Brauer–Manin obstruction

For any scheme $X$, its (cohomological) Brauer group is $\operatorname {Br}(X) := H^2(X, \mathbb {G}_m)$. Let $X$ be a variety over a number field $k$. We recall [Reference Colliot-Thélène and SkorobogatovCTS21, § 12.3] that there exists a Brauer–Manin pairing

\[ X(\mathbb{A}_k) \times \operatorname{Br}(X) \to \mathbb{Q}/\mathbb{Z}: ((x_v)_{v},A) \to \sum_{v \in \Omega_k} {\rm inv}_v( A(x_v)), \]

where $X(\mathbb {A}_k)$ is the set of adelic points of $X$. The left kernel of this pairing is called the Brauer–Manin set $X(\mathbb {A}_k)^{\operatorname {Br}}$, it contains the set of rational points $X(k)$, the right kernel contains the constant Brauer elements $\operatorname {Br}_0(X) := \textrm {Im}(\operatorname {Br}(k) \to \operatorname {Br}(X))$. If $X(\mathbb {A}_k) \neq \emptyset$ but $X(\mathbb {A}_k)^{\operatorname {Br}}= \emptyset$, we say that $X$ has a Brauer–Manin obstruction to the Hasse principle.

A homomorphism of groups $B \to \operatorname {Br}(X)$ induces a pairing

\[ X(\mathbb{A}_k) \times B \to \mathbb{Q}/\mathbb{Z}. \]

We call the left kernel of this pairing the Brauer–Manin set induced by $B$ and denote it by $X(\mathbb {A}_k)^B$. We have $X(\mathbb {A}_k)^{\operatorname {Br}} \subseteq X(\mathbb {A}_k)^B$ with equality if $B \to \operatorname {Br}(X)/\operatorname {Br}_0(X)$ is surjective. For $A \in \operatorname {Br}(X)$ let $\langle A \rangle \subset \operatorname {Br}(X)$ be the subgroup generated by $A$. We write $X(\mathbb {A}_k)^{A} := X(\mathbb {A}_k)^{\langle A \rangle }$ and call this the Brauer–Manin set induced by $A$

The algebraic Brauer group of $X$ is $\operatorname {Br}_1(X) := \textrm {Ker}(\operatorname {Br}(X) \to \operatorname {Br}(X_{\overline {k}}))$. The quotient $\operatorname {Br}(X)/ \operatorname {Br}_1(X)$ is the transcendental Brauer group of $X$.

2.3 Prolific algebras

To decide whether a quartic diagonal surface has a Brauer–Manin obstruction we use a method developed by Bright [Reference BrightBri15]. We want to slightly modify this method. In the original, one needs to know a priori that certain algebras are non-constant. We give a variation of the method where this is not necessary (but will follow a posteriori).

Let $X$ be a smooth variety over a number field $k$ and $v$ a finite place of $k$. Let $p$ be the characteristic of $\mathbb {F}_v$ and let $\mathcal {X}$ be a smooth $\mathcal {O}_v$-integral model of $X$, i.e. $\mathcal {X}$ is a smooth finite type separated $\mathcal {O}_v$-scheme and there is a fixed isomorphism of the generic fiber $\mathcal {X}_k$ with $X$. We also assume that the special fiber $\mathcal {X}_{\mathbb {F}_v}$ is connected and, thus, irreducible. Any integral model $\mathcal {X}$ can be turned into such a model by taking an open subscheme of the smooth locus $\mathcal {X}^{\textrm {sm}}$. Lastly, we assume that $\mathcal {X}(\mathbb {F}_v) \neq \emptyset$. Then $\mathcal {X}_{\mathbb {F}_v}$ has to be geometrically irreducible and $X(k_v) \supset \mathcal {X}(\mathcal {O}_v) \neq \emptyset$ by Hensel's lemma. Thus, $\operatorname {Br}_0(X_{k_v}) = \operatorname {Br}(k_v)$.

The following is a slight variation of [Reference BrightBri15, Definition 7.1].

Definition 2.1 We say that a morphism $B \xrightarrow {\alpha } \operatorname {Br}(X)$ is prolific at $v$ if the map

\[ {\rm inv}^B_v: X(k_v) \to B^\vee := \textrm{Hom}(B, \mathbb{Q}/ \mathbb{Z}) : (b, P) \to {\rm inv}_v(b(P)) \]

is surjective. An element $A \in \operatorname {Br}(X)$ of order $n$ is said to be prolific if $\mathbb {Z}/ n \mathbb {Z} \to \operatorname {Br}(X): {1}/{n} \to A$ is prolific.

Remark 2.2 If $B$ is prolific, then $B \to \operatorname {Br}(X)/\operatorname {Br}_0(X)$ is injective. Indeed, even the map $B \to \operatorname {Br}(X_{k_v})/ \operatorname {Br}(k_v)$ has to be injective. This is because the composition of ${\rm inv}^B_v$ with $B^\vee \to \alpha ^{-1}(\operatorname {Br}(k_v))^\vee$ always has to be constant, so $\alpha ^{-1}(\operatorname {Br}(k_v))^\vee$ is $0$ if $B$ is prolific. In particular, if $B$ is prolific in our sense, then $\alpha (B) \subset \operatorname {Br}(X)/\operatorname {Br}_0(X)$ is prolific in the sense of Bright.

The crucial property of prolific $B$ is the following [Reference BrightBri15, Proposition 7.3].

Proposition 2.3 If $B$ is prolific, then it induces no Brauer–Manin obstruction to the Hasse principle.

Let $\alpha : B \to \operatorname {Br}(X)$ be a map of abelian groups with $B$ finite and $p \nmid |B|$. We study whether $B$ is prolific by relating ${\rm inv}^B_v$ to another map.

We can compose $\alpha$ with the residue map $\partial _v: \operatorname {Br}(X)(p') \to \operatorname {H}^1(\mathcal {X}_{\mathbb {F}_v}, \mathbb {Q}/\mathbb {Z})(p')$ coming from the Gysin sequence [Reference Colliot-Thélène and SkorobogatovCTS21, Theorem 3.7.1]. This gives us a morphism $\partial _v \alpha : B \to \operatorname {H}^1(\mathcal {X}_{\mathbb {F}_v}, \mathbb {Q}/\mathbb {Z})(p')$. By duality and because $p \nmid |B|$ this corresponds to a map $\pi _1(\mathcal {X}_{\mathbb {F}_v}) \to B^\vee$ which we denote by $\partial ^v \alpha \in \operatorname {H}^1(\mathcal {X}_{\mathbb {F}_v}, B^\vee )$.

The following lemma gives the relation between $\partial ^v \alpha$ and ${\rm inv}_v^A$. For $x \in \mathcal {X}(\mathcal {O}_v)$ use the notation $x \bmod { \mathfrak {m}_v}$ for the image of $x$ under the composition

\[ \mathcal{X}(\mathcal{O}_v) \to \mathcal{X}(\mathbb{F}_v) = \mathcal{X}_{\mathbb{F}_v}(\mathbb{F}_v). \]

For any $\mathbb {F}_v$-variety $Z$ and $z \in Z(\mathbb {F}_v)$ we let $\textrm {Frob}_z$ be the image of the Frobenius $\textrm {Frob}_v$ under the map $z_*: \textrm {Gal}(\overline {\mathbb {F}}_v/\mathbb {F}_v) \to \pi _1(Z)$. This is only well-defined up to conjugacy. However, $B^\vee$ is abelian so $\partial ^v \alpha ( \textrm {Frob}_{x \bmod { \mathfrak {m}_v}})$ is well-defined.

Lemma 2.4 For all $x \in \mathcal {X}(\mathcal {O}_v) \subset X(k_v)$ we have

\[ {\rm inv}^B_v(x) = \partial^v \alpha( \textrm{Frob}_{x \bmod{ \mathfrak{m}_v}}). \]

Proof. We have to show that for all $b \in B$

\[ {\rm inv}_v(\alpha(b)(x)) = \partial_v \alpha(b)(\textrm{Frob}_{x \bmod{ \mathfrak{m}_v}}). \]

This follows from functoriality of the residue map [Reference Colliot-Thélène and SkorobogatovCTS21, Theorem 3.7.4], in particular the commutativity of the following diagram.

The composition of the bottom row is ${\rm inv}_v$ by combining [Reference Colliot-Thélène and SkorobogatovCTS21, Theorem 1.4.10, Lemma 2.3.3] and the definition of ${\rm inv}_v$.

This lemma allows us to relate whether $b^\vee \in B^\vee$ lies in the image of ${\rm inv}^B_v$ to the existence of points on certain varieties over $\mathbb {F}_v$.

Evaluation at the Frobenius induces an isomorphism $H^{1}(\mathbb {F}_v, B^\vee ) \cong B^\vee$. Denote the character corresponding to $b^\vee \in B^\vee$ under this isomorphism by $\chi _{b^\vee } \in H^{1}(\mathbb {F}_v, B^\vee )$. Consider now $\partial ^v \alpha - \chi _{b^\vee } \in \operatorname {H}^1( \mathcal {X}_{\mathbb {F}_v}, B^\vee )$. This corresponds to a $B^\vee$-torsor which we denote by $\pi _{b^\vee }: Y_{b^\vee } \to \mathcal {X}_{\mathbb {F}_v}$. Over $\overline {\mathbb {F}}_v$ all of the $Y_{b^\vee }$ are isomorphic. Denote this $\overline {\mathbb {F}}_v$-variety by $\overline {Y}$.

Corollary 2.5 For $x \in \mathcal {X}^{\textrm {sm}} (\mathcal {O}_v)$ we have ${\rm inv}^B_v(x) = b^\vee$ if and only if $x \bmod \mathfrak {m}_v \in \pi _{b^\vee }(Y_{b^\vee }(\mathbb {F}_v))$. Such an $x$ exists if and only if $Y_{b^\vee }(\mathbb {F}_v) \neq \emptyset.$

Proof. Let $P := x \bmod \mathfrak {m}_v$. We have $P \in \pi _{b^\vee }(Y_{b^\vee }(\mathbb {F}_v))$ if and only if $\pi _{b^\vee }^{ -1}(P)$ has a $\mathbb {F}_v$-point. This fiber is the $B^\vee$-torsor corresponding to the composition

\[ \pi_1(\mathbb{F}_v) = \textrm{Gal}(\overline{\mathbb{F}}_v/\mathbb{F}_v) \xrightarrow{P_*} \pi_1(\mathcal{X}_{\mathbb{F}_v}) \xrightarrow{\partial^v \alpha - \chi_{b^\vee} } B^\vee. \]

A torsor over a field has a rational point if and only if it splits. Thus, this torsor has a $\mathbb {F}_v$-point if and only if $(\partial ^v \alpha - \chi _{b^\vee })(\textrm {Frob}_P) = 0$. In other words $\partial ^v \alpha (\textrm {Frob}_P) = b^\vee$ and we are done by Lemma 2.4.

We are now interested in when $Y_{b^\vee }(\mathbb {F}_v) \neq \emptyset$. A necessary condition is that one (and, thus, any) connected component is geometrically connected. This condition is sufficient as long as $|\mathbb {F}_v|$ is sufficiently large with respect to the (compactly supported) Betti numbers of $\overline {Y}$ by the Weil conjectures [Reference DeligneDel80]. The following lemma describes when this happens.

Lemma 2.6 Applying the snake lemma to the following commutative diagram with exact rows

defines a morphism $\delta ^{b^\vee }: \textrm {Gal}(\overline {\mathbb {F}}_v/ \mathbb {F}_v) \to \textrm {coker}(\overline {\partial ^v \alpha })$. It is equal to $\delta ^{b^\vee } = \delta ^0 - \chi _{b^\vee }$.

In particular, the connected components of $Y_{b^\vee }$ are geometrically connected if and only if $\delta _{b^\vee } = 0$.

Proof. The first statement follows from standard homological algebra. To prove the second statement we note that the set of connected components of $Y_{b^\vee }$ has a transitive $B^\vee$-action with stabilizer $\textrm {im}(\partial ^v \alpha - \chi _{b^\vee })$, analogously the set of connected components of $\overline {Y}$ has a transitive $B^{\vee }$ action with stabilizer $\textrm {im}(\overline {\partial ^v \alpha })$. Every connected component is thus geometrically connected if and only if the natural map $\textrm {im}(\overline {\partial ^v \alpha }) \to \textrm {im}(\partial ^v \alpha - \chi _{b^\vee })$ is an isomorphism. The snake lemma implies that this happens if and only if $\delta ^{b^\vee } = 0$.

Remark 2.7 (i) The morphism $\delta ^{b^\vee }$ is 0 if and only

\[ \delta^{b^\vee}(\textrm{Frob}_v) = \delta^0(\textrm{Frob}_v) - b^\vee = 0 \in \textrm{coker}(\overline{\partial^v \alpha}). \]

The set of such $b^\vee$ is thus an $\textrm {im}(\overline {\partial ^v \alpha })$-coset of $B^\vee$.

(ii) If $\overline {Y}$ is connected, then $\overline {\partial ^v \alpha }$ is surjective so $\delta ^{b^\vee }$ will always be 0.

(iii) Dually, the boundary map $\delta _{b^\vee }$ which one gets by applying the snake lemma to the following diagram has to be zero.

Here $b^\vee$ is identified with the composition $B \xrightarrow {b^\vee } \mathbb {Q}/\mathbb {Z} \cong \operatorname {H}^1(\mathbb {F}_v, \mathbb {Q}/\mathbb {Z})$.

If $\mathcal {X}_{\mathbb {F}_v}$ is the smooth locus of a cone over a curve, we can say more. What follows is analogous to [Reference BrightBri15, Theorem 6.5]. We remark that the proof written there is incomplete. It is stated that fibrations with fiber $\mathbb {A}^1_{\mathbb {F}_v}$ induce isomorphisms on fundamental groups. This is already false for the fibration $\mathbb {A}^1_{\mathbb {F}_v} \to \operatorname {Spec}(\mathbb {F}_v)$ due to the existence of Artin–Schreier extensions of $\mathbb {A}^1_{\mathbb {F}_v}$. The statement is probably true for the maximal prime to $p$ quotient but we just need the following special case.

Lemma 2.8 Let $K$ be a field of characteristic $p$. Let $X \subset \mathbb {P}^{n+1}_K$ be the cone over a smooth variety $Y \subset \mathbb {P}^n_K$. The projection $X^{\textrm {sm}} \to Y$ induces an isomorphism $\pi _1(X^{\textrm {sm}})(p') \cong \pi _1(Y)(p')$.

Proof. Let $P \in X$ be the cone point. Consider the blow-up $X'$ at this point with exceptional divisor $E$. It is a $\mathbb {P}^1$-bundle over $Y$ and $X^{\textrm {sm}} = X \setminus P = X' \setminus E$. Thus, $X^{\textrm {sm}}$ is the complement of a relative normal crossing divisor $E$ in the smooth proper $Y$-variety $X'$. The desired statement follows from [Reference GrothendieckGro71, Examples XIII.4.4].

In particular, if $\mathcal {X}_{\mathbb {F}_v}$ is the smooth locus of a cone over a smooth proper curve $C$ then $\pi _1(\mathcal {X}_{\mathbb {F}_v})(p') \to \pi _1(C)(p')$ is an isomorphism. This induces an isomorphism $H^1(C, \mathbb {Q}/\mathbb {Z})(p') \cong \operatorname {H}^1(\mathcal {X}_{\mathbb {F}_v}, \mathbb {Q}/\mathbb {Z})(p')$.

Theorem 2.9 Assume that $\mathcal {X}_{\mathbb {F}_v}$ is the smooth locus of a cone over a smooth proper curve $C$ of genus $g$ and

(2.1)\begin{equation} |\mathbb{F}_v| > (g' + \sqrt{g'^2 -1})^2 \quad \mbox{where } g' = |\textrm{im}(\overline{\partial^v \alpha})| (g-1) + 1. \end{equation}

The image of ${\rm inv}^B_v$ contains $b^\vee$ if and only if $\delta ^{0}(\textrm {Frob}_v) = b^\vee \in \textrm {coker}(\overline {\partial ^v \alpha }).$

Proof. By Lemma 2.6 and Remark 2.7(i) it suffices to show that if all of the connected components of $Y_{b^\vee }$ are geometrically connected then it has a $\mathbb {F}_v$-point. By Lemma 2.8 there exists an étale cover $D \to C$ such that $Y_{b^\vee } = D \times _C \mathcal {X}_{\mathbb {F}_v}$. We show that $D(\mathbb {F}_v) \neq \emptyset$. Every connected component of $D$ is a connected étale $\textrm {im}(\overline {\partial ^v \alpha })$-torsor over $C$. By the Hurwitz formula this is a curve of genus $g'$. The Weil conjectures [Reference DeligneDel80] together with condition (2.1) give us what we want.

Corollary 2.10 Assume that $B \to H^{1}(C_{\overline {\mathbb {F}}_v}, \mathbb {Q}/\mathbb {Z})(p')$ is injective, equivalently, that the associated étale cover of $C_{\overline {\mathbb {F}}_v}$ is connected, in the situation of Theorem 2.9. Then $B$ is prolific and thus induces no Brauer–Manin obstruction to the Hasse principle.

Proof. Dually, the map $\overline {\partial ^v \alpha }$ is surjective so $\textrm {coker}(\overline {\partial ^v \alpha }) = 0$ and the condition in the theorem is automatically satisfied.

We end this subsection with a lemma which allows us to compute $\partial _v$ for cyclic algebras.

Lemma 2.11 Let $\chi \in H^{1}(k(X), \mathbb {Z}/ n \mathbb {Z})$ and $a \in H^{1}(k(X), \mu _n)$ such that $(\chi, a) \in \operatorname {Br}(X) \subset \operatorname {Br}(k(X))$. Let $v \not \in S$ be a finite place of $k$ coprime to $n$ such that $\mathcal {X}_{\mathbb {F}_v}$ is irreducible. In this case the valuation $v$ can be extended to $k(X)$. Assume that $\chi$ is unramified at $\mathcal {X}_{\mathbb {F}_v}$. Let $\chi |_{\mathbb {F}_v} \in H^{1}(\mathbb {F}_v(\mathcal {X}_{\mathbb {F}_v}), \mathbb {Z}/n\mathbb {Z})$ be the induced character. Then

(2.2)\begin{equation} \partial_v((\chi, a)) = v(a) \chi|_{\mathbb{F}_v}. \end{equation}

Proof. This is a combination of the second bullet point of [Reference Colliot-Thélène and SkorobogatovCTS21, p. 37] and applying [Reference Colliot-Thélène and SkorobogatovCTS21, Lemma 2.33] to compare the different notions of residue.

3. Computations for the Brauer–Manin obstruction

Let $X_{\mathbf {a}} \subset \mathbb {P}^3_{\mathbb {Q}}$ be the smooth surface defined by the equation

(3.1)\begin{equation} a_0 X_0^4 + a_1 X_1^4 + a_2 X_2^4 + a_3 X_3^4 = 0 \end{equation}

for $\mathbf {a} := (a_0, a_1, a_2, a_3) \in (\mathbb {Q}^{\times })^4$. Write $\theta _{\mathbf {a}} := a_0 a_1 a_2 a_3$. Two such surfaces are equivalent if one is obtained from the other by permuting the coefficients, multiplying the coefficients by fourth powers and multiplying them by a common factor. There is an obvious isomorphism between any two equivalent surfaces which permutes the coordinates and multiplies them by a constant.

3.1 Constructing Brauer group elements

Assume that $X_{\mathbf {a}}$ has local points everywhere. The following proposition gives two elements of $\operatorname {Br}(X_{\mathbf {a}})$. The first of these is due to Bright [Reference BrightBri11, Lemma 2.1] and the second is, written somewhat differently, due to Swinnerton-Dyer [Reference Swinnerton-DyerSwi00, (43)]. We give a different argument that these Brauer elements are unramified.

Proposition 3.1

  1. (i) Let $Y_{\mathbf {a}}$ be the smooth quadric surface defined by

    \[ a_0 Y_0^2 + a_1 Y_1^2 + a_2 Y_2^2 + a_3 Y_3^2 = 0 \]
    and let $\alpha : X_{\mathbf {a}} \to Y_{\mathbf {a}}$ the morphism given by $Y_i = X_i^2$. Because $X_{\mathbf {a}}$ has local points everywhere, $Y_{\mathbf {a}}$ must have local points everywhere and thus has a rational point by Hasse–Minkowski. Let $f$ be an equation defining the tangent plane at this rational point, then
    \[ \mathcal{A} := \bigg(\frac{\alpha^*f}{X_3^2}, \theta_{\mathbf{a}}\bigg) \in \operatorname{Br}(X_{\mathbf{a}}). \]
  2. (ii) If $\theta _{\mathbf {a}}\in \mathbb {Q}^{\times 2}$, let $\{k,\ell,m,n\} = \{0,1,2,3\}$ and let $Z_{\mathbf {a}}^{k \ell m}$ be the quadric curve defined by

    \[ a_k \mathbf{N}_{\mathbb{Q}(\sqrt{-({a_\ell}/{a_k})})}\bigg(Z_k + \sqrt{-\frac{a_\ell}{a_k}} Z_\ell\bigg)+ a_m Z_m^2 = a_k Z_k^2 + a_\ell Z_\ell^2 + a_m Z_m^2 = 0. \]
    For any choice of $\sqrt {\theta _{\mathbf {a}}}$ let $\gamma _{\sqrt {\theta _{\mathbf {a}}}}: X_{\mathbf {a}} \to Z_{\mathbf {a}}^{k \ell m}$ be the morphism defined by,
    \begin{align*} & Z_k = X_k^2 X_m^2 - \frac{\sqrt{\theta_{\mathbf{a}}}}{a_k a_m} X_\ell^2 X_n^2,\\ & Z_\ell = \frac{\sqrt{\theta_{\mathbf{a}}}}{a_\ell a_m} X_k^2 X_n^2 + X_\ell^2 X_m^2,\\ & Z_m = X_m^4 + \frac{a_n}{a_m} X_n^4. \end{align*}
    A priori this is only a rational map, but it can be extended to a morphism. By Hasse and Minkowski $Z_{\mathbf {a}}^{k \ell m}$ has a rational point. Let $h$ be an equation defining the tangent line at this point, then
    \[ \mathcal{B}_{k \ell m ; \sqrt{\theta_{\mathbf{a}}}} := \bigg(\gamma_{\sqrt{\theta_{\mathbf{a}}}}^*\frac{h}{Z_m}, -a_k a_m \sqrt{\theta_{\mathbf{a}}}\bigg) \in \operatorname{Br}(X_{\mathbf{a}}). \]

These algebras are independent of the choices made as elements of $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})$.

Proof. By the Grothendieck purity theorem [Reference GrothendieckGro68, III: Theorem 6.1] it suffices to prove that each of these Brauer elements is unramified at all the divisors of $X_{\mathbf {a}}$. Note that $\mathcal {A}$ is the pullback along $\alpha$ of the following Brauer element described in [Reference Colliot-Thélène and XuCTX09, § 5.8]:

(3.2)\begin{equation} \bigg(\frac{f}{Y_3}, a_0 a_1 a_2 a_3\bigg) \in \operatorname{Br}(Y_{\mathbf{a}} \setminus \{Y_3 = 0 \}). \end{equation}

This is independent of the choice of $f$ as an element of $\operatorname {Br}(Y_{\mathbf {a}} \setminus \{Y_3 = 0 \})/\operatorname {Br}(\mathbb {Q})$ which shows that $\mathcal {A}$ is well-defined as an element of $\operatorname {Br}(\mathbb {Q}(X_{\mathbf {a}}))/\operatorname {Br}(\mathbb {Q})$. It follows that $\mathcal {A}$ is unramified except possibly at the divisor $\{X_3 = 0\}$. But $f$ is ramified at $\{X_3 = 0\}$ of order 2 so by [Reference Garibaldi, Merkurjev and SerreGMS03, Proposition 8.2] $\mathcal {A}$ is also unramified at this divisor.

We now show that $\gamma _{\sqrt {\theta _{\mathbf {a}}}}$ is indeed a morphism. To prove this we may work over the algebraic closure $\overline {\mathbb {Q}}$ by Galois descent. The given equations define $\gamma$ except when $X_m^2 = \sqrt {-({a_n}/{a_m})} X_n^2$ and $X_k^2 = -({\sqrt {\theta _{\mathbf {a}}}}/{a_k a_n}) \sqrt {-({a_n}/{a_m})} X_\ell ^2$ for any choice of the square root $\sqrt {-({a_n}/{a_m})}$. Note that this includes the cases $X_k = X_\ell = 0$, $X_n = X_m = 0$. Multiplying all the coordinates by $(X_k^2 - ({\sqrt {\theta _{\mathbf {a}}}}/{a_k a_n}) \sqrt {-({a_n}/{a_m})} X_\ell ^2)/(X_m^2 - \sqrt {-({a_n}/{a_m})} X_n^2)$ gives coordinates which are defined in this case. Indeed,

(3.3) \begin{align} \frac{X_k^2 X_m^2 - ({\sqrt{\theta_{\mathbf{a}}}}/{a_k a_m}) X_\ell^2 X_n^2}{X_m^2 - \sqrt{-({a_n}/{a_m})} X_n^2} & = X_k^2 + X_n^2\frac{\sqrt{-({a_n}/{a_m})}X_k^2 - ({\sqrt{\theta_{\mathbf{a}}}}/{a_k a_m}) X_\ell^2}{X_m^2 - \sqrt{-({a_n}/{a_m})}X_n^2} \nonumber\\ & = X_k^2 + \frac{a_m}{a_k} \sqrt{-\frac{a_n}{a_m}}X_n^2 \frac{X_m^2 + \sqrt{-({a_n}/{a_m})}X_n^2}{X_k^2 - ({\sqrt{\theta_{\mathbf{a}}}}/{a_k a_n}) \sqrt{-({a_n}/{a_m})} X_\ell^2}, \nonumber\\ \frac{({\sqrt{\theta_{\mathbf{a}}}}/{a_\ell a_m}) X_k^2 X_n^2 + X_\ell^2 X_m^2}{X_m^2 - \sqrt{-({a_n}/{a_m})} X_n^2} & = X_\ell^2 + X_n^2\frac{\sqrt{-({a_n}/{a_m})} X_\ell^2 + ({\sqrt{\theta_{\mathbf{a}}}}/{a_\ell a_m}) X_k^2 }{X_m^2 - \sqrt{-({a_n}/{a_m})}X_n^2}\nonumber\\ & = X_\ell^2 + \frac{\sqrt{\theta_{\mathbf{a}}}}{a_k a_\ell } X_n^2 \frac{X_m^2 + \sqrt{-({a_n}/{a_m})}X_n^2}{X_k^2 - ({\sqrt{\theta_{\mathbf{a}}}}/{a_k a_n}) \sqrt{-({a_n}/{a_m})}X_\ell^2}, \nonumber\\ \frac{X_m^4 + ({a_n}/{a_m})X_n^4}{X_m^2 - \sqrt{-({a_n}/{a_m})} X_n^2} & = X_m^2 + \sqrt{-\frac{a_n}{a_m}} X_n^2. \end{align}

The divisor of $h$ is a square so $\mathcal {B}_{k \ell m; \sqrt {\theta _{\mathbf {a}}}}$ can only be ramified at the divisors contained in $\gamma ^*\{Z_m = 0\}$. By the definition of $\gamma$ and (3.3) this is the union of

\[ \bigg\{X_m^2 = \sqrt{-\frac{a_n}{a_m}} X_n^2,\ X_k^2 = \frac{\sqrt{\theta_{\mathbf{a}}}}{a_k a_n} \sqrt{-\frac{a_n}{a_m}} X_\ell^2\bigg\} \]

for the two different choices of $\sqrt {-({a_m}/{a_n})}$. So $-a_k a_m \sqrt {\theta _{\mathbf {a}}}$ is a square in the residue field.

If we have two choices of tangent lines defined by $g_1, g_2$, then their quotient $g_1/g_2$ is a rational function with square divisor and is thus a square up to multiplication by a constant. The associated quaternion algebras thus differ by a constant algebra.

Remark 3.2 The algebras $\mathcal {A}, \mathcal {B}_{k \ell m; \sqrt {\theta _{\mathbf {a}}}}$ exist more generally over other fields $K$ as long as the quadrics $Y, Z_{\mathbf {a}}^{k \ell m}$ have $K$-points. For the same reason as over $\mathbb {Q}$ they are still well-defined as elements of $\operatorname {Br}(X_{\mathbf {a}})/ \operatorname {Br}(K)$. We use this in what follows to see that $\mathcal {A}$ as an element of $\operatorname {Br}(X_{\mathbb {Q}_p})/\operatorname {Br}(\mathbb {Q}_p)$ can be computed by taking a tangent line at a $\mathbb {Q}_p$-point of $Y_{\mathbf {a}}$. It also implies that $\overline {\partial _v \mathcal {A}} \in \operatorname {H}^1(\mathcal {X}_{\overline {\mathbb {F}}_p}, \mathbb {Q}/\mathbb {Z})$ can be computed by taking the tangent line at a $K$-point of $Y_{\mathbf {a}}$ for an unramified $K/\mathbb {Q}_p$ extension. Analogously for $\mathcal {B}_{k \ell m; \sqrt {\theta _{\mathbf {a}}}}$.

3.2 Local computations

In this subsection we compute the local invariants of $\mathcal {A}$ utilizing the methods of § 2.3. Let us first describe when $X_{\mathbf {a}}$ has a $\mathbb {Q}_p$-point.

Lemma 3.3 Let $p$ be an odd prime number, then $X_{\mathbf {a}}( \mathbb {Q}_p) \neq \emptyset$ only if one of the following two conditions is true:

  1. (i) there exist $k \neq \ell$ such that $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^4$;

  2. (ii) there exist pairwise distinct $k, \ell, m$ such that $v_p(a_k) \equiv v_p(a_\ell ) \equiv v_p(a_m) \pmod {4}$.

Conversely, $X_{\mathbf {a}}(\mathbb {Q}_p) \neq \emptyset$ if the first condition holds or if the second one holds and $p > 33$.

Proof. If $X_{\mathbf {a}}(\mathbb {Q}_p) \neq \emptyset$, then by fixing a solution and then taking an equivalent surface while appropriately changing the solution we may assume that $a_0 \in \mathbb {Z}_p^{\times }$, $a_1, a_2, a_3 \in \mathbb {Z}_p$ and that there exist $x_0, x_1, x_2, x_3 \in \mathbb {Z}_p^{\times }$ such that $a_0 x_0^4 + a_1 x_1^4 + a_2 x_2^4 + a_3 x_3^4 = 0$. By reducing modulo $p$ we see that at least one of the other coefficients is a $p$-adic unit. If there is exactly one other coefficient $a_k$ which is a $p$-adic unit, then $-({a_k}/{a_0}) \equiv ({x_k}/{x_0})^4 \not \equiv 0 \pmod {p}$, so by Hensel's lemma $-({a_k}/{a_0}) \in \mathbb {Q}_p^{\times 4}$ and the first condition holds. Otherwise the second condition holds.

If the first condition holds, then without loss of generality $k = 0, \ell = 1$ and $[1: \sqrt [4]{-a_1/a_0}: 0: 0] \in X_{\mathbf {a}}(\mathbb {Q}_p)$. For the second condition take an equivalent surface such that $a_k, a_\ell, a_m \in \mathbb {Z}_p^{\times }$. In this case the equation $a_k X_k^4 + a_\ell X_\ell ^4 + a_m X_m^4 = 0$ defines a smooth proper curve of genus $3$ which has a $\mathbb {F}_p$-point if $p > 33$ because of the Weil conjectures [Reference DeligneDel80]. It thus has a $\mathbb {Z}_p$-point by Hensel's lemma.

Proposition 3.4 Let $\{k, \ell, m, n\} = \{0,1,2,3\}$. If there exists a prime $p > 16{\,}897$ such that $v_p(a_n) = 1$ and $p \nmid a_k a_\ell a_m$, then $X_{\mathbf {a}}$ has no Brauer–Manin obstruction to the Hasse principle.

Proof. This is the same as [Reference BrightBri15, Corolllary 7.10] except that it holds for more primes. The proof is the same except that we can bound $|\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})|$ by $32$ because in this case $\operatorname {Br}_1(X_{\mathbf {a}}) = \operatorname {Br}(X_{\mathbf {a}})$ due to [Reference Ieronymou and SkorobogatovIS15, Theorem 1.1] and [Reference Gvirtz and SkorobogatovGvSk22, Main theorem]. In addition $|\operatorname {Br}_1(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})| \leq 32$ by [Reference BrightBri02, Appendix A].

Lemma 3.5 Let $\{k, \ell, m, n\} = \{0,1,2,3\}$ and $p > 97$ an odd prime. If for all $i \neq j \in \{0, 1, 2, 3 \}$ we have $a_i a_j \not = 1,-1,2,-2 \in \mathbb {Q}^{\times }/ \mathbb {Q}^{\times 2}$, $\theta _{\mathbf {a}}\in \mathbb {Q}^{\times 2}$, $v_p(a_n) \equiv 2 \pmod {4}$ and $p \nmid a_k a_\ell a_m$, then $X_{\mathbf {a}}$ has no Brauer–Manin obstruction to the Hasse principle.

Proof. In this situation $\operatorname {Br}_1(X_{\mathbf {a}}) = \operatorname {Br}(X_{\mathbf {a}})$ because of [Reference Ieronymou and SkorobogatovIS15, Theorem 1.1] and [Reference Gvirtz and SkorobogatovGvSk22, Main theorem]. The only possible case of [Reference BrightBri02, Appendix A] is A131, hence $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q}) \cong \mathbb {Z}/ 2\mathbb {Z}$. We prove that $\mathcal {B}_{k \ell m; \sqrt {\theta _{\mathbf {a}}}}$ is prolific. This suffices because in that case $\mathcal {B}_{k \ell m; \sqrt {\theta _{\mathbf {a}}}}$ generates $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})$.

We apply Corollary 2.10. Let $\mathcal {X}$ be the smooth locus of the integral model given by the same equation. The special fiber $\mathcal {X}_{\mathbb {F}_p}$ is the smooth locus of the cone over the curve $C: a_k X_k^4 + a_\ell X_\ell ^4 + a_m X_m^4 = 0$. To compute $\partial _p \mathcal {B}_{k \ell m ; \sqrt {\theta _{\mathbf {a}}}}$ we may work over the maximal unramified extension of $\mathbb {Q}_p$ by Remark 3.2. We may thus assume that $a_k = a_\ell = a_m = 1$ and take $h = Z_k + \sqrt {-1} Z_m = 0$ as the equation defining a tangent line of $Z_{\mathbf {a}}^{k \ell m}$. Then

\[ \gamma^*_{\sqrt{\theta_{\mathbf{a}}}} \frac{h}{Z_m} \equiv \frac{X_k^2 X_m^2 + \sqrt{-1}X_m^4}{X_m^4} \equiv \frac{X_k^2}{X_m^2} + \sqrt{-1} \pmod{p}. \]

Lemma 2.11 implies that $\partial _p \mathcal {B}_{k \ell m ; \sqrt {\theta _{\mathbf {a}}}} = {X_k^2}/{X_m^2} + \sqrt {-1}$ since $v_p(\theta _{\mathbf {a}})$ is odd.

We check that the étale cover of $C$ defined by $Y^2 = X_k^2 + \sqrt {-1} X_m^2$ is geometrically irreducible using the Magma function IsAbsolutelyIrreducible(). The full script can be found on the authors webpage. Hence, $\mathcal {B}_{k \ell m ; \sqrt {\theta _{\mathbf {a}}}}$ is prolific by Corollary 2.10.

We say that the algebra $\mathcal {A}$ is normalized if the coefficients of $f$ in the definition are integral and coprime. It is clear that we may always assume that $\mathcal {A}$ is normalized. More generally, for any prime $p$ we say that $\mathcal {A}$ is $p$-normalized if all the coefficients are $p$-adic integers and at least one of them is a unit. Thus, $f$ and $\alpha ^* f$ will be non-zero polynomials when reduced modulo $p$ which we denote by $\tilde {f}$ and $\alpha ^* \tilde {f}$, respectively. These normalised algebras are useful because we are then able to write more explicit formulas for the invariant.

Lemma 3.6 Let $\{k, \ell, m, n\} = \{0,1,2,3\}$. Let $p$ be an odd prime such that $p \nmid a_k a_\ell a_m$, $v_p(a_n) = 0,2$, $X_{\mathbf {a}}(\mathbb {Q}_p) \neq \emptyset$. If $\mathcal {A}$ is $p$-normalized, then ${\rm inv}_p(\mathcal {A}(\cdot )) = 0$.

Proof. Let $\mathcal {X}$ be the integral model given by the same equation. It follows from looking at primitive solutions that $\mathcal {X}^{\textrm {sm}}(\mathbb {Z}_p) = X_{\mathbf {a}}(\mathbb {Q}_p)$. The special fiber $\mathcal {X}^{\textrm {sm}}_{\mathbb {F}_v}$ is irreducible, so $v_p$ can be extended to $\mathbb {Q}_p(X)$. By Lemma 2.4 it suffices to show that $\partial _p \mathcal {A} = 0$. Since $\mathcal {A}$ is normalized $\alpha ^* f$ will be non-zero when reduced modulo $p$. The reduction $\alpha ^* \tilde {f}$ cannot be zero on the special fiber because it has smaller degree than the defining equation so $v_p({\alpha ^* f}/{X_3^2}) = 0$. We are done by applying Lemma 2.11.

Lemma 3.7 Let $\{k, \ell, m, n\} = \{0,1,2,3\}$. Let $p$ be an odd prime such that $v_p(a_m), v_p(a_n) \in \{1,3\}$, $p \nmid a_k a_\ell$ and $X_{\mathbf {a}}(\mathbb {Q}_p) \neq \emptyset$.

If $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^{\times 2}$ and $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^{\times 4}$, then $\mathcal {A} \not \in \operatorname {Br}_0(X)$. If on top of this $\mathcal {A}$ is $p$-normalized, then for all $P \in X_{\mathbf {a}}(\mathbb {Q}_p)$,

\[ {\rm inv}_p(\mathcal{A}(P)) = \begin{cases} 0 & \textrm{if } \alpha^*\tilde{f}(P) = 0 \\ \frac{1}{2} & \textrm{if } \alpha^*\tilde{f}(P) \neq 0. \end{cases} \]

In particular, if $p \equiv 1 \pmod {4}$, then $\mathcal {A}$ is prolific and if $p \equiv 3 \pmod {4}$ then ${\rm inv}_p(\mathcal {A}(\cdot ))$ is constant.

Proof. Because $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^{\times 4}$ and $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^2$ we must have $-({a_m}/{a_n}) \not \in \mathbb {Q}_p^{\times 2}$. Let $\mathcal {X}$ be the integral model defined by the same equation. By looking at primitive solutions modulo $p^4$ we see that $\mathcal {X}^{\textrm {sm}}(\mathbb {Z}_p) = X_{\mathbf {a}}(\mathbb {Q}_p)$. Let $\mathcal {Y}$ be the integral model of $Y_{\mathbf {a}}$ given by the same equation. For the same reason $\mathcal {Y}^{\textrm {sm}}(\mathbb {Z}_p) = Y_{\mathbf {a}}(\mathbb {Q}_p)$.

Take a $b \in \mathbb {Z}_p^\times$ such that $-({a_k}/{a_\ell }) = b^4$. The special fiber $\mathcal {X}^{\textrm {sm}}_{\mathbb {F}_p}$ is equal to $\{X_k^4 = b^4 X_\ell ^4\} \setminus \{X_k = 0 = X_\ell \}$. This has four geometrically connected components, $\{X_k = \sqrt {-1}^t b X_\ell \}$ for $t = 0,1,2,3$. It thus has four or three connected components for $p \equiv 1 \pmod {4}$ or $p \equiv 3 \pmod {4}$, respectively.

The plane defined by $f$ is tangent to a $\mathbb {Q}$-point of $Y_{\mathbf {a}}$, so it is tangent to a $\mathbb {Z}_p$-point of $\mathcal {Y}^{\textrm {sm}}$. The reduction of this plane modulo $p$ is defined by $\tilde {f}$ because $\mathcal {A}$ is normalized. However, $\mathcal {Y}^{\textrm {sm}}_{\mathbb {F}_p} = \{Y_k^2 = b^4Y_\ell ^2\} \setminus \{Y_k = 0 = Y_\ell \}$ is a disjoint union of $2$ planes, so $\{ \tilde {f} = 0\}$ must be one of these two planes. Over each of these two planes lie two geometric components of $\mathcal {X}^{\textrm {sm}}_{\mathbb {F}_p}$.

Let $D$ be one of the connected components of $\mathcal {X}^{\textrm {sm}}_{\mathbb {F}_p}$ and $D^c$ the complement in $\mathcal {X}^{\textrm {sm}}_{\mathbb {F}_p}$. Let $\mathcal {X}^D := \mathcal {X}^{\textrm {sm}} \setminus D^c$. The special fiber $\mathcal {X}^D_{\mathbb {F}_p} = D$ is connected. By Lemma 2.4 it suffices to compute the residue $\partial _D \mathcal {A}$ for $\mathcal {X}^D$. We can extend $v_p$ to a valuation $v_D$ of $\mathbb {Q}_p(X)$. Since $v_p(\theta _{\mathbf {a}})$ is even and $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^{\times 2}$ we can apply Lemma 2.11 to find that

\[ \partial_D \mathcal{A} = \frac{1}{2} v_D\bigg(\frac{\alpha^* f}{X_3^2}\bigg) \in \operatorname{H}^1(\mathbb{F}_p, \mathbb{Z}/2\mathbb{Z}) \subset \operatorname{H}^1(D, \mathbb{Q}/\mathbb{Z}). \]

If $D \not \subset \{\alpha ^*\tilde {f} = 0\}$, then $v_D({\alpha ^*f}/{X_3^2}) = 0$ so $\partial _p \mathcal {A} = 0$.

If we change $\mathcal {A}$ by a constant algebra, then $\partial _D \mathcal {A}$ changes by a constant independent of $D$. It thus suffices to show that for a single choice of $f$ we have $\partial _D \mathcal {A} = \frac {1}{2}$ if $D \subset \{\alpha ^*\tilde {f} = 0\}$. By Remark 3.2 we may take $f = Y_k - b^2Y_\ell$ and, thus, $\alpha ^* f = X_k^2 - b^2X_\ell ^2$. Then

\[ \frac{X_k^2 - b^2X_\ell^2}{X_3^2} = -\frac{a_m X_n^4 + a_n X_n^4}{a_k(X_k^2 + b^2 X_\ell^2)X_3^2} = -p^i \frac{({a_m}/{p^i}) X_m^4 + ({a_n}/{p^i}) X_n^4}{a_k(X_k^2 +b^2 X_\ell^2)X_3^2}, \]

where $i = \min (v_p(a_m), v_p(a_n))$, so $i$ is odd, ${a_m}/{p^i}, {a_n}/{p^i} \in \mathbb {Z}_p$ and at least one of them is a unit. Because $D \not \subset \{X_k^2 = - b^2X_\ell ^2\}, \{({a_m}/{p^i}) X_m^4 + ({a_n}/{p^i})X_n^4 = 0\}$ we get $\partial _D(\mathcal {A}) = {i}/{2} = \frac {1}{2}$.

Note that $\partial _D( \mathcal {A})$ changes if we change $D$, so $\mathcal {A} \not \in \operatorname {Br}_0(X)$. The last statement follows by Hensel's lemma because $\alpha ^*\{ Y_k = -b^2 Y_\ell \} = \{X_k^2 = - b^2 X_\ell ^2\}$ has $\mathbb {F}_p$-points if and only if $p \equiv 3 \pmod {4}$.

We are now interested in how the invariant changes as the surface changes. As there is no uniform formula for $\mathcal {A}$ as shown in Corollary 6.3 we are only able to understand this for certain limited changes. Given $\mathbf {u} = (u_0, u_1, u_2, u_3) \in (\mathbb {Q}^{\times })^4$ we let $\mathbf {a} \mathbf {u}^2 =(a_k u_k^2)$. Choose a representation $\mathcal {A} = ({\alpha ^*f}/{X_3^2}, \theta )$ with $f = \sum _{i = 0}^3 y_i a_i Y_i$ tangent to the point $[y_0: y_1: y_2 : y_3] \in Y_{\mathbf {a}}(\mathbb {Q}).$ We then get a point $[{y_0}/{u_0}:{y_1}/{u_1}:{y_2}/{u_2}:{y_3}/{u_3}] \in Y_{\mathbf {a} \mathbf {u}^2}(\mathbb {Q})$ with a tangent plane defined by $f_{\mathbf {u}} = \sum _{i = 0}^3 y_i u_i a_i Y_i^2$. This defines an element $\mathcal {A}_{\mathbf {u}} = ({\alpha ^* f_{\mathbf {u}}}/{X_3^2}) \in \operatorname {Br}(X_{\mathbf {a}\mathbf {u}^2})$.

Note that if $\mathcal {A}$ is $p$-normalized and $u_i \in \mathbb {Z}_p^{\times }$, then $\mathcal {A}_{\mathbf {u}}$ will also be $p$-normalized. From this we can prove the following proposition.

Proposition 3.8 Let $\{k, \ell, m, n\} = \{0,1,2,3\}$. Let $p$ be an odd prime such that $v_p(a_k),v_p(a_\ell )$ have the same parity and $v_p(a_m), v_p(a_n)$ have the other parity.

If $\theta _{\mathbf {a}}\in \mathbb {Q}_p^{\times 2}$, then ${\rm inv}(\mathcal {A}(\cdot )) = 0$. Otherwise $\mathcal {A} \not \in \operatorname {Br}_0(X)$.

If $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^{\times 2}$ and $p \equiv 1 \pmod {4}$, then $\mathcal {A}$ is prolific so induces no obstruction to the Brauer–Manin obstruction.

If $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^{\times 2}$, $p \equiv 3 \pmod {4}$ and $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^{\times 4}$, then ${\rm inv}(\mathcal {A}(\cdot ))$ is constant and for all $\mathbf {u} \in (\mathbb {Z}_p)^{\times 4}$ we have

\[ {\rm inv}(\mathcal{A_{\mathbf{u}}}(\cdot)) = {\rm inv}(\mathcal{A}(\cdot)) + \frac{({u_k u_\ell}/{p}) - 1}{4}. \]

Recall from Lemma 3.3 that because $X_{\mathbf {a}}(\mathbb {Q}_p) \neq \emptyset$ either $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^{\times 4}$ or $-({a_m}/{a_n}) \in \mathbb {Q}_p^{\times 4}$, the second case can also be handled via this proposition via permuting the indices.

Proof. If $\theta _{\mathbf {a}}\in \mathbb {Q}_p^2$, then $\mathcal {A} = 1$ as an element of $\operatorname {Br}(X_{\mathbb {Q}_p})$ so ${\rm inv}_p(\mathcal {A}(\cdot ))$ will be constant.

By taking an equivalent surface we may assume that $p \nmid a_k a_\ell$, $v_p(a_m), v_p(a_n) \in \{1, 3\}$ and $-({a_k}/{a_\ell }) \in \mathbb {Q}_p^{\times 4}$. We may then also assume that $\mathcal {A}$ is $p$-normalized. The only thing to show after applying Lemma 3.7 is the formula for ${\rm inv}(\mathcal {A_{\mathbf {u}}}(\cdot ))$. As in the lemma there exists a $t$ such that $\alpha ^* \tilde {f} = X_k^2 - (-1)^{t} X_\ell ^2$. Let $\mathcal {X}_{\mathbf {a} \mathbf {u}^2}$ be the obvious integral model. By definition, the reduction of $\alpha ^* \tilde {f}$ is equal to $\alpha ^* \tilde {f}_\mathbf {u} = u_k X_k^2 - (-1)^t u_\ell X_\ell ^2$. The desired statement follows after applying Lemma 3.7 to $X_{\mathbf {a} \mathbf {u}^2}.$

If there exists $v_k \in \mathbb {Q}_p$ such that $u_k = v_k^2$ for all $k$, then there exists a bijection $X_{\mathbf {a}}(\mathbb {Q}_p) \to X_{\mathbf {a} \mathbf {u}^2}(\mathbb {Q}_p): \mathbf {x} \to \mathbf {x} \mathbf {v}^{-1} = [x_0 v_0^{-1}: x_1 v_1^{-1}: x_2 v_2^{-1}: x_3 v_3^{-1}]$. It follows directly from the definitions that

(3.4)\begin{equation} {\rm inv}(\mathcal{A_{\mathbf{u}}}(\mathbf{x} \mathbf{v}^{-1})) = {\rm inv}(\mathcal{A}(\mathbf{x})). \end{equation}

A fact we need is as follows.

Lemma 3.9 If $a_k \in \mathbb {Z}_p^{\times }$ and $u_k \in \mathbb {Z}_p$ for all $k$ and if $u_k$ is a unit for all but one $k$, then $\mathcal {A}$ is $p$-normalized if and only if $\mathcal {A}_{\mathbf {u}}$ is $p$-normalized.

Proof. Assume that $u_0$ is the non-unit. Let $[y_0: y_1: y_2 : y_3] \in Y_{\mathbf {a}}(\mathbb {Q}_p)$ be such that $f = \sum _{i=0}^3 y_i a_i Y_i$ and, thus, $f_{\mathbf {u}} = \sum _{i=0}^3 y_i u_i a_i Y_i$. If $\mathcal {A}$ is $p$-normalized and $\mathcal {A}_{\mathbf {u}}$ is not, then $v_p(y_0) = 0$ but $v_p(y_i) \geq 1$ for $i \neq 1$. On the other hand, if $\mathcal {A}_{\mathbf {u}}$ is $p$-normalized but $\mathcal {A}$ is not, then $v_p(y_0) = - v_p(u_0) \leq -1$ but $v_p(y_i) \geq 1$ for $i \neq 1$. Both are impossible because $\sum _{i = 0}^3 a_i y_i^2 = 0$.

We require the following example.

Lemma 3.10 The surface $X:X_0^4 - 16X_1^4 + 7X_2^4 + 7X_3^4 = 0$ has local points everywhere and $\mathcal {A}$ is locally nowhere prolific.

Proof. It has local solutions because it has a rational point $[2:1:0:0] \in X(\mathbb {Q})$.

The quadric $Y$ has a rational point $[4:1:0:0]$ so we have a normalized $\mathcal {A} = (1 + 4({X_1^2}/{X_0^2}), -1)$. The left-hand side is always positive so ${\rm inv}_{\infty }(\mathcal {A}(\cdot )) = 0$. By Lemma 3.6 ${\rm inv}_{p}(\mathcal {A}(\cdot )) = 0$ for $p \neq 2,7$. Because $7 \equiv 3 \pmod {4}$ Lemma 3.7 implies that ${\rm inv}_{7}(\mathcal {A}(\cdot ))$ is constant. Lastly, by looking at primitive solutions $\mathbf {x} = [x_0: x_1: x_2: x_3] \in X(\mathbb {Q}_2)$ modulo $4$ we see that $x_0 \in \mathbb {Z}_2^{\times }$ so we have $1 + 4({x_1^2}/{x_0^2}) \equiv 1 \pmod {4}$. The invariant ${\rm inv}_{2}(\mathcal {A}(\mathbf {x}))$ is thus always equal to $(1 + 4({x_1^2}/{x_0^2}),-1)_2 = (1, -1)_2 = 0$.

4. Analytic preliminaries

We want to apply a generalization of the method of [Reference Heath-BrownHea93] to multiple different sums: to do this we prove a general theorem describing this generalization.

4.1 Frobenian multiplicative functions

We start by slightly generalizing the notion of frobenian multiplicative functions [Reference Loughran and MatthiesenLM19, Definition 2.1] to number fields. These are closely related to the frobenian functions of Serre [Reference SerreSer12, § 3.3].

Definition 4.1 Let $K$ be a number field and let $S$ be a finite set of primes of $K$. A multiplicative function $\rho : I_K \to \mathbb {C}$ is a $S$-frobenian multiplicative function if it satisfies the following properties.

  1. (i) There exists a $H \in \mathbb {N}$ such that $|\rho (\mathfrak {p}^k)| \leq H^k$ for all primes $\mathfrak {p}$ and $k \in \mathbb {N}$.

  2. (ii) For all $\varepsilon > 0$ there exists a constant $C_\varepsilon$ such that $|\rho (\mathfrak {n})| \leq C_{\varepsilon }\mathbf {N}(\mathfrak {n})^{\varepsilon }$

  3. (iii) The reduction of $\rho$ to the totally split primes is $S$-frobenian, i.e. there exists a Galois extension $L/K$ with Galois group $\Gamma$ such that $S$ contains all the primes which ramify in this extension and a class function $\varphi : \Gamma \to \mathbb {C}$ such that for all totally split primes $\mathfrak {p} \not \in S$,

    \[ \rho(\mathfrak{p}) = \varphi(\textrm{Frob}_{\mathfrak{p}}). \]

The mean of a frobenian multiplicative function is the mean of $\varphi$, i.e.

\[ m(\rho) = \frac{1}{|\Gamma|} \sum_{\gamma \in \Gamma} \varphi(\gamma). \]

Remark 4.2 We only require that the restriction to the set of totally split primes is frobenian because the non-totally split primes end up only affecting the constant in the asymptotics.

The mean is independent of $L$ since this is true for class functions.

Hecke characters with finite modulus are frobenian multiplicative by class field theory.

For any two functions $f,g: I_K \to \mathbb {C}$ let $f * g$ be their Dirichlet convolution.

Lemma 4.3 If $\rho _1, \rho _2$ are $S$-frobenian multiplicative functions defined by Galois extensions $L_1, L_2$, then their product $\rho _1 \rho _2$ and their convolution $\rho _1 * \rho _2$ are $S$-frobenian multiplicative functions defined by the Galois extension $L_1 L_2$. The implicit constants of $\rho _1 \rho _2$ and $\rho _1 * \rho _2$ are bounded by those of $\rho _1, \rho _2$. The mean of $\rho _1 * \rho _2$ is $m(\rho _1 * \rho _2) = m(\rho _1) + m(\rho _2)$.

Proof. We first show the required bounds, let $H_1, C_{1, \varepsilon }$ and $H_2, C_{2, \varepsilon }$ be the required constants for $\rho _1$ and $\rho _2$, respectively. Then by the divisor bound

\begin{align*} |\rho_1 \rho_2 (\mathfrak{p}^k)| &= |\rho_1(\mathfrak{p}^{k})| |\rho_2(\mathfrak{p}^{k})| \leq (H_1 H_2)^k, \\ |\rho_1 \rho_2 (\mathfrak{n})| &= |\rho_1(\mathfrak{n})| |\rho_2(\mathfrak{n})| \leq C_{1, \varepsilon} C_{2, \varepsilon} \mathbf{N}(\mathfrak{n})^{2 \varepsilon} , \\ |\rho_1 * \rho_2(\mathfrak{p}^k)| &\leq \sum_{i = 0}^k |\rho_1(\mathfrak{p}^{i})||\rho_2(\mathfrak{p}^{k-i})| \leq \sum_{i = 0}^k H_1^{i} H_2^{k - i} \leq (H_1 + H_2)^k , \\ |\rho_1 * \rho_2(\mathfrak{n})| &\leq \sum_{\mathfrak{d}| \mathfrak{n}} |\rho_1(\mathfrak{d})| \bigg|\rho_2\bigg(\frac{\mathfrak{n}}{\mathfrak{d}}\bigg)\bigg| \leq \tau(\mathfrak{n}) C_{1, \varepsilon} C_{2, \varepsilon} \mathbf{N}(\mathfrak{d})^{\varepsilon} \mathbf{N}\bigg(\frac{\mathfrak{n}}{\mathfrak{d}}\bigg)^{\varepsilon} \ll_{\varepsilon} C_{1, \varepsilon} C_{2, \varepsilon} \mathbf{N}(\mathfrak{n})^{2 \varepsilon}. \end{align*}

If $\mathfrak {p}$ is totally split, then $\rho _1 \rho _2(\mathfrak {p}) = \rho _1(\mathfrak {p}) \rho _2(\mathfrak {p}), \rho _1 * \rho _2(\mathfrak {p}) = \rho _1(\mathfrak {p}) + \rho _2(\mathfrak {p})$. The class function of $\rho _1 \rho _2$ and $\rho _1 * \rho _2$ is thus the product and the sum of the class functions, respectively, of $\rho _1$ and $\rho _2$.

We require the following lemma.

Lemma 4.4 Let $\rho$ be a frobenian multiplicative function defined by a Galois extension $L/K$ and $\chi$ a Hecke character with finite modulus of $K$, then $m(\chi \rho ) \neq 0$ only if $\chi$ corresponds via class field theory to a character of $\textrm {Gal}(L/K)$.

Proof. For $K = \mathbb {Q}$ this is the first part of the proof of [Reference Loughran and MatthiesenLM19, Lemma 2.4]. The argument generalizes to general number fields. To spell out the details, let $L'/L$ be an extension such that $L'/K$ is Galois and $\chi$ corresponds via class field theory to a character of $\Gamma := \textrm {Gal}(L'/K)$, which we also denote by $\chi$. Let $\varphi : \Gamma \to \mathbb {C}$ be the class function defining $\rho$. The subgroup $N := \textrm {Gal}(L'/ L) \subset \Gamma$ is normal and $\varphi$ is invariant under translation by $N$. Then

\[ |\Gamma|m(\chi \rho) = \sum_{g \in \Gamma} \chi(g) \varphi(g) = \sum_{gN \in \Gamma/N} \varphi(gN) \sum_{h \in gN} \chi(h) = \sum_{gN \in \Gamma/N} \varphi(gN) \sum_{h' \in N} \chi(gh'). \]

The sum $\sum _{h' \in N} \chi (gh')$ is $0$ unless $\chi$ is trivial on $N$. In that case $\chi$ is a character of $\Gamma /N = \textrm {Gal}(L/K)$ as desired.

It will be easier to deal with frobenian multiplicative functions over $\mathbb {Q}$.

Definition 4.5 For any finite field extension $K'/K$ and any frobenian multiplicative function $\rho$ over $K'$ we define the induced frobenian multiplicative function $\textrm {Ind}^{K}_{K'}(\rho )$ for all $\mathfrak {n} \in I_K$ by

\[ \textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{n}) := \sum_{\substack{\mathfrak{N} \in I_{K'} \\ \mathbf{N}_{K'/K}(\mathfrak{N}) = \mathfrak{n}}} \rho(\mathfrak{N}). \]

It follows directly from the definition that

\[ \sum_{\mathbf{N}_{K'/\mathbb{Q}}(\mathfrak{N}) \leq x} \rho(\mathfrak{N}) = \sum_{\mathbf{N}_{K/\mathbb{Q}}(\mathfrak{n}) \leq x} \textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{n}). \]

We can thus reduce the study of such sums for general frobenian multiplicative functions to those over $\mathbb {Q}$. Let us show that these are indeed frobenian multiplicative.

Lemma 4.6 If $\rho$ is a $S$-frobenian multiplicative function over $K'$, then $\textrm {Ind}^{K}_{K'}(\rho )$ is a $S'$-frobenian multiplicative function of the same mean with

\[ S' := \{\mathfrak{p} : \mathfrak{p} \mbox{ ramifies in } K' \mbox{ or lies under an element of } S \}. \]

The required constants of $\textrm {Ind}^{K}_{K'}(\rho )$ are bounded in terms of those of $\rho$ and $[K' : K]$.

Proof. The multiplicativity is clear. To show that it satisfies the desired inequalities we note that $\sum _{\mathbf {N}_{K'/K}(\mathfrak {N}) = \mathfrak {n}} 1$ is multiplicative, and is smaller than $\tau _{[K':K]}(\mathfrak {n})$ as can be seen by looking at prime powers. By the divisor bound we obtain

(4.1)\begin{align} |\textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{p}^k)| &\leq \sum_{\substack{\mathfrak{N} \in I_{K'} \\ \mathbf{N}_{K'/K}(\mathfrak{N}) = \mathfrak{p}^k}} |\rho(\mathfrak{N})| \leq \tau_{[K':K]}(\mathfrak{p}^k) H^k \leq [K':K]^k H^k , \end{align}
(4.2)\begin{align} |\textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{n})| &= \sum_{\substack{\mathfrak{N} \in I_{K'} \\ \mathbf{N}_{K'/K}(\mathfrak{N}) = \mathfrak{n}}} |\rho(\mathfrak{N})| \leq \tau_{[K':K]}(\mathfrak{n}) C_{\varepsilon} \mathbf{N}(\mathfrak{n})^{\varepsilon} \ll_{\varepsilon, [K': K]} C_{\varepsilon} \mathbf{N}(\mathfrak{n})^{2 \varepsilon}. \end{align}

It remains to show that the value of $\textrm {Ind}^{K}_{K'}(\rho )$ at the totally split primes is determined by a frobenian function of the same mean. By enlarging $L/K'$ we may assume that $L/K$ is Galois. Let $G := \textrm {Gal}(L/K)$, $H := \textrm {Gal}(L/K')$. Assume that $\rho$ has an associated class function $\varphi$. For any totally split prime ideal $\mathfrak {p} \not \in S'$ of $K$ we have

\[ \textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{p}) = \sum_{\mathbf{N}_{K'/K}(\mathfrak{q}) = \mathfrak{p}} \rho(\mathfrak{q}) = \sum_{\mathbf{N}(\mathfrak{q}) = \mathbf{N}(\mathfrak{p})} \varphi(\textrm{Frob}_{\mathfrak{q}}). \]

Consider the set of prime ideals $\mathfrak {P} \in I_L$ lying over $\mathfrak {p}$. This set has a transitive $G$-action. Let us show that the prime ideal $\mathfrak {q} = \mathfrak {P} \cap \mathcal {O}_{K'}$ has the property $\mathbf {N}(\mathfrak {q}) = \mathbf {N}(\mathfrak {p})$ if and only if $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}} \in H$. If $\mathbf {N}(\mathfrak {q}) = \mathbf {N}(\mathfrak {p})$,3 then $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}} = \textrm {Frob}_{\mathfrak {P}/\mathfrak {q}} \in H$. On the other hand if $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}} \in H$ then $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}}$ keeps $K'$ invariant so fixes $\mathbb {F}_{\mathfrak {q}}$, hence $\mathbf {N}_{K'/K}(\mathfrak {q}) = \mathfrak {p}$.

Let $f := [\mathbb {F}_{\mathfrak {P}}: \mathbb {F}_{\mathfrak {p}}] = [\mathbb {F}_{\mathfrak {P}}: \mathbb {F}_{\mathfrak {q}}]$. This is independent of $\mathfrak {P}$ because $L/K$ is Galois. The set of prime ideals $\mathfrak {P} \in I_L$ lying over $\mathfrak {p}$, respectively $\mathfrak {q}$ has size $|G|/f$, respectively $|H|/f$. Thus

\[ \sum_{\mathbf{N}(\mathfrak{q}) = \mathbf{N}(\mathfrak{p})} \varphi(\textrm{Frob}_{\mathfrak{q}}) = \frac{f}{|H|} \sum_{\mathfrak{P}, \textrm{Frob}_{\mathfrak{P}/\mathfrak{p}} \in H} \varphi(\textrm{Frob}_{\mathfrak{P}/\mathfrak{q}}). \]

The $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}}$ are equidistributed over the conjugacy class $C_{\mathfrak {p}}$ of $\textrm {Frob}_{\mathfrak {p}}$, i.e. there are $|G|/f |C_{\mathfrak {p}}|$ primes $\mathfrak {P}$ with the same $\textrm {Frob}_{\mathfrak {P}/\mathfrak {p}}$. This implies that

\[ \textrm{Ind}^{K}_{K'}(\rho)(\mathfrak{p}) = \frac{|G|}{|H| |C_{\mathfrak{p}}|} \sum_{h \in H \cap C_{\mathfrak{p}}} \varphi(h) = \textrm{Ind}^G_H(\varphi)(\textrm{Frob}_{\mathfrak{p}}). \]

Where $\textrm {Ind}^G_H(\varphi )$ is the induced class function and the last equality is by definition. This class function has the same mean as $\varphi$.

That the class function of the induced frobenian multiplicative function is the induction of the class function of the original frobenian multiplicative function justifies the term induced.

Definition 4.7 The $L$-function of a frobenian multiplicative function $\rho$ for $\textrm {Re}(s) > 1$ is given by the Dirichlet series

\[ L(\rho, s) := \sum_{\mathfrak{n} \in I_K} \frac{\rho(\mathfrak{n})}{\mathbf{N}(\mathfrak{n})^s} = \prod_\mathfrak{p} \bigg(1 + \frac{\rho(\mathfrak{p})}{\mathbf{N}(\mathfrak{p})^s} + \frac{\rho(\mathfrak{p}^2)}{\mathbf{N}(\mathfrak{p})^{2s}} + \cdots\bigg). \]

This converges absolutely in this region because $|\rho (\mathfrak {n})| \leq C_{\varepsilon }\mathbf {N}(\mathfrak {n})^{\varepsilon }$ for all $\varepsilon > 0$.

The normalized $L$-function is defined as $L_n(\rho, s) = L(\rho, s)\zeta (s)^{-m(\rho )}$.

We remark that the $L$-function of the induction of a frobenian multiplicative function is equal to the $L$-function of the original frobenian multiplicative function.

The normalized $L$-function can be analytically continued slightly to the left of the line $\textrm {Re}(s) > 1$ as in [Reference Loughran and MatthiesenLM19, Proposition 2.7]. By controlling how far to the left it can be extended we will be able to derive a Selberg–Delange type asymptotic formula for $\sum _{\mathbf {N}(\mathfrak {n}) \leq x} \rho (\mathfrak {n})$. To control this we need the notion of the conductor of a frobenian multiplicative function.

Definition 4.8 The class function of a frobenian multiplicative function can be written as a sum of characters of $\Gamma$. We define the conductor $q(\rho )$ of a frobenian multiplicative function as the maximum taken over all characters $\chi$ whose coefficient is non-zero of the conductor of the Artin $L$-series $L(\chi, s)$. This is $\mathbf {N}_{K}(\mathfrak {f}_\chi ) d_K^{\chi (1)}$, where $\mathfrak {f}_\chi$ is the Artin conductor of $\chi$ and $d_K$ is the discriminant of $K$.

The conductor of a frobenian multiplicative function is bounded by the discriminant of $L$. We have the inequality $q(\textrm {Ind}_{K'}^K(\rho )) \leq q(\rho )$ because the Artin $L$-series of an induced character is equal to the $L$-series of the original character.

We start with the mean $0$ case.

Lemma 4.9 Let $A > 0$ and $\rho$ a frobenian multiplicative function of mean $0$ and conductor $q := q(\rho ) \leq (\log x)^{A}$. For all $x \geq 2$ there exists a $c > 0$ such that

\[ \sum_{\mathbf{N}(\mathfrak{n}) \leq x} \rho(\mathfrak{n}) \ll e^{O(|S|)} x e^{-c\sqrt{\log x}}. \]

The implicit constants depend on $n := [K: \mathbb {Q}], \Gamma, C_{\epsilon },H, A$, but for the rest they are independent of $\rho$.

Proof. As $\varphi$ is a class function we can write it as a sum over the irreducible characters of $\Gamma$:

\[ \varphi = \sum_{\chi} \lambda_\chi \chi. \]

We have the trivial inequality $|\lambda _\chi | \leq \chi (1)H = O_{\Gamma, H}(1)$.

This leads to the equality

\[ L(\rho, s) = G(s)\prod_{\chi} L(\chi, s)^{\lambda_\chi}, \]

where $L(\chi, s)$ is the Artin $L$-function of $\chi$ and

\[ G(s) = \prod_{\mathfrak{p}} \bigg(1 + \frac{\rho(\mathfrak{p})}{\mathbf{N}(\mathfrak{p})^s} + \frac{\rho(\mathfrak{p}^2)}{\mathbf{N}(\mathfrak{p})^{2s}} + \cdots\bigg) \prod_{\chi}\textrm{det}\bigg(1- \frac{\pi_{\chi}(\textrm{Frob}_{\mathfrak{p}})}{\mathbf{N}(\mathfrak{p})^s}\bigg)^{\lambda_\chi}. \]

Here $\pi _\chi$ is the representation corresponding to $\chi$.

Let us consider the local factors of $G(s)$, there are three cases. We use the standard notation $s = \sigma + it$.

  1. (i) If $\mathfrak {p} \in S$, then the contribution of the factors $\textrm {det}(1- {\pi _{\chi }(\textrm {Frob}_{\mathfrak {p}})}/{\mathbf {N}(\mathfrak {p})^s})$ is $e^{O_{n, \Gamma, H}(|S|)}$. For the other factors we have a bound for any $\varepsilon > 0$ of

    \[ 1 + \frac{|\rho(\mathfrak{p})|}{\mathbf{N}(\mathfrak{p})^{\sigma}} + \cdots \leq 1 + C_{\varepsilon} \frac{\mathbf{N}(\mathfrak{p})^{\varepsilon}}{\mathbf{N}(\mathfrak{p})^{\sigma}} + \cdots = 1 + C_{\varepsilon} \mathbf{N}(\mathfrak{p})^{\varepsilon - \sigma}(1 - \mathbf{N}(\mathfrak{p})^{\varepsilon - \sigma})^{-1}. \]

    The total contribution of these parts in the vertical strip $\sigma > \frac {3}{4}$ is thus $e^{ O_{n,\Gamma, H, C_{\varepsilon }}(|S|)}$.

  2. (ii) The contribution of the primes $\mathfrak {p}$ which are not totally split is absolutely convergent for $\sigma > \frac {1}{2}$ and in the vertical strip $\sigma > \frac {3}{4}$ is $O_{n, \Gamma, H, C_{\varepsilon }}(1)$.

  3. (iii) For the other primes $\mathfrak {p}$ we have, by definition, $\rho (\mathfrak {p}) = \sum _\chi \lambda _\chi \textrm {Tr}(\pi _{\chi }(\textrm {Frob}_{\mathfrak {p}}))$ so the associated factor is

    \[ 1 + O_{\Gamma, H}\bigg(\frac{|\rho(\mathfrak{p})| + |\rho(\mathfrak{p}^2)|}{\mathbf{N}(\mathfrak{p})^{2 \sigma}} + \frac{|\rho(\mathfrak{p})| + |\rho(\mathfrak{p}^2)| + |\rho(\mathfrak{p}^3)|}{\mathbf{N}(\mathfrak{p}^{3\sigma})} + \cdots \bigg). \]
    For all $\sigma > \varepsilon > 0$, this is bounded by
    \[ 1 + C_{\varepsilon} O_{\Gamma, H}\bigg(\frac{\mathbf{N}(\mathfrak{p})^{2 \varepsilon}}{\mathbf{N}(\mathfrak{p})^{2 \sigma}} + \cdots \bigg) = 1 + C_{\varepsilon}\mathbf{N}(\mathfrak{p})^{2\varepsilon - 2\sigma} O_{\Gamma}\big((1 - \mathbf{N}(\mathfrak{p})^{ \epsilon - \sigma})^{-1}\big). \]
    The total contribution of all of these primes is thus absolutely convergent for $\sigma > \frac {1}{2}$ and is $O_{n, \Gamma, H,C_{\varepsilon }}( 1)$ in the vertical strip $\sigma > \frac {3}{4}$.

Combining everything we see that $G(s) = e^{O_{n, \Gamma, H, C_{\varepsilon }}(|S|)}$ in the vertical strip $\sigma > \frac {3}{4}$.

By the Brauer-induction theorem [Reference SerreSer77, Theorem 18] we may assume that the $\chi$ are Hecke characters such that the $L$-series $L(\chi, s)$ has conductor at most $q$ at the cost of changing $\lambda _\chi$. However, $\lambda _\chi$ will still be $O_{\Gamma, H}(1)$.

Let $T = e^{\sqrt {\log x}}$. Choose $1 > 1 - \delta \geq \frac {3}{4}$ such that $s$ has at least a distance $\delta$ from every zero of $L(\chi, s)$ for all $s$ in the region

(4.3)\begin{equation} \sigma \geq 1 -\delta,\quad |t| \leq T. \end{equation}

We can choose $\delta$ such that $\delta \gg _A q^{-{1}/{2A}} \geq 1/\log T = 1/\sqrt {\log x}$ by the standard zero-free region of Hadamard and de la Vallée Poussin and Siegel's theorem for Hecke characters [Reference FogelsFog62]. We have the following inequality in the region (4.3):

(4.4)\begin{equation} | \log L(\chi, s)| \ll_A \log \log q t \ll_A \log \log T. \end{equation}

The case of Dirichlet characters is [Reference Montgomery and VaughanMV07, Theorem 11.4]. The proof for general Hecke characters is the same.

Exponentiating we obtain

\[ |L(\chi, s)^{\lambda_\chi}| = (\log T )^{O_{A, \Gamma, H}(1)} = (\log x )^{{O_{A, \Gamma, H}(1)}} \]

and, thus,

(4.5)\begin{equation} |F(s)| = e^{O_{n, \Gamma, H, C_{\epsilon}}(|S|)} (\log x)^{O_{A, \Gamma, H}(1)}. \end{equation}

We now apply Perron's formula [Reference TenenbaumTen95, Theorem II.3.3] for $\kappa = 1 + {1}/{\log x}$:

\[ \sum_{\mathbf{N}(\mathfrak{n}) \leq x} \rho(\mathfrak{n}) \log \bigg(\frac{x}{\mathbf{N}(\mathfrak{n})}\bigg) = \frac{1}{2 \pi i} \int_{\kappa - i \infty}^{\kappa + i\infty} F(s) x^s \frac{ds}{s^2}. \]

The function $F(s)$ has an analytic continuation to the region (4.3). We can, thus, change the contour to be the straight lines between $\kappa - i \infty$, $\kappa - iT$, $1 - {\delta }/{\log (1 + T)} - iT$, $1 - {\delta }/{\log (1 + T)} + iT$, $\kappa + iT$, $\kappa + i\infty$. The integrals over everything except the line between $1 - {\delta }/{\log (1 + T)} - iT$, $1 - {\delta }/{\log (1 + T)} + iT$ are

\[ \ll e^{O_{n, \Gamma, C_{\epsilon}}(|S|)} (\log x)^{O_{A, \Gamma, H}(1)} \frac{x^{\kappa}}{T} \ll_{A, \Gamma, H} e^{O_{n,\Gamma, C_{\epsilon}}(|S|)} x e^{- ({1}/{2}) \sqrt{ \log x}}. \]

The contribution of this last line segment is

\[ \ll e^{O_{n,\Gamma, H, C_{\epsilon}}(|S|)} (\log x)^{O_{A, H}(1)} \frac{x^{1 - \delta}}{\delta} \ll_{A, \Gamma, H} e^{O_{n,\Gamma, H, C_{\epsilon}}(|S|)} x e^{- ({1}/{2}) \sqrt{ \log x}}. \]

Equation (4.9) follows from partial summation.

To deal with the general case we require a uniform upper bound for absolute values of frobenian multiplicative functions.

Lemma 4.10 Let $\rho$ be a frobenian multiplicative function with implicit constants $C_{\varepsilon }$, then

\[ \sum_{\mathbf{N}(\mathfrak{n}) \leq x} |\rho(\mathfrak{n})| \ll_{[K : \mathbb{Q}]} C_{{1}/{6}} x (\log x)^{[K : \mathbb{Q}] H}. \]

Proof. Consider the induced frobenian multiplicative function $\textrm {Ind}^{\mathbb {Q}}_{K}(\rho )$. We may take its $H$ constant to be $H [K :\mathbb {Q}]$ by (4.1) and its $C_{{1}/{3}}$ constant may be taken to be $\ll C_{{1}/{6}}$. The proof thus reduces to proving that if $\rho$ is a frobenian multiplicative function over $\mathbb {Q}$, then

\[ \sum_{n \leq x} |\rho(n)| \ll C_{1/3} x (\log x)^{H}. \]

Write $n = m k^2$ with $m$ square-free. We claim that $|\rho (n)| \leq C_{1/3} \tau _H(m) k$. To prove this we reduce to the case when $n$ is a prime power. If $n = p$, then $|\rho (p)| \leq H = \tau _H(p)$. If $n = p^i$ for $i > 1$, then $|\rho (p^i)| \leq C_{1/3} p^{i/3} \leq p^{\lfloor i \rfloor /2} = \sqrt {k}$.

We thus get an upper bound

\[ \leq \sum_{m k^2 \leq x} |\rho(m k^2)| \leq C_{1/3} \sum_{m k^2 \leq x} \tau_H(m) k \leq C_{1/3 } x \sum_{m \leq x} \frac{\tau_H(m)}{m} \leq C_{1/3} x (\log x)^{H}. \]

Corollary 4.11 If $\rho$ is a frobenian multiplicative function of mean $0$, then for all $C, \varepsilon > 0$ and $x \geq 2$

\[ \sum_{\mathbf{N}(\mathfrak{n}) \leq x} \rho(\mathfrak{n}) \ll_{\varepsilon, C, [K: \mathbb{Q}], H} q(\rho)^{\varepsilon} e^{|S|}x (\log x)^{-C}. \]

Proof. If $q(\rho ) \leq (\log x)^{([K :\mathbb {Q}] H + C)\varepsilon ^{-1}}$, then this follows from Lemma 4.9. Otherwise it follows from Lemma 4.10.

We can now prove an asymptotic formula for frobenian multiplicative functions of arbitrary means. To state the result we need the complex-valued functions $\gamma _j(z)$ from [Reference TenenbaumTen95, II.5]. We then define

(4.6)\begin{equation} \lambda_k(\rho) := \frac{1}{\Gamma(m(\rho)-k)} \sum_{h + j = k} \frac{1}{h! j!} L_n^{(h)}(\rho,1) \gamma_j(m(\rho)). \end{equation}

In particular, $\lambda _0(\rho ) = L_n(\rho,1)$.

Theorem 4.12 Let $\rho$ a frobenian multiplicative function of conductor $q := q(\rho )$ and $N \in \mathbb {Z}_{\geq -1}$. For all $x \geq 2$

(4.7)\begin{equation} \sum_{\mathbf{N}(\mathfrak{n}) \leq x} \rho(\mathfrak{n}) = x (\log x)^{m(\rho) - 1} \bigg(\sum_{k = 0}^N \frac{\lambda_k(\rho)}{(\log x)^k} + e^{O(|S|)}O_{\varepsilon} (q^{\varepsilon}(\log x)^{-N- 1})\bigg). \end{equation}

The implicit constants depend on $N, [K: \mathbb {Q}], \Gamma, C_{\epsilon },H, m(\rho )$, but for the rest are independent of $\rho$.

Proof. By considering the induced frobenian multiplicative function we may assume that $K = \mathbb {Q}$, note that $|S|$ may increase at most by the amount of prime divisors $\omega (q)$ of the conductor $q$, but by the divisor bound $e^{O(\omega (q))} \leq \tau _{O(1)}(q) \ll _{\varepsilon } q^{\varepsilon }$.

The decomposition of $L$-series $L(\rho,s) = L_n(\rho, s) \zeta (s)^{m(\rho )}$ corresponds to writing $\rho$ as a convolution $\rho _n * \tau _{m(\rho )}$. Here $\rho _n = \rho * \tau _{-m(\rho )}$. The function $\tau _{-m(\rho )}$ is frobenian multiplicative of mean $-m(\rho )$ by the divisor bound, so $\rho _n$ is frobenian multiplicative of mean $0$ by Lemma 4.3.

By the hyperbola method we have

(4.8)\begin{align} \sum_{n \leq x} \rho(n) &= \sum_{n \leq \sqrt{x}} \rho_n(n) \sum_{m \leq {x}/{n}} \tau_{m(\rho)}(m) + \sum_{m \leq \sqrt{x}} \tau_{m(\rho)}(m) \sum_{n \leq {x}/{m}} \rho_n(n) \nonumber\\ &\quad -\sum_{n \leq \sqrt{x}} \rho_n(n) \sum_{m \leq \sqrt{x}} \tau_{m(\rho)}(m). \end{align}

It follows from the Selberg–Delange method [Reference TenenbaumTen95] that for any $M \in \mathbb {N}, y \geq 2$,

(4.9)\begin{align} \sum_{m \leq y} \tau_{m(\rho)}(m) &= y (\log y)^{m(\rho) - 1} \bigg(\sum_{j = 0}^M \frac{\gamma_j(m(\rho))}{\Gamma(m(\rho) - j) j! (\log y)^j}\bigg) \nonumber\\ &\quad + O_M((\log y)^{m(\rho) -M - 2}). \end{align}

From this, Corollary 4.11, Lemma 4.10 and partial summation, we deduce that the contribution of the latter two sums is

\[ \ll_{ B, \varepsilon, H, n} q^{\varepsilon} x (\log x)^{m(\rho) - 1 - B }. \]

This is part of the error term of (4.7) if we choose $B= N + 1$.

Filling (4.9) into the first sum of (4.8) gives an error term

\[ \ll x (\log x)^{m(\rho) - M - 2} \sum_{n \leq \sqrt{x}} \frac{|\rho_n(n)|}{n} \ll x (\log x)^{m(\rho) - M - 2} (\log x)^{H + |m(\rho)|}. \]

With the choice $M = H + 2\lceil |m(\rho )| \rceil + N$ this is part of the error term of (4.7).

The remaining part to deal with is the sum over the following terms:

\[ \frac{\gamma_j(m(\rho)) x}{\Gamma(m(\rho) - j) j!} \sum_{n \leq \sqrt{x}} \frac{\rho_n(n)}{n} \bigg(\log \frac{x}{n}\bigg)^{m(\rho) - 1 - j}. \]

We expand $(\log ({x}/{n}))^{m(\rho ) - 1 - j} = (\log x- \log n)^{m(\rho ) - 1 - j}$ to obtain

\[ \bigg(\log \frac{x}{n}\bigg)^{m(\rho) - 1 - j} = \sum_{h = 0}^{\infty} (-1)^h \binom{m(\rho) - 1 - j}{h}(\log x)^{m(\rho) - 1 - j - h} (\log n)^h. \]

Using the identity $\binom {m(\rho ) - 1 - j}{h} = {\Gamma (m(\rho ) - j)}/{\Gamma (m(\rho ) - j - h) h!}$, we have the sum over $j,h$ of

\[ \frac{\gamma_j(m(\rho)) x (\log x)^{m(\rho) - 1 - j - h}}{\Gamma(m(\rho) - j - h) j! h!} \sum_{n \leq \sqrt{x}} \frac{\rho_n(n)}{n} (\log n)^{h}. \]

The sum over $n$ converges by Corollary 4.11 and partial summation. The total series is thus equal to $L_n^{(h)}(\rho,1)$ by Abel's theorem. The resulting error bounds are

\begin{align*} \sum_{n > \sqrt{x}} \frac{\rho_n(n)}{n} (\log n)^{h} & \ll_{C, \varepsilon} (h + 1) e^{|S|} q^{\varepsilon} (\log x)^{-C + h}, \\ \sum_{n = 1}^{\infty} \frac{\rho_n(n)}{n} (\log n)^{h} & \ll_{C, \varepsilon} (h + 1)e^{|S|} q^{\varepsilon}. \end{align*}

We use the first bound if $j + h \leq N$ and the second bound if $j + h > N$. Taking $C \geq 2 N + 1$, using the bounds $\Gamma (m(\rho ) - j - h) \gg 1$ and [Reference TenenbaumTen95, II.5.(7)] proves the theorem.

Remark 4.13 In Corollary 4.11 and Theorem 4.12 we may count in $S$ only those $p \in S$ which do not divide the conductor. This is because for all $\varepsilon > 0$ we have $e^{O(\omega (q(\rho )))} \ll _{\varepsilon } q(\rho )^{\varepsilon }$ by the divisor bound.

4.2 Linked sums

We first introduce relevant definitions.

Definition 4.14 Let $I$ be an index set. A subset $U_{\mathbb {R}} \subset \mathbb {R}_{\geq 0}^{I}$ is downward-closed if $(x_i)_{i \in I} \in U_{\mathbb {R}}$ implies that $\{(y_i)_{i \in I} \in \mathbb {R}_{\geq 0}^{I}: y_i \leq x_i \mbox { for all } i \in I \} \subset U$.

Similarly, let $K_i$ be a number field for all $i \in I$. A subset $U \subset \prod _{i \in I} I_{K_i}$ is downward-closed if $(\mathfrak {a}_i)_{i \in I} \in U$ implies that $\{(\mathfrak {b}_i)_{i \in I} \in \prod _{i} I_{K_i}: \mathbf {N}_{K_i}(\mathfrak {b}_i) \leq \mathbf {N}_{K_i}(\mathfrak {a}_i) \mbox { for all } i \in I \} \subset U$.

The length of a downward-closed set $U_{\mathbb {R}} \subset \mathbb {R}_{\geq 0}^{I}$, respectively $U \subset \prod _i I_{K_i}$ is

\[ L_{U_{\mathbb{R}}} := \sup_{(x_i)_{i \in I} \in U_{\mathbb{R}}} \prod x_i,\qquad L_U := \sup_{(\mathfrak{a}_i)_{i \in I} \in U} \prod_{i \in I} \mathbf{N}_{K_i}(\mathfrak{a}_i). \]

We use the following notation.

Definition 4.15 Let $I$ be an index set, and $X_i$ a set for all $i \in I$. Let $U \subset \prod _{i \in I} X_i$, $J \subset I$ and $x_j \in X_j$ for all $j \in J$. We define the slice of $U$ with respect to the $x_j$ as

\[ U[(x_j)_{j \in J}] := U[(x_j \in X_j)_{j \in J}] := U \cap \bigg(\prod_{j \in J}\{ x_j \} \times \prod_{i \in I \setminus J} X_i \bigg). \]

Some basic properties of downward-closed sets are as follows.

Proposition 4.16 Let $U \subset \prod _{i \in I} I_{K_i}$ be a downward-closed set.

  1. (i) If $I$ is finite, then $L_U \ll _{K_i} \# U \ll _{K_i} L_U (\log L_U)^{|I| -1}$.

  2. (ii) Let $J \subset I$ and $\mathfrak {a}_j \in I_{K_j}$ for $j \in J$. The slice $U[ (\mathfrak {a}_j)_{j \in J}]$ is then also downward-closed. Similarly, if $U_{\mathbb {R}} \subset \mathbb {R}_{\geq 0}^I$ is downward-closed and $x_j \in \mathbb {R}_{\geq 0}$, then the slice $U[(x_j)_{j \in J}]$ is downward-closed.

Proof. The first statement is clear if $U$ is infinite.

If it is finite, choose $\mathfrak {a_i}$ such that $L_U = \prod _{i \in I} \mathbf {N}(\mathfrak {a_i})$. Recall that for any number field $K$ there exists a constant $C_K$ such that $\#\{ \mathfrak {n} \in I_{K}: \mathbf {N}(\mathfrak {n}) \leq T\} \sim C_K T$ as $T \to \infty$. Now note that $U$ contains the set $\{(\mathfrak {u}_i)_{i \in I} : \mathbf {N}(\mathfrak {u_i}) \leq \mathbf {N}(\mathfrak {a_i}) \}$ which has size $\gg \prod _{i \in I} \mathbf {N}(\mathfrak {a_i}) = L_U$. It is also contained in the set $\{(\mathfrak {u}_i)_{i \in I} : \prod _{i \in I}\mathbf {N}(\mathfrak {u_i}) \leq L_U \}$ which is of size $\ll L_U (\log L_U)^{|I| -1}$.

The latter statements follow directly from the definitions.

We require a lemma that positive sums over downward-closed sets can be approximated by integrals. Note that any downward-closed set $U_\mathbb {R} \subset \mathbb {R}_{\geq 0}^n$ defines a downward-closed set $U := \{ (\mathfrak {u}_i)_{i \in I} \in \prod _{i = 1}^n I_{K_i}: (\mathbf {N}(\mathfrak {u}_i))_{i \in I} \in U_\mathbb {R} \}$.

Lemma 4.17 For $i = 1, \ldots, n$ let $f_i: I_{K_i} \to \mathbb {R}_{\geq 0}$ be a non-negative arithmetic function and $g_i,h_i: \mathbb {R}_{\geq 0} \to \mathbb {R}_{\geq 0}$ non-negative real functions such that for all $x > 0$ we have $|\int _0^x g_i(x_i)\,d x_i - \sum _{\mathbf {N}(\mathfrak {u}_i) \leq x} f_i(\mathfrak {u}_i)| \leq \int _0^x h_i(x_i)\,d x_i$. Then

\[ \bigg|\int_{\mathbf{x} \in U_{\mathbb{R}}} \prod_{i = 1}^n g_i(x_i)\,dx_i - \sum_{\mathbf{u} \in U} \prod_{i = 1}^n f_i(\mathfrak{u}_i)\bigg| \leq \sum_{J \subsetneq \{1, \ldots, n\}} \int_{U_{\mathbb{R}}} \prod_{j \in J} g_j(x_j)\,dx_j \prod_{i \in \{1, \ldots, n\} \setminus J} h_i(x_i)\,d x_i. \]

Proof. We prove this via induction on $n$: if $n = 1$, then the result is equal to the assumption; otherwise, we can write

\[ \sum_{\mathbf{u} \in U} \prod_{i \in I} f_i(\mathfrak{u}_i) = \sum_{\mathfrak{u}_1} f_1(\mathfrak{u}_1) \sum_{\mathbf{u} \in U[\mathfrak{u}_1]} \prod_{i \in I} f_i(\mathfrak{u}_i). \]

By the induction hypothesis we can approximate the sum over the slice $U[\mathfrak {u}_1]$, because it is also downward-closed. The error term is

\begin{align*} & \sum_{\mathfrak{u}_1} f_1(\mathfrak{u}_1) \sum_{J \subsetneq \{2, \ldots, n\}} \int_{U_{\mathbb{R}}[\mathbf{N}(\mathfrak{u}_1)]} \prod_{j \in J} g_j(x_j) \,dx_j \prod_{i \in \{2, \ldots, n\} \setminus J} h_i(x_i) \,d x_i \nonumber\\ &\quad = \sum_{J \subsetneq \{2, \ldots, n\}} \int \sum_{\mathbf{N}(\mathfrak{u}_1) \in U_{\mathbb{R}}[x_i : i \in \{2, \ldots, n\}]} f_1(\mathfrak{u}_1) \prod_{j \in J} g_j(x_j) \,dx_j \prod_{i \in \{2, \ldots, n\} \setminus J} h_i(x_i)\,d x_i. \end{align*}

As $U_{\mathbb {R}}[x_i : i \in \{2, \ldots, n\}]$ is downward-closed, there exists an $x > 0$ such that it is equal to $[0, x]$ or $[0,x)$. Applying the inequality $\sum _{\mathbf {N}(\mathfrak {u}_1) \leq x} f_1(\mathfrak {u}_1) \leq \int _0^x g_1(x_1)\,d x_1 +\int _0^x h_1(x_1)\,dx_1$ we obtain

(4.10)\begin{equation} \sum_{ \{2, \ldots, n\} \neq J \subsetneq \{1, \ldots, n\}} \int_{U_{\mathbb{R}}} \prod_{j \in J} g_j(x_j)\,dx_j \prod_{i \in \{1, \ldots, n\} \setminus J} h_i(x_i)\,d x_i. \end{equation}

The main term is given by

\[ \sum_{\mathfrak{u}_1} f_1(\mathfrak{u}_1) \int_{U_{\mathbb{R}}[\mathbf{N}(\mathfrak{u}_1)]} \prod_{i = 2}^n g_i(x_i)\,dx_j = \int \bigg(\sum_{\mathbf{N}(\mathfrak{u}_1) \in U_{\mathbb{R}}[x_i : i \in \{2, \ldots, n\}]} f_1(\mathfrak{u}_1)\bigg) \prod_{i = 2}^s g_i(x_i)\, dx_i. \]

Using again that $U_{\mathbb {R}}[x_i : i \in \{2, \ldots, n\}]$ is downward-closed, we can approximate the sum over $\mathfrak {u}_1$ using the assumption. The resulting error term is

\[ \int_{U_{\mathbb{R}}} h_1(x_1)\,d x_1 \prod_{i = 2}^s g_i(x_i)\,dx_i, \]

which together with (4.10) gives the required error term. The main term is

\[ \int_{\mathbf{x} \in U_{\mathbb{R}}} \prod_{i = 1}^n g_i(x_i)\,dx_i \]

as desired.

Remark 4.18 An interesting special case is when $g_i = 0$. In this case the lemma says that if $|\sum _{\mathbf {N}(\mathfrak {u}_i) \leq x} f_i(\mathfrak {u}_i)| \leq \int _0^x h_i(x_i)$, then

\[ \bigg|\sum_{\mathbf{u} \in U} \prod_{i = 1}^n f_i(\mathfrak{u}_i)\bigg| \leq \int_{\mathbf{x} \in U_{\mathbb{R}}} \prod_{i = 1}^n h_i(x_i)\,dx_i. \]

We apply Lemma 4.17 to the case when $U$ is defined by inequalities of the type $\prod _i x_i^{\alpha _i} \leq T$ for $\alpha _i \geq 0$. In that case it can be reinterpreted as a form of the hyperbola method on toric varieties [Reference Pieropan and SchindlerPS20] which only works if the function $f(x_1, \ldots, x_n)$ can be written as a product of single variable function $f_i(x_i)$. However, unlike [Reference Pieropan and SchindlerPS20, Theorem 1.1], it can also be applied when $\sum _{x_i \leq T} f_i(x_i) \sim C(\log T)^M$.

The functions which will link variables are as follows.

Definition 4.19 Let $K, L$ be number fields, $A > 0$ and $q \in \mathbb {N}$, an $(A,q)$-oscillating bilinear character is a bilinear morphism $\alpha :I_K \times I_L \to \mathbb {C}$ such that for all $\mathfrak {a} \in I_K, \mathfrak {b} \in I_{L}$ the character $\alpha (\mathfrak {a}, \cdot )$, respectively $\alpha (\cdot, \mathfrak {b})$:

  1. (i) is either $0$ or a Hecke character of finite modulus $\mathfrak {m}$ such that $\mathbf {N}_L(\mathfrak {m}) \leq q \mathbf {N}_K(\mathfrak {a})^{A}$, respectively $\mathbf {N}_K(\mathfrak {m}) \leq q \mathbf {N}_L(\mathfrak {b})^{A}$;

  2. (ii) is non-principal if there exists a $p \nmid q$ such that $v_p(N_K(\mathfrak {a})) = 1$, respectively $v_p(N_L(\mathfrak {b})) = 1$.

The oscillating bilinear characters used in this paper are described in the following lemma. We use the notation $\bigl (\frac {\cdot }{\cdot }\bigr )$ for the Jacobi symbol of $\mathbb {Q}$ and $\bigl (\frac {\cdot }{\cdot }\bigr )_{4}$ for the $4$th power residue symbol of $\mathbb {Q}(i)$. Note that for all $n \in \mathbb {N}, \mathfrak {m} \in I_{\mathbb {Q}(i)}$ we have $\bigl (\frac {n}{\mathbf {N}(\mathfrak {m})}\bigr ) = \bigl (\frac {n^2}{\mathfrak {m}}\bigr )_{4}$. This can be seen since they are both bilinear and take the same value if $\mathfrak {m}$ is prime.

Lemma 4.20 The following functions $\alpha$ and their inverses are $(6,2^{24})$-oscillating bilinear characters. The functions take the value $0$ if the following description is not defined:

  1. (i) $(m,n) \in \mathbb {N}^2 \to \bigl (\frac {n}{m}\bigr )$;

  2. (ii) $(\mathfrak {m},n) \in I_{\mathbb {Q}(i)} \times \mathbb {N} \to \bigl (\frac {n}{\mathfrak {m}}\bigr )_{4}$;

  3. (iii) $(\mathfrak {m},n) \in I_{\mathbb {Q}(i)} \times \mathbb {N} \to \bigl (\frac {\mathbf {N}(\mathfrak {m})}{n}\bigr )$;

  4. (iv) $(\mathfrak {m},n) \in I_{\mathbb {Q}(i)} \times \mathbb {N} \to \bigl (\frac {n}{\mathbf {N}(\mathfrak {m})}\bigr ) = \bigl (\frac {n^2}{\mathfrak {m}}\bigr )_{4}$;

  5. (v) $(\mathfrak {m},n) \in I_{\mathbb {Q}(i)} \times \mathbb {N} \to \bigl (\frac {n}{\mathfrak {m}}\bigr )_{4} \bigl (\frac {\mathbf {N}(\mathfrak {m})}{n}\bigr )$;

  6. (vi) $(\mathfrak {m},\mathfrak {n}) \in I_{\mathbb {Q}(i)}^2 \to \bigl (\frac {\mathbf {N}(\mathfrak {n})}{\mathfrak {m}}\bigr )_{4}$;

  7. (vii) $(\mathfrak {m},\mathfrak {n}) \in I_{\mathbb {Q}(i)}^2 \to \bigl (\frac {\mathbf {N}(\mathfrak {n})}{\mathfrak {m}}\bigr )_{4} \bigl (\frac {\mathbf {N}(\mathfrak {m})}{\mathfrak {n}}\bigr )_{4}$;

  8. (viii) $(\mathfrak {m},\mathfrak {n}) \in I_{\mathbb {Q}(i)}^2 \to \bigl (\frac {\mathbf {N}(\mathfrak {n})}{\mathfrak {m}}\bigr )_{4} \bigl (\frac {\mathbf {N}(\mathfrak {m})}{\mathfrak {n}}\bigr )_{4}^{-1}$.

Even more is true. If $p$ is a prime such that $v_p(\mathbf {N}(\mathfrak {u})) = 1$ for $\mathfrak {u} \in \mathbb {N}, I_{\mathbb {Q}(i)}$, then $p$ divides the norm of the conductor of the Hecke characters $\alpha (\mathfrak {u}, \cdot ), \alpha (\cdot, \mathfrak {u})$.

Proof. That $\alpha$ is bilinear is clear. In what follows, we write $\mathfrak {u}$ for one of $n, m, \mathfrak {n}, \mathfrak {m}$ and $\mathfrak {v}$ for the other variable of $\alpha$ to keep the arguments uniform. If we fix the variable $\mathfrak {v}$, then $\alpha (\mathfrak {v}, \cdot )$, respectively $\alpha (\cdot, \mathfrak {v})$ is a product of a Hecke characters modulo $\mathfrak {v}$ or a Hecke character of the field extension $\mathbb {Q}(\sqrt {-1}, \sqrt [4]{\mathbf {N}(\mathfrak {v})})$ which has discriminant dividing $2^{24}\mathbf {N}(\mathfrak {v})^6$. The norm of the modulus of this Hecke character must divide this discriminant. It only remains to show the non-principality.

We first handle the case that $\mathfrak {u}$ is an odd totally split prime $\mathfrak {p}$. It then suffices to show that there exists a single $\mathfrak {v} \equiv 1 \pmod {2^{24}}$ such that the value of $\alpha (\mathfrak {p}, \mathfrak {v})$, respectively $\alpha (\mathfrak {v},\mathfrak {p})$ is not $1$. Except in the cases 5, 6 and 8 this can be done by choosing $\mathfrak {v}$ as a totally split prime and applying Chebotarev density.

Consider now the cases of 5, 6 and 8 where we have to choose an ideal $\mathfrak {v} \in I_{\mathbb {Z}[i]}$. Start by finding a prime $\mathfrak {q} \equiv 1 \pmod {2^{24}}$ such that $\bigl (\frac {\mathbf {N}(\mathfrak {p})}{\mathfrak {q}}\bigr )_{4}$ is equal to $i$ or $-i$, i.e. $\mathbf {N}(\mathfrak {p})$ is not a square modulo $\mathbf {N}(\mathfrak {q})$. This is possible because of Dirichlet's theorem. Then $\alpha (\mathfrak {p}, \mathfrak {v})$, respectively $\alpha (\mathfrak {v},\mathfrak {p})$ is not equal to $1$ for one of two choices $\mathfrak {v} = \mathfrak {q}, \overline {\mathfrak {q}}$.

The only remaining case is to find some $n \in \mathbb {N}$ such that $\bigl (\frac {n}{\mathfrak {p}}\bigr )_{4} \bigl (\frac {\mathbf {N}(\mathfrak {p})}{n}\bigr ) \neq 1$. For this choose $n \equiv 1 \pmod {2^{24}}$ which is a not a square modulo $\mathbf {N}(\mathfrak {p})$. By quadratic reciprocity

\[ \bigg(\frac{n}{\mathfrak{p}}\bigg)_{4} \bigg(\frac{\mathbf{N}(\mathfrak{p})}{n}\bigg) = \bigg(\frac{n}{\mathfrak{p}}\bigg)_{4} \bigg(\frac{n}{\mathbf{N}(\mathfrak{p})}\bigg) = \bigg(\frac{n^3}{\mathfrak{p}}\bigg)_{4} \neq 1. \]

To deduce the general case assume that $p$ is an odd prime such that $v_p(\mathbf {N}(\mathfrak {m})) = 1$. In this case we can write $\mathfrak {m} = \mathfrak {p} \mathfrak {m}'$ such that $\mathbf {N}(\mathfrak {p}) = p$ and $p, \mathbf {N}(\mathfrak {m}')$ are coprime. In this case $\alpha (\mathfrak {m}, \cdot ) = \alpha (\mathfrak {p}, \cdot ) \alpha (\mathfrak {m}', \cdot )$ is a product of a non-principal Hecke character such that $p$ divides the conductor and a Hecke character whose conductor is coprime to $p$. The conductor of such a product has to be divisible by $p$. The case $v_p(\mathbf {N}(\mathfrak {n})) = 1$ is completely analogous.

For these oscillating bilinear character we prove an analogue of [Reference Heath-BrownHea93, Lemma 4].

Proposition 4.21 Let $K, L$ be number fields. Let $a_{\mathfrak {m}}, b_{\mathfrak {n}}$ be complex numbers of absolute value at most $1$ where $\mathfrak {m}, \mathfrak {n}$ ranges over $I_K, I_L$, respectively. Let $U \subset I_K \times I_L$ be a downward-closed set and $\alpha$ an $(A, q)$-oscillating bilinear character. There exists a $\delta > 0$ depending only on $K, L, A$ such that for all $M,N \geq 2$,

\[ \mathop{\sum\sum}_{\substack{(\mathfrak{m}, \mathfrak{n}) \in U \\ \mathbf{N}(\mathfrak{m}) \leq M \\ \mathbf{N}( \mathfrak{n}) \leq N}} a_{\mathfrak{m}} b_{\mathfrak{n}} \alpha(\mathfrak{m}, \mathfrak{n}) \ll_{K, L, A} qMN(\log M N)^{\delta^{-1}}( N^{-\delta}+ M^{-\delta}). \]

The implicit constant are, in particular, independent of $U$.

Proof. Write $B(M, N)$ for the left-hand side of the expression in Proposition 4.21. We may assume by symmetry that $M \leq N$. Let $k$ be a natural number to be chosen later. By applying Hölder's inequality to the sum over $\mathfrak {m}$ and switching the order of summation we obtain

\begin{align*} |B(M,N)|^{k} &\ll M^{k-1} \sum_{\mathbf{N}(\mathfrak{m}) \leq M} \bigg\lvert\sum_{\substack{(\mathfrak{m}, \mathfrak{n}) \in U \\ \mathbf{N}(\mathfrak{n}) \leq N}} b_{\mathfrak{n}} \alpha(\mathfrak{m}, \mathfrak{n})\bigg\rvert^{k} \\ &\leq M^{k-1} \sum_{\mathbf{N}(\mathfrak{n}_1), \ldots, \mathbf{N}(\mathfrak{n}_k) \leq N} \bigg\lvert \sum_{\substack{(\mathfrak{m}, \mathfrak{n}_i) \in U \\ \mathbf{N}(\mathfrak{m}) \leq M}} c_{\mathfrak{m}} \alpha(\mathfrak{m}, \mathfrak{n}_1 \cdots \mathfrak{n}_k) \bigg\rvert, \end{align*}

where $|c_\mathfrak {m}| = 1$ is the complex number such that $c_\mathfrak {m} (\sum b_{\mathfrak {n}} \alpha (\mathfrak {m}, \mathfrak {n}))^{k}$ is a positive real.

At the cost of a factor of $k$ we may sum only over those tuples such that $\mathbf {N}(\mathfrak {n}_1), \ldots, \mathbf {N}(\mathfrak {n}_{k -1}) \leq \mathbf {N}(\mathfrak {n}_k) \leq N$. Note that the condition $(\mathfrak {m}, \mathfrak {n}_i) \in U$ is then implied by $(\mathfrak {m}, \mathfrak {n}_k) \in U$ because $U$ is downward-closed. Write $\mathfrak {l} = \mathfrak {n}_1 \cdots \mathfrak {n}_{k-1}$. The amount of tuples giving the same $\mathfrak {l}$ is

\[ \sum_{\substack{\mathbf{N}(\mathfrak{n}_1), \ldots, \mathbf{N}(\mathfrak{n}_{k-1}) \leq \mathbf{N}(\mathfrak{n}_k) \\ \mathfrak{n}_1 \cdots \mathfrak{n}_{k-1} = \mathfrak{l}}} 1 \leq \tau_{k-1}(\mathfrak{l}). \]

Applying Cauchy–Schwarz gives that $|B(M,N)|^{2k}$ is bounded by

\begin{align*} & k^2 M^{2k-2} \bigg(\sum_{\mathbf{N}(\mathfrak{n}_k) \leq N} \sum_{\mathbf{N}(\mathfrak{l}) \leq N^{k-1}} \tau_{k-1}(\mathfrak{l})^2\bigg) \bigg(\sum_{\mathbf{N}(\mathfrak{n}_k) \leq N} \sum_{\mathbf{N}(\mathfrak{m}_1), \mathbf{N}(\mathfrak{m}_2) \leq M} \bigg\lvert \sum_{\mathfrak{l}} \alpha(\mathfrak{m}_1 , \mathfrak{l}) \overline{\alpha(\mathfrak{m}_2 , \mathfrak{l})}\bigg\rvert\bigg) \\ &\quad \ll k^2 M^{2k - 2} N^{k + 1} (\log N)^{k^2 -2 k} \sum_{\mathbf{N}(\mathfrak{m}_1), \mathbf{N}(\mathfrak{m}_2) \leq M} \bigg\lvert \sum_{\mathbf{N}(\mathfrak{l}) \leq N^{k-1}} \alpha(\mathfrak{m}_1 , \mathfrak{l}) \overline{\alpha(\mathfrak{m}_2 , \mathfrak{l})}\bigg\rvert. \end{align*}

Every pair $\mathfrak {m}_1, \mathfrak {m}_2$ such that $v_p(\mathbf {N}_K(\mathfrak {m}_1 \mathfrak {m}_2)) \neq 1$ for all $p \nmid q$ has the property that their product can be written (non-uniquely) as $\mathfrak {m}_1 \mathfrak {m}_2 = \mathfrak {u} \mathfrak {v}^2 \mathfrak {w}^3$ with $\mathbf {N}(\mathfrak {u})| q$. The number of such pairs is by the divisor bound

\[ \ll \sum_{\substack{\mathbf{N}(\mathfrak{u} \mathfrak{v}^2 \mathfrak{w}^3) \leq M^2\\ \mathbf{N}(\mathfrak{u}) \mid q}} \tau(\mathfrak{u} \mathfrak{v}^2 \mathfrak{w}^3) \ll \sum_{\mathbf{N}(\mathfrak{u}) \mid q} \mathbf{N}(\mathfrak{u})^{{1}/{2}} \sum_{\mathbf{N}(\mathfrak{v}) \mathbf{N}(\mathfrak{w})^{{3}/{2}} \leq M} \mathbf{N}(\mathfrak{v} \mathfrak{w})^{{1}/{2}}\ll q M^{{3}/{2}}. \]

The total contribution of this case is, thus, $\ll k^2 q M^{2k-{1}/{2}} N^{2k} (\log N)^{k^2-2k}$.

We claim that in the other cases $\alpha (\mathfrak {m}_1 , \cdot ) \overline {\alpha (\mathfrak {m}_2, \cdot })$ is a non-principal character. As $\alpha$ is an oscillating bilinear character the norm of its modulus is at most $q \mathbf {N}_K(\mathfrak {m}_1 \mathfrak {m_2})^{A}$. The sum over $\mathfrak {l}$ is, in this case,

\[ \ll q^{{1}/{(d_L + 1})} \mathbf{N}_K(\mathfrak{m}_1 \mathfrak{m_2})^{{A}/{(d_L +1)}} (\log M)^{d_L}N^{(k-1)({(d_L-1)}/{(d_L + 1)})} \]

by Polya–Vinogradov for number fields [Reference LandauLan18].

The total contribution of this part is, thus,

\[ \ll k^2 q M^{2k+ {2 A}/{(d_L + 1)}} (\log M)^{d_L} N^{{( 2k d_L + 2)}/{(d_L + 1})} (\log N)^{k^2 -2k}. \]

Choosing $k$ sufficiently large such that $ {(2k d_L + 2 + 2A)}/{(d_L + 1)} \leq 2k - \frac {1}{2}$ and $d_L \leq 2k$, taking $\delta = {1}/{4k}$ and using that $M \leq N$ gives the desired bound.

It remains to prove the claim. We may assume that there exists a prime $p \nmid q$ such that $v_{p}(\mathbf {N}(\mathfrak {m}_1)) = 1$ and $p \nmid \mathbf {N}(\mathfrak {m}_2)$. There exists a natural number $d$ such that $\overline {\alpha (\mathfrak {m}_2 , \cdot )} = \alpha (\mathfrak {m}_2 , \cdot )^d$ because it is a Hecke character. We thus have to show that $\alpha (\mathfrak {m}_1 \mathfrak {m}_2^d, \cdot )$ is non-principal. However, $v_p(\mathbf {N}(\mathfrak {m}_1 \mathfrak {m}_2^d)) = 1$ so this is true by definition.

We now consider the following situation. Let $f: \prod _{i \in I} I_{K_i} \to \mathbb {C}$ be a function and $U \subset \prod _i I_{K_i}$ a downward-closed set. Our goal is to evaluate $\sum _{\mathbf {u} \in U} f(\mathbf {u})$ uniformly in $U$. We require the following additional data.

  1. (i) Two integers $q_{\textrm {osc}}, q_{\textrm {frob}}$ and a constant $A$.

  2. (ii) A relation $R \subset I \times I$ which is symmetric, i.e. $(i, j) \in R$ implies $(j, i) \in R$, and such that $(i, i) \not \in R$ for all $i \in I$. We say that $i$ is linked to $j$ or that the variable $\mathfrak {u}_i$ is linked to $\mathfrak {u}_j$ if $(i,j) \in R$.

  3. (iii) A set of indices $i$ which we call frobenian. We also say that the variable $\mathfrak {u}_i$ is frobenian.

These data have to satisfy the following properties.

  1. (i) If $i$ is linked to $j$, then for all tuples $\mathbf {u}_{\hat {i j}} := (\mathfrak {u}_k)_{k \in I, k \neq i,j}$ there are three functions

    \begin{align*} & a(\cdot ;{\mathbf{u}_{\hat{i j}}}): I_{K_i} \to \mathbb{C}: \mathfrak{u}_i \to a(\mathfrak{u}_i ; \mathbf{u}_{\hat{i j}}), \\ & b(\cdot ;{\mathbf{u}_{\hat{i j}}}): I_{K_j} \to \mathbb{C}: \mathfrak{u}_j \to b(\mathfrak{u}_j ; \mathbf{u}_{\hat{i j}}), \\ & \alpha(\cdot, \cdot ; \mathbf{u}_{\hat{i j}}): I_{K_i} \times I_{K_j} \to \mathbb{C}: (\mathfrak{u}_i, \mathfrak{u}_j) \to \alpha(\mathfrak{u}_i, \mathfrak{u}_j; \mathbf{u}_{\hat{i j}}) \end{align*}
    that exist such that $f((\mathfrak {u}_{\ell })_{\ell \in I}) = a(\mathfrak {u}_i ; \mathbf {u}_{\hat {i j}}) b(\mathfrak {u}_j ; \mathbf {u}_{\hat {i j}}) \alpha (\mathfrak {u}_i, \mathfrak {u}_j; \mathbf {u}_{\hat {i j}})$. We also require that the function $\alpha (\cdot, \cdot ; \mathbf {u}_{\hat {i j}})$ has to be an $(A, q_{\textrm {osc}})$-oscillating bilinear character and $|a(\mathfrak {u}_i ; \mathbf {u}_{\hat {i j}})|, |b(\mathfrak {u}_j ; \mathbf {u}_{\hat {i j}})| \leq 1$. Less formally, the only interaction between $\mathfrak {u}_i$ and $\mathfrak {u}_j$ in $f$ has to come from an oscillating bilinear function.
  2. (ii) If $i$ is frobenian, then for all tuples $\mathbf {u}_{\hat {i}} := (\mathfrak {u}_k)_{k \in I, k \neq i}$ let

    \[ S(\mathbf{u}_{\hat{i}}) = \{p \mid q_{\textrm{frob}} \} \cup \{ p \mid \mathbf{N}(\mathfrak{u}_k) : k \neq i \}. \]
    For all $\mathbf {u}_{\hat {i}}$ there has to exist a constant $|c(\mathbf {u}_{\hat {i}})| \leq 1$ and a $S(\mathbf {u}_{\hat {i}})$-frobenian multiplicative function $\rho (\cdot ;\mathbf {u}_{\hat {i}})$ such that $f((\mathfrak {u}_{\ell })_{\ell \in I}) = c(\mathbf {u}_{\hat {i}}) \rho (\mathfrak {u}_i ;\mathbf {u}_{\hat {i}})$. The frobenian multiplicative function $\rho (\cdot ;\mathbf {u}_{\hat {i}})$ has to have conductor at most $q_{\textrm {frob}} \prod _{j \mbox { linked to } i} \mathbf {N}(\mathfrak {u}_j)^A$ and its mean is $0$ if there exists an index $j$ linked to $i$ such that $\mathfrak {u}_j \neq 1$.

Given these data, we can prove that some of the variables may be taken to be small or even equal to $1$. This allows us to reduce the number of variables we have to consider.

Theorem 4.22 Let $f, U$ be as before. For all $\varepsilon, \delta, C_1 > 0$ there exists a $C_2 > 0$ such that

\[ \sum_{\mathbf{u} \in U} f(\mathbf{u}) = \sum_{\substack{\mathbf{u} \in U \\ \mathbf{u} \text { satisfies (4.11)}}} f(\mathbf{u}) + O\big(q_{\text{osc}}q_{\text{frob}}^{\varepsilon}L_U (\log L_U)^{-C_1}\big), \]

where the condition in the sum is

(4.11) \begin{equation} \begin{aligned} & \textrm{for every pair of linked variables } \mathfrak{u}_i, \mathfrak{u}_j\textrm{ at most one is } > (\log L_U)^{C_2}, \\ & \textrm{if } \mathfrak{u}_i \textrm{ is a frobenian variable and } \mathbf{N}(\mathfrak{u}_i) > e^{(\log L_U)^{\delta}}, \textrm{ then every } \\ & \textrm{variable linked to } \mathfrak{u}_i \textrm{ is equal to } 1. \end{aligned} \end{equation}

The implicit constants and $C_2$ depend only on $|I|, A, K_i, \varepsilon, \delta, C_1$.

Proof. Choose quantities $\frac {1}{2} = V_0 < 1 = V_1 < V_2 < \cdots < V_N = L_U$ such that $V_{n+1} \leq 2 V_{n}$ for all $n$ and such that there exists $m_1, m_2$ such that $V_{m_1} = (\log L_U)^{C_2}, V_{m_2} = e^{(\log L_U)^{\delta }}$. We can choose such quantities with $N = O(\log L_U)$. For every tuple $\mathbf {V} = (V_{n_i})_{i \in I}$ we can then consider the subsum

\[ S(\mathbf{V}) = \sum_{\substack{\mathbf{u} \in U \\ V_{n_i - 1} < \mathbf{N}(\mathfrak{u}_i) \leq V_{n_i}}} f(\mathbf{u}). \]

We bound $S(\mathbf {V})$ for the tuples $\mathbf {V}$ which fail (4.11). As there are at most $O((\log L_U)^{|I|})$ of such tuples it suffices to prove that for such $\mathbf {V}$

\[ S(\mathbf{V}) \ll_{\varepsilon} q_{\textrm{osc}}q_{\textrm{frob}}^{\varepsilon} L_U(\log L_U)^{-C_1 - |I|}. \]

Note that the sum is empty if $\prod _{i \in I} V_{n_i - 1} > L_U$ so $S(\mathbf {V}) = 0$ in that case. We may thus assume that $\prod _{i \in I} V_{n_i} \leq 2^{|I|} L_U$.

If there are two linked variables $\mathfrak {u}_j, \mathfrak {u}_k$ such that $V_{n_j - 1}, V_{n_k - 1} \geq (\log L_U)^{C_2}$, then by assumption we have

\[ S(\mathbf{V}) \leq \sum_{\substack{\mathbf{u}_{\hat{j k}} \\ V_{n_i - 1} < \mathbf{N}(\mathfrak{u}_i) \leq V_{n_i}}} \bigg|\mathop{\sum \sum}_{\substack{(\mathfrak{u}_j, \mathfrak{u}_k) \in U[\mathbf{u}_{\hat{j k}}]\\ V_{n_j - 1} < \mathbf{N}(\mathfrak{u}_j) \leq V_{n_j} \\ V_{n_k - 1} < \mathbf{N}(\mathfrak{u}_k) \leq V_{n_k} }} a(\mathfrak{u}_j ; \mathbf{u}_{\hat{j k}}) b(\mathfrak{u}_k ; \mathbf{u}_{\hat{j k}}) \alpha(\mathfrak{u}_j, \mathfrak{u}_k; \mathbf{u}_{\hat{j k}})\bigg|. \]

The slice $U[\mathbf {u}_{\hat {j k}}]$ is downward-closed, so by Proposition 4.21 there exists a $\delta ' > 0$ such that the inner sum is

\[ \ll_{A, K_j, K_k} V_{n_j} V_{n_k} \log(V_{n_j} V_{n_k})^{\delta'^{-1}} \big(V_{n_j}^{- \delta'} + V_{n_k}^{-\delta'}\big). \]

Hence,

\[ S(\mathbf{V}) \ll_{A, K_i} q_{\textrm{osc}}\prod_{i \in I} V_{n_i} \log(V_{n_j} V_{n_k})^{\delta'^{-1}} \big(V_{n_j}^{- \delta'} + V_{n_k}^{-\delta'} \big) \ll q_{\textrm{osc}} L_U \log( L_U)^{\delta'^{-1} -C_2 \delta'}. \]

The desired inequality is true as long as one takes $C_2$ sufficiently large.

If $\mathfrak {u}_j$ is a frobenian variable such that $V_{n_j -1 } \geq e^{(\log L_U)^{\delta }}$ and for at least one linked variable $\mathfrak {u}_{\ell }$ we have $V_{n_\ell } \neq 1$, then we can consider two cases. We may assume that $e^{(\log L_U)^{\delta }} \geq (\log L_U)^{C_2}$ by increasing the implied constant in the theorem.

Either there exists a variable $\mathfrak {u}_k$ linked to $\mathfrak {u}_j$ such that $V_{n_k - 1} \geq (\log L_U)^{C_2}$. In this case we are in the previous situation. In the other case, we have by assumption $V_{n_k} \leq (\log L_U)^{C_2}$ for all variables $\mathfrak {u}_k$ linked to $\mathfrak {u}_j$. We then have

\[ S(\mathbf{V}) \leq \sum_{\substack{\mathbf{u}_{\hat{j}} \\ V_{n_i - 1} < \mathbf{N}(\mathfrak{u}_i) \leq V_{n_i}}} |c(\mathbf{u}_{\hat{i}})| \bigg| \sum_{\substack{ \mathfrak{u}_j \in U[\mathfrak{u}_{\hat{j}}] \\ V_{n_j - 1} < \mathbf{N}(\mathfrak{u}_j) \leq V_{n_j}}} \rho(\mathfrak{u}_i ;\mathbf{u}_{\hat{i}})\bigg|. \]

By assumption $|c(\mathbf {u}_{\hat {i}})| \leq 1$ and $\rho (\mathfrak {u}_i ;\mathbf {u}_{\hat {i}})$ is a frobenian multiplicative function of conductor $\ll q_{\textrm {frob}} \prod _{k \mbox { linked to } j} V_{n_k}^{A}$. It has mean zero because $V_{n_{\ell }} \neq 1$ so there is a linked variable $\mathfrak {u}_{\ell } \neq 1$. Corollary 4.11 implies that for all $D > 0$,

\[ S(\mathbf{V}) \ll_{D, \varepsilon} \sum_{\substack{(\mathfrak{u}_i)_{i \neq j} \in U[\mathfrak{u}_j = 1] \\ V_{n_i - 1} < \mathbf{N}(\mathfrak{u}_i) \leq V_{n_i}}}e^{O(\prod_{i \neq j}\omega(\mathfrak{u_i}))} \bigg(q_{\textrm{frob}}\prod_{ \mathfrak{u}_k \mbox{ linked to } \mathfrak{u}_j} V_{n_k}^A\bigg)^{\varepsilon} V_{n_j}(\log V_{n_j})^{-D}. \]

Use upper bounds to get rid of the condition induced by $U$ and use that $e^{O(\omega (\mathfrak {u}_i))} \leq \tau _{O(1)}(\mathfrak {u}_i)$ to see that $\sum _{ V_{n_i - 1} < \mathbf {N}(\mathfrak {u}_i) \leq V_{n_i}}e^{O( \omega (\mathfrak {u}_i))} \ll L_U (\log L_U)^{O(1)}$ and, hence,

\begin{align*} S(\mathbf{V}) &\ll_{D, \varepsilon} q^{\varepsilon} V_{n_j} (\log L_U)^{-D \delta} (\log L_U)^{|I| A C_2 \varepsilon }\prod_{i \neq j} V_{n_i} (\log V_{n_i})^{O(1)} \\ &\ll q^{\varepsilon} L_U (\log L_U)^{|I| A C_2 \varepsilon + O(1) - D \delta}. \end{align*}

Picking $D$ sufficiently large gives the desired statement.

5. Counting

Let $S$ be the set of primes $p \leq 16{\,}897$, so all the statements in § 3.2 which are only true for significantly large enough primes will be true for primes $p \not \in S$. In this section we find asymptotic formulas for the size of the sets $N_{ = \square }^{\operatorname {Br}}(T), N_{ = -\square }^{\operatorname {Br}}(T), N_{ \neq \pm \square }^{\operatorname {Br}}(T)$ defined in the introduction.

We split these further into certain subsets. Let $\mathcal {P}_2(4)$ be the set of size $2$ subsets of $\{0,1,2,3\}$, this has $6$ elements. Consider the set

\[ \Phi := \{ \mathbf{A} = (A_0, A_1, A_2, A_3) \in \mathbb{N}^4 : p \mid A_0 A_1 A_2 A_3 \Rightarrow p \in S \mbox{ or } p^3 \mid A_0 A_1 A_2 A_3 \}. \]

For any $\mathbf {A} \in \Phi$ we define $m_{\mathbf {A}} := \textrm {rad}(A_0 A_1 A_2 A_3 \prod _{p \in S} p), \theta _{\mathbf {A}} := A_0 A_1 A_2 A_3$. Consider also a 4-tuple $\mathbf {M} = (M_0, M_1, M_2, M_3)$ of cosets of $(\mathbb {Z}/8 m_{\mathbf {A}} \mathbb {Z})^{\times 4}$ in $(\mathbb {Z}/8 m_{\mathbf {A}} \mathbb {Z})^{\times }$.

Remark 5.1 We may replace the set $\Phi$ by any subset and the rest of this section will still hold without any modification. For example, if we only want to count those $\mathbf {a}$ with $a_0, a_1, a_2, a_3$ coprime, then we can instead use the subset

\[ \Phi' := \{ \mathbf{A} = (A_0, A_1, A_2, A_3) \in \Phi : \gcd(A_0, A_1, A_2, A_3) = 1 \}. \]

We use the notation $\{k, \ell, m, n\} = \{0,1,2,3\}$ and $v_{k \ell } = v_{\ell k} := v_{\{k, \ell \}}$, similarly for any tuple indexed by $\mathcal {P}_2(4)$. We also write $t_{k \ell } = v_{k \ell } w_{k \ell }$, note that $v_{k \ell }$ and $w_{k \ell }$ are uniquely determined by $t_{k \ell }$.

We define $a_k := A_k u_k^2 \prod _{\{k, \ell \} \in \mathcal {P}_2(4)} v_{k \ell } w_{k \ell }$ and put $\mathbf {a} := (a_0, a_1, a_2, a_3)$. Note that due to the coprimality and square-freeness conditions the tuple $\mathbf {a}$ uniquely determines $\mathbf {A}, \mathbf {M}, \mathbf {u}, \mathbf {v}, \mathbf {w}$ and that $\theta _{\mathbf {A}} = \theta _{\mathbf {a}}= a_0 a_1 a_2 a_3 \bmod {\mathbb {Q}^{\times 2}}$. For each pair $\mathbf {A}, \mathbf {M}$ of such tuples we define

\[ N_{\mathbf{A}, \mathbf{M}}(T) := \left\{\begin{array}{@{}l@{}} (\mathbf{u}, \mathbf{v}, \mathbf{w}) \\ \mathbf{u} \in \mathbb{N}^4 \\ \mathbf{v}, \mathbf{w} \in \mathbb{N}^{\mathcal{P}_2(4)} \end{array} : \begin{array}{l@{}} \bigg|A_k u_k^2 \displaystyle\prod_{\{k, \ell\}} v_{k \ell} w_{k \ell}\bigg| \leq T, \\ \mu\bigg(m_\mathbf{A} \displaystyle\prod_{k} u_k \prod_{\ell \neq k} v_{k \ell} w_{k \ell}\bigg)^2 = 1, \\ p \mid v_{k \ell} \Rightarrow \theta_{\mathbf{A}} \in \mathbb{Q}_p^{\times 2}, p \mid w_{k \ell} \Rightarrow \theta_{\mathbf{A}} \not \in \mathbb{Q}_p^{\times 2}, \\ u_k^2 \displaystyle\prod_{\ell \neq k} v_{k \ell} w_{k \ell} \ (\mathrm{mod}\ 8 m_{\mathbf{A}}) \in M_k, \\ X_{\mathbf{a}}(\mathbb{Q}_p) \neq \emptyset \mbox{ for } p \nmid m_\mathbf{A} \end{array}\right\}. \]

The set $N_{\mathbf {A}, \mathbf {M}}(T)$ is only non-empty if $M_0 M_1 M_2 M_3$ is a square because $\prod _k u_k^2 \prod _{\{k, \ell \}} v_{k \ell }^2 w_{k \ell }^2 \in M_0 M_1 M_2 M_3$. Denote the set of pairs of tuples $\mathbf {A}, \mathbf {M}$ such that $M_0 M_1 M_2 M_3$ is a square by $\Psi$ and let $\Psi (T) := \{ (\mathbf {A}, \mathbf {M}) \in \Psi : |A_k| \leq T\}.$

We study the following subsets of $N_{\mathbf {A}, \mathbf {M}}(T)$:

\begin{align*} & N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T) := \{ (\mathbf{u}, \mathbf{v}, \mathbf{w}) \in N_{\mathbf{A}, \mathbf{M}}(T): X_{\mathbf{a}} \mbox{ has a BM obstruction} \}, \\ & N^{\mathcal{A}}_{\mathbf{A}, \mathbf{M}}(T) := \{ (\mathbf{u}, \mathbf{v}, \mathbf{w}) \in N_{\mathbf{A}, \mathbf{M}}(T): X_{\mathbf{a}} \mbox{ has a BM obstruction induced by } \mathcal{A} \}, \\ & N^{\textrm{loc}}_{\mathbf{A}, \mathbf{M}}(T) := \{ (\mathbf{u}, \mathbf{v}, \mathbf{w}) \in N_{\mathbf{A}, \mathbf{M}}(T) : p \mid w_{k \ell} \Rightarrow p \equiv 3\ {\rm (mod\ 4)} \}. \end{align*}

Let $M_{\mathbf {A}} := 11/16$ if $\theta _{\mathbf {A}} \in -\mathbb {Q}^{\times 2}$ and $M_{\mathbf {A}} := 15/32$ if $\theta _{\mathbf {A}} \not \in \pm \mathbb {Q}^{\times 2}$. It will later turn out that this is the mean of a certain frobenian multiplicative function.

5.1 Reductions

In this subsection we reduce finding an asymptotic formula for $N^{\operatorname {Br}}(T)$ to finding one for $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$.

Lemma 5.2

\begin{align*} \# N_{= \square}^{\operatorname{Br}}(T) &= \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T) \\ \theta_{\mathbf{A}} \in \mathbb{Q}^{\times 2}}} \# N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T), \\ \# N_{= -\square}^{\operatorname{Br}}(T) &= \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T) \\ \theta_{\mathbf{A}} \in -\mathbb{Q}^{\times 2}}} \# N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T), \\ \# N_{\neq \pm \square}^{\operatorname{Br}}(T) &= \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T) \\ \theta_{\mathbf{A}} \not \in \pm \mathbb{Q}^{\times 2}}} \ \# N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T). \end{align*}

Proof. The sets in the right-hand side are disjoint and can be interpreted as a subset of the right-hand side because $\mathbf {A}, \mathbf {M}, \mathbf {u}, \mathbf {v}, \mathbf {w}$ are uniquely determined by $\mathbf {a}$. Elements counted by the right-hand side which are not counted in the left-hand side are tuples $\mathbf {a}$ such that there exists a prime $p \not \in S$ such that $v_p(a_0 a_1 a_2 a_3) = 1$ and such that $X_{\mathbf {a}}$ has a Brauer–Manin obstruction. These tuples cannot exist by Proposition 3.4.

The following lemma will allow us to bound $N_{= \square }^{\operatorname {Br}}(T)$.

Lemma 5.3 If $\theta _{\mathbf {A}} = A_0 A_1 A_2 A_3 \in \mathbb {Q}^{\times 2}$, then we have for $T > 2$

\[ \# N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T) \ll \frac{T^2 (\log T)^2}{|\theta_{\mathbf{A}}|^{{1}/{2}}}. \]

Proof. By Lemma 3.5 the left-hand side is bounded by the sum of the sizes of the following subsets of $N_{\mathbf {A}, \mathbf {M}}(T)$. The first is defined by $a_i a_j = 1,-1,2,-2 \in \mathbb {Q}^{\times }/ \mathbb {Q}^{\times 2}$ for $i \neq j$ and the second by $u_k = 1$ for all $k$. We may assume that $i =0, j =1$ when bounding the contribution of the first subset. Using that $t_{k \ell } = v_{k \ell } w_{k \ell }$ uniquely determines $v_{k \ell }, w_{k \ell }$ we see that the size of these subsets is bounded by a sum over $u_k$ and $t_{k \ell }$. If $a_0 a_1 = 1,-1,2,-2 \in \mathbb {Q}^{\times }/ \mathbb {Q}^{\times 2}$, then $t_{k \ell } = 1$ for ${k, \ell } \neq \{0, 1\}, \{2, 3\}$. We thus obtain an upper bound for the size of the first subset of

(5.1)\begin{equation} \ll \mathop{\sum_{t_{01}, t_{23}} \sum_{u_k}}_{|A_k| u_k^2 \prod_{\ell \neq k} t_{k \ell} \leq T} 1 \ll \frac{T^2}{|A_0 A_1 A_2 A_3|^{{1}/{2}}} \sum_{t_{01}, t_{23} \leq T} \frac{1}{t_{01} t_{23}} \ll \frac{T^2(\log T)^2}{|A_0 A_1 A_2 A_3|^{{1}/{2}}}. \end{equation}

The size of the second subset is bounded by the sum $\sum _{{A_k \prod _{\ell \neq k} t_{k \ell } \leq T}} 1$. By applying Lemma 4.17 this sum can be bounded by the integral

\[ 2^6\int_{t_{k \ell} \geq {1}/{2}}^{|A_k| \prod_{\ell \neq k} t_{k \ell} \leq T} \prod_{\{k, \ell\}} d t_{k \ell}. \]

Make the change of variables $x_k = t_{0 k}, y_k = \prod _{\ell \neq k} t_{k \ell }$ for $k = 1,2,3$. The inverse is given by $t_{k \ell } = \sqrt {y_k y_\ell x_m /y_m x_k x_\ell }$ for $\{k, \ell, m\} = \{1,2,3\}$. The associated Jacobian determinant is

\[ \begin{vmatrix} \dfrac{1}{2}\sqrt{\dfrac{y_2 x_3}{y_1 y_3 x_1 x_2}} & \dfrac{1}{2}\sqrt{\dfrac{y_1 x_3}{y_2 y_3 x_1 x_2}} & -\dfrac{1}{2}\sqrt{\dfrac{y_1 y_2 x_3}{y_3^3 x_1 x_2}} \\ \dfrac{1}{2}\sqrt{\dfrac{y_3 x_2}{y_1 y_2 x_1 x_3}} & -\dfrac{1}{2}\sqrt{\dfrac{y_1 y_3 x_2}{y_2^3 x_1 x_3}} & \dfrac{1}{2}\sqrt{\dfrac{y_1 x_2}{y_3 y_2 x_1 x_3 }} \\ -\dfrac{1}{2}\sqrt{\dfrac{y_2 y_3 x_1}{y_1^3 y_3 x_2 x_3}} & \dfrac{1}{2}\sqrt{\dfrac{y_3 x_1}{y_1 y_2 x_2 x_3}} & \dfrac{1}{2}\sqrt{\dfrac{y_2 x_1}{y_1 y_3 x_2 x_3}} \end{vmatrix} = - \dfrac{1}{2 \sqrt{ x_1 x_2 x_3 y_1 y_2 y_3}}. \]

Enlarging the integral, we obtain

(5.2)\begin{align} \sum_{{A_k \prod_{\ell \neq k} t_{k \ell} \leq T}} 1 &\ll \int_{x_k \geq {1}/{2}, y_k \geq {1}/{8}}^{x_1 x_2 x_3 \leq T/A_0, y_k \leq T/A_k} \frac{1}{\sqrt{x_1 x_2 x_3 y_1 y_2 y_3}} \prod_k d x_k \,dy_k \nonumber\\ &\ll \frac{T^2}{|A_0 A_1 A_2 A_3|^{{1}/{2}}}\int_{x_1, x_2 \geq {1}/{2}}^{x_1, x_2 \leq 4T} \frac{1}{x_1 x_2} d x_1 \,d x_2 \ll \frac{T^2( \log T)^2}{|\theta_{\mathbf{A}}|^{{1}/{2}}}. \end{align}

We require the following bounds, which are proven in § 5.3.

Lemma 5.4 If $\theta _{\mathbf {A}} \not \in \mathbb {Q}^{\times }$, then for all $\varepsilon, C > 0, T > 2$ and $i \neq j \in \{0,1,2,3\}$,

(5.3)\begin{align} \# \{ (\mathbf{u}, \mathbf{v}, \mathbf{w}) \in N^{\text{loc}}_{\mathbf{A}, \mathbf{M}}(T) : w_{k \ell} \leq (\log T^2)^C \text{ for all } \{k, \ell\} \} &\ll_{\varepsilon, C} \frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}, \end{align}
(5.4)\begin{align} \# \{ (\mathbf{u}, \mathbf{v}, \mathbf{w}) \in N^{\text{loc}}_{\mathbf{A}, \mathbf{M}}(T) : u_{k} \leq (\log T^2)^C \text{ for } k \neq i,j\} &\ll_{\varepsilon, C} \frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}. \end{align}

We can now show that most of the surfaces with a Brauer–Manin obstruction have a Brauer–Manin obstruction induced by $\mathcal {A}$.

Lemma 5.5 If $\theta _{\mathbf {A}} \not \in \mathbb {Q}^{\times 2}$, then we have for $T > 2$

\[ \# N^{\operatorname{Br}}_{\mathbf{A}, \mathbf{M}}(T) = \# N^{\mathcal{A}}_{\mathbf{A}, \mathbf{M}}(T) + O_{\varepsilon} \bigg(\frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \]

Proof. The difference between these two counts is bounded by the amount of surfaces in $N^{\operatorname {Br}}_{\mathbf {A}, \mathbf {M}}(T)$ such that $\mathcal {A}$ does not generate $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q})$. This is bounded by the amount of these surfaces for which either $\mathcal {A} \in \operatorname {Br}(\mathbb {Q})$ or for which $\operatorname {Br}(X_{\mathbf {a}})/\operatorname {Br}(\mathbb {Q}) \not \cong \mathbb {Z}/ 2 \mathbb {Z}$. In the second case there exist $i \neq j$ such that $a_i a_j = 1,-1,2,-2 \in \mathbb {Q}^\times /\mathbb {Q}^{\times 2}$ by [Reference BrightBri02, Appendix A]. The contribution of this second case can thus be bounded as in the first part of the proof of Lemma 5.3 by (5.1).

A surface such that $\mathcal {A} \in \operatorname {Br}(\mathbb {Q})$ has by Proposition 3.7 the property that $w_{k \ell } = 1$ for all $\{k , \ell \}$. This contribution is thus bounded by the size of the subset of $N_{\mathbf {A}, \mathbf {M}}(T)$ defined by $w_{k \ell } = 1$. This is equal to the subset of $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$ defined by $w_{k \ell } = 1$. This is a subset of the left-hand side of (5.3).

Note that for all places $v \in S \cup \{p \mid \theta _{\mathbf {A}}\} \cup \{ \infty \}$ the surfaces contained in $N_{\mathbf {A}, \mathbf {M}}(T)$ are equivalent over $\mathbb {Q}_v$. For $v \neq 2, \infty$ this follows from Hensel's lemma. For $v = 2$ it follows from the fact that $(\mathbb {Z}/2^{n+2}\mathbb {Z})^\times \cong \mathbb {Z}/2 \mathbb {Z} \times \mathbb {Z}/2^n \mathbb {Z}$ so the map $\mathbb {Z}_2^\times /\mathbb {Z}_2^{\times 4} \to (\mathbb {Z}/ 16 \mathbb {Z})^\times$ is an isomorphism. It is true for $v = \infty$ because $u_k, v_{k \ell }, w_{k \ell } > 0$. The following condition is thus either true for all these surfaces, or for none of them:

(5.5)\begin{equation} \mbox{for all places } v \in S \cup \{p \mid \theta_{\mathbf{A}}\} \cup \{ \infty\}, X_{\mathbf{a}}(\mathbb{Q}_v) \neq \emptyset, {\rm inv}_v(\mathcal{A}(\cdot)) \mbox{ is constant}. \end{equation}

If this condition fails, then there is certainly no Brauer–Manin obstruction induced by $\mathcal {A}$ so $N^{\mathcal {A}}_{\mathbf {A}, \mathbf {M}}(T) = 0$. Let $\eta (\mathbf {A}, \mathbf {M})$ be the indicator function of condition (5.5).

Lemma 5.6 If $\theta _{\mathbf {A}} \not \in \mathbb {Q}^{\times 2}$, then for $T> 2$

\[ \# N^{\mathcal{A}}_{\mathbf{A}, \mathbf{M}}(T) = \frac{\eta(\mathbf{A}, \mathbf{M})}{2} \# N^{\text{loc}}_{\mathbf{A}, \mathbf{M}}(T) + O_{\varepsilon} \bigg(\frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \]

Remark 5.7 This lemma can be interpreted as saying that $50\%$ percent of the surfaces for which $\mathcal {A}$ has locally everywhere a constant evaluation, i.e. is not prolific, have a Brauer–Manin obstruction induced by $\mathcal {A}$.

Proof. If $\eta (\mathbf {A}, \mathbf {M}) = 0$, then the left-hand side is $0$. We may thus assume that condition (5.5) is satisfied.

Note that $N^{\mathcal {A}}_{\mathbf {A}, \mathbf {M}}(T) \subset N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$ because of Proposition 3.8.

Let $\mathbf {u}, \mathbf {v}, \mathbf {w}$ be a triple of tuples in $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$. The associated surface always has $\mathbb {Q}_p$-points for $p \mid m_\mathbf {A}$ by condition (5.5). That it is locally soluble at the other primes means by Lemma 3.3 that for all primes $p \mid v_{k \ell }w_{k \ell }$ one of $-({a_k}/{a_\ell }), -({a_m}/{a_n})$ is a fourth power in $\mathbb {Q}_p$. For a prime $p \mid w_{k \ell }$ at most one of these can be true because $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^2$. For every such tuple, there thus exists a unique factorization $w_{k \ell } = w^L_{k \ell } w^R_{k \ell }$ ($L$ stands for left and $R$ for right) where

\[ w^L_{k \ell} = \prod_{\substack{p \mid w_{k \ell} \\ -({a_k}/{ a_\ell}) \in \mathbb{Q}_p^{\times 4}}} p,\quad\quad w^R_{k \ell} = \prod_{\substack{p \mid w_{k \ell}\\ -({ a_m}/{ a_n}) \in \mathbb{Q}_p^{\times 4}}} p. \]

Note that for all primes $p \mid w^L_{k \ell }$ we have $p \equiv 3 \pmod {4}$, so since $-({a_k}/{a_\ell }) \in \mathbb {Z}_p$ the condition $-({a_k}/{ a_\ell }) \in \mathbb {Q}_p^{\times 4}$ is equivalent to $-({a_k}/{a_\ell }) \in \mathbb {Z}_p^{\times 2}$. Analogously for $w^R_{k\ell }$. In particular, this condition does not depend on $u_i$.

For every odd prime $p$ fix an injection $\psi _p:\mathbb {Z}_p^{\times }/ \mathbb {Z}_p^{\times 4} \to \mathbb {Z}/4 \mathbb {Z}$. Denote $\boldsymbol {\zeta } = (\zeta ^L, \zeta ^R, \xi ^L, \xi ^R) \in (\mathbb {Z}/4 \mathbb {Z})^4$. We can then uniquely factorize $v_{k \ell } = \prod _{\boldsymbol {\zeta } \in (\mathbb {Z}/4 \mathbb {Z})^4} v_{k \ell }^{\boldsymbol {\zeta }}$, where

\[ v_{k \ell}^{\boldsymbol{\zeta}} = \prod_{{\substack{p \mid v_{k \ell} \\ \psi_p(u_{k}/u_{\ell}) = \zeta^L,\quad \psi_p(u_{m}/u_{n}) = \zeta^R \\ \psi_p(t_{k m} /t_{\ell n}) = \xi^L,\quad \psi_p(t_{\ell m}/t_{k n}) = \xi^R }}}, \quad p. \]

Now note that because

(5.6)\begin{equation} -\frac{a_{k}}{a_{\ell}} = - \frac{A_k}{A_\ell} \frac{u_k^2}{u_\ell^2} \frac{t_{k m} t_{k n}}{t_{\ell m} t_{\ell n}}, \quad\quad -\frac{a_m}{a_n} = - \frac{A_m}{A_n} \frac{u_m^2}{u_n^2} \frac{t_{k m} t_{\ell m}}{t_{k n} t_{\ell n}} \end{equation}

there exists a subset $\Omega _p \subset (\mathbb {Z}/ 4 \mathbb {Z})^4$ for every prime $p$ with the following property. The surface $X_{\mathbf {a}}$ has a $\mathbb {Q}_p$-point for $p \mid v^{\boldsymbol {\zeta }}_{k \ell }$ if and only if $\boldsymbol {\zeta } \in \Omega _p$. The point of this factorization is that the conditions induced by $v_{k \ell }^{\boldsymbol {\zeta }}$ on $t_{i j}$ and on $u_i$ are independent of each other.

To summarize, we have constructed a bijection of $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$ with the set $N^{\textrm {loc}'}_{\mathbf {A}, \mathbf {M}}(T)$

(5.7) \begin{equation} \left\{\begin{array}{@{}l} (\mathbf{u}, \mathbf{v}^{\boldsymbol{\zeta}}, \mathbf{w}^L, \mathbf{w}^R) \\ \mathbf{u} \in \mathbb{N}^4 \\ \mathbf{v}^{\boldsymbol{\zeta}} \in (\mathbb{N}^{\mathcal{P}_2(4)})^{(\mathbb{Z}/4\mathbb{Z})^4} \\ \mathbf{w}^L, \mathbf{w}^R \in \mathbb{N}^{\mathcal{P}_2(4)} \end{array} : \begin{array}{l@{}} \bigg|A_k u_k^2 \displaystyle\prod_{\{k, \ell\}} \prod_{\boldsymbol{\zeta} \in (\mathbb{Z}/4\mathbb{Z})^4} v^{\boldsymbol{\zeta}}_{k \ell} w^L_{k \ell} w^R_{k \ell}\bigg| \leq T, \\ \mu\bigg(m_\mathbf{A} \displaystyle\prod_{k} u_k \prod_{\ell \neq k} \prod_{\boldsymbol{\zeta} \in (\mathbb{Z}/4\mathbb{Z})^4} v^{\boldsymbol{\zeta}}_{k \ell} w^L_{k \ell} w^R_{k \ell}\bigg)^2 = 1, \\ p \mid v^{\boldsymbol{\zeta}}_{k \ell} \Rightarrow \theta_{\mathbf{A}} \in \mathbb{Q}_p^{\times 2}, \boldsymbol{\zeta} \in \Omega_p \mbox{ and } \psi_p\bigg(\dfrac{u_{k}}{u_{\ell}}\bigg) = \zeta^L, \\ \displaystyle\psi_p\bigg(\dfrac{u_{m}}{u_{n}}\bigg) = \zeta^R, \psi_p\bigg(\dfrac{t_{k m}}{t_{\ell n}}\bigg) = \xi^L, \psi_p\bigg(\dfrac{t_{\ell m}}{t_{k n}}\bigg) = \xi^R, \\ p \mid w^L_{k \ell} \Rightarrow \theta_{\mathbf{A}} \not \in \mathbb{Q}_p^{\times 2},\quad -\dfrac{a_k}{a_\ell} \in \mathbb{Z}_p^{\times 2}, \\ p \mid w^R_{k \ell} \Rightarrow \theta_{\mathbf{A}} \not \in \mathbb{Q}_p^{\times 2},\quad -\dfrac{a_m}{a_n} \in \mathbb{Z}_p^{\times 2}, \\ u_k^2 \displaystyle\prod_{\ell \neq k} \prod_{\boldsymbol{\zeta} \in (\mathbb{Z}/4\mathbb{Z})^4} v^{\boldsymbol{\zeta}}_{k \ell} w^L_{k \ell} w^R_{k \ell} \ (\mathrm{mod}\ 8 m_{\mathbf{A}}) \in M_k \end{array}\right\}. \end{equation}

For each triple of tuples $\mathbf {v}^{\boldsymbol {\zeta }}, \mathbf {w}^L, \mathbf {w}^R$ choose a tuple $\mathbf {u}$ counted by $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$ and choose a normalized $\mathcal {\mathcal {A}}$ for the associated surface. The invariant map of $\mathcal {A}$ is constant at all places. For the places $v|m_{\mathbf {A}} \infty$ this is by condition (5.5). For primes $p \mid v_{k \ell }, w_{k \ell }$ this follows from Proposition 3.8. For other primes this is a consequence of Lemma 3.6.

Consider a different tuple $\mathbf {u}'$. We have an algebra $\mathcal {A}_{\mathbf {u}'^2 \mathbf {u}^{-2}}$ on the corresponding surface. This algebra is $p$-normalized for all primes $p \nmid m_{\mathbf {A}}, v_{k \ell }, w_{k \ell }^L, w_{k \ell }^R$ by Lemma 3.9 so for these primes ${\rm inv}_p(\mathcal {A}(\cdot )) = {\rm inv}_p(\mathcal {A}_{\mathbf {u}'^2 \mathbf {u}^{-2}}(\cdot )) = 0$ by Lemma 3.6. The same is true for $p \mid v_{k \ell }$ by Proposition 3.8. In addition, for places $p \mid m_{\mathbf {A}} \infty$ the formula (3.4) implies that ${\rm inv}_p(\mathcal {A}(\cdot )) = {\rm inv}_p(\mathcal {A}_{\mathbf {u}'^2 \mathbf {u}^{-2}}(\cdot ))$. For $p \mid w_{k \ell }^L, w_{k \ell }^R$ we again use Proposition 3.8 to find that

\begin{align*} & {\rm inv}_p(\mathcal{A}_{\mathbf{u}'^2 \mathbf{u}^{-2}}(\cdot)) = {\rm inv}_p(\mathcal{A}(\cdot)) + \frac{\bigl(\frac{u'_k u_k u'_\ell u_\ell}{p}\bigr) - 1}{4} \mbox{ if } p \mid w_{k \ell}^L,\\ & {\rm inv}_p(\mathcal{A}_{\mathbf{u}'^2 \mathbf{u}^{-2}}(\cdot)) = {\rm inv}_p(\mathcal{A}(\cdot)) + \frac{\bigl(\frac{u'_m u_m u'_n u_n}{p}\bigr) - 1}{4} \mbox{ if } p \mid w_{k \ell}^R. \end{align*}

Summing this over all $p$ we find that

\[ \sum_p {\rm inv}_p(\mathcal{A}_{\mathbf{u}'^2 \mathbf{u}^{-2}}(\cdot)) = \sum_p {\rm inv}_p(\mathcal{A}(\cdot)) + \frac{\prod_{\{k ,\ell\}}\bigl(\frac{u'_k u_k u'_\ell u_\ell}{w_{k \ell}^L}\bigr) \bigl(\frac{u'_m u_m u'_n u_n}{w_{k \ell}^R}\bigr)- 1}{4}. \]

There thus exists a $\delta := \delta (\mathbf {A}, \mathbf {M}, \mathbf {v}^{\boldsymbol {\zeta }}, \mathbf {w}^L, \mathbf {w}^R) \in \{1,-1\}$, such that the surface $X_{\mathbf {a}}$ has a Brauer–Manin obstruction if and only if

\[ \prod_{\{k, \ell\} \in \mathcal{P}_2(4)} \bigg(\frac{u_k u_\ell}{w^L_{k \ell}}\bigg) \bigg(\frac{u_m u_n}{w^R_{k \ell}}\bigg) = \delta. \]

The indicator function of having a Brauer–Manin obstruction is then given by

\[ \frac{1}{2} - \frac{\delta}{2} \prod_{\{k, \ell\} \in \mathcal{P}_2(4)} \bigg(\frac{u_k u_\ell}{w^L_{k \ell}}\bigg) \bigg(\frac{u_m u_n}{w^L_{k \ell}}\bigg). \]

Hence, to prove the lemma it suffices to bound

\[ \mathop{\sum_{\mathbf{v}^{\zeta}} \sum_{\mathbf{w}^L} \sum_{\mathbf{w}^R} \sum_{\mathbf{u}}}_{(\mathbf{u}, \mathbf{v}^{\zeta}, \mathbf{w}^L, \mathbf{w}^R) \in N^{\textrm{loc}'}_{\mathbf{A}, \mathbf{M}}(T)} \delta \prod_{\{k, \ell \}} \bigg(\frac{u_k u_\ell}{w^L_{k \ell}}\bigg) \bigg(\frac{u_m u_n}{w^R_{k \ell}}\bigg). \]

For each possible $4$-tuple $\mathbf {U}$ of cosets of $(\mathbb {Z}/8 m_{\mathbf {A}} \mathbb {Z})^{\times 4}$ in $(\mathbb {Z}/8 m_{\mathbf {A}} \mathbb {Z})^{\times }$ consider the subsum

\[ \mathop{\sum_{\mathbf{v}^{\zeta}} \sum_{\mathbf{w}^L} \sum_{\mathbf{w}^R} \sum_{\mathbf{u}}}_{\substack{(\mathbf{u}, \mathbf{v}^{\zeta}, \mathbf{w}^L, \mathbf{w}^R) \in N^{\textrm{loc}'}_{\mathbf{A}, \mathbf{M}}(T) \\ \mathbf{u} \in \mathbf{U}}} \delta \prod_{\{k, \ell \}} \bigg(\frac{u_k u_\ell}{w^L_{k \ell}}\bigg) \bigg(\frac{u_m u_n}{w^R_{k \ell}}\bigg). \]

We will bound this using Theorem 4.22 with $q_{\textrm {osc}} = O(1), A = 6$. As can be seen in the definition (5.7) the only conditions induced by $N^{\textrm {loc}'}_{\mathbf {A}, \mathbf {M}}(T)$ and $\delta$ for which $\mathbf {w}^L, \mathbf {w}^R$ interact with $\mathbf {u}$ is the coprimality condition and the modulo condition coming from $\mathbf {M}$. The coprimality condition is also contained in the Jacobi symbol and the interaction coming from $\mathbf {M}$ has been removed by the presence of $\mathbf {U}$. The variable $w_{k \ell }^L$, respectively $w_{k \ell }^R$, can thus be linked to $u_k$ and $u_\ell$, respectively to $u_m$ and $u_n$, by Lemma 4.20.

The downward-closed set we are summing over is defined by the inequalities

\[ |A_k|u_k^2 \prod_{\ell \neq k} \bigg(\prod_{\boldsymbol{\zeta}} v_{k \ell}^{\boldsymbol{\zeta}}\bigg) w^L_{k \ell} w^R_{k \ell} \leq T. \]

Taking the product of these inequalities shows that $L_U \leq |\theta _{\mathbf {A}}|^{-({1}/{2})} T^2$ which gives an error term in Theorem 4.22 of $\ll |\theta _{\mathbf {A}}|^{-({1}/{2})} T^2$. To obtain the total error term we sum over the possible $\mathbf {U}$, there are $\ll \tau _4(\theta _{\mathbf {A}})^4 \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ of them due to the divisor bound.

After applying the theorem we can get rid of $\delta$ and the Jacobi symbols with a trivial bound. We can then sum back over the possible $\mathbf {U}$ and rewrite $v_{k \ell } = \prod _{\boldsymbol {\zeta } \in (\mathbb {Z}/4 \mathbb {Z})^4} v_{k \ell }^{\boldsymbol {\zeta }}$. What remains is a sum which counts the elements of a certain subset of $N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$. This subset is contained in the union of the two sets which are defined respectively by the following two conditions, the $C$ comes from the application of Theorem 4.22.

  1. (i) The first is when $u_k \leq (\log |\theta _{\mathbf {A}}|^{-({1}/{2})} T^2)^C$ for at least two $k$. In this case we can rewrite $w_{k \ell } = w^L_{k \ell } w^R_{k \ell }$ and get a subset of the left-hand side of (5.4).

  2. (ii) The second is when $u_k > (\log |\theta _{\mathbf {A}}|^{-({1}/{2})} T^2)^C$ for at least three $k$. We must then have $w^L_{k \ell },w^R_{k \ell } \leq (\log |\theta _{\mathbf {A}}|^{-({1}/{2})} T^2)^C$ for all $\{k, \ell \}$ by Theorem 4.22. We may rewrite $w_{k \ell } = w^L_{k \ell } w^R_{k \ell }$ and attempt to bound the size of the larger set defined by $w_{k \ell } \leq (\log T^2)^{2C}$. This set is exactly the left-hand side of (5.3).

In the following sections, we prove the following lemma.

Lemma 5.8 If $\theta _{\mathbf {A}} \not \in \mathbb {Q}^{\times 2}$, then there exists a constant $0 < Q_{\mathbf {A}} \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ for all $\varepsilon > 0$ such that for $T > 2$ we have

\[ \# N^{\text{loc}}_{\mathbf{A}, \mathbf{M}}(T) = \frac{T^2 (\log T)^{6 M_{\mathbf{A}}} }{|\theta_{\mathbf{A}}|^{{1}/{2}}} (Q_{\mathbf{A}} + O_{ \varepsilon}(|\theta_{\mathbf{A}}|^{\varepsilon}(\log T)^{- M_{\mathbf{A}}})). \]

We can now prove the main result.

Proof of Theorem 1.2 Lemmas 5.2 and 5.3 together imply that

\[ \# N_{= \square}^{\operatorname{Br}}(T) = \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T) \\ \theta_{\mathbf{A}} \in \mathbb{Q}^{\times 2}}} O\bigg(\frac{T^2 (\log T)^2}{|\theta_{\mathbf{A}}|^{{1}/{2}}}\bigg). \]

Combining Lemmas 5.2, 5.5, 5.6 and 5.8 and recalling that $M_{\mathbf {A}}$ was defined to be $\frac {11}{16}$ if $\theta _{\mathbf {A}} \in - \mathbb {Q}^{\times 2}$ and $\frac {15}{32}$ if $\theta _{\mathbf {A}} \not \in \pm \mathbb {Q}^{\times 2}$ shows that

\begin{align*} \# N_{= -\square}^{\operatorname{Br}}(T) &= T^2 (\log T)^{{33}/{8}} \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T), \\ \theta_{\mathbf{A}} \in -\mathbb{Q}^{\times 2}}} \frac{\eta(\mathbf{A}, \mathbf{M})}{2 |\theta_{\mathbf{A}}|^{{1}/{2}}} \bigg(Q_{\mathbf{A}} + O_{\varepsilon}\bigg(\frac{|\theta_{\mathbf{A}}|^{\varepsilon}}{(\log T)^{{11}/{16}}}\bigg)\bigg), \\ \# N_{\neq \pm \square}^{\operatorname{Br}}(T) &= T^2(\log T)^{{45}/{16}} \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi(T) \\ \theta_{\mathbf{A}} \not \in \pm \mathbb{Q}^{\times 2}}} \frac{\eta(\mathbf{A}, \mathbf{M})}{2 |\theta_{\mathbf{A}}|^{{1}/{2}}} \bigg(Q_{\mathbf{A}} + O_{\varepsilon}\bigg(\frac{|\theta_{\mathbf{A}}|^{\varepsilon}}{(\log T)^{{15}/{32}}}\bigg)\bigg). \end{align*}

To prove Theorem 1.2 it remains to show that the three sums converge, and to a non-zero constant for the latter two. To see that the constant is non-zero it suffices to give a single example of a surface in which $\mathcal {A}$ is locally not prolific at all places, for example as it induces a Brauer–Manin obstruction, because in that case the corresponding $\eta (\mathbf {A}, \mathbf {M})$ is non-zero. For $\theta _{\mathbf {a}}\not \in \pm \mathbb {Q}^{\times 2}$ such an example is [Reference BrightBri11, Proposition 3.3] and for $\theta _{\mathbf {a}}\in - \mathbb {Q}^{\times 2}$ Lemma 3.10 provides the required example.

We show that the sums converge. By taking upper bounds it suffices to bound

\[ \sum_{\substack{(\mathbf{A}, \mathbf{M}) \in \Psi \\ |\theta_{\mathbf{A}}| \geq T}} |\theta_{\mathbf{A}}|^{-({1}/{2}) + \varepsilon} \]

for sufficiently small $\varepsilon$. The amount of $\mathbf {M}$ associated to each $\mathbf {A}$ is $\ll \tau _4(\theta _{\mathbf {A}})^4 \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ by the divisor bound, so it suffices to bound

\[ \sum_{\substack{\mathbf{A} \in \Phi \\ |\theta_{\mathbf{A}}| > T}} |\theta_{\mathbf{A}}|^{-({1}/{2}) + 2\varepsilon}. \]

All the tuples $\mathbf {A} \in \Phi$ are such that $\theta _{\mathbf {A}}$ can be written (non-uniquely) as $g h_3^3 h_4^4 h_5^5$ such that for all $p \mid g$ we have $p \in S$ and $v_p(g) \leq 2$. There are thus only a finite number of possibilities for $g$. For every $\theta$ there exist only $\tau _4(\theta ) \ll _{\varepsilon } |\theta |^{\varepsilon }$ tuples $\mathbf {A} \in \Phi$ such that $\theta _{\mathbf {A}} = \theta$, so we obtain an upper bound

\begin{align*} \ll \sum_{\substack{g, h_3, h_4, h_5 \\ |g h_3^3 h_4^4 h_5^5| > T}} |g h_3^3 h_4^4 h_5^5|^{-({1}/{2}) + 3\varepsilon} &\ll_{\varepsilon} T^{-({1}/{6}) + 3\varepsilon} \sum_{h_4, h_5} h_4^{-2 + 12 \varepsilon + {2}/{3} - 12 \varepsilon} h_5^{-({5}/{2}) + 15 \varepsilon + {5}/{6} - 15 \varepsilon} \\ &\ll T^{-({1}/{6}) + 3\varepsilon}. \end{align*}

By taking $\varepsilon < 1/18$ we are done.

5.2 A sum of linked variables

The goal of this subsection is to simplify $\#N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$. For simplicity, we assume that $\theta _{\mathbf {A}} \not \in \mathbb {Q}^{\times 2}$. This also allows us to deal with the error terms in the lemmas of the previous subsection. For this, fix a downward-closed subset $U$ of the set defined by the inequalities:

(5.8)\begin{equation} |A_k| u_k^2 \prod_{\ell \neq k} v_{k \ell} w_{k \ell} \leq T. \end{equation}

By taking the product of the inequalities we see that $L_U \leq {T^2}/{|A_0 A_1 A_2 A_3|^{{1}/{2}}}$. Define $N^{\textrm {loc}, U}_{\mathbf {A}, \mathbf {M}}(T) := N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T) \cap U$. For a first reading it will be simplest to assume that $U$ is defined by (5.8) and thus that $N^{\textrm {loc}, U}_{\mathbf {A}, \mathbf {M}}(T) = N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$.

Let

\[ \Gamma_{\mathbf{A}} := (\mathbb{Z}/ 8 m_\mathbf{A})^{\times}/(\mathbb{Z}/ 8 m_\mathbf{A})^{\times 4}. \]

The condition induced by $\mathbf {M}$ is then encoded in the standard way as a sum of characters

\[ \frac{1}{|\Gamma_{\mathbf{A}}|^4}\prod_{k}\sum_{\chi_k \in \Gamma_{\mathbf{A}}^{\vee}} \overline{\chi}_k(M_k) \chi_k\bigg(u_k^2\prod_{\ell \neq k} v_{k \ell} w_{k \ell}\bigg). \]

Write $\boldsymbol {\chi } = (\chi _0, \chi _1, \chi _2, \chi _3)$. Substituting this into the main sum and switching the order of summation shows that

(5.9)\begin{equation} \#N^{\textrm{loc}, U}_{\mathbf{A}, \mathbf{M}}(T) = \frac{1}{|\Gamma_{\mathbf{A}}|^4} \sum_{\boldsymbol{\chi} \in (\Gamma_{\mathbf{A}}^{\vee})^4} \prod_{k} \overline{\chi}_k(M_k) S_{U}(\boldsymbol{\chi}, T), \end{equation}

where

(5.10)\begin{equation} S^{U}_{\mathbf{A}}(\boldsymbol{\chi}, T) := \mathop{\sum \sum \sum}_{(\mathbf{u}, \mathbf{v}, \mathbf{w}) \in \sqcup_{\mathbf{N}}N^{\textrm{loc}, U}_{\mathbf{A}, \mathbf{N}}(T)} \chi_k\bigg(u_k^2\prod_{\ell \neq k} v_{k \ell} w_{k \ell}\bigg). \end{equation}

If $N^{\textrm {loc}, U}_{\mathbf {A}, \mathbf {M}}(T) = N^{\textrm {loc}}_{\mathbf {A}, \mathbf {M}}(T)$, then we just write $S_{\mathbf {A}}(\boldsymbol {\chi }, T) := S^{U}_{\mathbf {A}}(\boldsymbol {\chi }, T)$. The goal of this section is to prove the following lemma.

Lemma 5.9 There exist frobenian multiplicative functions $\alpha (\cdot ; \theta _{\mathbf {A}}), \beta (\cdot, \theta _{\mathbf {A}})$ of conductor at most $16 |\theta _{\mathbf {A}}|^2$ such that for $T > 2$ and all $\mathbf {A}, \boldsymbol {\chi }, U$ we have

(5.11)\begin{align} S^{U}_{\mathbf{A}}(\boldsymbol{\chi}, T) &= \mathop{\sum \sum \sum}_{\substack{(\mathbf{u}, \mathbf{d}, \mathbf{w}) \in U \nonumber\\ \mu(\prod_k u_k \prod_{\{k, \ell\}} d_{k \ell} w_{k \ell})^2 = 1}} \prod_k \chi_k(u_k)^2 \prod_{\{k, \ell\}}\chi_k \chi_\ell(w_{k \ell} d_{k \ell})\alpha(w_{k \ell}; \theta_{\mathbf{A}}) \beta( d_{k \ell}; \theta_{\mathbf{A}}) \nonumber\\ &\quad + O_{\varepsilon}\bigg(\frac{T^2(\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \end{align}

The functions $\alpha (\cdot ; \theta _{\mathbf {A}}), \beta (\cdot, \theta _{\mathbf {A}})$ respectively have means $\frac {1}{2}, \frac {3}{16}$ if $\theta _{\mathbf {A}} \in - \mathbb {Q}^{\times 2}$ and means $\frac {1}{4}, \frac {7}{32}$ if $\theta _{\mathbf {A}} \not \in \pm \mathbb {Q}^{\times 2}$.

Before proving this lemma we need some preparations. To encode the condition that local points have to exist we apply Lemma 3.3. From now on assume that $k < \ell$ and $m < n$. If $p \nmid m_\mathbf {A}\prod _{\{k, \ell \}} v_{k \ell } w_{k \ell }$, then there are always local solutions. If $p \mid v_{k \ell } w_{k \ell }$, then there local solutions if and only if $-({a_k}/{a_\ell }) \in \mathbb {Z}_p^{\times 4}$ or $-({a_m}/{a_n}) \in \mathbb {Z}_p^{\times 4}$.

If $p \mid w_{k \ell }$, then this follows from the other conditions on $w_{k \ell }$, because in this case $p \equiv 3 \pmod {4}$ so $\mathbb {Z}_p^{\times 4} = \mathbb {Z}_p^{\times 2}$. However, also $\theta _{\mathbf {a}}\not \in \mathbb {Q}_p^{\times 2}$ so $-({a_k}/{a_\ell }) \not \equiv -({a_m}/{a_n}) \mod \mathbb {Z}_p^{\times 2}$. Hence, at least one of them must be a square. To encode the other conditions on $w_{k \ell }$ we utilize the function $\alpha (w; \theta )$ which is the indicator function of the set

\[ \{w : w \mbox{ square-free},\ p \mid w \Rightarrow p \equiv 3 \ (\mathrm{mod}\ 4),\ \theta \not \in \mathbb{Q}_p^{\times 2}\}. \]

This is a $\{2, p \mid \theta \}$-frobenian multiplicative function defined by $\mathbb {Q}(\sqrt {-1}, \sqrt {\theta _{\mathbf {a}}})$ and thus of conductor at most $16 |\theta |^2$. It has mean $\frac {1}{2}$ if $-\theta _{\mathbf {a}}\in \mathbb {Q}^{\times 2}$ and mean $\frac {1}{4}$ if $\theta _{\mathbf {a}}\not \in \pm \mathbb {Q}^{2 \times }$.

To encode the properties for $v_{k \ell }$ we first introduce some functions. Given $x, y, z \in \mathbb {Q}$ define $\varpi (v;x, y, z)$ as the indicator function of the set

\[ \{v \in \mathbb{Z} : \mu(v)^2 = 1,\ v \mbox{ coprime with } x, y,z, p \mid v \Rightarrow z \in \mathbb{Z}_p^{\times 2} \mbox{ and } x \mbox{ or } x y^2 z \in \mathbb{Q}_p^{\times 4}\}. \]

This is a multiplicative function in $v$ which encodes the condition on $v_{k \ell }$ as

\[ \begin{array}{ll} v = v_{k \ell}, & x = -\dfrac{a_k}{a_\ell} = -\dfrac{A_k v_{k m} w_{k m} v_{k n} w_{k n} u_k^2}{A_\ell v_{\ell m} w_{\ell m} v_{\ell n} w_{\ell n} u_\ell^2}, \\ y = y_{k n}^{\ell m} := \dfrac{v_{\ell m} w_{\ell m} u_\ell u_m}{v_{k n} w_{k n} u_k u_n}, & z = z_{k n}^{\ell m} := \dfrac{A_\ell A_m}{A_k A_n} \end{array} \]

due to (5.6). We think of $z$ as being constant. Using these functions we have the following formula for $S_U(\boldsymbol {\chi }, T)$:

(5.12)\begin{equation} \mathop{\sum_{\mathbf{v}} \sum_{\mathbf{w}} \sum_{\mathbf{u}}}_{\substack{(\mathbf{u}, \mathbf{v}, \mathbf{w}) \in U \\ \mu(\prod_k u_k \prod_{\{k, \ell\}} v_{k \ell} w_{k \ell})^2 = 1}} \prod_k \chi_k(u_k)^2 \prod_{k < \ell}\alpha(w_{k \ell} ; \theta_{\mathbf{A}}) \varpi\bigg(v_{k \ell} ; -\frac{a_k}{a_{\ell}}, y_{k n}^{\ell m}, z_{k n}^{\ell m}\bigg) \chi_k \chi_{\ell}(v_{k \ell} w_{k \ell}). \end{equation}

We now give a description of $\varpi (v_{k \ell } ; -({a_k}/{a_{\ell })}, y_{k n}^{\ell m}, z_{k n}^{\ell m})$ as a convolution of simpler functions. Let $\beta (x;z), \gamma (x;z), \gamma ^c(x; z), \delta (x;z)$ be the multiplicative functions which are $0$ if $2x$ is not square-free and such that for an odd prime $p$ they take the value described in the following table.

These are $\{p : p = 2, \mbox { or } v_p(z) \neq 0\}$-frobenian multiplicative functions of conductor at most the discriminant $\mathbb {Q}(i,\sqrt [4]{z})$ which is at most $2^{24}\prod _{v_p(z) \neq 0} p^6$. The functions $\beta (\cdot,z), \delta (\cdot, z)$ are defined by the extension $\mathbb {Q}(\sqrt {-1}, z)$ and thus have conductor at most $16 \prod _{v_p(z) \neq 0} p^2$. The means of these functions in the relevant cases are given in the table. The functions $\gamma$ and $\gamma ^c$ are bounded by $\delta$.

Lemma 5.10 The following expression is equal to $\varpi (v;x, y,z)$:

(5.13)\begin{align} & \mu(v)^2 \sum_{d e f f' \mathbf{N}(\mathfrak{g})\mathbf{N}(\mathfrak{g}') = v} \beta(d; z) \beta(e; z) \gamma(f; z) \gamma(f' ; z) \delta(\mathbf{N}(\mathfrak{g}); z) \gamma^c(\mathbf{N}(\mathfrak{g}') ; z) \nonumber\\ &\quad \times \bigg(\frac{x}{e}\bigg) \bigg(\frac{y}{f}\bigg) \bigg(\frac{x y}{f'}\bigg) \bigg(\frac{x}{\mathfrak{g}}\bigg)_{4} \bigg(\frac{x y^2}{\mathfrak{g}'}\bigg)_{4}, \end{align}

where $d, e,f,f'$ range over $\mathbb {N}$ and $\mathfrak {g},\mathfrak {g}'$ range over $I_{\mathbb {Q}[i]}$.

Proof. Both sides are multiplicative in $v$ and $0$ at higher prime powers so it suffices to check this equality at primes. This follows from a case analysis. If $p \equiv 1 \pmod {4}$, then it factors in $\mathbb {Z}[i]$ as $\mathfrak {p} \overline {\mathfrak {p}}$. The four characters of $\mathbb {F}_p^{\times }/\mathbb {F}_p^{\times 4}$ are $1, \bigl (\frac {\cdot }{p}\bigr ), \bigl (\frac {\cdot }{\mathfrak {p}}\bigr )_{4}, \bigl (\frac {\cdot }{\overline {\mathfrak {p}}}\bigr )_{4}$ and for any fixed choice of $z \in \mathbb {Z}_p^{\times 2}/\mathbb {Z}_p^{\times 4}$ one can check that the expression (5.13) for $p = v$ is exactly the representation of $\varpi (p; x,y,z)$ as a sum of characters. The same is true for $p \equiv 3$ but the characters for $\mathbb {F}_p^{\times }/\mathbb {F}_p^{\times 4}$ in that case are $1, \bigl (\frac {\cdot }{p}\bigr )$.

We can now prove Lemma 5.9. We remark that the frobenian multiplicative functions $\alpha (\cdot, \theta ), \beta (\cdot, \theta )$ are the same ones that appear in the statement of Lemma 5.9.

Proof. Substituting (5.13) in the sum (5.12) and switching the order of summation gives a sum of $46$ variables over a downward-closed set $V$. To describe this sum we rename some of the variables and associate to each of them a factor. The sum is over the product of these factors, together with the condition that these variables are pairwise coprime and square-free.

  1. (i) The $u_k$ variables are given a factor $\chi _k(u_k)^2$. This is bounded by $1$.

  2. (ii) The $w_{k \ell }$ have a factor $\alpha ( w_{k \ell }; \theta ) \chi _k \chi _{\ell }(w_{k \ell })$ which is bounded by $\alpha ( w_{k \ell }; \theta )$.

  3. (iii) Denote the $d$ variable associated to $v_{k \ell }$ by $d_{k \ell }$. It is given a factor

    \[ \beta(d_{k \ell} ; z_{k n}^{\ell m}) \chi_k \chi_{\ell}(d_{k \ell}). \]
    This is bounded by $\beta ( w_{k \ell }; z_{k n}^{\ell m})$.
  4. (iv) The $e$ variable associated to $v_{k \ell }$ will be denoted by $e_{k \ell }$. It has a factor

    \[ \beta(e_{k \ell}, z_{k n}^{\ell m}) \bigg(\frac{-A_k A_\ell t_{k m} t_{k n} t_{\ell m} t_{\ell n}}{e_{k \ell}}\bigg) \chi_k \chi_{\ell}(e_{k \ell}). \]
    This is bounded by $\beta ( w_{k \ell }; z_{k n}^{\ell m})$.
  5. (v) Denote the $f$ variable associated to $v_{k \ell }$ by $f_{k \ell }^{n m}$. The associated factor is

    \[ \gamma(f_{k \ell}^{m n}, z_{kn}^{\ell m}) \bigg(\frac{ -A_k A_\ell u_k u_\ell u_m u_n t_{k n} t_{\ell m} }{f_{k \ell}^{n m}}\bigg) \chi_k \chi_{\ell}(f_{k \ell}^{m n}). \]
    Note that $t_{k n}$ has the first index of the top and bottom of $f_{k \ell }^{n m}$ and $t_{\ell m}$ has the second index of the top and bottom. This is bounded by $\delta (f_{k \ell }^{n m}, z_{kn}^{\ell m})$.
  6. (vi) We denote the $f'$ variable associated to $v_{k \ell }$ by $f_{k \ell }^{m n}$. Its factor is

    \[ \gamma(f_{k \ell}^{m n}, z_{kn}^{\ell m}) \bigg(\frac{ -A_k A_\ell u_k u_\ell u_m u_n t_{k m} t_{\ell n}}{f_{k \ell}^{m n}}\bigg) \chi_k \chi_{\ell}(f_{k \ell}^{m n}). \]
    Again note that $t_{k m}$ has the first index of the top and bottom of $f_{k \ell }^{m n}$ and $t_{\ell n}$ has the second index of the top and bottom. It is bounded by $\delta (f_{k \ell }^{m n}, z_{kn}^{\ell m})$
  7. (vii) We rename the $\mathfrak {g}$ variable associated to $v_{k \ell }$ by $\mathfrak {g}_{k \ell }$. It has a factor

    \[ \delta(\mathbf{N}(\mathfrak{g}_{k \ell}); z_{kn}^{\ell m}) \bigg(\frac{-A_k t_{k m} t_{k n} u_k^2}{\mathfrak{g}_{k \ell}}\bigg)_{4} \bigg(\frac{A_\ell t_{\ell m} t_{\ell n} u_\ell^2}{ \mathfrak{g}_{k \ell}}\bigg)_{4}^{-1} \chi_{k} \chi_{\ell}(\mathbf{N}(\mathfrak{g}_{k \ell})). \]
    Its factor is bounded by $\delta (\mathbf {N}(\mathfrak {g}_{k \ell }); z_{kn}^{\ell m})$.
  8. (viii) The $\mathfrak {g}'$ associated to $v_{k \ell }$ will be denoted by $\mathfrak {g}_{k \ell }^{c}$, (the $c$ standing for complement):

    \[ \gamma^c(\mathbf{N}(\mathfrak{g}^{c}_{k \ell}); z_{kn}^{\ell m}) \bigg(\frac{-A_k t_{k m} t_{\ell m} u_m^2}{\mathfrak{g}_{k \ell}^c}\bigg)_{4} \bigg(\frac{A_\ell t_{k n} t_{\ell n} u_n^2}{ \mathfrak{g}_{k \ell}^c}\bigg)_{4}^{-1} \chi_{k} \chi_{\ell}(\mathbf{N}(\mathfrak{g}_{k \ell}^c)). \]
    This factor is bounded by $\delta (\mathbf {N}(\mathfrak {g}^{c}_{k \ell }); z_{kn}^{\ell m})$.

We now stop assuming that $k < \ell, m < n$ and write $d_{k \ell } = d_{\ell k}, f_{k \ell }^{m n} = f_{\ell k}^{n m}, \dots$. In this way, we can treat $f_{k \ell }^{n m}$ and $f_{k \ell }^{m n}$ uniformly. With this relabeling the downward-closed set $V$ is equal to

\[ \{\mathbf{u}, \mathbf{d}, \mathbf{e}, \mathbf{f}, \mathbf{g}, \mathbf{g}^c : (\mathbf{u}, (e_{k \ell} d_{k \ell} f_{k \ell}^{m n} f_{k \ell}^{n m} \mathbf{N}(\mathfrak{g}_{k \ell}) \mathbf{N}(\mathfrak{g}^c_{k \ell}))_{\{k, \ell\}}, \mathbf{w}) \in U \}. \]

It is then clear that the length of $V$ is $L_V = L_U \leq |\theta _{\mathbf {A}}|^{-({1}/{2})}T^2$.

We now apply Theorem 4.22 with $A = 6, q_{\textrm {osc}} = O(1), q_{\textrm {frob}} = O(1)|\theta _{\mathbf {A}}|^{O(1)}$. The resulting error term is $O_{\varepsilon }(|\theta _{\mathbf {A}}|^{-({1}/{2}) + \varepsilon } T^2)$. We use Lemma 4.20 to show that the following pairs of variables can be linked. Note that the coprimality condition between the linked variables is contained in the Jacobi or power residue symbols.

  1. (i) The variable $u_i$ is linked to $f_{k \ell }^{m n}$ and to $\mathfrak {g}_{k \ell }$ if $i = k, \ell$ and to $f_{k \ell }^{m n}$ and $\mathfrak {g}_{k \ell }^c$ if $i = m, n$.

  2. (ii) The variables $w_{i j}$ and $d_{i j}$ are both linked to $e_{k \ell }, \mathfrak {g}_{k \ell }$ and $\mathfrak {g}_{k \ell }^c$ if $\{i, j\} \neq \{k, \ell \}, \{m, n\}$. It is also linked to $f_{k \ell }^{m n}$ if $\{i, j\} = \{k, m\}, \{\ell, n\}$.

  3. (iii) The variable $e_{i j}$ is linked to $f_{k \ell }^{m n}$ if $\{i, j\} = \{k, n\}, \{\ell, m \}$. It is also linked to $\mathfrak {g}_{k \ell }, \mathfrak {g}_{k \ell }^c$ if $\{i, j\} \neq \{k, \ell \}, \{m, n\}$.

  4. (iv) The variable $f_{k \ell }^{m n}$ is linked to $f_{k m}^{n \ell }, f_{\ell n}^{m k}$, $f_{k n}^{\ell m}$ and $f_{\ell m}^{k n}$. It is also linked to $\mathfrak {g}_{ij}$ and $\mathfrak {g}_{ij}^c$ as long as $\{i, j\} \neq \{k, \ell \}, \{m, n\}$.

  5. (v) The variables $\mathfrak {g}_{i j}$ and $\mathfrak {g}_{i j}^c$ are both linked to $\mathfrak {g}_{k \ell }$ and $\mathfrak {g}_{k \ell }^c$ as long as $\{i, j\} \neq \{k, \ell \}, \{\ell, m\}$.

All the variables are also frobenian. Indeed, if we fix all but one variable, then the product of the factors is a product of the following quantities.

  1. (i) A constant dependent on the other variables, whose absolute value is bounded by $1$.

  2. (ii) Coprimality conditions which only increase the set of bad primes of the frobenian multiplicative function.

  3. (iii) Factors of the form $\bigl (\frac {n}{m}\bigr ) \bigl (\frac {m}{n}\bigr )$. These are Hecke characters modulo $4$ by quadratic reciprocity.

  4. (iv) A frobenian multiplicative function of conductor at most $O(|\theta _{\mathbf {A}}|^6)$ independent of the other variables.

  5. (v) The factor $\chi ^2_k$ or $\chi _k \chi _\ell$ which is a Dirichlet character modulo $8 m_{\mathbf {A}}$ independent of the other variables.

  6. (vi) A product of Jacobi or quartic residue characters of the same type as defined in Lemma 4.20 corresponding to a linked variable $y$. The norm of the modulus of this Hecke character is $O(\mathbf {N}(y)^6)$ by that lemma.

It is thus a frobenian multiplicative of conductor $O(1) |\theta _{\mathbf {A}}|^{O(1)}$ by Lemma 4.3.

Assume that there is a linked variable $y \neq 1$. Then the mean of this frobenian multiplicative function is $0$ by Lemma 4.4. Indeed, the product of the factors except for the Hecke character corresponding to $y$ is frobenian multiplicative and defined by a field with discriminant coprime to $\mathbf {N}(y)$ by the coprimality conditions. The Hecke character corresponding to $y$ cannot be defined by such a field since $\mathbf {N}(y)$ is square-free and thus divides the norm of the conductor of the corresponding Hecke character by Lemma 4.20.

Let us now consider the case when one of the (frobenian) variables $x$ is not equal to $1$. By Theorem 4.22 we may assume that all the linked variables are at most $e^{(2\log T)^{\varepsilon }}$. Note that $\alpha (\cdot ; z), \beta (\cdot ; z)$ and $\delta (\cdot ; z)$ depend only on the class of $z$ in $\mathbb {Q}^{\times }/\mathbb {Q}^{\times 2}$. Thus, we may replace $z_{kn}^{\ell m}$ by $\theta _{\mathbf {A}}$. Using upper bounds, including to get rid of the dependence of $U$, we see that the contribution of this is at most

(5.14)\begin{align} & \mathop{\sum_{\mathbf{w}} \sum_{\mathbf{d}} \sum_{\mathbf{e}} \sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{g}^c} \sum_{\mathbf{u}}}_{\substack{|A_k| u_k^2 \prod_{\ell \neq k} t_{k \ell} \leq T \nonumber\\ \mathbf{N}(y) \leq e^{(\log T^2)^{\varepsilon}} \text{ for } y \text{ linked to } x}} \prod_{\{k, \ell\}}\alpha(w_{k \ell}; \theta_{\mathbf{A}}) \beta(d_{k \ell}; \theta_{\mathbf{A}}) \beta(e_{k \ell}; \theta_{\mathbf{A}}) \nonumber\\ &\quad \times\delta(\mathbf{N}(\mathfrak{g}_{k \ell}); \theta_{\mathbf{A}}) \delta(\mathbf{N}(\mathfrak{g}^c_{k \ell}); \theta_{\mathbf{A}}) \prod_{m,n \neq k, \ell} \delta(f_{k \ell}^{m n}; \theta_{\mathbf{A}}). \end{align}

We can sum over the $u_k$ and enlarge the sum to obtain an upper bound

\begin{align*} & \frac{T^2}{|\theta_{\mathbf{A}}|^{{1}/{2}}} \mathop{\sum_{\mathbf{w}} \sum_{\mathbf{d}} \sum_{\mathbf{e}} \sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{g}^c}}_{\substack{\mathbf{N}(y) \leq e^{(\log T^2)^{\varepsilon}} \text{ for } y \text{ linked to } x \\ \mathbf{N}(z) \leq T \text{ for } z \text{ not linked to } x}} \prod_{\{k, \ell\}} \frac{\alpha(w_{k \ell}; \theta_{\mathbf{A}})}{w_{k \ell}} \frac{\beta(d_{k \ell}; \theta_{\mathbf{A}})}{d_{k \ell}} \frac{\beta(e_{k \ell}; \theta_{\mathbf{A}})}{e_{k \ell} } \\ &\quad \times\frac{\delta(\mathbf{N}(\mathfrak{g}_{k \ell}); \theta_{\mathbf{A}})}{\mathbf{N}(\mathfrak{g}_{k \ell})} \frac{\delta(\mathbf{N}(\mathfrak{g}^c_{k \ell}); \theta_{\mathbf{A}})}{\mathbf{N}(\mathfrak{g}^c_{k \ell})} \prod_{m,n \neq k, \ell} \frac{\delta(f_{k \ell}^{m n}; \theta_{\mathbf{A}})}{f_{k \ell}^{m n}}. \end{align*}

The functions $\alpha (\cdot ; \theta _{\mathbf {A}}), \beta (\cdot ; \theta _{\mathbf {A}}), \delta (\cdot ; \theta _{\mathbf {A}}), \delta (\mathbf {N}(\cdot ), \theta _{\mathbf {A}})$ are frobenian multiplicative of known means and conductor $O(|\theta _{\mathbf {A}}|^2)$. The mean of $\delta (\mathbf {N}(\cdot ), \theta _{\mathbf {A}})$ is twice that of $\delta (\cdot, \theta _{\mathbf {A}})$ because a prime $p$ splits in $\mathbb {Q}(i)$ if and only if $p \equiv 1 \pmod {4}$. Applying Proposition 4.12, partial summation and using that all of the frobenian multiplicative functions are bounded by $1$ we obtain an upper bound

\[ \ll_{\varepsilon} \frac{T^2 (\log T)^{W}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}} \sum_{\substack{y \text{ linked to } x \\ \mathbf{N}(y) \leq e^{(\log T^2)^{\varepsilon}}}} \frac{1}{\prod_y \mathbf{N}(y)} \ll_{\varepsilon} \frac{T^2 (\log T)^{W + N \varepsilon}}{|\theta_{\mathbf{A}}|^{{1}/{2} - (N + 1) \varepsilon}}, \]

where $N = 46$ is the total number of variables and $W$ is the sum of the means of the frobenian multiplicative functions corresponding to the unlinked variables, including $x$. We now compute $W$ for $x = \mathfrak {g}_{k \ell }, \mathfrak {g}_{k \ell }^c$ in the following table. By the notation $2 \cdot \frac {1}{2}$ we mean that there are two unlinked variables of that type and that the corresponding frobenian multiplicative function has mean $\frac {1}{2}$.

We can now assume that $\mathfrak {g}_{k \ell }, \mathfrak {g}_{k \ell }^c = 1$. Consider now the case that $x = f_{k \ell }^{m n} \neq 1$. All the $u_i$ are linked to $x$, by (5.14) we can thus give an upper bound for this contribution of

\[ \mathop{\sum_{\mathbf{w}} \sum_{\mathbf{d}} \sum_{\mathbf{e}} \sum_{\mathbf{f}} \sum_{\mathbf{u}}}_{\substack{|A_k| u_k^2 \prod_{\ell \neq k} w_{k \ell} d_{k \ell} e_{k \ell} f_{k \ell}^{m n} f_{k \ell}^{n m} \leq T \\ u_k \leq e^{(\log T^2)^{\varepsilon}}}} \prod_{\{k, \ell\}}\alpha(w_{k \ell}; \theta_{\mathbf{A}}) \beta(d_{k \ell}; \theta_{\mathbf{A}}) \beta(e_{k \ell}; \theta_{\mathbf{A}}) \prod_{m,n \neq k, \ell} \delta(f_{k \ell}^{m n}; \theta_{\mathbf{A}}). \]

Consider the Dirichlet convolution $\rho := \alpha * \beta * \beta * \delta * \delta$. One can check by looking at prime powers that $|\rho (\cdot ; \theta _{\mathbf {A}})| \leq 1$. Writing $t_{k \ell } = w_{k \ell } d_{k \ell } e_{k \ell } f_{k \ell }^{m n} f_{k \ell }^{n m}$ we obtain an upper bound

\[ \sum_{u_k \leq e^{(\log T^2)^{\varepsilon}}} \sum_{|A_k| u_k^2 \prod_{\ell \neq k} t_{k \ell} \leq T} \rho(t_{k \ell}; \theta_{\mathbf{A}}) \leq \sum_{u_k \leq e^{(\log T^2)^{\varepsilon}}} \sum_{|A_k| u_k^2 \prod_{\ell \neq k} t_{k \ell} \leq T} 1. \]

The inner sum was bounded in the proof of Lemma 5.3. To be precise, (5.2) gives an upper bound

\[ \ll \sum_{u_k \leq e^{(\log T^2)^{\varepsilon}}} \frac{T^2 (\log T)^2}{\prod_k u_k |\theta_{\mathbf{A}}|^{{1}/{2}}} \ll \frac{T^2 (\log T)^{2 + 4 \varepsilon}}{|\theta_{\mathbf{A}}|^{{1}/{2}}}. \]

We can thus assume that $f_{k \ell }^{m n} = 1$. Consider now the case that $x = e_{k \ell } \neq 1$. We can apply the same method as for $\mathfrak {g}_{k \ell }, \mathfrak {g}_{k \ell }^c$, but we may assume that $\mathfrak {g}_{k \ell } = \mathfrak {g}_{k \ell }^c = f_{k \ell }^{m n} = 1$ so they will not contribute to $W$. We compute $W$ in the following table.

We recall that $M_{\mathbf {A}} = \frac {11}{16}$ if $\theta _{\mathbf {A}} \in - \mathbb {Q}^{\times 2}$ and $M_{\mathbf {A}} = \frac {15}{32}$ if $\theta _{\mathbf {A}} = \pm \mathbb {Q}^{\times 2}$. Thus, in the above cases $W$ is always strictly smaller than $5 M_{\mathbf {A}}$. Combining everything and changing our $\varepsilon$ we find that

\begin{align*} S^{U}_{\mathbf{A}}(\boldsymbol{\chi}, T) &= \mathop{\sum \sum \sum}_{\substack{(\mathbf{u}, \mathbf{d}, \mathbf{w}) \in U \\ \mu(\prod_k u_k \prod_{\{k, \ell\}} d_{k \ell} w_{k \ell})^2 = 1}} \prod_k \chi_k(u_k)^2 \prod_{\{k, \ell\}}\chi_k \chi_\ell(w_{k \ell} d_{k \ell})\alpha(w_{k \ell}; \theta_{\mathbf{A}}) \beta( d_{k \ell}; \theta_{\mathbf{A}}) \\ &\quad + O_{\varepsilon}\bigg(\frac{T^2(\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg) \end{align*}

as desired.

5.3 The error terms

Proof of Lemma 5.4 By combining (5.9) and (5.11) we have for all downward-closed sets $U$

(5.15)\begin{equation} \#N^{\textrm{loc}, U}_{\mathbf{A}, \mathbf{M}}(T) \ll_{\varepsilon} \mathop{\sum \sum \sum}_{(\mathbf{u}, \mathbf{d}, \mathbf{w}) \in U} \prod_{\{k, \ell\}} \alpha(w_{k \ell}; \theta_{\mathbf{A}}) \beta( d_{k \ell}; \theta_{\mathbf{A}}) + \frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}. \end{equation}

We have rid ourselves of the coprimality and square-freeness with a trivial bound.

There are two sets $U$ we have to consider.

  1. (i) The $U$ we have to consider for (5.3) is defined by the inequalities $w_{k \ell } \leq (\log T^2)^{C}$ for all $\{k , \ell \}$. Summing over the $u_k$ and enlarging the sum gives

    \[ \ll_{\varepsilon} \frac{T^2(\log T)^{\varepsilon}}{|\theta_{\mathbf{A}}|^{{1}/{2}}} \sum_{w_{k \ell} \leq (\log T^2)^{C}} \prod_{\{k, \ell\}} \frac{\alpha( w_{k \ell}; \theta_{\mathbf{A}})}{w_{k \ell}} \sum_{d_{k \ell} \leq T} \prod_{\{k, \ell\}}\frac{\beta( d_{k \ell}; \theta_{\mathbf{A}})}{d_{k \ell}}. \]
    The functions $\alpha (\cdot ; \theta _{\mathbf {A}}), \beta (\cdot ; \theta _{\mathbf {A}})$ are frobenian multiplicative functions of conductor at most $16 |\theta _{\mathbf {A}}|^2$ and mean given in Lemma 5.9. By Theorem 4.12 and partial summation this is
    \[ \ll_{\varepsilon} |\theta_{\mathbf{A}}|^{12 \varepsilon- {1}/{2}} T^2 (\log T)^{W} (C \log \log T)^6 \ll_C |\theta_{\mathbf{A}}|^{12 \varepsilon- {1}/{2}} T^2 (\log T)^{5 M_{\mathbf{A}}}. \]
    Here $W = 6 \cdot \frac {3}{16} < \frac {55}{16} = 5 M_{\mathbf {A}}$ if $\theta _{\mathbf {A}} \in - \mathbb {Q}^{\times 2}$ and $W = 6 \cdot \frac {7}{32} < \frac {75}{32} = 5M_{\mathbf {A}}$ if $\theta _{\mathbf {A}} \not \in \pm \mathbb {Q}^{\times 2}$.
  2. (ii) We may assume that $i = 0, j = 1$ when proving (5.4), the condition defining $U$ is thus $u_0, u_1 \leq (\log T^2)^C$. Summing first over $u_2, u_3$ and then over the $w_{k \ell } \leq T/|A_k| u_k^2 d_{k \ell } \prod _{m \neq k, \ell } d_{k m} w_{k m}$ for all $(k, \ell ) = (0, 2), (1, 3)$ together with the trivial bound $\alpha (w_{k \ell }; \theta _{\mathbf {A}})\leq 1$ gives the following upper bound after enlarging the sum

    \[ \ll \frac{T^2}{|\theta_{\mathbf{A}}|^{{1}/{2}}} \sum_{u_0, u_1 \leq (\log T^2)^C} \frac{1}{u_0 u_1} \sum_{d_{k \ell}, w_{k \ell} \leq T} \prod_{\{k, \ell \} \neq \{0,2\}} \frac{\alpha(w_{k \ell}; \theta_{\mathbf{A}})}{ w_{k \ell}} \prod_{\{k, \ell\}} \frac{\beta( d_{k \ell}; \theta_{\mathbf{A}})}{d_{k \ell}}. \]
    Again, the functions $\alpha, \beta$ are frobenian of known means so using Theorem 4.12 and partial summation we obtain
    \[ \ll |\theta_{\mathbf{A}}|^{10 \varepsilon- {1}/{2}} T^2 (\log T)^{W} (C\log \log T)^2 \ll |\theta_{\mathbf{A}}|^{6 \varepsilon- {1}/{2}} T^2 (\log T)^{5 M_{\mathbf{A}}}. \]
    Here $W = 4 \cdot \frac {1}{2} + 6 \cdot \frac {3}{16} < \frac {55}{16} = 5 M_{\mathbf {A}}$ if $\theta _{\mathbf {A}} \in - \mathbb {Q}^{\times 2}$ and $W = 4 \cdot \frac {1}{4} + 6 \cdot \frac {7}{32} < \frac {75}{32} = 5M_{\mathbf {A}}$ if $\theta _{\mathbf {A}} \not \in \pm \mathbb {Q}^{\times 2}$.

We now look at $S_{\mathbf {A}}(\boldsymbol {\chi }, T)$. Let $\lambda (\cdot ; \theta )$ be the $S \cup \{p \mid \theta \}$-frobenian multiplicative function

\[ \lambda(t; \theta) := \mu(t)^2 \sum_{\substack{wd = t \\ (\theta \prod_{p \in S} p , t) = 1}} \alpha(w; \theta) \beta(d; \theta). \]

To see that it is frobenian multiplicative note that it is bounded by the convolution $\alpha (\cdot ; \theta ) *\beta (\cdot ; \theta )$, which is frobenian multiplicative due to Lemma 4.3, and equal to that convolution at all but finitely many primes $t$. Its mean $M_{\theta }$ is equal to the sum of the means of $\alpha (\cdot ; \theta ), \beta (\cdot ; \theta )$. This is $11/16$ if $-\theta \in \mathbb {Q}^{\times 2}$ and $15/32$ if $\pm \theta \not \in \mathbb {Q}^{\times 2}$. We remark that $M_{\theta _{\mathbf {A}}}\, \!{=}\, M_{\mathbf {A}}$.

It is bounded by $1$ because there exists at most one pair $d,w$ such that the term in the sum is non-zero. The extension $\mathbb {Q}(\sqrt {-1}, \sqrt {\theta })$ defines $\lambda (\cdot ; \theta )$ so it has conductor at most $16 |\theta |^2$.

Note that $\chi _k(x) = 0$ if $(\theta _{\mathbf {A}} \prod _{p \in S} S, x) \neq 1$ so substituting the definition (5.3) into (5.11) shows that

(5.16)\begin{align} S_{\mathbf{A}}(\boldsymbol{\chi}, T) &= \mathop{\sum_{\mathbf{u}} \sum_{\mathbf{t}}}_{|A_k| u_k^2 \prod_{\ell \neq k} t_{k \ell} \leq T} \mu\bigg(\prod_k u_k \prod_{\{k, \ell\}} t_{k \ell}\bigg)^2\prod_k\chi_k(u_k^2) \prod_{\{k,\ell\}} \chi_k \chi_\ell(t_{k \ell})\lambda(t_{k \ell}; \theta_{\mathbf{A}}) \nonumber\\ &\quad + O_{\varepsilon}\bigg(\frac{T^2(\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \end{align}

We first deal with the $\mu (\prod _k u_k \prod _{\{k, \ell \}} t_{k \ell })^2$ factor via Möbius inversion. Let $\kappa$ be the $10$-variable multiplicative function with Dirichlet series

(5.17)\begin{align} F(\mathbf{s}, \mathbf{s}') &= \sum_{\mathbf{u}} \sum_{\mathbf{t}} \frac{\mu(\prod_k u_k \prod_{\{k, \ell\}} t_{k \ell})^2}{ \prod_{k} u_k^{s_k}\prod_{\{k, \ell\}} t_{k \ell}^{s'_{k \ell}}} \prod_k \zeta^{-1}(s_k) \prod_{\{k, \ell\}}\zeta(s'_{k \ell})^{-1} \nonumber\\ &= \prod_p\bigg(1 + \sum_k p^{-s_k} + \sum_{\{k, \ell \}} p^{-s'_{k \ell}}\bigg) \prod_k(1 - p^{-s_k})\prod_{\{k, \ell\}} (1 - p^{-s'_{k \ell}}). \end{align}

This converges absolutely as long as $s_i + s_k, s_i + s'_{k \ell }, s'_{i j} + s'_{k \ell } > 1$ for all $i,j,k,\ell$. By the correspondence between arithmetic functions and Dirichlet series we have

\[ \mu\bigg(\prod_k u_k \prod_{\{k, \ell\}} t_{k \ell}\bigg)^2 = \sum_{\substack{\mathbf{f} \\ f_{k \ell}| t_{k \ell}}} \sum_{\substack{\mathbf{g} \\ g_{k}| u_{k}}} \kappa(\mathbf{f}, \mathbf{g}). \]

Substituting this into (5.16), rewriting $t_{k \ell } = f_{k \ell } t_{k \ell }, u_k = g_k u_k$, using that $\lambda (\cdot ; \theta _{\mathbf {A}})$ is multiplicative and supported on square-free integers and switching the order of summation shows that the main term of $S_{\mathbf {A}}(\boldsymbol {\chi }, T)$ is

(5.18)\begin{equation} \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{u}} \sum_{\mathbf{t}}}_{\substack{|A_k| g_k^2 u_k^2 \prod_{\ell \neq k} f_{k \ell} t_{k \ell} \leq T \\ (f_{k \ell}, t_{k \ell}) = 1}} \kappa(\mathbf{f}, \mathbf{g}) \prod_k\chi_k(g_k^2 u_k^2) \prod_{\{k, \ell\}}\chi_k \chi_\ell(f_{k \ell} t_{k \ell}) \lambda(f_{k \ell} ; \theta_{\mathbf{A}})\lambda(t_{k \ell}; \theta_{\mathbf{A}}). \end{equation}

We prove the following lemma to deal with $\chi _k$.

Lemma 5.11 If there exists a character $\chi \in \Gamma _{\mathbf {A}}^\vee [2]$ such that $\chi _k = \chi$ for all $k$, then

(5.19)\begin{align} S_{\mathbf{A}}(\boldsymbol{\chi}, T) = S_{\mathbf{A}}(\boldsymbol{1}, T) &= \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{u}} \sum_{\mathbf{t}}}_{\substack{|A_k| g_k^2 u_k^2 \prod_{\ell \neq k} f_{k \ell} t_{k \ell} \leq T \nonumber\\ (m_{\mathbf{A}}, \prod_k u_k g_k)= (f_{k \ell}, t_{k \ell}) = 1 }} \kappa(\mathbf{f}, \mathbf{g}) \prod_{\{k, \ell\}} \lambda(f_{k \ell} ; \theta_{\mathbf{A}})\lambda(t_{k \ell}; \theta_{\mathbf{A}}) \nonumber\\ &\quad + O_{\varepsilon}\bigg(\frac{T^2(\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \end{align}

For all other $\boldsymbol {\chi }$ we have

\[ S_{\mathbf{A}}(\boldsymbol{\chi}, T) = O_{\varepsilon}\bigg(\frac{T^2(\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \]

Proof. If there exists such a character $\chi$, then $\chi _k(g_k^2 u_k^2)= \chi ^2(g_k u_k)$ and $\chi _k \chi _\ell (f_{k \ell } t_{k \ell }) = \chi ^2(f_{k \ell } t_{k \ell })$. As $\chi ^2$ is principal it is the indicator function of the set $\{x \in \mathbb {Z}: (x,m_{\mathbf {A}}) =1\}$. The equality (5.19) follows from (5.18) after noting that the definition (5.3) of $\lambda$ already includes the necessary coprimality condition.

Assume now that no such $\chi$ exists. We claim that there exist $k \neq \ell$ such that $\chi _k \chi _{\ell }$ is non-principal. To prove the claim assume that for all $k \neq \ell$ the character $\chi _k \chi _{\ell }$ is principal. Then for all pairwise distinct $k, \ell, m$, we have $\chi _k = \chi _{\ell }^{-1} = \chi _m$. Hence, $\chi _k = \chi _m = \chi _k^{-1}$ for all $k, m$.

We may assume that $\chi _0 \chi _1$ is non-principal. First sum over $t_{0 1}$ in (5.18). The function $\chi _0 \chi _1(\cdot ) \lambda (\cdot, \theta _{\mathbf {A}})$ is a $\{p \mid m_{\mathbf {A}} f_{0 1} \}$-frobenian multiplicative function of conductor $\ll |\theta _{\mathbf {A}}|^{O(1)}$. The function $\lambda (t, \theta )$ is strictly positive if $t$ is odd square-free so $\chi _0 \chi _1(\cdot ) \lambda (\cdot, \theta _{\mathbf {A}})$ has mean zero by [Reference Loughran and MatthiesenLM19, Lemma 2.4]. Corollary 4.11 and the divisor bound thus imply that this sum is

\begin{align*} & \ll_{C, \varepsilon} |\theta_{\mathbf{A}}|^{\varepsilon} e^{\omega(f_{0 1})} T \min\bigg(\frac{\bigl(\log ({2T}/{|A_0| u_0^2 t_{0 2} t_{0 3} f_{0 1} f_{0 2} f_{0 3}})\bigr)^{-2C}}{|A_0| u_0^2 t_{0 2} t_{0 3} f_{0 1} f_{0 2} f_{0 3}},\\ &\frac{\bigl(\log ({2T}/{|A_1| u_1^2 t_{1 2} t_{1 3} f_{0 1} f_{1 2} f_{1 3}})\bigr)^{-2C}}{|A_1| u_1^2 t_{1 2} t_{1 3} f_{0 1} f_{1 2} f_{1 3}}\bigg) \\ & \ll_{C, \varepsilon} |\theta_{\mathbf{A}} f_{0 1}|^{\varepsilon} \frac{T \bigl(\log ({2T}/{u_0^2 |A_0| t_{0 2} t_{0 3} f_{0 1} f_{0 2} f_{0 3}})\bigr)^{-C} \bigl(\log ({2T}/{u_1^2 |A_1| t_{1 2} t_{1 3} f_{0 1} f_{1 2} f_{1 3}})\bigr)^{-C}}{u_0 u_1 f_{0 1}(|A_0 A_1| t_{0 2} t_{0 3} f_{0 2} f_{0 3} t_{1 2} t_{1 3} f_{1 2} f_{1 3})^{{1}/{2}}}. \end{align*}

Choose $C = 2$ and use the inequality $|\chi _k(\cdot )| \leq 1$. The sum over $u_i$ for $i = 0,1$ is bounded by the integral

\[ \int_{x \geq 0}^{x^2 \leq {T}/{A_i t_{i 2} t_{i 3} f_{i j} f_{i 2} f_{i 3}}} \frac{\bigl(\log ({2T}/{x^2 A_i t_{i 2} t_{i 3} f_{i j} f_{i 2} f_{i 3}})\bigr)^{-2}}{x}\, {dx} = \int_{y^2 \geq 2}^{\infty} \frac{1}{y(2 \log y)^2} \, {dy} \ll 1, \]

where we made the change of variables $y = {(2T)^{{1}/{2}}}/{x (A_i t_{i 2} t_{i 3} f_{i j} f_{i 2} f_{i 3})^{{1}/{2}}}$. Summing over the $u_k$ for $k = 2,3$ and using the trivial bound $\lambda (f_{k \ell }, \theta _{\mathbf {A}}) \leq 1$ gives a total upper bound

\[ \ll_{\varepsilon} \frac{T^2}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}} \bigg(\sum_{\mathbf{f}, \mathbf{g}}|\kappa(\mathbf{f}, \mathbf{g})| f_{0 1}^{\varepsilon - 1} \sum_{t_{k \ell} \leq T} \prod_{\{k, \ell\} \neq \{0, 1\}} \frac{\lambda(t_{k \ell}; \theta_{\mathbf{A}})}{t_{k \ell} f_{k \ell}} + (\log T)^{5 M_{\mathbf{A}}}\bigg). \]

The sum over $\mathbf {f}$ and $\mathbf {g}$ is bounded by a value of the series obtained by taking absolute values in the series $F(\boldsymbol {1}, \mathbf {s}')$, which was defined in (5.17), with $s'_{01} = 1 - \varepsilon$ and $s'_{k \ell } = 1$ for $\{k, \ell \} \neq \{0, 1\}$. This is $O(1)$ for $\varepsilon \leq \frac {1}{4}$ because the series is absolutely convergent there. By applying Theorem 4.12 to $\lambda (\cdot ; \theta _{\mathbf {A}})$ and using partial summation we can conclude that

\[ S_{\mathbf{A}}(\boldsymbol{\chi}, T) \ll_{\varepsilon} \frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}. \]

Lemma 5.11 implies that

(5.20)\begin{align} N^{\textrm{loc}}_{\mathbf{A}, \mathbf{M}}(T) &= \frac{1}{|\Gamma_{\mathbf{A}}|^4} \sum_{\chi \in \Gamma_{\mathbf{A}}^{\vee}[2]} \prod_{k} \overline{\chi}(M_k) S_{\mathbf{A}}(\boldsymbol{1}, T) + O_{\varepsilon} \bigg(\frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg) \nonumber\\ &= \frac{|\Gamma_{\mathbf{A}}^{\vee}[2]|}{|\Gamma_{\mathbf{A}}|^4} S_{\mathbf{A}}(\boldsymbol{1}, T) + O_{\varepsilon} \bigg(\frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \end{align}

The second equality is because $\prod _k M_k$ has to be a square so $\chi (\prod _k M_k) = 1$ if $\chi \in \Gamma _{\mathbf {A}}^{\vee }[2]$.

5.4 The main term

It remains to evaluate $S_{\mathbf {A}}(\boldsymbol {1}, T)$. By Theorem 4.12 we have the following asymptotic formula for all $\varepsilon > 0$

\[ \sum_{\substack{t \leq x \\ (t,f) = 1}} \lambda(t; \theta) = (K_{\theta, f} M_{\theta} + e^{O(f)}O_{\epsilon}(|\theta|^{\varepsilon}(\log x)^{-1})) x (\log x)^{M_{\theta} - 1}. \]

The constant $K_{\theta, f}$ is given by the Euler product

\[ K_{\theta, f} := \frac{1}{\Gamma(M_\theta) M_{\theta}} \prod_{p \not \in S, p \nmid f \theta} \bigg(1 + \frac{\lambda(p, \theta)}{p}\bigg) \prod_p \bigg(1 - \frac{1}{p}\bigg)^{M_{\theta}}. \]

In particular, $K_{\theta, f} \leq K_{\theta, 1}$. By applying Theorem 4.12 with $N = -1$ we see that $K_{\theta, 1} \ll _{\varepsilon } |\theta |^{\varepsilon }$.

It follows from the divisor bound that $e^{O(\omega (f))} = O_{\varepsilon }(f^{\varepsilon })$. Applying partial summations and noting that $M_{\theta } < 1$ shows that

(5.21)\begin{equation} \sum_{\substack{t \leq x \\ (t,f) = 1}} \frac{\lambda(t; \theta)}{t} = K_{\theta, f}(\log x)^{M_{\theta}} + O_{\varepsilon}((|\theta| f)^{\varepsilon}). \end{equation}

We use the shortened notation $M_{\mathbf {A}} := M_{\theta _{\mathbf {A}}},\, K_{\mathbf {A}, f} := K_{\theta _{\mathbf {A}}, f}$ which is consistent with our earlier definition of $M_{\mathbf {A}}$.

We now prove an asymptotic formula for the main term.

Lemma 5.12 There exists a positive constant $Q'_{\mathbf {A}} \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ for all $\varepsilon$ such that for $T > 2$ we have

(5.22)\begin{equation} S_{\mathbf{A}}(\mathbf{1}, T) = \bigg(Q'_{\mathbf{A}} + O_{\varepsilon} \bigg(\frac{|\theta_{\mathbf{A}}|^{\varepsilon}}{(\log T)^{M_{\mathbf{A}}}}\bigg)\bigg) \frac{T^{2} (\log T)^{6 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2}}}. \end{equation}

Proof. We have to evaluate the right-hand side of (5.19):

(5.23)\begin{equation} \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{u}} \sum_{\mathbf{t}}}_{\substack{|A_k| g_k^2 u_k^2 \prod_{\ell \neq k} f_{k \ell} t_{k \ell} \leq T \\ (m_{\mathbf{A}}, \prod_k u_k g_k) =(f_{k \ell}, t_{k \ell}) = 1}} \kappa(\mathbf{f}, \mathbf{g}) \prod_{\{k, \ell\}} \lambda(f_{k \ell} ; \theta_{\mathbf{A}})\lambda(t_{k \ell}; \theta_{\mathbf{A}}) + O_{\varepsilon}\bigg(\frac{T^2 (\log T)^{5 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2} - \varepsilon}}\bigg). \end{equation}

We first sum over the $u_k$, this sum is $ ({\varphi (m_{\mathbf {A}})}/{m_{\mathbf {A}}}) ({T}/{|A_k|^{{1}/{2}} g_k \prod _{\ell \neq k} f_{k \ell } t_{k \ell }}) + O(\tau (m_{\mathbf {A}}))$, it is also $\ll {T}/{|A_k|^{{1}/{2}} g_k \prod _{\ell \neq k} f_{k \ell } t_{k \ell }}$. To deal with the resulting error terms it thus suffices to bound

\[ \frac{\tau(m_{\mathbf{A}})T^{{3}/{2}}}{|A_1 A_2 A_3|^{{1}/{2}}} \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{t}}}_{\substack{|A_k| g_k^2 \prod_{\ell \neq k} f_{k \ell} t_{k \ell} \leq T \\ (m_{\mathbf{A}}, \prod_k g_k) = (f_{k \ell}, t_{k \ell}) = 1}} \frac{|\kappa(\mathbf{f}, \mathbf{g})| g_0 \prod_{i \neq 0} \sqrt{f_{0 i} t_{0 i}}\prod_{\{k, \ell\}} \lambda( f_{k \ell}; \theta_{\mathbf{A}})\lambda( t_{k \ell}; \theta_{\mathbf{A}})}{\prod_k g_{k} \prod_{\{k, \ell\}} f_{k \ell} t_{k \ell}}. \]

Applying the trivial inequality $\lambda ( \cdot ; \theta _{\mathbf {A}}) \leq 1$, getting rid of the coprimality conditions by an upper bound, summing over $t_{0 1} \leq T/ g_0 t_{0 2} t_{03}\prod _{i \neq 0} f_{0 i}$ and enlarging the sum gives an upper bound

\[ \ll \frac{\tau(m_{\mathbf{A}})T^{2}}{|\theta_{\mathbf{A}}|^{{1}/{2}}} \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{t} \setminus \{t_{0 1}\}}}_{ t_{k \ell} \leq T} \frac{|\kappa(\mathbf{f}, \mathbf{g})|\prod_{\{k, \ell\}} \lambda( f_{k \ell}; \theta_{\mathbf{A}}) \lambda( t_{k \ell}; \theta_{\mathbf{A}})}{ \prod_k g_{k} \prod_{\{k, \ell\}} f_{k \ell} t_{k \ell}}. \]

Using the bound $\lambda ( f_{k \ell }; \theta _{\mathbf {A}}) \leq 1$ we see that the sums over $\mathbf {f}$ and $\mathbf {g}$ are bounded by the series obtained by taking absolute values in the series defining $F(\mathbf {1}, \mathbf {1})$ which was defined in (5.17). This series converges because $F$ is absolutely convergent at $(\mathbf {1}, \mathbf {1})$. The divisor bound gives $\tau (m_{\mathbf {A}}) \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$. In addition, the sum over each $t_{k \ell }$ gives a contribution $\ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }(\log T)^{M_{\mathbf {A}}}$ by (5.21). The total error term is thus, after changing $\varepsilon$, given by $\ll _{\varepsilon } T^{2}(\log T)^{5 M_{\mathbf {A}}}/ |\theta _{\mathbf {A}}|^{{1}/{2} - \varepsilon }$. We find that $S_{\mathbf {A}}(\mathbf {1}, T)$ is equal to

(5.24)\begin{align} & \bigg(\frac{\varphi(m_{\mathbf{A}})}{m_{\mathbf{A}}}\bigg)^4 \frac{T^{2}}{|\theta_{\mathbf{A}}|^{{1}/{2}}} \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}} \sum_{\mathbf{t}}}_{\substack{|A_k| g_k^2 \prod_{\ell \neq k} f_{k \ell} t_{k \ell} \leq T \nonumber\\ (m_{\mathbf{A}}, \prod_k g_k) = (f_{k \ell}, t_{k \ell}) = 1}} \frac{\kappa(\mathbf{f}, \mathbf{g})\prod_{\{k, \ell\}} \lambda( f_{k \ell}; \theta_{\mathbf{A}}) \lambda(t_{k \ell}; \theta_{\mathbf{A}})}{\prod_k g_{k} \prod_{\{k, \ell\}} f_{k \ell} t_{k \ell}} \nonumber\\ &\quad + O_{\varepsilon}(|\theta_{\mathbf{A}}|^{\varepsilon - {1}/{2}}T^2(\log T)^{5M_{\mathbf{A}}}). \end{align}

Denote the sum over $\mathbf {t}$ by $R_{\mathbf {A}}(T; \mathbf {f}, \mathbf {g})$. We first compare this sum with the slightly different sum

\[ R'_{\mathbf{A}}(T; \mathbf{f}) := \sum_{\substack{\mathbf{t}\\ \prod_{\ell \neq k} t_{k \ell} \leq T \\ (f_{k \ell}, t_{k \ell}) = 1}} \prod_{\{k, \ell\}}\frac{\lambda( t_{k \ell}; \theta_{\mathbf{A}})}{t_{k \ell}}. \]

The difference between these two sums can be bounded by the sum over $i \in \{0, 1, 2, 3\}$ of the subsum of $R'_{\mathbf {A}}(T, \mathbf {1})$ defined by the condition $T/|A_i| g_{i}^2 \prod _{j \neq i} f_{i j} \leq \prod _{\ell \neq i} t_{i \ell } \leq T$. Choose $j \neq i$ and sum over $t_{i j}$. Using the trivial inequality $\lambda (t_{i j}; \theta _{\mathbf {A}}) \leq 1$ and ignoring the coprimality condition with an upper bound shows that this gives a contribution of

\[ \ll \log \frac{T}{\prod_{\ell \neq i,j} t_{i \ell}} - \log \frac{T}{|A_i| g_{i}^2 \prod_{j \neq i} f_{i j} \prod_{\ell \neq i,j} t_{i \ell}} = \log |A_i| g_{i}^2 \prod_{j \neq i} f_{i j}. \]

Summing over the other $t_{k \ell } \leq T$ and applying (5.21) shows that $|R_{\mathbf {A}}(T; \mathbf {f}, \mathbf {g}) - R'_{\mathbf {A}}(T; \mathbf {f})|$ is

\[ \ll_{\varepsilon}|\theta_{\mathbf{A}}|^{5 \varepsilon}\log\bigg( |\theta_{\mathbf{A}}| \prod_{k} g_{k} \prod_{\{k, \ell\}} f_{k \ell}\bigg)(\log T)^{5 M_{\mathbf{A}}} \ll_{\varepsilon} \bigg(|\theta_{\mathbf{A}}| \prod_{k} g_{k} \prod_{\{k, \ell\}} f_{k \ell}\bigg)^{6 \varepsilon} (\log T)^{5 M_{\mathbf{A}}}. \]

To evaluate $R'_{\mathbf {A}}(T; \mathbf {f})$ we apply Lemma 4.17. By (5.21) we may take

\begin{align*} & g_{k \ell}(t_{k \ell}) := K_{\mathbf{A}, f_{k \ell}}\frac{d}{dt_{k \ell}} (\log t_{k \ell})^{M_{\mathbf{A}}}\quad \mbox{ if } t_{k \ell} \geq 1,\ 0 \mbox{ otherwise,} \\ & h_{k \ell}(t_{k \ell}) := O_{\varepsilon}((|\theta_{\mathbf{A}}| f_{k \ell})^{\varepsilon})\quad \mbox{ if } \frac{1}{2} \leq t_{k \ell} \leq 1,\ 0 \mbox{ otherwise.} \end{align*}

By enlarging the integrals we see that the error term is bounded by the sum over $J \subsetneq \mathcal {P}_2(4)$ of the integrals

\[ \prod_{\{k, \ell\} \in J} K_{\mathbf{A}, f_{k \ell}} \int_{1}^{4 T} \frac{d}{d t_{k \ell}}(\log t_{k \ell})^{M_{\mathbf{A}}}\,d t_{k \ell} \prod_{\{m, n\} \not \in J} O_{\varepsilon} ((|\theta_{\mathbf{A}}| f_{m n})^{\varepsilon}) \int_{{1}/{2}}^{1}d t_{m n}. \]

The contribution for $\{m, n\} \not \in J$ is $O_{\varepsilon }((|\theta _{\mathbf {A}}| \prod _{\{m, n\}}f_{m n})^{(6- |J|)\varepsilon })$ and the contribution of $\{k, \ell \} \in J$ is $O(|\theta _{\mathbf {A}}|^{|J|\varepsilon } (\log T)^{|J|M_{\mathbf {A}}})$. The total error term is, thus,

\[ O_{\varepsilon}\bigg(\bigg(|\theta_{\mathbf{A}}| \prod_{\{k, \ell\}}f_{k \ell}\bigg)^{6 \varepsilon} (\log T)^{5 M_{\mathbf{A}}}\bigg). \]

The main term is given by the integral

\[ \prod_{\{k, \ell\}} K_{\mathbf{A}, f_{k \ell}} \int_{1 \leq t_{k \ell}}^{\prod_{\ell \neq k} t_{k \ell} \leq T} \prod_{\{k, \ell\}} \frac{d}{d t_{k \ell}}(\log t_{k \ell})^{M_{\mathbf{A}}} \,d t_{k \ell}. \]

By making the change of variables $s_{k \ell } = ( \log t_{k \ell }/ \log T)^{M_\mathbf {A}}$ we see that this is equal to

\[ \prod_{\{k, \ell\}} K_{\mathbf{A}, f_{k \ell}} (\log T)^{6 M_{\mathbf{A}}} \int_{0 \leq s_{k \ell}}^{\sum_{\ell \neq k} s_{k \ell}^{1/M_{\mathbf{A}}} \leq 1} \prod_{\{k, \ell \}} \,d s_{k \ell}. \]

The integral is equal to a constant $0 < L_{\mathbf {A}} \leq 1$ depending on $M_\mathbf {A}$. To summarize

\[ R_{\mathbf{A}}(T; \mathbf{f}, \mathbf{g})= \prod_{\{k, \ell\}} K_{\mathbf{A}, f_{k \ell}} L_{\mathbf{A}} (\log T)^{6 M_{\mathbf{A}}} + O_{\varepsilon}\bigg(\bigg(|\theta_{\mathbf{A}}| \prod_{\{k, \ell\}}f_{k \ell} \prod_k g_k\bigg)^{\varepsilon} (\log T)^{5 M_{\mathbf{A}}}\bigg). \]

We claim that the expression one gets after filling this into (5.24) converges as $T \to \infty$. Using the inequalities $K_{\mathbf {A}, f_{k \ell }} \leq K_{\mathbf {A}, 1}, \lambda ( f_{k \ell }; \theta _{\mathbf {A}}) \leq 1$ it suffices to bound the sums

\[ K_{\mathbf{A},1}^6\mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}}}_{|A_k| g_k^2 \prod_{\ell \neq k} f_{k \ell} > T \mbox{ for some } k} \frac{|\kappa(\mathbf{f}, \mathbf{g})|}{\prod_k g_k^{1 - \varepsilon} \prod_{\{k, \ell\}} f_{k \ell}^{1 - \varepsilon}}. \]

for sufficiently small $\varepsilon$. By enlarging the sum it suffices to bound the same sum over either the region defined by $g_k > (T/ |\theta _{\mathbf {A}}|)^{{1}/{5}}$ for some $k$ or the region defined by $f_{k \ell } > (T/ |\theta _{\mathbf {A}}|)^{{1}/{5}}$ for some $\{k, \ell \}$. This sum is exactly the remainder of the series gotten by taking absolute values of the coefficients of $F(\mathbf {1} - \boldsymbol {\varepsilon }, \mathbf {1} - \boldsymbol {\varepsilon })$. This Dirichlet series converges absolutely in a large enough half open plane so that the sum is $O( {|\theta _{\mathbf {A}}|^{\varepsilon }}/{T^{\varepsilon }})$ for sufficiently small $\varepsilon$. We also recall that $K_{\mathbf {A},1} \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$. Hence, there exists a constant $0 < F_{\mathbf {A}}(\mathbf {1}, \mathbf {1}) \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ such that

\[ \mathop{\sum_{\mathbf{f}} \sum_{\mathbf{g}}}_{\substack{|A_k| g_k^2 \prod_{\ell \neq k} f_{k \ell}\leq T }} R_{\mathbf{A}}(T; \mathbf{f}, \mathbf{g}) = F_{\mathbf{A}}(\mathbf{1},\mathbf{1}) L_{\mathbf{A}} (\log T)^{6 M_{\mathbf{A}}} + O_{\varepsilon}( |\theta_{\mathbf{A}}|^{\varepsilon} (\log T)^{5 M_{\mathbf{A}}} ). \]

Combining everything, we find that

\[ S_{\mathbf{A}}(\mathbf{1}, T) = \bigg(\bigg(\frac{\varphi(m_{\mathbf{A}})}{m_{\mathbf{A}}}\bigg)^4 F_{\mathbf{A}}(\mathbf{1}, \mathbf{1}) L_{\mathbf{A}} + O_{\varepsilon} \bigg(\frac{|\theta_{\mathbf{A}}|^{\varepsilon}}{(\log T)^{M_{\mathbf{A}}}}\bigg)\bigg) \frac{T^{2} (\log T)^{6 M_{\mathbf{A}}}}{|\theta_{\mathbf{A}}|^{{1}/{2}}}. \]

As $( {\varphi (m_{\mathbf {A}})}/{m_{\mathbf {A}}})^4 \leq 1, 0 < F_{\mathbf {A}}(\mathbf {1}, \mathbf {1}) \ll _{\varepsilon } |\theta _{\mathbf {A}}|^{\varepsilon }$ and $0 < L_{\mathbf {A}} \leq 1$ we are done.

Substituting (5.22) into (5.20) shows that

\[ N^{\textrm{loc}}_{\mathbf{A}, \mathbf{M}}(T) = \frac{|\Gamma_{\mathbf{A}}^{\vee}[2]|}{|\Gamma_\mathbf{A}|^4}\bigg(Q'_{\mathbf{A}} + O_{\varepsilon} \bigg(\frac{|\theta_{\mathbf{A}}|^{\varepsilon}}{(\log T)^{M_{\mathbf{A}}}}\bigg)\bigg) \frac{T^{2} (\log T)^{6 M_{\mathbf{A}}}}{{|\theta_{\mathbf{A}}|^{{1}/{2}}}}. \]

This proves Lemma 5.8 because $|\Gamma _{\mathbf {A}}^{\vee }[2]| \leq |\Gamma _{\mathbf {A}}|$.

6. Uniform formula

One of the difficulties in finding upper and especially lower bounds of $N^{\operatorname {Br}}(T)$ was that we do not have a uniform description of $\mathcal {A}$ as this algebra depends on choosing a rational point on a quadric. We prove in this section that no such uniform description can exist.

We follow the ideas of [Reference UematsuUem14]. Let $k$ be a field of characteristic $0$, $\pi : \mathfrak {X} \to \mathbb {P}_k^3$ the universal diagonal quartic, i.e. $\mathfrak {X} \subset \mathbb {P}_k^3 \times \mathbb {P}_k^3$ is given by

\[ a_0 X_0^4 + a_1 X_1^4 + a_2 X_2^4 + a_3 X_3^4 = 0. \]

Let $K := k(\mathbb {P}_k^3)$. Write $\lambda = -{a_1}/{a_0}$, $\mu = -{a_2}/{a_0}$ and $\nu = {a_0 a_3}/{a_1 a_2}$. We see that $K = k(\lambda,\mu,\nu )$ and the generic fiber $\mathfrak {X}_K$ of $\pi$ is the smooth $K$-variety defined by the equation

\[ X_0^4 - \lambda X_1^4 - \mu X_2^4 + \lambda \mu \nu X_3^4 = 0. \]

Here $\lambda, \mu, \nu$ were chosen to be compatible with [Reference UematsuUem16, § 2].

Uematsu defined a specialization map [Reference UematsuUem14, § 2] whose definition in this particular case we recall. Let $A \in \operatorname {Br}(\mathfrak {X}_K)$. Then there exists an open subscheme $U \subset \mathbb {P}^{3}_k$ and an element $\tilde {A} \in \operatorname {Br}(\mathfrak {X}_U)$ such that $\mathfrak {X}_U$ is smooth over $U$ and $A$ is the image of $\tilde {A}$ under the injection $\operatorname {Br}(\mathfrak {X}_U) \to \operatorname {Br}(\mathfrak {X}_K)$. For every $P \in U(k)$ we define the specialization of $A$ at $P$ as

\[ \mbox{sp}(A ; P) := \tilde{A}|_{\mathfrak{X}_P} \in \operatorname{Br}(\mathfrak{X}_P). \]

We deal with the algebraic and transcendental Brauer groups separately.

Proposition 6.1 If $\sqrt [4]{2}\not \in k(i, \sqrt {2})$, then $\operatorname {Br}(\mathfrak {X}_K)/\operatorname {Br}_1(\mathfrak {X}_K) = 0$.

Proof. For the even torsion we apply [Reference Gvirtz and SkorobogatovGvSk22, Theorem 3.8], the surface $\mathfrak {X}_K$ is split by the extension $K(i, \sqrt {2}, \sqrt [4]{-\lambda },\sqrt [4]{-\mu },\sqrt [4]{\nu })$ which does not contain $\sqrt [4]{2}$.

The odd torsion follows from [Reference Gvirtz and SkorobogatovGvSk22, Remark 3.10]. To spell out the details, let $p$ be an odd prime. The remark says that the action of $\textrm {Gal}(\overline {K}/\overline {k}(\lambda, \mu, \nu ))$ on $\operatorname {Br}(X_{\overline {K}})[p] \cong \mathbb {Z}[\sqrt {-1}]/p$ is given by multiplication with the character $\textrm {Gal}(\overline {K}/\overline {k}(\lambda, \mu, \nu )) \to \mu _n$ corresponding to the quartic extension given by adjoining the fourth root of $\lambda ^2 \mu ^2 \nu.$ The fixed elements of this action are thus fixed by multiplication with $\sqrt {-1}$. The only such element is $0$, so

\[ \operatorname{Br}(X_{\overline{K}})[p]^{\textrm{Gal}(\overline{K}/K)} \subset \operatorname{Br}(X_{\overline{K}})[p]^{\textrm{Gal} (\overline{K}/\overline{k}(\lambda, \nu, \mu))} = 0 \]

and $\operatorname {Br}(\mathfrak {X}_K)/\operatorname {Br}_1(\mathfrak {X}_K)[p]$ by definition injects into this group.

Theorem 6.2 We have $\operatorname {Br}_1(\mathfrak {X}_K) /\operatorname {Br}_0(\mathfrak {X}_K) = 0$ but $H^1(K, \textrm {Pic}(\mathfrak {X}_{\overline {K}})) \cong \mathbb {Z}/2\mathbb {Z}$.

Proof. The Hochschild–Serre spectral sequence [Reference MilneMil80, Theorem 2.20] $H^p(K, H^q(\mathfrak {X}_{\overline {K}}, \mathbb {G}_m)) \Rightarrow H^{p+q}(\mathfrak {X}_K, \mathbb {G}_m)$ induces an exact sequence

\[ \operatorname{Br}(K) \to \operatorname{Br}_1(\mathfrak{X}_K) \to \operatorname{H}^1(K, \textrm{Pic}(\mathfrak{X}_{\overline{K}})) \xrightarrow{d^{1,1}_{\mathfrak{X}_K}} H^3(K, \mathbb{G}_m). \]

It suffices to prove that the map $d^{1,1}_{\mathfrak {X}_K}$ is injective. The group $H^1(K, \textrm {Pic}(\mathfrak {X}_{\overline {K}}))$ has been computed by Bright [Reference BrightBri02, Chapter 3]. He only mentions number fields, but the same arguments work over general fields of characteristic $0$. The relevant case in Appendix A of [Reference BrightBri02] is $A222,\,\, B60,\,\, C56,\,\, D56$ or $E18$ depending on $K \cap \mathbb {Q}( \sqrt [4]{-1}) = k \cap \mathbb {Q}( \sqrt [4]{-1})$. In all cases $H^1(K, \textrm {Pic}(\mathfrak {X}_{\overline {K}})) \cong \mathbb {Z}/2\mathbb {Z}$.

Consider the open subscheme $V := \{ X_3 \neq 0\} \subset \mathfrak {X}_K$. It is the affine quartic surface defined by $x_0^4 - \lambda x_1^4 - \mu x_2^4 + \lambda \mu \nu = 0$. Let $U$ be the affine quadric surface defined by $y_0^2 - \lambda y_1^2 - \mu y_2^2 + \lambda \mu \nu = 0$ and $f: V \to U$ the map $y_i = x_i^2$. The functoriality of the Hochschild–Serre spectral sequence gives us the following commutative diagram.

We have $H^1(K,\textrm {Pic}(U_{\overline {K}})) \cong \mathbb {Z}/2\mathbb {Z}$ and the map $d^{1,1}_{U}$ is injective [Reference UematsuUem16, Proposition 2.2, Theorem 3.1]. Let $\phi \in \operatorname {H}^1(K,\textrm {Pic}(U_{\overline {K}}))$ be the generator. It suffices to find an element $\psi \in \operatorname {H}^1(K, \textrm {Pic}(\mathfrak {X}_{\overline {K}}))$ such that $\psi |_V = f^* \phi$ because then $d^{1,1}_{\mathfrak {X}_K} \psi = d^{1,1}_{U} \phi \neq 0$ which implies that $\psi$ generates $H^1(K,\textrm {Pic}(U_{\overline {K}})) \cong \mathbb {Z}/2\mathbb {Z}$.

Let $\alpha := \sqrt {\lambda }$, $\alpha ' := \sqrt {\mu }$, $\gamma := \sqrt {\nu }$ and $\beta := \alpha \gamma$. We define the following divisors of $\mathfrak {X}_K$:

\begin{align*} & L_1: X_0^2 = \alpha X_1^2, X_2^2 = \beta X_3^2, \\ & L_2: X_0^2 = \alpha X_1^2, X_2^2 = - \beta X_3^2, \\ & L'_1: X_0^2 = -\alpha X_1^2, X_2^2 = \beta X_3^2, \\ & L'_2: X_0^2 = -\alpha X_1^2, X_2^2 = - \beta X_3^2. \end{align*}

Let $H \in \textrm {Pic}(\mathfrak {X}_{\overline {K}})$ be the hyperplane class.

By [Reference UematsuUem16, Corollary 2.3] $f^* \phi$ is represented by the cocycle

\[ f^* \phi: \textrm{Gal}(K(\gamma)/K) \to \textrm{Pic}(V_{\overline{K}}): \textrm{id} \to 0, \sigma \to [L_1 \cap V], \]

where $\sigma$ is the generator of $\textrm {Gal}(K(\gamma )/K)$.

We claim that the following cochain is a cocycle

\[ \psi: \textrm{Gal}(K(\gamma)/K) \to \textrm{Pic}(\mathfrak{X}_{\overline{K}}): \textrm{id} \to 0, \sigma \to [L_1] - H. \]

If the claim holds, then $\psi |_V = f^* \phi$ as desired.

The claim consists of two parts, first that $[L_1]$ is $\textrm {Gal}(\overline {K}/K(\gamma ))$-invariant. The Galois conjugates of $L_1$ are $L_1$ and $L'_2$ so this is equivalent to $[L_1] - [L'_2] = 0$. This follows from

\begin{align*} & \textrm{div}(X_0^2 - \alpha X_1^2 - \alpha'X_2^2 +\alpha' \beta X_3^2) = L_1 + \{X_0^2 = \alpha' X_2^2, \alpha X_1^2 = \alpha' \beta X_3^2 \}, \\ & \textrm{div}(X_0^2 + \alpha X_1^2 - \alpha'X_2^2 - \alpha' \beta X_3^2) = L'_2 + \{X_0^2 = \alpha' X_2^2, \alpha X_1^2 = \alpha' \beta X_3^2 \}. \end{align*}

That the left-hand side contains the right-hand side is clear. They are thus equal because they both have degree $8$. The second part of the claim is that $[L_1] + \sigma [L_1] -2H = 0$. This follows from $[L_1] + \sigma [L_1] = [L_1] + [L_2] = [\textrm {div}(X_0^2 - \alpha X_1^2)] = 2H$.

This theorem allows us to prove in a precise sense that there exists no uniform formula for $\mathcal {A}$. Let $k = \mathbb {Q}, K =\mathbb {Q}(\lambda, \mu, \nu )$.

Corollary 6.3 There exists no $A \in \operatorname {Br}(\mathfrak {X}_K)$ such that

\[ \text{sp}(A; P) = \mathcal{A} \]

for all $P \in \mathbb {P}^3_\mathbb {Q}(\mathbb {Q})$ for which this specialization is defined.

Proof. By Proposition 6.1 and Theorem 6.2 it suffices to show that the subset of $\mathbb {P}^3_\mathbb {Q}(\mathbb {Q})$ on which $\mathcal {A}$ is not constant is Zariski-dense. Let $p$ be an odd prime. By Lemma 3.8 the Zariski-dense set $\{ [p a_0: pa_1: a_2: a_3] :p \nmid a_0 a_1 a_2 a_3 \}$ is contained in this subset.

Acknowledgements

First and foremost I want to thank Daniel Loughran for suggesting I study this question and for his extensive help with the problem and while writing this paper. I would also like to thank Martin Bright and Nick Rome for answering some of my questions. I am grateful to Alexei Skorobogatov for the proof of Proposition 6.1. I am also grateful to the anonymous reviewer for their helpful comments.

Footnotes

The author is supported by FWO-Vlaanderen (Research Foundation–Flanders) with grant number 11I0621N.

References

Bright, M. J., Computations on diagonal quartic surfaces, PhD thesis, University of Cambridge (2002).Google Scholar
Bright, M. J., The Brauer–Manin obstruction on a general diagonal quartic surface, Acta Arith. 147 (2011), 291302; MR 2773207.10.4064/aa147-3-8CrossRefGoogle Scholar
Bright, M. J., Bad reduction of the Brauer–Manin obstruction, J. Lond. Math. Soc. (2) 91 (2015), 643666; MR 3355119.CrossRefGoogle Scholar
Bright, M., Obstructions to the Hasse principle in families, Manuscripta Math. 157 (2018), 529550; MR 3858417.CrossRefGoogle Scholar
Bright, M. J., Browning, T. D. and Loughran, D., Failures of weak approximation in families, Compos. Math. 152 (2016), 14351475; MR 3530447.CrossRefGoogle Scholar
Colliot-Thélène, J.-L. and Skorobogatov, A. N., The Brauer-Grothendieck group, Ergebnisse der Mathematik und ihrer Grenzgebiete. 3. Folge. A Series of Modern Surveys in Mathematics, vol. 71 (Springer, Cham, 2021); MR 4304038.CrossRefGoogle Scholar
Colliot-Thélène, J.-L. and Xu, F., Brauer–Manin obstruction for integral points of homogeneous spaces and representation by integral quadratic forms, Compos. Math. 145 (2009), 309363; MR 2501421.10.1112/S0010437X0800376XCrossRefGoogle Scholar
Deligne, P., La conjecture de Weil. II, Publ. Math. Inst. Hautes Études Sci. 52 (1980), 137252; MR 601520.10.1007/BF02684780CrossRefGoogle Scholar
Fogels, E., Über die Ausnahmenullstelle der Heckeschen $L$-Funktionen, Acta Arith. 8 (1962/63), 307309; MR 156823.10.4064/aa-8-3-307-309CrossRefGoogle Scholar
Garibaldi, S., Merkurjev, A. and Serre, J.-P., Cohomological invariants in Galois cohomology, University Lecture Series, vol. 28 (American Mathematical Society, 2003); MR 1999383.Google Scholar
Gille, P. and Szamuely, T., Central simple algebras and Galois cohomology, Cambridge Studies in Advanced Mathematics, vol. 101 (Cambridge University Press, Cambridge, 2006); MR 2266528.10.1017/CBO9780511607219CrossRefGoogle Scholar
Grothendieck, A., Le groupe de Brauer: I, II, III, Dix exposés sur la cohomologie des schémas 3 (1968), 6787.Google Scholar
Grothendieck, A., Revêtements étales et groupe fondamental (SGA 1), Lecture Notes in Mathematics, vol. 224 (Springer, 1971).10.1007/BFb0058656CrossRefGoogle Scholar
Gvirtz, D., Loughran, D. and Nakahara, M., Quantitative arithmetic of diagonal degree $2$ K3 surfaces, Math. Ann. 384 (2022), 135209; MR 4476231.CrossRefGoogle Scholar
Gvirtz, D. and Skorobogatov, A. N., Cohomology and the Brauer groups of diagonal surfaces, Duke Math. J. 171 (2022), 12991347; MR 4408122.CrossRefGoogle Scholar
Harpaz, Y., Second descent and rational points of Kummer varieties, Proc. Lond. Math. Soc. (3) 118 (2019), 606648; MR 3932783.CrossRefGoogle Scholar
Harpaz, Y. and Skorobogatov, A. N., Hasse principle for Kummer varieties, Algebra Number Theory 10 (2016), 813841; MR 3519097.CrossRefGoogle Scholar
Heath-Brown, D. R., The size of Selmer groups for the congruent number problem, Invent. Math. 111 (1993), 171195; MR 1193603.CrossRefGoogle Scholar
Ieronymou, E. and Skorobogatov, A. N., Odd order Brauer–Manin obstruction on diagonal quartic surfaces, Adv. Math. 270 (2015), 181205; MR 3286534.CrossRefGoogle Scholar
Landau, E., Verallgemeinerung eines Pólyaschen satzes auf algebraische Zahlkörper, Nachr. Ges. Wiss. Gottingen, Math.-Phys. Kl. 1918 (1918), 478488.Google Scholar
Lehmann, B. and Tanimoto, S., On the geometry of thin exceptional sets in Manin's conjecture, Duke Math. J. 166 (2017), 28152869; MR 3712166.CrossRefGoogle Scholar
Loughran, D., The number of varieties in a family which contain a rational point, J. Eur. Math. Soc. (JEMS) 20 (2018), 25392588; MR 3852186.CrossRefGoogle Scholar
Loughran, D. and Matthiesen, L., Frobenian multiplicative functions and rational points in fibrations. J. Eur. Math. Soc. (JEMS), to appear. Preprint (2019), arXiv:1904.12845.Google Scholar
Loughran, D., Rome, N. and Sofos, E., A new conjecture on rational points in families, Preprint (2022), arXiv:2210.13559.Google Scholar
Loughran, D. and Smeets, A., Fibrations with few rational points, Geom. Funct. Anal. 26 (2016), 14491482; MR 3568035.CrossRefGoogle Scholar
Loughran, D., Takloo-Bighash, R. and Tanimoto, S., Zero-loci of Brauer group elements on semi-simple algebraic groups, J. Inst. Math. Jussieu 19 (2020), 14671507; MR 4138949.CrossRefGoogle Scholar
Milne, J. S., Étale cohomology, Princeton Mathematical Series, vol. 33 (Princeton University Press, Princeton, NJ, 1980); MR 559531.Google Scholar
Montgomery, H. L. and Vaughan, R. C., Multiplicative number theory. I. Classical theory, Cambridge Studies in Advanced Mathematics, vol. 97 (Cambridge University Press, Cambridge, 2007); MR 2378655.Google Scholar
Pieropan, M. and Schindler, D., Hyperbola method on toric varieties, Preprint (2020), arXiv:2001.09815.Google Scholar
Pieropan, M., Smeets, A., Tanimoto, S. and Várilly-Alvarado, A., Campana points of bounded height on vector group compactifications, Proc. Lond. Math. Soc. (3) 123 (2021), 57101; MR 4307130.CrossRefGoogle Scholar
Rome, N., The number of diagonal planar conics with a point (2022). Talk at ‘Rational points on higher-dimensional varieties’, International Centre for Mathematical Sciences, https://media.ed.ac.uk/playlist/dedicated/51612401/1_c3glvibl/1_wznl5zjb.Google Scholar
Serre, J.-P., Linear representations of finite groups, Graduate Texts in Mathematics, vol. 42 (Springer, New York–Heidelberg, 1977), translated from the second French edition by Leonard L. Scott; MR 0450380.CrossRefGoogle Scholar
Serre, J.-P., Lectures on $N_X (p)$, Chapman & Hall/CRC Research Notes in Mathematics, vol. 11 (CRC Press, Boca Raton, FL, 2012); MR 2920749.Google Scholar
Skorobogatov, A., Diagonal quartic surfaces, in Explicit methods in number theory, Oberwolfach Reports, vol. 6 (EMS Press, 2009), 1843–1920; MR 2664991.Google Scholar
Sofos, E. and Visse-Martindale, E., The density of fibres with a rational point for a fibration over hypersurfaces of low degree, Ann. Inst. Fourier (Grenoble) 71 (2021), 679709; MR 4353917.CrossRefGoogle Scholar
Swinnerton-Dyer, P., Arithmetic of diagonal quartic surfaces. II, Proc. Lond. Math. Soc. (3) 80 (2000), 513544; MR 1744774.CrossRefGoogle Scholar
Tenenbaum, G., Introduction to analytic and probabilistic number theory, Cambridge Studies in Advanced Mathematics, vol. 46 (Cambridge University Press, Cambridge, 1995); translated from the second French edition (1995) by C. B. Thomas; MR 1342300.Google Scholar
Uematsu, T., On the Brauer group of diagonal cubic surfaces, Q. J. Math. 65 (2014), 677701; MR 3230382.CrossRefGoogle Scholar
Uematsu, T., On the Brauer group of affine diagonal quadrics, J. Number Theory 163 (2016), 146158; MR 3459565.10.1016/j.jnt.2015.11.015CrossRefGoogle Scholar