1. Introduction
 Let w be a word on r letters, i.e. an element in the free group on the letters 
 $x_{1},\ldots,x_{r}$
. Let
$x_{1},\ldots,x_{r}$
. Let 
 $X_{1},\ldots,X_{r}$
 be random
$X_{1},\ldots,X_{r}$
 be random 
 $d\times d$
 unitary matrices, chosen independently at random according to the Haar probability measure, and consider the random matrix
$d\times d$
 unitary matrices, chosen independently at random according to the Haar probability measure, and consider the random matrix 
 $w(X_{1},\ldots,X_{r})$
, obtained by substituting
$w(X_{1},\ldots,X_{r})$
, obtained by substituting 
 $X_{i}$
 for
$X_{i}$
 for 
 $x_{i}$
 in w. For example, if
$x_{i}$
 in w. For example, if 
 $w=x_{1}x_{2}x_{1}^{-1}x_{2}^{-1}$
, then
$w=x_{1}x_{2}x_{1}^{-1}x_{2}^{-1}$
, then 
 $w(X_{1},X_{2})=X_{1}X_{2}X_{1}^{-1}X_{2}^{-1}$
. In this paper, we study the distribution of the characteristic polynomial of
$w(X_{1},X_{2})=X_{1}X_{2}X_{1}^{-1}X_{2}^{-1}$
. In this paper, we study the distribution of the characteristic polynomial of 
 $w(X_{1},\ldots,X_{r})$
. To set notation, given a
$w(X_{1},\ldots,X_{r})$
. To set notation, given a 
 $d\times d$
-matrix A and
$d\times d$
-matrix A and 
 $1\leq m\leq d$
, let
$1\leq m\leq d$
, let 
 $c_{m}(A)$
 be the coefficient of
$c_{m}(A)$
 be the coefficient of 
 $t^{d-m}$
 in the characteristic polynomial
$t^{d-m}$
 in the characteristic polynomial 
 $\det(t\cdot\mathrm{Id}-A)$
 of A. Note that
$\det(t\cdot\mathrm{Id}-A)$
 of A. Note that 
 $c_{m}(A)=(-1)^{m}\,\mathrm{tr}\big(\bigwedge\nolimits^{\!m}A\big)$
, where
$c_{m}(A)=(-1)^{m}\,\mathrm{tr}\big(\bigwedge\nolimits^{\!m}A\big)$
, where 
 $\bigwedge^{\!m}A:\bigwedge^{\!m}\mathbb{C}^{d}\rightarrow\bigwedge^{\!m}\mathbb{C}^{d}$
 is the mth exterior power of A. If A is unitary, all eigenvalues have absolute value 1, so we get the trivial bound
$\bigwedge^{\!m}A:\bigwedge^{\!m}\mathbb{C}^{d}\rightarrow\bigwedge^{\!m}\mathbb{C}^{d}$
 is the mth exterior power of A. If A is unitary, all eigenvalues have absolute value 1, so we get the trivial bound 
 $|c_{m}(A)|\leq\binom{d}{m}$
.
$|c_{m}(A)|\leq\binom{d}{m}$
.
Our main theorem is as follows.
Theorem 1.1. For every non-trivial word 
 $w\in F_{r}$
, there exists a constant
$w\in F_{r}$
, there exists a constant 
 $\epsilon(w)>0$
 such that
$\epsilon(w)>0$
 such that 
 \[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2})\leq\binom{d}{m}^{\!\!2(1-\epsilon(w))},\]
\[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2})\leq\binom{d}{m}^{\!\!2(1-\epsilon(w))},\]
 for every d and every 
 $1\leq m\leq d$
. In particular, we have
$1\leq m\leq d$
. In particular, we have 
 \[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|)\leq\binom{d}{m}^{\!\!1-\epsilon(w)}.\]
\[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|)\leq\binom{d}{m}^{\!\!1-\epsilon(w)}.\]
Remark 1.2. We make the following remarks.
- 
(1) In the proof of Theorem 1.1, we show that, if the length of w is  $\ell$
 and $\ell$
 and $d\geq(25\ell)^{7\ell}$
, then one can take $d\geq(25\ell)^{7\ell}$
, then one can take $\epsilon(w)=\frac{1}{72}(25\ell)^{-2\ell}$
. We believe $\epsilon(w)=\frac{1}{72}(25\ell)^{-2\ell}$
. We believe $\epsilon(w)^{-1}$
 can be taken to be a polynomial in $\epsilon(w)^{-1}$
 can be taken to be a polynomial in $\ell$
, for $\ell$
, for $d\gg_{\ell}1$
. $d\gg_{\ell}1$
.
- 
(2) On the other hand, it follows from [Reference Elkasapy and ThomET15, Theorem 5.2] that, for a fixed d, one has to take  $\epsilon(w)\lesssim e^{-\sqrt{\ell}}$
, for some arbitrarily long words, even for $\epsilon(w)\lesssim e^{-\sqrt{\ell}}$
, for some arbitrarily long words, even for $m=1$
. $m=1$
.
Theorem 1.1 relies on the following.
Theorem 1.3. For every 
 $m,\ell\in\mathbb{N}$
, every
$m,\ell\in\mathbb{N}$
, every 
 $d\geq m\ell$
, and every word
$d\geq m\ell$
, and every word 
 $w\in F_{r}$
 of length
$w\in F_{r}$
 of length 
 $\ell$
, one has
$\ell$
, one has 
 \begin{equation}\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2})\leq(22\ell)^{m\ell}.\end{equation}
\begin{equation}\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2})\leq(22\ell)^{m\ell}.\end{equation}
 In particular, if 
 $d\geq(22\ell)^{\ell}m$
, we have
$d\geq(22\ell)^{\ell}m$
, we have 
 \[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2}\big)\leq\binom{d}{m}.\]
\[\mathbb{E}(|c_{m}(w(X_{1},\ldots,X_{r}))|^{2}\big)\leq\binom{d}{m}.\]
In addition, we show that similar bounds hold for symmetric powers.
Theorem 1.4. For every 
 $\ell\in\mathbb{N}$
, every
$\ell\in\mathbb{N}$
, every 
 $d\geq m\ell$
, and every word
$d\geq m\ell$
, and every word 
 $w\in F_{r}$
 of length
$w\in F_{r}$
 of length 
 $\ell$
, one has
$\ell$
, one has 
 \[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})\leq(16\ell)^{m\ell}.\]
\[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})\leq(16\ell)^{m\ell}.\]
 In particular, if 
 $d\geq(16\ell)^{\ell}m$
, we have
$d\geq(16\ell)^{\ell}m$
, we have 
 \[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})\leq\binom{d+m-1}{m}=\dim\mathrm{Sym}^{m}\mathbb{C}^{d},\]
\[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})\leq\binom{d+m-1}{m}=\dim\mathrm{Sym}^{m}\mathbb{C}^{d},\]
and by the Cauchy–Schwarz inequality,
 \[|\mathbb{E}(\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})))|\leq(\dim\mathrm{Sym}^{m}\mathbb{C}^{d})^{\frac{1}{2}}.\]
\[|\mathbb{E}(\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})))|\leq(\dim\mathrm{Sym}^{m}\mathbb{C}^{d})^{\frac{1}{2}}.\]
Remark 1.5. Theorem 1.4 is an analogue of Theorem 1.3. It is also an analogue of Theorem 1.1 for m at most linear in d. In contrast to exterior powers, the methods of this paper are insufficient for finding bounds similar to Theorem 1.1 for 
 $|\mathbb{E}(\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})))|$
, in the regime where m is superlinear in d.
$|\mathbb{E}(\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})))|$
, in the regime where m is superlinear in d.
1.1 Related work
Word maps on unitary groups and their eigenvalues have been studied extensively in the past few decades.
 The case 
 $w=x$
, namely, the study of a Haar-random unitary matrix X, also known as the circular unitary ensemble (CUE), is an important object of study in random matrix theory (see, e.g., [Reference Anderson, Guionnet and ZeitouniAGZ10, Reference MeckesMec19] and the references therein). The joint density of the eigenvalues of X is given by the Weyl integration formula [Reference WeylWey39]. Schur’s orthogonality relations immediately imply that
$w=x$
, namely, the study of a Haar-random unitary matrix X, also known as the circular unitary ensemble (CUE), is an important object of study in random matrix theory (see, e.g., [Reference Anderson, Guionnet and ZeitouniAGZ10, Reference MeckesMec19] and the references therein). The joint density of the eigenvalues of X is given by the Weyl integration formula [Reference WeylWey39]. Schur’s orthogonality relations immediately imply that 
 $\mathbb{E}(|c_{m}(X)|^{2})=1$
 for all
$\mathbb{E}(|c_{m}(X)|^{2})=1$
 for all 
 $1\leq m\leq d$
. Various other properties of the characteristic polynomial of a random unitary matrix X have been studied extensively (see, e.g., [Reference Keating and SnaithKS00, Reference Hughes, Keating and O’ConnellHKO01, Reference Conrey, Farmer, Keating, Rubinstein and SnaithCFK+03, Reference Bump and GamburdBG06, Reference Diaconis and GamburdDG06, Reference Bourgade, Hughes, Nikeghbali and YorBHNY08, Reference Arguin, Belius and BourgadeABB17, Reference Chhaibi, Madaule and NajnudelCMN18, Reference Paquette and ZeitouniPZ17]).
$1\leq m\leq d$
. Various other properties of the characteristic polynomial of a random unitary matrix X have been studied extensively (see, e.g., [Reference Keating and SnaithKS00, Reference Hughes, Keating and O’ConnellHKO01, Reference Conrey, Farmer, Keating, Rubinstein and SnaithCFK+03, Reference Bump and GamburdBG06, Reference Diaconis and GamburdDG06, Reference Bourgade, Hughes, Nikeghbali and YorBHNY08, Reference Arguin, Belius and BourgadeABB17, Reference Chhaibi, Madaule and NajnudelCMN18, Reference Paquette and ZeitouniPZ17]).
 Diaconis and Shahshahani [Reference Diaconis and ShahshahaniDS94] have shown that, for a fixed 
 $m\in\mathbb{N}$
, the sequence of random variables
$m\in\mathbb{N}$
, the sequence of random variables 
 $\mathrm{tr}(X),\mathrm{tr}(X^{2}),\ldots,\mathrm{tr}(X^{m})$
 converges in distribution, as
$\mathrm{tr}(X),\mathrm{tr}(X^{2}),\ldots,\mathrm{tr}(X^{m})$
 converges in distribution, as 
 $d\rightarrow\infty$
, to a sequence of independent complex normal random variables. For the proof, which relies on the moment method, they computed the joint moments of those random variables and showed that
$d\rightarrow\infty$
, to a sequence of independent complex normal random variables. For the proof, which relies on the moment method, they computed the joint moments of those random variables and showed that 
 \begin{equation}\mathbb{E}\bigg(\prod_{j=1}^{m}\mathrm{tr}(X^{j})^{a_{j}}\,\mathrm{tr}(\overline{X}^{j})^{b_{j}}\bigg)=\delta_{a,b}\prod_{j=1}^{m}j^{a_{j}}a_{j}!,\end{equation}
\begin{equation}\mathbb{E}\bigg(\prod_{j=1}^{m}\mathrm{tr}(X^{j})^{a_{j}}\,\mathrm{tr}(\overline{X}^{j})^{b_{j}}\bigg)=\delta_{a,b}\prod_{j=1}^{m}j^{a_{j}}a_{j}!,\end{equation}
 for 
 $d\geq\sum_{j=1}^{m}(a_{j}+b_{j})j$
. The rate of convergence was later shown to be super-exponential by Johansson [Reference JohanssonJoh97].
$d\geq\sum_{j=1}^{m}(a_{j}+b_{j})j$
. The rate of convergence was later shown to be super-exponential by Johansson [Reference JohanssonJoh97].
 When 
 $w=x^{\ell}$
, (1.2) gives a formula for the moments of traces, and one can use Newton’s identities relating elementary symmetric polynomials and power sums, to deduce that
$w=x^{\ell}$
, (1.2) gives a formula for the moments of traces, and one can use Newton’s identities relating elementary symmetric polynomials and power sums, to deduce that 
 \[\mathbb{E}(|c_{m}(X^{\ell})|^{2})=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\binom{\ell+m-1}{m},\]
\[\mathbb{E}(|c_{m}(X^{\ell})|^{2})=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\binom{\ell+m-1}{m},\]
 for 
 $d\geq2m\ell$
 (see Appendix A). In [Reference RainsRai97, Reference RainsRai03], Rains partially extended (1.2) for small d and gave an explicit formula for the joint density of the eigenvalues of
$d\geq2m\ell$
 (see Appendix A). In [Reference RainsRai97, Reference RainsRai03], Rains partially extended (1.2) for small d and gave an explicit formula for the joint density of the eigenvalues of 
 $X^{\ell}$
 (see [Reference RainsRai03, Theorem 1.3]).
$X^{\ell}$
 (see [Reference RainsRai03, Theorem 1.3]).
 We now move to general words 
 $w\in F_{r}$
. The case
$w\in F_{r}$
. The case 
 $m=1$
, namely, the asymptotics as
$m=1$
, namely, the asymptotics as 
 $d\rightarrow\infty$
 of the distribution of the random variable
$d\rightarrow\infty$
 of the distribution of the random variable 
 $\mathrm{tr}(w(X_{1},\ldots,X_{r}))$
, was studied in the context of Voiculescu’s free probability (see, e.g., [Reference Voiculescu, Dykema and NicaVDN92, Reference Mingo and SpeicherMS17]). In particular, in [Reference VoiculescuVoi91, Reference RădulescuRăd06, Reference Mingo, ś niady and SpeicherMSS07] it was shown that, for a fixed
$\mathrm{tr}(w(X_{1},\ldots,X_{r}))$
, was studied in the context of Voiculescu’s free probability (see, e.g., [Reference Voiculescu, Dykema and NicaVDN92, Reference Mingo and SpeicherMS17]). In particular, in [Reference VoiculescuVoi91, Reference RădulescuRăd06, Reference Mingo, ś niady and SpeicherMSS07] it was shown that, for a fixed 
 $w\in F_{r}$
, the sequence of random variables
$w\in F_{r}$
, the sequence of random variables 
 $\mathrm{tr}(w(X_{1},\ldots,X_{r}))$
, for
$\mathrm{tr}(w(X_{1},\ldots,X_{r}))$
, for 
 $d=1,2,\ldots$
, converges in distribution, as
$d=1,2,\ldots$
, converges in distribution, as 
 $d\rightarrow\infty$
, to a complex normal random variable (with suitable normalization). As a direct consequence, for a fixed
$d\rightarrow\infty$
, to a complex normal random variable (with suitable normalization). As a direct consequence, for a fixed 
 $m\in\mathbb{N}$
, the random variables
$m\in\mathbb{N}$
, the random variables 
 $c_{m}(w(X_{1},\ldots,X_{r}))$
 converge, as
$c_{m}(w(X_{1},\ldots,X_{r}))$
 converge, as 
 $d\rightarrow\infty$
, to a certain explicit polynomial of Gaussian random variables. This is done in Appendix A, Corollary A.4, following [Reference Diaconis and GamburdDG06].
$d\rightarrow\infty$
, to a certain explicit polynomial of Gaussian random variables. This is done in Appendix A, Corollary A.4, following [Reference Diaconis and GamburdDG06].
 In [Reference Magee and PuderMP19], Magee and Puder have shown that 
 $\mathbb{E}(\mathrm{tr}(w(X_{1},\ldots,X_{r})))$
 coincides with a rational function of d, if d is sufficiently large, and bounded its degree in terms of the commutator length of w. They also found a geometric interpretation for the coefficients of the expansion of that rational function as a power series in
$\mathbb{E}(\mathrm{tr}(w(X_{1},\ldots,X_{r})))$
 coincides with a rational function of d, if d is sufficiently large, and bounded its degree in terms of the commutator length of w. They also found a geometric interpretation for the coefficients of the expansion of that rational function as a power series in 
 $d^{-1}$
, see [Reference Magee and PuderMP19, Corollaries 1.8 and 1.11]. See [Reference BrodskyBro24] for additional work in this direction.
$d^{-1}$
, see [Reference Magee and PuderMP19, Corollaries 1.8 and 1.11]. See [Reference BrodskyBro24] for additional work in this direction.
1.2 Ideas of proofs
With a few exceptions, the results stated in § 1.1 are asymptotic in d, but not uniform in both m and d. We try to explain some of the challenges in proving results that are uniform in m, while explaining the idea of the proof of Theorem 1.1.
Our main tool (which is also used in the papers [Reference RădulescuRăd06, Reference Mingo, ś niady and SpeicherMSS07, Reference Magee and PuderMP19] cited previously) to study integrals on unitary groups is the Weingarten calculus [Reference WeingartenWei78, Reference CollinsCol03, Reference Collins and ŚniadyCS06]. Roughly speaking, the Weingarten calculus utilizes the Schur–Weyl duality to express integrals on unitary groups as sums of so-called Weingarten functions over symmetric groups. In our case, in order to prove Theorem 1.1, we need to estimate the integral
 \begin{equation}\mathbb{E}(|c_{m}(w)|^{2})=\int_{\mathrm{U}_{d}^{r}}\bigg|\mathrm{tr}\bigg(\bigwedge^{\!m}w(X_{1},\ldots,X_{r})\bigg)\!\bigg|^{2}\,dX_{1},\ldots,dX_{r}.\end{equation}
\begin{equation}\mathbb{E}(|c_{m}(w)|^{2})=\int_{\mathrm{U}_{d}^{r}}\bigg|\mathrm{tr}\bigg(\bigwedge^{\!m}w(X_{1},\ldots,X_{r})\bigg)\!\bigg|^{2}\,dX_{1},\ldots,dX_{r}.\end{equation}
Using Weingarten calculus (Theorem 2.12), we express (1.3) as a finite sum
 \begin{equation}\sum_{(\pi_{1},\ldots,\pi_{2r})\in\prod_{i=1}^{2r}S_{m\ell_{i}}}F(\pi_{1},\ldots,\pi_{2r})\prod_{i=1}^{r}\mathrm{Wg}_{d}^{(i)}(\pi_{i}\pi_{i+r}^{-1}),\end{equation}
\begin{equation}\sum_{(\pi_{1},\ldots,\pi_{2r})\in\prod_{i=1}^{2r}S_{m\ell_{i}}}F(\pi_{1},\ldots,\pi_{2r})\prod_{i=1}^{r}\mathrm{Wg}_{d}^{(i)}(\pi_{i}\pi_{i+r}^{-1}),\end{equation}
 where 
 $\ell_{1},\ldots,\ell_{2r}\in\mathbb{N}$
 and
$\ell_{1},\ldots,\ell_{2r}\in\mathbb{N}$
 and 
 $F:\prod_{i=1}^{2r}S_{m\ell_{i}}\rightarrow\mathbb{Z}$
 are related to combinatorial properties of w, and each
$F:\prod_{i=1}^{2r}S_{m\ell_{i}}\rightarrow\mathbb{Z}$
 are related to combinatorial properties of w, and each 
 $\mathrm{Wg}_{d}^{(i)}:S_{m\ell_{i}}\rightarrow\mathbb{R}$
 is a Weingarten function (see Definition 2.10). There are two main difficulties when dealing with sums such as (1.4) in the region when m is unbounded.
$\mathrm{Wg}_{d}^{(i)}:S_{m\ell_{i}}\rightarrow\mathbb{R}$
 is a Weingarten function (see Definition 2.10). There are two main difficulties when dealing with sums such as (1.4) in the region when m is unbounded.
- 
(1) While the asymptotics of Weingarten functions  $\mathrm{Wg}_{d}:S_{m}\rightarrow\mathbb{R}$
 are well understood when $\mathrm{Wg}_{d}:S_{m}\rightarrow\mathbb{R}$
 are well understood when $d\gg m$
 (see [Reference CollinsCol03, Section 2.2] and [Reference Collins and MatsumotoCM17, Theorem 1.1]), much less is known in the regime where m is comparable with d. $d\gg m$
 (see [Reference CollinsCol03, Section 2.2] and [Reference Collins and MatsumotoCM17, Theorem 1.1]), much less is known in the regime where m is comparable with d.
- 
(2) Even if we have a good understanding of a single Weingarten function, the number of summands in (1.4) is large and it is not enough to bound each individual Weingarten function. 
 Luckily, there are plenty of cancellations in the sum (1.4). To understand these cancellations, we identify a symmetry of (1.4). More precisely, we find a group H acting on 
 $\prod_{i=1}^{2r}S_{m\ell_{i}}$
 such that F is equivariant with respect to H, and such that the contribution of any H-orbit to the sum (1.4) is a product of terms, each of which has the form
$\prod_{i=1}^{2r}S_{m\ell_{i}}$
 such that F is equivariant with respect to H, and such that the contribution of any H-orbit to the sum (1.4) is a product of terms, each of which has the form 
 \begin{equation}\frac{1}{m!^{2\ell_{i}}}\sum_{h,h'\in S_{m}^{\ell_{i}}}\mathrm{sgn}(hh')\,\mathrm{Wg}_{d}^{(i)}(h'\pi_{i}h\pi_{i+r}^{-1}),\end{equation}
\begin{equation}\frac{1}{m!^{2\ell_{i}}}\sum_{h,h'\in S_{m}^{\ell_{i}}}\mathrm{sgn}(hh')\,\mathrm{Wg}_{d}^{(i)}(h'\pi_{i}h\pi_{i+r}^{-1}),\end{equation}
 where 
 $\mathrm{sgn}(x)$
 is the sign of x and the sum is over the Young subgroup
$\mathrm{sgn}(x)$
 is the sign of x and the sum is over the Young subgroup 
 $S_{m}^{\ell_{i}}\subseteq S_{m\ell_{i}}$
, see Corollary 5.3.
$S_{m}^{\ell_{i}}\subseteq S_{m\ell_{i}}$
, see Corollary 5.3.
 Weingarten functions are class functions, so they are linear combinations of irreducible characters of 
 $S_{m\ell_{i}}$
. Explicitly, we have (see [Reference Collins and ŚniadyCS06, (13)])
$S_{m\ell_{i}}$
. Explicitly, we have (see [Reference Collins and ŚniadyCS06, (13)]) 
 \begin{equation}\mathrm{Wg}_{d}^{(i)}(\sigma)=\frac{1}{(m\ell_{i})!^{2}}\sum_{\lambda\vdash m\ell_{i},\ell(\lambda)\leq d}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\chi_{\lambda}(\sigma),\quad \sigma\in S_{m\ell_{i}},\end{equation}
\begin{equation}\mathrm{Wg}_{d}^{(i)}(\sigma)=\frac{1}{(m\ell_{i})!^{2}}\sum_{\lambda\vdash m\ell_{i},\ell(\lambda)\leq d}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\chi_{\lambda}(\sigma),\quad \sigma\in S_{m\ell_{i}},\end{equation}
 where each 
 $\lambda$
 is a partition of
$\lambda$
 is a partition of 
 $m\ell_{i}$
 with at most d parts and
$m\ell_{i}$
 with at most d parts and 
 $\chi_{\lambda}$
 and
$\chi_{\lambda}$
 and 
 $\rho_{\lambda}$
 are the corresponding irreducible characters of
$\rho_{\lambda}$
 are the corresponding irreducible characters of 
 $S_{m\ell_{i}}$
 and
$S_{m\ell_{i}}$
 and 
 $\mathrm{U}_{d}$
, respectively. The cancellations that we get in the sum (1.5) come from averaging irreducible characters of
$\mathrm{U}_{d}$
, respectively. The cancellations that we get in the sum (1.5) come from averaging irreducible characters of 
 $S_{m\ell_{i}}$
 over
$S_{m\ell_{i}}$
 over 
 $S_{m}^{\ell_{i}}$
-cosets. Here
$S_{m}^{\ell_{i}}$
-cosets. Here 
 $S_{m}^{\ell_{i}}$
 is a large subgroup of
$S_{m}^{\ell_{i}}$
 is a large subgroup of 
 $S_{m\ell_{i}}$
, so these cancellations will be significant as well. For example, all terms in (1.6) for which
$S_{m\ell_{i}}$
, so these cancellations will be significant as well. For example, all terms in (1.6) for which 
 $\lambda$
 has more than
$\lambda$
 has more than 
 $\ell_{i}$
 columns vanish. See Lemmas 2.7 and 2.8 for the precise bounds.
$\ell_{i}$
 columns vanish. See Lemmas 2.7 and 2.8 for the precise bounds.
 After we bound the average contribution of each H-orbit in the sum (1.4) by a function C(m,d,w), we bound (1.4) by 
 $|Z|\cdot C(m,d,w)$
 for some finite set Z. This becomes a counting problem, which we solve in § 6, see Proposition 6.1.
$|Z|\cdot C(m,d,w)$
 for some finite set Z. This becomes a counting problem, which we solve in § 6, see Proposition 6.1.
The proof of Theorem 1.1 occupies §§ 4, 5, 6 and 7. Since the combinatorics of general words is a bit complicated, we prove a simplified version of Theorem 1.3 for the special case of the Engel word [[x,y],y] in § 3. The proof for this special case contains the main ideas of the paper, while being easier to understand.
1.3 Further discussion and some open questions
The results of this paper fit in the larger framework of the study of word measures and their Fourier coefficients.
 Let G be a compact group, and let 
 $\mu_{G}$
 be the Haar probability measure on G. To each word
$\mu_{G}$
 be the Haar probability measure on G. To each word 
 $w(x_{1},\ldots,x_{r})\in F_{r}$
 we associate the corresponding word map
$w(x_{1},\ldots,x_{r})\in F_{r}$
 we associate the corresponding word map 
 $w_{G}:G^{r}\rightarrow G$
, defined by
$w_{G}:G^{r}\rightarrow G$
, defined by 
 $(g_{1},\ldots,g_{r})\mapsto w(g_{1},\ldots,g_{r})$
. The pushforward measure
$(g_{1},\ldots,g_{r})\mapsto w(g_{1},\ldots,g_{r})$
. The pushforward measure 
 $(w_{G})_{*}(\mu_{G}^{r})$
 is called the word measure
$(w_{G})_{*}(\mu_{G}^{r})$
 is called the word measure 
 $\tau_{w,G}$
 associated with w and G. Let
$\tau_{w,G}$
 associated with w and G. Let 
 $\mathrm{Irr}(G)$
 be the set of irreducible characters of G. The Fourier coefficient of
$\mathrm{Irr}(G)$
 be the set of irreducible characters of G. The Fourier coefficient of 
 $\tau_{w,G}$
 at
$\tau_{w,G}$
 at 
 $\rho\in\mathrm{Irr}(G)$
 is
$\rho\in\mathrm{Irr}(G)$
 is 
 \begin{equation}a_{w,G,\rho}:=\int_{G^{r}}\rho(w(x_{1},\ldots,x_{r}))\mu_{G}^{r}=\int_{G}\rho(y)\tau_{w,G}.\end{equation}
\begin{equation}a_{w,G,\rho}:=\int_{G^{r}}\rho(w(x_{1},\ldots,x_{r}))\mu_{G}^{r}=\int_{G}\rho(y)\tau_{w,G}.\end{equation}
 If 
 $w\neq1$
 and G is a compact semisimple Lie group, then by Borel’s theorem [Reference BorelBor83], the map
$w\neq1$
 and G is a compact semisimple Lie group, then by Borel’s theorem [Reference BorelBor83], the map 
 $w_{G}:G^{r}\rightarrow G$
 is a submersion outside a proper subvariety in
$w_{G}:G^{r}\rightarrow G$
 is a submersion outside a proper subvariety in 
 $G^{r}$
. It follows that
$G^{r}$
. It follows that 
 $\tau_{w,G}$
 is absolutely continuous with respect to
$\tau_{w,G}$
 is absolutely continuous with respect to 
 $\mu_{G}$
 and, therefore,
$\mu_{G}$
 and, therefore, 
 $\tau_{w,G}=f_{w,G}\cdot \mu_{G}$
, where
$\tau_{w,G}=f_{w,G}\cdot \mu_{G}$
, where 
 $f_{w,G}\in L^{1}(G)$
 is the Radon–Nikodym density. In this case,
$f_{w,G}\in L^{1}(G)$
 is the Radon–Nikodym density. In this case, 
 $f_{w,G}=\sum_{\rho\in\mathrm{Irr}(G)}\overline{a_{w,G,\rho}}\cdot\rho$
.
$f_{w,G}=\sum_{\rho\in\mathrm{Irr}(G)}\overline{a_{w,G,\rho}}\cdot\rho$
.
 In [Reference Larsen, Shalev and TiepLST19, Theorem 4], Larsen, Shalev, and Tiep proved uniform 
 $L^{\infty}$
-mixing time for convolutions of word measures on sufficiently large finite simple groups. From this, the following can be deduced.
$L^{\infty}$
-mixing time for convolutions of word measures on sufficiently large finite simple groups. From this, the following can be deduced.
Theorem 1.6. For every 
 $w\in F_{r}$
, there exists
$w\in F_{r}$
, there exists 
 $N(w)\in\mathbb{N}$
 such that if G is a finite simple group with at least N(w) elements, then
$N(w)\in\mathbb{N}$
 such that if G is a finite simple group with at least N(w) elements, then 
 \begin{equation}|a_{w,G,\rho}|\leq(\dim\rho)^{1-\epsilon(w)},\end{equation}
\begin{equation}|a_{w,G,\rho}|\leq(\dim\rho)^{1-\epsilon(w)},\end{equation}
 for 
 $\epsilon(w)=C\cdot\ell(w)^{-4}$
 and some absolute constant C.
$\epsilon(w)=C\cdot\ell(w)^{-4}$
 and some absolute constant C.
The proof of Theorem 1.6 is given at the end of § 7.
We believe that a similar statement should be true for compact semisimple Lie groups.
Conjecture 1.7. For every 
 $1\neq w\in F_{r}$
, there exists
$1\neq w\in F_{r}$
, there exists 
 $\epsilon(w)>0$
 such that, for every compact connected semisimple Lie group G and every
$\epsilon(w)>0$
 such that, for every compact connected semisimple Lie group G and every 
 $\rho\in\mathrm{Irr}(G)$
,
$\rho\in\mathrm{Irr}(G)$
, 
 \[|a_{w,G,\rho}|\leq(\dim\rho)^{1-\epsilon(w)}.\]
\[|a_{w,G,\rho}|\leq(\dim\rho)^{1-\epsilon(w)}.\]
 It is natural to estimate 
 $\epsilon(w)$
 in terms of the length
$\epsilon(w)$
 in terms of the length 
 $\ell(w)$
 of the word w. For simple groups of bounded rank, item (2) of Remark 1.2 (i.e. [Reference Elkasapy and ThomET15, Theorem 5.2]) shows that there are arbitrarily long words w for which
$\ell(w)$
 of the word w. For simple groups of bounded rank, item (2) of Remark 1.2 (i.e. [Reference Elkasapy and ThomET15, Theorem 5.2]) shows that there are arbitrarily long words w for which 
 $\epsilon(w)$
 cannot be larger than
$\epsilon(w)$
 cannot be larger than 
 $e^{-\sqrt{\ell(w)}}$
. However, we believe that better Fourier decay can be achieved for the high-rank case.
$e^{-\sqrt{\ell(w)}}$
. However, we believe that better Fourier decay can be achieved for the high-rank case.
 
Question 1.8. Can one take 
 $\epsilon(w)$
 to be a polynomial in
$\epsilon(w)$
 to be a polynomial in 
 $\ell(w)$
, if
$\ell(w)$
, if 
 $\mathrm{rk}(G)\gg_{\ell(w)}1$
?
$\mathrm{rk}(G)\gg_{\ell(w)}1$
?
 Theorem 1.1 gives evidence to Conjecture 1.7 for 
 $G=\mathrm{SU}_{d}$
 and the collection of fundamental representations
$G=\mathrm{SU}_{d}$
 and the collection of fundamental representations 
 $\big\{ \bigwedge^{\!m}\mathbb{C}^{d}\big\}_{m=1}^{d}$
. Indeed, for every
$\big\{ \bigwedge^{\!m}\mathbb{C}^{d}\big\}_{m=1}^{d}$
. Indeed, for every 
 $\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
, since
$\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
, since 
 $|\rho(\lambda A)|=|\rho(A)|$
 for
$|\rho(\lambda A)|=|\rho(A)|$
 for 
 $A\in\mathrm{SU}_{d}$
 and
$A\in\mathrm{SU}_{d}$
 and 
 $\lambda\in\mathrm{U}_{1}$
, and since
$\lambda\in\mathrm{U}_{1}$
, and since 
 $\mu_{\mathrm{U}_{d}}$
 is the pushforward of
$\mu_{\mathrm{U}_{d}}$
 is the pushforward of 
 $\mu_{\mathrm{U}_{1}}\times\mu_{\mathrm{SU}_{d}}$
 by the multiplication map
$\mu_{\mathrm{U}_{1}}\times\mu_{\mathrm{SU}_{d}}$
 by the multiplication map 
 $(\lambda,A)\mapsto\lambda A$
, we have
$(\lambda,A)\mapsto\lambda A$
, we have 
 \begin{align}|a_{w,\mathrm{SU}_{d},\rho}|^{2}&\leq\mathbb{E}_{X_{1},\ldots,X_{r}\in\mathrm{SU}_{d}}(|\rho(w(X_{1},\ldots,X_{r}))|^{2})\nonumber\\&=\mathbb{E}_{(\lambda_{1},X_{1}),\ldots,(\lambda_{r},X_{r})\in\mathrm{SU}_{d}\times\mathrm{U}_{1}}(|\rho(w(\lambda_{1},\ldots,\lambda_{r})w(X_{1},\ldots,X_{r}))|^{2})\nonumber\\&= \mathbb{E}_{(\lambda_{1},X_{1}),\ldots,(\lambda_{r},X_{r})\in\mathrm{SU}_{d}\times\mathrm{U}_{1}}(|\rho(w(\lambda_{1}X_{1},\ldots,\lambda_{r}X_{r}))|^{2})\nonumber\\&=\mathbb{E}_{\mathrm{U}_{d}}(|\rho(w(X_{1},\ldots,X_{r}))|^{2}).\end{align}
\begin{align}|a_{w,\mathrm{SU}_{d},\rho}|^{2}&\leq\mathbb{E}_{X_{1},\ldots,X_{r}\in\mathrm{SU}_{d}}(|\rho(w(X_{1},\ldots,X_{r}))|^{2})\nonumber\\&=\mathbb{E}_{(\lambda_{1},X_{1}),\ldots,(\lambda_{r},X_{r})\in\mathrm{SU}_{d}\times\mathrm{U}_{1}}(|\rho(w(\lambda_{1},\ldots,\lambda_{r})w(X_{1},\ldots,X_{r}))|^{2})\nonumber\\&= \mathbb{E}_{(\lambda_{1},X_{1}),\ldots,(\lambda_{r},X_{r})\in\mathrm{SU}_{d}\times\mathrm{U}_{1}}(|\rho(w(\lambda_{1}X_{1},\ldots,\lambda_{r}X_{r}))|^{2})\nonumber\\&=\mathbb{E}_{\mathrm{U}_{d}}(|\rho(w(X_{1},\ldots,X_{r}))|^{2}).\end{align}
 Theorem 1.4 deals with another family of irreducible representations 
 $\{ \mathrm{Sym}^{m}\mathbb{C}^{d}\}_{m=1}^{\lfloor d/(16\ell)^{\ell}\rfloor }$
, giving further evidence for Conjecture 1.7.
$\{ \mathrm{Sym}^{m}\mathbb{C}^{d}\}_{m=1}^{\lfloor d/(16\ell)^{\ell}\rfloor }$
, giving further evidence for Conjecture 1.7.
 Verifying Conjecture 1.7 will imply that, for every word w, the random walks induced by the collection of measures 
 $\{\tau_{w,G}\}_{G}$
, where G runs over all compact connected simple Lie groups, admit a uniform
$\{\tau_{w,G}\}_{G}$
, where G runs over all compact connected simple Lie groups, admit a uniform 
 $L^{\infty}$
-mixing time. Namely, using [Reference Guralnick, Larsen and ManackGLM12, Theorem 1], it will show the existence of
$L^{\infty}$
-mixing time. Namely, using [Reference Guralnick, Larsen and ManackGLM12, Theorem 1], it will show the existence of 
 $t(w)\in\mathbb{N}$
 such that
$t(w)\in\mathbb{N}$
 such that 
 \begin{equation}\bigg\Vert \frac{\tau_{w,G}^{*t(w)}}{\mu_{G}}-1\bigg\Vert_{\infty}<1/2,\end{equation}
\begin{equation}\bigg\Vert \frac{\tau_{w,G}^{*t(w)}}{\mu_{G}}-1\bigg\Vert_{\infty}<1/2,\end{equation}
 for every compact connected simple Lie group G. By the above discussion, t(w) grows at least exponentially with 
 $\sqrt{\ell(w)}$
 under no restriction on the rank. If the condition (1.10) is replaced by the condition that
$\sqrt{\ell(w)}$
 under no restriction on the rank. If the condition (1.10) is replaced by the condition that 
 $\tau_{w,G}^{*t(w)}$
 has bounded density, one might hope for polynomial bounds.
$\tau_{w,G}^{*t(w)}$
 has bounded density, one might hope for polynomial bounds.
 
Question 1.9. Let 
 $1\neq w\in F_{r}$
. Can one find
$1\neq w\in F_{r}$
. Can one find 
 $t(w)\in\mathbb{N}$
 such that for every compact connected semisimple Lie group G,
$t(w)\in\mathbb{N}$
 such that for every compact connected semisimple Lie group G, 
 $\tau_{w,G}^{*t(w)}$
 has bounded density with respect to
$\tau_{w,G}^{*t(w)}$
 has bounded density with respect to 
 $\mu_{G}$
? Can t(w) be chosen to have polynomial dependence on
$\mu_{G}$
? Can t(w) be chosen to have polynomial dependence on 
 $\ell(w)$
?
$\ell(w)$
?
 Question 1.9 can be seen as an analytic specialization of a geometric phenomenon. Let 
 $\varphi:X\rightarrow Y$
 be a polynomial map between smooth
$\varphi:X\rightarrow Y$
 be a polynomial map between smooth 
 $\mathbb{Q}$
-varieties. We say that
$\mathbb{Q}$
-varieties. We say that 
 $\varphi$
 is (FRS) if it is flat and its fibers all have rational singularities. In [Reference Aizenbud and AvniAA16, Theorem 3.4], Aizenbud and the first author showed that if
$\varphi$
 is (FRS) if it is flat and its fibers all have rational singularities. In [Reference Aizenbud and AvniAA16, Theorem 3.4], Aizenbud and the first author showed that if 
 $\varphi$
 is (FRS), then for every non-Archimedean local field F and every smooth, compactly supported measure
$\varphi$
 is (FRS), then for every non-Archimedean local field F and every smooth, compactly supported measure 
 $\mu$
 on X(F), the pushforward
$\mu$
 on X(F), the pushforward 
 $\varphi_{*}\mu$
 has bounded density. This result was extended in [Reference ReiserRei18] to the Archimedean case,
$\varphi_{*}\mu$
 has bounded density. This result was extended in [Reference ReiserRei18] to the Archimedean case, 
 $F=\mathbb{R}$
 or
$F=\mathbb{R}$
 or 
 $\mathbb{C}$
, and, moreover, if one runs over a large enough family of local fields, the condition of (FRS) is, in fact, necessary as well for the densities of pushforwards to be bounded (see [Reference Aizenbud and AvniAA16, Theorem 3.4] and [Reference Glazer, Hendel and SodinGHS24, Corollary 6.2]).
$\mathbb{C}$
, and, moreover, if one runs over a large enough family of local fields, the condition of (FRS) is, in fact, necessary as well for the densities of pushforwards to be bounded (see [Reference Aizenbud and AvniAA16, Theorem 3.4] and [Reference Glazer, Hendel and SodinGHS24, Corollary 6.2]).
To rephrase Question 1.9 in geometric term, we further need the following notion from [Reference Glazer and HendelGH19, Reference Glazer and HendelGH21].
Definition 1.10. [Reference Glazer and HendelGH19, Definition 1.1]. Let 
 $\varphi:X\rightarrow G$
 and
$\varphi:X\rightarrow G$
 and 
 $\psi:Y\rightarrow G$
 be morphisms from algebraic varieties X,Y to an algebraic group G. We define their convolution by
$\psi:Y\rightarrow G$
 be morphisms from algebraic varieties X,Y to an algebraic group G. We define their convolution by 
 \[\varphi*\psi:X\times Y\rightarrow G,\quad (x,y)\mapsto\varphi(x)\cdot\psi(y).\]
\[\varphi*\psi:X\times Y\rightarrow G,\quad (x,y)\mapsto\varphi(x)\cdot\psi(y).\]
 We denote by 
 $\varphi^{*k}:X^{k}\rightarrow G$
 the k-fold convolution of
$\varphi^{*k}:X^{k}\rightarrow G$
 the k-fold convolution of 
 $\varphi$
 with itself.
$\varphi$
 with itself.
Based on the above discussion, a positive answer to the following question will answer Question 1.9 positively.
 
Question 1.11. [Reference Glazer and HendelGH24, Question 1.15]. Can we find 
 $\alpha,C>0$
 such that, for every
$\alpha,C>0$
 such that, for every 
 $w\in F_{r}$
 of length
$w\in F_{r}$
 of length 
 $\ell$
 and every simple algebraic group G, the word map
$\ell$
 and every simple algebraic group G, the word map 
 $w_{G}^{*C\ell^{\alpha}}$
 is (FRS)?
$w_{G}^{*C\ell^{\alpha}}$
 is (FRS)?
 In [Reference Glazer and HendelGH19, Reference Glazer and HendelGH21], the second author and Hendel have shown that any dominant map 
 $\varphi:X\rightarrow G$
 from a smooth variety to a connected algebraic group becomes (FRS) after sufficiently many self-convolutions. Concrete bounds were given in [Reference Glazer, Hendel and SodinGHS24, Corollary 1.9]. Based on these results, we prove Conjecture 1.7 and answer Question 1.9 for the bounded rank case (see Proposition 7.2).
$\varphi:X\rightarrow G$
 from a smooth variety to a connected algebraic group becomes (FRS) after sufficiently many self-convolutions. Concrete bounds were given in [Reference Glazer, Hendel and SodinGHS24, Corollary 1.9]. Based on these results, we prove Conjecture 1.7 and answer Question 1.9 for the bounded rank case (see Proposition 7.2).
To conclude the discussion, we remark that a positive answer for Question 1.11 will answer Question 1.9 for compact semisimple p-adic groups as well. Significant progress was made in this direction in the work [Reference Glazer and HendelGH24], by the second author and Hendel, where singularities of word maps on semisimple Lie algebras and algebraic groups were studied.
1.4 Conventions and notation
- 
(1) We denote the set  $\{1,\ldots,N\}$
 by [N]. $\{1,\ldots,N\}$
 by [N].
- 
(2) For a finite set X, we denote the symmetric group on X by  $\mathrm{Sym}(X)$
 and the space of functions $\mathrm{Sym}(X)$
 and the space of functions $f:X\rightarrow\mathbb{C}$
 by $f:X\rightarrow\mathbb{C}$
 by $\mathbb{C}[X]$
. $\mathbb{C}[X]$
.
- 
(3) We write  $(-1)^{\sigma}$
 for the sign of a permutation $(-1)^{\sigma}$
 for the sign of a permutation $\sigma$
. $\sigma$
.
- 
(4) For a group G, a representation is a pair  $(\pi,V)$
, with $(\pi,V)$
, with $\pi:G\rightarrow\mathrm{GL}(V)$
 a homomorphism. We denote the character of $\pi:G\rightarrow\mathrm{GL}(V)$
 a homomorphism. We denote the character of $(\pi,V)$
 by $(\pi,V)$
 by $\chi_{\pi}$
 and denote its dual by $\chi_{\pi}$
 and denote its dual by $(\pi^{\vee},V^{\vee})$
. $(\pi^{\vee},V^{\vee})$
.
2. Preliminaries
2.1 Some facts in representation theory
 For a compact group G, we denote the set of irreducible complex characters of G by 
 $\mathrm{Irr}(G)$
. Given a subgroup
$\mathrm{Irr}(G)$
. Given a subgroup 
 $H\leq G$
 and a character
$H\leq G$
 and a character 
 $\chi\in\mathrm{Irr}(H)$
, we denote the induction of
$\chi\in\mathrm{Irr}(H)$
, we denote the induction of 
 $\chi$
 to G by
$\chi$
 to G by 
 $\mathrm{Ind}_{H}^{G}\chi$
. We normalize the Haar measure to be a probability measure and denote the expectation with respect to the Haar measure by
$\mathrm{Ind}_{H}^{G}\chi$
. We normalize the Haar measure to be a probability measure and denote the expectation with respect to the Haar measure by 
 $\mathbb{E}$
. The standard inner product on functions on G is
$\mathbb{E}$
. The standard inner product on functions on G is 
 $\langle f_{1},f_{2}\rangle_{G}=\mathbb{E}f_{1}\overline{f_{2}}$
.
$\langle f_{1},f_{2}\rangle_{G}=\mathbb{E}f_{1}\overline{f_{2}}$
.
2.1.1 Representation theory of the symmetric group
 Given 
 $m\in\mathbb{N}$
, a partition of m is a non-increasing sequence
$m\in\mathbb{N}$
, a partition of m is a non-increasing sequence 
 $\lambda=(\lambda_{1},\ldots,\lambda_{k})$
 of non-negative integers that sum to m. In this case, we write
$\lambda=(\lambda_{1},\ldots,\lambda_{k})$
 of non-negative integers that sum to m. In this case, we write 
 $\lambda\vdash m$
. Two partitions are equivalent if they differ only by a string of zeros at the end. A partition
$\lambda\vdash m$
. Two partitions are equivalent if they differ only by a string of zeros at the end. A partition 
 $\lambda=(\lambda_{1},\ldots,\lambda_{k})$
, with
$\lambda=(\lambda_{1},\ldots,\lambda_{k})$
, with 
 $\lambda_{k}>0$
, is graphically encoded by a Young diagram, which is a finite collection of boxes (or cells) arranged in k left-justified rows, where the jth row has
$\lambda_{k}>0$
, is graphically encoded by a Young diagram, which is a finite collection of boxes (or cells) arranged in k left-justified rows, where the jth row has 
 $\lambda_{j}$
 boxes. The length
$\lambda_{j}$
 boxes. The length 
 $\ell(\lambda)$
 of a partition
$\ell(\lambda)$
 of a partition 
 $\lambda\vdash m$
 is the number of non-zero parts
$\lambda\vdash m$
 is the number of non-zero parts 
 $\lambda_{i}$
 or, equivalently, the number of rows in the corresponding Young diagram.
$\lambda_{i}$
 or, equivalently, the number of rows in the corresponding Young diagram.
 The irreducible representations of 
 $S_{m}$
 are in bijection with partitions
$S_{m}$
 are in bijection with partitions 
 $\lambda\vdash m$
. We write
$\lambda\vdash m$
. We write 
 $\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
 for the corresponding character. For each cell (i,j) in the Young diagram of
$\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
 for the corresponding character. For each cell (i,j) in the Young diagram of 
 $\lambda$
, the hook length
$\lambda$
, the hook length 
 $h_{\lambda}(i,j)$
 is the number of cells (a,b) in the Young diagram of
$h_{\lambda}(i,j)$
 is the number of cells (a,b) in the Young diagram of 
 $\lambda$
 such that either
$\lambda$
 such that either 
 $a=i$
 and
$a=i$
 and 
 $b\geq j$
, or
$b\geq j$
, or 
 $a\geq i$
 and
$a\geq i$
 and 
 $b=j$
. The hook-length formula states that
$b=j$
. The hook-length formula states that 
 \begin{equation}\chi_{\lambda}(1)=\frac{m!}{\prod_{(i,j)\in\lambda}h_{\lambda}(i,j)}.\end{equation}
\begin{equation}\chi_{\lambda}(1)=\frac{m!}{\prod_{(i,j)\in\lambda}h_{\lambda}(i,j)}.\end{equation}
Definition 2.1.
- 
(1) Fix a Young diagram  $\lambda$
 and let $\lambda$
 and let $n\in\mathbb{N}$
. An n-expansion of $n\in\mathbb{N}$
. An n-expansion of $\lambda$
 is any Young diagram obtained by adding n boxes to $\lambda$
 is any Young diagram obtained by adding n boxes to $\lambda$
 in such a way that no two boxes are added in the same column. $\lambda$
 in such a way that no two boxes are added in the same column.
- 
(2) Given a partition  $\lambda=(\lambda_{1},\ldots,\lambda_{l_{1}})\vdash k$
 and a partition $\lambda=(\lambda_{1},\ldots,\lambda_{l_{1}})\vdash k$
 and a partition $\mu=(\mu_{1},\ldots,\mu_{l_{2}})\vdash l$
, a $\mu=(\mu_{1},\ldots,\mu_{l_{2}})\vdash l$
, a $\mu$
-expansion of $\mu$
-expansion of $\,\lambda$
 is defined to be a $\,\lambda$
 is defined to be a $\mu_{l_{2}}$
-expansion of a $\mu_{l_{2}}$
-expansion of a $\mu_{l_{2}-1}$
-expansion of a $\mu_{l_{2}-1}$
-expansion of a $\cdots$
 of a $\cdots$
 of a $\mu_{1}$
-expansion of the Young diagram of $\mu_{1}$
-expansion of the Young diagram of $\lambda$
. For a $\lambda$
. For a $\mu$
-expansion of $\mu$
-expansion of $\lambda$
, we label the boxes added in the $\lambda$
, we label the boxes added in the $\mu_{l_{j}}$
-expansion by the number j and order the boxes lexicographically by their position, first from top to bottom and then from right to left. We say that a $\mu_{l_{j}}$
-expansion by the number j and order the boxes lexicographically by their position, first from top to bottom and then from right to left. We say that a $\mu$
-expansion of $\mu$
-expansion of $\lambda$
 is strict if, for every $\lambda$
 is strict if, for every $p\in\{1,\ldots,l_{2}-1\}$
 and every box t, the number of boxes coming before t that are labeled p is greater than or equal to the number of boxes coming before t that are labeled $p\in\{1,\ldots,l_{2}-1\}$
 and every box t, the number of boxes coming before t that are labeled p is greater than or equal to the number of boxes coming before t that are labeled $(p+1)$
. $(p+1)$
.
Theorem 2.2 (Littlewood–Richardson rule [Reference MacdonaldMac95, I.9]). Let 
 $\lambda\vdash k$
 and
$\lambda\vdash k$
 and 
 $\mu\vdash m$
. Then,
$\mu\vdash m$
. Then, 
 \[\mathrm{Ind}_{\mathrm{S}_{k}\times S_{m}}^{S_{k+m}}(\chi_{\lambda}\otimes\chi_{\mu})=\bigoplus_{\nu\vdash k+m}N_{\lambda\mu\nu}\chi_{\nu},\]
\[\mathrm{Ind}_{\mathrm{S}_{k}\times S_{m}}^{S_{k+m}}(\chi_{\lambda}\otimes\chi_{\mu})=\bigoplus_{\nu\vdash k+m}N_{\lambda\mu\nu}\chi_{\nu},\]
 where 
 $N_{\lambda\mu\nu}$
 is the number of strict
$N_{\lambda\mu\nu}$
 is the number of strict 
 $\mu$
-expansions of
$\mu$
-expansions of 
 $\lambda$
 that are a Young diagram of the partition
$\lambda$
 that are a Young diagram of the partition 
 $\nu$
.
$\nu$
.
We need the following consequence of Theorem 2.2.
Lemma 2.3. Let 
 $l\in\mathbb{Z}_{\geq2}$
 and identify
$l\in\mathbb{Z}_{\geq2}$
 and identify 
 $S_{m}^{l}$
 with its image in the standard embedding
$S_{m}^{l}$
 with its image in the standard embedding 
 $S_{m}^{l}\hookrightarrow S_{ml}$
. Then, each
$S_{m}^{l}\hookrightarrow S_{ml}$
. Then, each 
 $\chi_{\nu}\in\mathrm{Irr}(S_{ml})$
 appearing in
$\chi_{\nu}\in\mathrm{Irr}(S_{ml})$
 appearing in 
 $\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(1)$
 (respectively,
$\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(1)$
 (respectively, 
 $\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(\mathrm{sgn})$
) corresponds to a partition
$\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(\mathrm{sgn})$
) corresponds to a partition 
 $\nu\vdash ml$
 with at most l rows (respectively, l columns).
$\nu\vdash ml$
 with at most l rows (respectively, l columns).
Proof. We prove the statement for the trivial representation 1 by induction on l. The proof for sgn is similar. The character 1 of 
 $S_{m}$
 corresponds to the partition
$S_{m}$
 corresponds to the partition 
 $\lambda$
 consisting of one row of length m. By the induction hypothesis, we may assume that
$\lambda$
 consisting of one row of length m. By the induction hypothesis, we may assume that 
 $\mathrm{Ind}_{\mathrm{S}_{m}^{j}}^{S_{mj}}(1)=\bigoplus_{\mu\vdash mj}m_{\mu}\chi_{\mu}$
, with
$\mathrm{Ind}_{\mathrm{S}_{m}^{j}}^{S_{mj}}(1)=\bigoplus_{\mu\vdash mj}m_{\mu}\chi_{\mu}$
, with 
 $m_{\mu}>0$
 only if
$m_{\mu}>0$
 only if 
 $\mu$
 has at most j rows, for all
$\mu$
 has at most j rows, for all 
 $j<l$
. Hence, we can write
$j<l$
. Hence, we can write 
 \begin{equation}\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(1)=\mathrm{Ind}_{\mathrm{S}_{m(l-1)}\times S_{m}}^{S_{ml}}\big(\mathrm{Ind}_{\mathrm{S}_{m}^{l-1}}^{\mathrm{S}_{m(l-1)}}(1)\otimes1\big)=\bigoplus_{\mu\vdash m(l-1)}m_{\mu}\,\mathrm{Ind}_{\mathrm{S}_{m(l-1)}\times S_{m}}^{S_{ml}}(\chi_{\mu}\otimes1).\end{equation}
\begin{equation}\mathrm{Ind}_{\mathrm{S}_{m}^{l}}^{S_{ml}}(1)=\mathrm{Ind}_{\mathrm{S}_{m(l-1)}\times S_{m}}^{S_{ml}}\big(\mathrm{Ind}_{\mathrm{S}_{m}^{l-1}}^{\mathrm{S}_{m(l-1)}}(1)\otimes1\big)=\bigoplus_{\mu\vdash m(l-1)}m_{\mu}\,\mathrm{Ind}_{\mathrm{S}_{m(l-1)}\times S_{m}}^{S_{ml}}(\chi_{\mu}\otimes1).\end{equation}
 By Theorem 2.2 and since a strict 
 $\lambda$
-expansion of
$\lambda$
-expansion of 
 $\mu$
 increases the number of rows by at most one, the lemma follows.
$\mu$
 increases the number of rows by at most one, the lemma follows.
2.1.2 Representation theory of the unitary group
 The irreducible representations of 
 $\mathrm{U}_{d}$
 can be identified with the irreducible rational representations of
$\mathrm{U}_{d}$
 can be identified with the irreducible rational representations of 
 $\mathrm{GL}_{d}(\mathbb{C})$
. More precisely, the restriction map
$\mathrm{GL}_{d}(\mathbb{C})$
. More precisely, the restriction map 
 $\rho\mapsto\rho|_{\mathrm{U}_{d}}$
 induces a bijection
$\rho\mapsto\rho|_{\mathrm{U}_{d}}$
 induces a bijection 
 $\mathrm{Irr}(\mathrm{GL}_{d}(\mathbb{C}))\rightarrow\mathrm{Irr}(\mathrm{U}_{d})$
. Moreover, the set
$\mathrm{Irr}(\mathrm{GL}_{d}(\mathbb{C}))\rightarrow\mathrm{Irr}(\mathrm{U}_{d})$
. Moreover, the set 
 $\mathrm{Irr}(\mathrm{U}_{d})$
 is in bijection with the set
$\mathrm{Irr}(\mathrm{U}_{d})$
 is in bijection with the set 
 $\Lambda$
 of dominant weights,
$\Lambda$
 of dominant weights, 
 \[\Lambda:=\{(\lambda_{1},\ldots,\lambda_{d}):\lambda_{1}\geq\cdots\geq\lambda_{d},\ \lambda_{i}\in\mathbb{Z}\}.\]
\[\Lambda:=\{(\lambda_{1},\ldots,\lambda_{d}):\lambda_{1}\geq\cdots\geq\lambda_{d},\ \lambda_{i}\in\mathbb{Z}\}.\]
 We denote the representation corresponding to 
 $\lambda\in\Lambda$
 by
$\lambda\in\Lambda$
 by 
 $(\rho_{\lambda},V_{\lambda})$
. The irreducible representations
$(\rho_{\lambda},V_{\lambda})$
. The irreducible representations 
 \[\mathbb{C}^{d},\bigwedge\nolimits^{\!2}\mathbb{C}^{d},\ldots,\bigwedge\nolimits^{\!d}\mathbb{C}^{d},\]
\[\mathbb{C}^{d},\bigwedge\nolimits^{\!2}\mathbb{C}^{d},\ldots,\bigwedge\nolimits^{\!d}\mathbb{C}^{d},\]
 are called the fundamental representations, and we have 
 $\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\simeq V_{(1,\ldots,1,0,\ldots,0)}$
, with 1 appearing m times. In particular, the standard representation
$\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\simeq V_{(1,\ldots,1,0,\ldots,0)}$
, with 1 appearing m times. In particular, the standard representation 
 $\mathbb{C}^{d}$
 is
$\mathbb{C}^{d}$
 is 
 $V_{(1,0,\ldots,0)}$
. Note that
$V_{(1,0,\ldots,0)}$
. Note that 
 $\bigwedge^{\!d}\mathbb{C}^{d}$
 is the determinant representation
$\bigwedge^{\!d}\mathbb{C}^{d}$
 is the determinant representation 
 $\chi_{\det}$
. We identify a weight
$\chi_{\det}$
. We identify a weight 
 $\lambda\in\Lambda$
 such that
$\lambda\in\Lambda$
 such that 
 $\lambda_{d}\geq0$
 with a partition
$\lambda_{d}\geq0$
 with a partition 
 $(\lambda_{1},\ldots,\lambda_{d})$
.
$(\lambda_{1},\ldots,\lambda_{d})$
.
Remark 2.4 [Reference Fulton and HarrisFH91, I.6, Exc. 6.4]. For each 
 $\lambda=(\lambda_{1},\ldots,\lambda_{d})\vdash m$
,
$\lambda=(\lambda_{1},\ldots,\lambda_{d})\vdash m$
, 
 \begin{equation}\rho_{\lambda}(1)=\frac{\chi_{\lambda}(1)\cdot\prod_{(i,j)\in\lambda}(d+j-i)}{m!},\end{equation}
\begin{equation}\rho_{\lambda}(1)=\frac{\chi_{\lambda}(1)\cdot\prod_{(i,j)\in\lambda}(d+j-i)}{m!},\end{equation}
 where (i,j) are the coordinates of the cells in the Young diagram with shape 
 $\lambda$
.
$\lambda$
.
 Given 
 $\lambda,\mu\in\Lambda$
, the irreducible subrepresentations of
$\lambda,\mu\in\Lambda$
, the irreducible subrepresentations of 
 $\rho_{\lambda}\otimes\rho_{\mu}$
 are determined by the Littlewood–Richardson rule as follows.
$\rho_{\lambda}\otimes\rho_{\mu}$
 are determined by the Littlewood–Richardson rule as follows.
Theorem 2.5 (Littlewood–Richardson rule; see, e.g., [Reference Fulton and HarrisFH91, I.6, Equation (6.7)]). Let 
 $\lambda,\mu\in\Lambda$
 and suppose that
$\lambda,\mu\in\Lambda$
 and suppose that 
 $\lambda_{d},\mu_{d}\geq0$
. Let
$\lambda_{d},\mu_{d}\geq0$
. Let 
 $N_{\lambda\mu\nu}$
 be the coefficients from Theorem 2.2. Then,
$N_{\lambda\mu\nu}$
 be the coefficients from Theorem 2.2. Then, 
 \[\rho_{\lambda}\otimes\rho_{\mu}=\bigoplus_{\nu:\nu_{d}\geq0}N_{\lambda\mu\nu}\rho_{\nu}.\]
\[\rho_{\lambda}\otimes\rho_{\mu}=\bigoplus_{\nu:\nu_{d}\geq0}N_{\lambda\mu\nu}\rho_{\nu}.\]
Remark 2.6. For 
 $\lambda,\mu\in\Lambda$
, set
$\lambda,\mu\in\Lambda$
, set 
 $\widetilde{\lambda}:=\lambda-(\lambda_{d},\ldots,\lambda_{d})$
 and
$\widetilde{\lambda}:=\lambda-(\lambda_{d},\ldots,\lambda_{d})$
 and 
 $\widetilde{\mu}:=\mu-(\mu_{d},\ldots,\mu_{d})$
. Then
$\widetilde{\mu}:=\mu-(\mu_{d},\ldots,\mu_{d})$
. Then 
 $\rho_{\lambda}=\chi_{\det}^{\lambda_{d}}\cdot\rho_{\widetilde{\lambda}}$
 and
$\rho_{\lambda}=\chi_{\det}^{\lambda_{d}}\cdot\rho_{\widetilde{\lambda}}$
 and 
 $\rho_{\mu}=\chi_{\det}^{\mu_{d}}\cdot\rho_{\widetilde{\mu}}$
, and hence by Theorem 2.5, one has
$\rho_{\mu}=\chi_{\det}^{\mu_{d}}\cdot\rho_{\widetilde{\mu}}$
, and hence by Theorem 2.5, one has 
 \begin{equation}\rho_{\lambda}\otimes\rho_{\mu}=\chi_{\det}^{\lambda_{d}+\mu_{d}}\rho_{\widetilde{\lambda}}\otimes\rho_{\widetilde{\mu}}=\chi_{\det}^{\lambda_{d}+\mu_{d}}\bigoplus_{\nu}N_{\widetilde{\lambda}\widetilde{\mu}\nu}\rho_{\nu}.\end{equation}
\begin{equation}\rho_{\lambda}\otimes\rho_{\mu}=\chi_{\det}^{\lambda_{d}+\mu_{d}}\rho_{\widetilde{\lambda}}\otimes\rho_{\widetilde{\mu}}=\chi_{\det}^{\lambda_{d}+\mu_{d}}\bigoplus_{\nu}N_{\widetilde{\lambda}\widetilde{\mu}\nu}\rho_{\nu}.\end{equation}
2.1.3 Averaging characters over cosets
Lemma 2.7. Let G be a finite group, let 
 $(\pi,V)$
 be an irreducible representation of G, let
$(\pi,V)$
 be an irreducible representation of G, let 
 $H\leq G$
 be a subgroup, and let
$H\leq G$
 be a subgroup, and let 
 $\lambda$
 be any one-dimensional character of H. Then, for every
$\lambda$
 be any one-dimensional character of H. Then, for every 
 $g\in G$
,
$g\in G$
, 
 \[\bigg|\frac{1}{|H|}\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)\bigg|\leq\langle \chi_{\pi}|_{H},\lambda\rangle_{H}.\]
\[\bigg|\frac{1}{|H|}\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)\bigg|\leq\langle \chi_{\pi}|_{H},\lambda\rangle_{H}.\]
 In particular, if 
 $\langle \chi_{\pi}|_{H},\lambda\rangle_{H}=0$
, then
$\langle \chi_{\pi}|_{H},\lambda\rangle_{H}=0$
, then 
 $\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)=0$
.
$\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)=0$
.
Proof. Write 
 $\pi|_{H}=\bigoplus_{i=1}^{\widetilde{N}}\pi_{i}$
 with each
$\pi|_{H}=\bigoplus_{i=1}^{\widetilde{N}}\pi_{i}$
 with each 
 $(\pi_{i},V_{i})$
 an irreducible representation of H. For each i and
$(\pi_{i},V_{i})$
 an irreducible representation of H. For each i and 
 $h'\in H$
,
$h'\in H$
, 
 \begin{align*}\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg)\pi_{i}(h') & =\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(hh')=\sum_{h\in H}\lambda^{-1}(hh'^{-1})\pi_{i}(h)\\& =\sum_{h\in H}\lambda^{-1}(h'^{-1}h)\pi_{i}(h)=\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h'h)\\&=\pi_{i}(h')\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg).\end{align*}
\begin{align*}\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg)\pi_{i}(h') & =\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(hh')=\sum_{h\in H}\lambda^{-1}(hh'^{-1})\pi_{i}(h)\\& =\sum_{h\in H}\lambda^{-1}(h'^{-1}h)\pi_{i}(h)=\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h'h)\\&=\pi_{i}(h')\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg).\end{align*}
 By Schur’s lemma, 
 $\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)$
 is a scalar matrix
$\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)$
 is a scalar matrix 
 $\alpha\cdot I_{V_{i}}$
, for some
$\alpha\cdot I_{V_{i}}$
, for some 
 $\alpha\in\mathbb{C}$
. Hence,
$\alpha\in\mathbb{C}$
. Hence, 
 \begin{equation}\alpha\cdot\chi_{\pi_{i}}(1)=\mathrm{tr}\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg)=\sum_{h}\lambda^{-1}(h)\chi_{\pi_{i}}(h)=\begin{cases}|H| & \text{if }\chi_{\pi_{i}}=\lambda,\\0 & \text{otherwise.}\end{cases}\end{equation}
\begin{equation}\alpha\cdot\chi_{\pi_{i}}(1)=\mathrm{tr}\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi_{i}(h)\bigg)=\sum_{h}\lambda^{-1}(h)\chi_{\pi_{i}}(h)=\begin{cases}|H| & \text{if }\chi_{\pi_{i}}=\lambda,\\0 & \text{otherwise.}\end{cases}\end{equation}
 Let 
 $L:=\{ v\in V:\pi(h)v=\lambda(h)\cdot v,\ \forall h\in H\} $
 be the subspace of
$L:=\{ v\in V:\pi(h)v=\lambda(h)\cdot v,\ \forall h\in H\} $
 be the subspace of 
 $(H,\lambda)$
-equivariant vectors in V and let
$(H,\lambda)$
-equivariant vectors in V and let 
 $L^{\bot}$
 be an H-invariant subspace of V with
$L^{\bot}$
 be an H-invariant subspace of V with 
 $V=L\oplus L^{\bot}$
. By (2.5), the map
$V=L\oplus L^{\bot}$
. By (2.5), the map 
 $A:=\sum_{h\in H}\lambda^{-1}(h)\pi(h)\in\mathrm{End}(V)$
 satisfies
$A:=\sum_{h\in H}\lambda^{-1}(h)\pi(h)\in\mathrm{End}(V)$
 satisfies 
 $A|_{L^{\bot}}=0$
 and
$A|_{L^{\bot}}=0$
 and 
 $A|_{L}=|H|\cdot I_{L}$
. Take an orthonormal basis
$A|_{L}=|H|\cdot I_{L}$
. Take an orthonormal basis 
 $v_{1},\ldots,v_{N}$
 for V with
$v_{1},\ldots,v_{N}$
 for V with 
 $L=\langle v_{1},\ldots,v_{M}\rangle$
,
$L=\langle v_{1},\ldots,v_{M}\rangle$
, 
 $L^{\bot}=\langle v_{M+1},\ldots,v_{N}\rangle$
. Then,
$L^{\bot}=\langle v_{M+1},\ldots,v_{N}\rangle$
. Then, 
 \[\bigg|\!\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)\!\bigg|=\bigg|\!\sum_{i=1}^{N}\bigg\langle\pi(g)\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi(h)\bigg)v_{i},v_{i}\bigg\rangle\!\bigg|=|H|\bigg|\!\sum_{i=1}^{M}\langle\pi(g)v_{i},v_{i}\rangle\!\bigg|\leq M|H|,\]
\[\bigg|\!\sum_{h\in H}\lambda^{-1}(h)\chi_{\pi}(gh)\!\bigg|=\bigg|\!\sum_{i=1}^{N}\bigg\langle\pi(g)\bigg(\sum_{h\in H}\lambda^{-1}(h)\pi(h)\bigg)v_{i},v_{i}\bigg\rangle\!\bigg|=|H|\bigg|\!\sum_{i=1}^{M}\langle\pi(g)v_{i},v_{i}\rangle\!\bigg|\leq M|H|,\]
and the lemma follows.
The following lemma gives a different estimate on the average of a character over a coset, and this estimate is sharper when the double coset HgH is large. We will not need these alternative estimates, but we thought it could be useful to state them.
Lemma 2.8. Let G be a finite group, and let 
 $H\leq G$
 be a subgroup. Then, for each
$H\leq G$
 be a subgroup. Then, for each 
 $\chi\in\mathrm{Irr}(G)$
 and each
$\chi\in\mathrm{Irr}(G)$
 and each 
 $g\in G$
,
$g\in G$
, 
 \[\bigg|\frac{1}{|H|}\sum_{h\in H}\chi(hg)\bigg|\leq\frac{\langle\chi,1\rangle_{H}^{1/2}\cdot|G|^{1/2}}{|HgH|^{1/2}\chi(1)^{1/2}}.\]
\[\bigg|\frac{1}{|H|}\sum_{h\in H}\chi(hg)\bigg|\leq\frac{\langle\chi,1\rangle_{H}^{1/2}\cdot|G|^{1/2}}{|HgH|^{1/2}\chi(1)^{1/2}}.\]
Proof. Let G be a finite group. For each 
 $\chi\in\mathrm{Irr}(G)$
, we denote by
$\chi\in\mathrm{Irr}(G)$
, we denote by 
 $(\pi_{\chi},V_{\chi})$
 the representation corresponding to
$(\pi_{\chi},V_{\chi})$
 the representation corresponding to 
 $\chi$
. The non-commutative Fourier transform (see, e.g., [Reference ApplebaumApp14, Section 2.3]) is the map
$\chi$
. The non-commutative Fourier transform (see, e.g., [Reference ApplebaumApp14, Section 2.3]) is the map 
 ${\mathcal F}:\mathbb{C}[G]\rightarrow\bigoplus_{\chi\in\mathrm{Irr}(G)}\mathrm{End}(V_{\chi})$
 defined by
${\mathcal F}:\mathbb{C}[G]\rightarrow\bigoplus_{\chi\in\mathrm{Irr}(G)}\mathrm{End}(V_{\chi})$
 defined by 
 $f\mapsto\widehat{f}:=(\widehat{f}(\chi))_{\chi\in\mathrm{Irr}(G)}$
, where
$f\mapsto\widehat{f}:=(\widehat{f}(\chi))_{\chi\in\mathrm{Irr}(G)}$
, where 
 $\widehat{f}(\chi)=(\frac{1}{|G|})\sum_{g'\in G}f(g')\pi_{\chi}(g'^{-1})$
. We denote by
$\widehat{f}(\chi)=(\frac{1}{|G|})\sum_{g'\in G}f(g')\pi_{\chi}(g'^{-1})$
. We denote by 
 $\Vert f\Vert_{2}:=\big((\frac{1}{|G|})\sum_{g'\in G}|f(g')|^{2}\big)^{{1}/{2}}$
. Similarly, for a collection of endomorphisms
$\Vert f\Vert_{2}:=\big((\frac{1}{|G|})\sum_{g'\in G}|f(g')|^{2}\big)^{{1}/{2}}$
. Similarly, for a collection of endomorphisms 
 $(A_{\chi})_{\chi\in\mathrm{Irr}(G)}\in\bigoplus_{\chi\in\mathrm{Irr}(G)}\,\mathrm{End}(V_{\chi})$
, with
$(A_{\chi})_{\chi\in\mathrm{Irr}(G)}\in\bigoplus_{\chi\in\mathrm{Irr}(G)}\,\mathrm{End}(V_{\chi})$
, with 
 $A_{\chi}\in\mathrm{End}(V_{\chi})$
, we define
$A_{\chi}\in\mathrm{End}(V_{\chi})$
, we define 
 \[\Vert\! (A_{\chi})_{\chi\in\mathrm{Irr}(G)}\!\Vert_{2}:=\bigg(\sum_{\chi\in\mathrm{Irr}(G)}\chi(1)\cdot\Vert A_{\chi}\Vert_{\mathrm{HS}}^{2}\bigg)^{\!\frac{1}{2}},\]
\[\Vert\! (A_{\chi})_{\chi\in\mathrm{Irr}(G)}\!\Vert_{2}:=\bigg(\sum_{\chi\in\mathrm{Irr}(G)}\chi(1)\cdot\Vert A_{\chi}\Vert_{\mathrm{HS}}^{2}\bigg)^{\!\frac{1}{2}},\]
 where 
 $\Vert A_{\chi}\Vert_{\mathrm{HS}}:=\mathrm{tr}(A_{\chi}\cdot A_{\chi}^{*})^{{1}/{2}}$
 is the Hilbert–Schmidt norm on
$\Vert A_{\chi}\Vert_{\mathrm{HS}}:=\mathrm{tr}(A_{\chi}\cdot A_{\chi}^{*})^{{1}/{2}}$
 is the Hilbert–Schmidt norm on 
 $\mathrm{End}(V_{\chi})$
. The Plancherel theorem (see, e.g., [Reference ApplebaumApp14, Theorem 2.3.1(2)]), states that
$\mathrm{End}(V_{\chi})$
. The Plancherel theorem (see, e.g., [Reference ApplebaumApp14, Theorem 2.3.1(2)]), states that 
 \begin{equation}\Vert f\Vert_{2}=\Vert \widehat{f}\Vert_{2}.\end{equation}
\begin{equation}\Vert f\Vert_{2}=\Vert \widehat{f}\Vert_{2}.\end{equation}
 Let 
 $\psi_{HgH}:=(\frac{1}{|HgH|})1_{HgH}$
. For each
$\psi_{HgH}:=(\frac{1}{|HgH|})1_{HgH}$
. For each 
 $\chi\in\mathrm{Irr}(G)$
, one has
$\chi\in\mathrm{Irr}(G)$
, one has 
 \[\widehat{\psi_{HgH}}(\chi)=\frac{1}{|G|}\sum_{g'\in G}\psi_{HgH}(g')\pi_{\chi}(g'^{-1})=\frac{1}{|HgH||G|}\sum_{g'\in HgH}\pi_{\chi}(g'^{-1}).\]
\[\widehat{\psi_{HgH}}(\chi)=\frac{1}{|G|}\sum_{g'\in G}\psi_{HgH}(g')\pi_{\chi}(g'^{-1})=\frac{1}{|HgH||G|}\sum_{g'\in HgH}\pi_{\chi}(g'^{-1}).\]
 The square of the 
 $L^{2}$
-norm of
$L^{2}$
-norm of 
 $\psi_{HgH}$
 is given by
$\psi_{HgH}$
 is given by 
 \begin{equation}\Vert \psi_{HgH}\Vert_{2}^{2}=\frac{1}{|G|}\sum_{g'\in G}(\psi_{HgH}(g'))^{2}=\frac{1}{|G|}\sum_{g'\in HgH}\frac{1}{|HgH|^{2}}=\frac{1}{|HgH||G|}.\end{equation}
\begin{equation}\Vert \psi_{HgH}\Vert_{2}^{2}=\frac{1}{|G|}\sum_{g'\in G}(\psi_{HgH}(g'))^{2}=\frac{1}{|G|}\sum_{g'\in HgH}\frac{1}{|HgH|^{2}}=\frac{1}{|HgH||G|}.\end{equation}
 Let 
 $v_{1},\ldots,v_{M}$
 be an orthonormal basis of
$v_{1},\ldots,v_{M}$
 be an orthonormal basis of 
 $V_{\chi}^{H}:=\{ v\in V_{\chi}:\pi_{\chi}(h)\cdot v=v,\ \forall h\in H\}$
 with respect to some G-invariant inner product
$V_{\chi}^{H}:=\{ v\in V_{\chi}:\pi_{\chi}(h)\cdot v=v,\ \forall h\in H\}$
 with respect to some G-invariant inner product 
 $\langle\,,\,\rangle$
 on
$\langle\,,\,\rangle$
 on 
 $V_{\chi}$
, with
$V_{\chi}$
, with 
 $M=\langle\chi,1\rangle_{H}$
. Let
$M=\langle\chi,1\rangle_{H}$
. Let 
 $\big(V_{\chi}^{H}\big)^{\perp}$
 be the orthogonal complement to
$\big(V_{\chi}^{H}\big)^{\perp}$
 be the orthogonal complement to 
 $V_{\chi}^{H}$
 in
$V_{\chi}^{H}$
 in 
 $V_{\chi}$
. In the proof of Lemma 2.7, in the case that
$V_{\chi}$
. In the proof of Lemma 2.7, in the case that 
 $\lambda=1$
, we have seen that
$\lambda=1$
, we have seen that 
 \begin{equation}\sum_{h\in H}\pi_{\chi}(h)\cdot v=\begin{cases}0 & \text{if }v\in(V_{\chi}^{H})^{\perp},\\|H|\cdot v & \text{if }v\in V_{\chi}^{H}.\end{cases}\end{equation}
\begin{equation}\sum_{h\in H}\pi_{\chi}(h)\cdot v=\begin{cases}0 & \text{if }v\in(V_{\chi}^{H})^{\perp},\\|H|\cdot v & \text{if }v\in V_{\chi}^{H}.\end{cases}\end{equation}
In particular, we have
 \begin{align*}&\bigg\langle\! \sum_{g'\in HgH}\pi_{\chi}(g'^{-1})\cdot v,v\bigg\rangle\\&\quad =\frac{|HgH|}{|H|^{2}}\bigg\langle\! \sum_{h',h\in H}\pi_{\chi}(h'g^{-1}h)\cdot v,v\bigg\rangle=\frac{|HgH|}{|H|^{2}}\bigg\langle\! \bigg(\sum_{h'\in H}\pi_{\chi}(h')\bigg)\cdot\bigg(\sum_{h\in H}\pi_{\chi}(g^{-1}h)\cdot v\bigg),v\bigg\rangle\\&\quad =\frac{|HgH|}{|H|^{2}}\bigg\langle\! \sum_{h\in H}\pi_{\chi}(g^{-1}h)\cdot v,\sum_{h'\in H}\pi_{\chi}(h')v\bigg\rangle =\begin{cases}0 & \text{if }v\in(V_{\chi}^{H})^{\perp}\\|HgH|\langle\pi_{\chi}(g^{-1})\cdot v,v\rangle & \text{if }v\in V_{\chi}^{H}.\end{cases}\end{align*}
\begin{align*}&\bigg\langle\! \sum_{g'\in HgH}\pi_{\chi}(g'^{-1})\cdot v,v\bigg\rangle\\&\quad =\frac{|HgH|}{|H|^{2}}\bigg\langle\! \sum_{h',h\in H}\pi_{\chi}(h'g^{-1}h)\cdot v,v\bigg\rangle=\frac{|HgH|}{|H|^{2}}\bigg\langle\! \bigg(\sum_{h'\in H}\pi_{\chi}(h')\bigg)\cdot\bigg(\sum_{h\in H}\pi_{\chi}(g^{-1}h)\cdot v\bigg),v\bigg\rangle\\&\quad =\frac{|HgH|}{|H|^{2}}\bigg\langle\! \sum_{h\in H}\pi_{\chi}(g^{-1}h)\cdot v,\sum_{h'\in H}\pi_{\chi}(h')v\bigg\rangle =\begin{cases}0 & \text{if }v\in(V_{\chi}^{H})^{\perp}\\|HgH|\langle\pi_{\chi}(g^{-1})\cdot v,v\rangle & \text{if }v\in V_{\chi}^{H}.\end{cases}\end{align*}
Hence,
 \begin{align}\Vert \widehat{\psi_{HgH}}\Vert_{2}^{2}&=\sum_{\chi\in\mathrm{Irr}(G)}\chi(1)\bigg\Vert\frac{1}{|HgH||G|}\sum_{g'\in HgH}\pi_{\chi}(g'^{-1})\bigg\Vert_{\mathrm{HS}}^{2}\nonumber\\&=\sum_{\chi\in\mathrm{Irr}(G)}\frac{\chi(1)}{|G|^{2}}\sum_{i,j=1}^{M}|\!\langle\pi_{\chi}(g^{-1}).v_{i},v_{j}\rangle\!|^{2}.\end{align}
\begin{align}\Vert \widehat{\psi_{HgH}}\Vert_{2}^{2}&=\sum_{\chi\in\mathrm{Irr}(G)}\chi(1)\bigg\Vert\frac{1}{|HgH||G|}\sum_{g'\in HgH}\pi_{\chi}(g'^{-1})\bigg\Vert_{\mathrm{HS}}^{2}\nonumber\\&=\sum_{\chi\in\mathrm{Irr}(G)}\frac{\chi(1)}{|G|^{2}}\sum_{i,j=1}^{M}|\!\langle\pi_{\chi}(g^{-1}).v_{i},v_{j}\rangle\!|^{2}.\end{align}
By (2.6), (2.7) is equal to (2.9), hence,
 \begin{align*}\bigg|\frac{1}{|H|}\sum_{h\in H}\chi(hg^{-1})\!\bigg|^{2} &=\bigg|\!\sum_{i=1}^{M}\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{i}\rangle\!\bigg|^{2}\leq M\sum_{i=1}^{M}|\!\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{i}\!\rangle|^{2}\\& \leq M\sum_{i,j=1}^{M}|\!\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{j}\rangle\!|^{2}\leq\frac{M|G|}{\chi(1)|HgH|},\end{align*}
\begin{align*}\bigg|\frac{1}{|H|}\sum_{h\in H}\chi(hg^{-1})\!\bigg|^{2} &=\bigg|\!\sum_{i=1}^{M}\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{i}\rangle\!\bigg|^{2}\leq M\sum_{i=1}^{M}|\!\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{i}\!\rangle|^{2}\\& \leq M\sum_{i,j=1}^{M}|\!\langle\pi_{\chi}(g^{-1})\cdot v_{i},v_{j}\rangle\!|^{2}\leq\frac{M|G|}{\chi(1)|HgH|},\end{align*}
where the first equality follows from (2.8), and the first inequality follows from Cauchy–Schwarz inequality.
2.2 Weingarten calculus
 In §§ 2.1.1 and 2.1.2 we stated that each partition 
 $\lambda\vdash m$
 with
$\lambda\vdash m$
 with 
 $\ell(\lambda)\leq d$
 induces two different representations,
$\ell(\lambda)\leq d$
 induces two different representations, 
 $\rho_{\lambda}\in\mathrm{Irr}(\mathrm{U}_{d})$
 and
$\rho_{\lambda}\in\mathrm{Irr}(\mathrm{U}_{d})$
 and 
 $\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
. There is a deeper connection between
$\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
. There is a deeper connection between 
 $\rho_{\lambda}$
 and
$\rho_{\lambda}$
 and 
 $\chi_{\lambda}$
 coming from the Schur–Weyl duality: the space
$\chi_{\lambda}$
 coming from the Schur–Weyl duality: the space 
 $(\mathbb{C}^{d})^{\otimes m}$
 carries a natural action of
$(\mathbb{C}^{d})^{\otimes m}$
 carries a natural action of 
 $\mathrm{U}_{d}\times S_{m}$
, where
$\mathrm{U}_{d}\times S_{m}$
, where 
 $A\in\mathrm{U}_{d}$
 acts diagonally
$A\in\mathrm{U}_{d}$
 acts diagonally 
 $A\cdot(v_{1}\otimes\cdots\otimes v_{m})=Av_{1}\otimes\cdots\otimes Av_{m}$
, and
$A\cdot(v_{1}\otimes\cdots\otimes v_{m})=Av_{1}\otimes\cdots\otimes Av_{m}$
, and 
 $\sigma\in S_{m}$
 acts by
$\sigma\in S_{m}$
 acts by 
 $\sigma\cdot (v_{1}\otimes\cdots\otimes v_{m})=v_{\sigma(1)}\otimes\cdots\otimes v_{\sigma(m)}$
. The Schur–Weyl duality can be phrased as follows.
$\sigma\cdot (v_{1}\otimes\cdots\otimes v_{m})=v_{\sigma(1)}\otimes\cdots\otimes v_{\sigma(m)}$
. The Schur–Weyl duality can be phrased as follows.
Theorem 2.9 (Schur–Weyl duality [Reference WeylWey39]). The space 
 $(\mathbb{C}^{d})^{\otimes m}$
 is a multiplicity-free representation of
$(\mathbb{C}^{d})^{\otimes m}$
 is a multiplicity-free representation of 
 $\mathrm{U}_{d}\times S_{m}$
. The decomposition of
$\mathrm{U}_{d}\times S_{m}$
. The decomposition of 
 $(\mathbb{C}^{d})^{\otimes m}$
 into irreducible components is given by
$(\mathbb{C}^{d})^{\otimes m}$
 into irreducible components is given by 
 \begin{equation}(\mathbb{C}^{d})^{\otimes m}=\bigoplus_{\lambda\vdash m,\ell(\lambda)\leq d}\rho_{\lambda}\otimes\chi_{\lambda}.\end{equation}
\begin{equation}(\mathbb{C}^{d})^{\otimes m}=\bigoplus_{\lambda\vdash m,\ell(\lambda)\leq d}\rho_{\lambda}\otimes\chi_{\lambda}.\end{equation}
 There are two special functions on 
 $S_{m}$
 which come from (2.10). First, writing
$S_{m}$
 which come from (2.10). First, writing 
 $\ell(\sigma)$
 for the number of disjoint cycles in
$\ell(\sigma)$
 for the number of disjoint cycles in 
 $\sigma\in S_{m}$
, the character of
$\sigma\in S_{m}$
, the character of 
 $(\mathbb{C}^{d})^{\otimes m}$
 as a representation of
$(\mathbb{C}^{d})^{\otimes m}$
 as a representation of 
 $S_{m}$
 is the function
$S_{m}$
 is the function 
 $\sigma\mapsto d^{\ell(\sigma)}$
.
$\sigma\mapsto d^{\ell(\sigma)}$
.
 Recall we have an isomorphism of algebras 
 $\mathbb{C}[S_{m}]\simeq\bigoplus_{\lambda\vdash m}\mathrm{End}(V_{\chi_{\lambda}})$
, where the multiplication in
$\mathbb{C}[S_{m}]\simeq\bigoplus_{\lambda\vdash m}\mathrm{End}(V_{\chi_{\lambda}})$
, where the multiplication in 
 $\mathbb{C}[S_{m}]$
 is the convolution operation
$\mathbb{C}[S_{m}]$
 is the convolution operation 
 $f_{1}*f_{2}(y):=\sum_{x\in S_{m}}f(x)g(x^{-1}y)$
. We denote by
$f_{1}*f_{2}(y):=\sum_{x\in S_{m}}f(x)g(x^{-1}y)$
. We denote by 
 $\mathbb{C}_{d}[S_{m}]$
 the subalgebra corresponding to
$\mathbb{C}_{d}[S_{m}]$
 the subalgebra corresponding to 
 $\bigoplus_{\lambda\vdash m,\ell(\lambda)\leq d}\mathrm{End}(V_{\chi_{\lambda}})$
.
$\bigoplus_{\lambda\vdash m,\ell(\lambda)\leq d}\mathrm{End}(V_{\chi_{\lambda}})$
.
Definition 2.10 [Reference Collins and ŚniadyCS06, Proposition 2.3]. Let 
 $d\in\mathbb{N}$
. The Weingarten function
$d\in\mathbb{N}$
. The Weingarten function 
 $\mathrm{Wg}_{d}:S_{m}\rightarrow\mathbb{C}$
 is the inverse of the function
$\mathrm{Wg}_{d}:S_{m}\rightarrow\mathbb{C}$
 is the inverse of the function 
 $d^{\ell(\sigma)}$
 in the ring
$d^{\ell(\sigma)}$
 in the ring 
 $\mathbb{C}_{d}[S_{m}]$
. It has the following Fourier expansion:
$\mathbb{C}_{d}[S_{m}]$
. It has the following Fourier expansion: 
 \begin{equation}\mathrm{Wg}_{d}(\sigma)=\frac{1}{m!^{2}}\sum_{\lambda\vdash m,\ell(\lambda)\leq d}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\chi_{\lambda}(\sigma).\end{equation}
\begin{equation}\mathrm{Wg}_{d}(\sigma)=\frac{1}{m!^{2}}\sum_{\lambda\vdash m,\ell(\lambda)\leq d}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\chi_{\lambda}(\sigma).\end{equation}
Remark 2.11. Since in this paper we only consider 
 $\mathrm{Wg}_{d'}(\sigma)$
 for
$\mathrm{Wg}_{d'}(\sigma)$
 for 
 $d'=d$
, we write Wg instead of
$d'=d$
, we write Wg instead of 
 $\mathrm{Wg}_{d}$
.
$\mathrm{Wg}_{d}$
.
The Weingarten calculus, developed in [Reference WeingartenWei78, Reference CollinsCol03, Reference Collins and ŚniadyCS06], utilizes the Schur–Weyl duality to express integrals on unitary groups as finite sums of Weingarten functions on symmetric groups. One formulation is the following theorem by Collins and Śniady.
Theorem 2.12 [Reference Collins and ŚniadyCS06, Corollary 2.4]. Let 
 $(i_{1},\ldots,i_{m})$
,
$(i_{1},\ldots,i_{m})$
, 
 $(j_{1},\ldots,j_{m})$
,
$(j_{1},\ldots,j_{m})$
, 
 $(i'_{1},\ldots,i'_{m})$
, and
$(i'_{1},\ldots,i'_{m})$
, and 
 $(j'_{1},\ldots,j'_{m})$
 be tuples of integers in [d]. Then,
$(j'_{1},\ldots,j'_{m})$
 be tuples of integers in [d]. Then, 
 \begin{align}& \mathbb{E}_{X\in\mathrm{U}_{d}}\big(X_{i_{1},j_{1}}\cdots X_{i_{m},j_{m}}\cdot\overline{X_{i'_{1},j'_{1}}\cdots X_{i'_{m},j'_{m}}}\big)\nonumber\\&\quad = \sum_{\sigma,\tau\in S_{m}}\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}\cdot\mathrm{Wg}_{d}(\sigma^{-1}\tau).\end{align}
\begin{align}& \mathbb{E}_{X\in\mathrm{U}_{d}}\big(X_{i_{1},j_{1}}\cdots X_{i_{m},j_{m}}\cdot\overline{X_{i'_{1},j'_{1}}\cdots X_{i'_{m},j'_{m}}}\big)\nonumber\\&\quad = \sum_{\sigma,\tau\in S_{m}}\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}\cdot\mathrm{Wg}_{d}(\sigma^{-1}\tau).\end{align}
We will use a coordinate-free version of Theorem 2.12 which we proceed to state.
Definition 2.13. Let 
 $\Omega$
 be a set.
$\Omega$
 be a set.
- 
(1) A symmetric partition  $\Phi$
 of $\Phi$
 of $\Omega$
 is a partition $\Omega$
 is a partition $\Omega=\bigsqcup_{i=1}^{r}A_{i}\sqcup\bigsqcup_{i=1}^{r}B_{i}$
, where $\Omega=\bigsqcup_{i=1}^{r}A_{i}\sqcup\bigsqcup_{i=1}^{r}B_{i}$
, where $|A_{i}|=|B_{i}|$
. $|A_{i}|=|B_{i}|$
.
- 
(2) Given a symmetric partition  $\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
, let $\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
, let \[S_{\Phi}=\{ \Sigma\in\mathrm{Sym}(\Omega):\Sigma(A_{i})=B_{i},\Sigma(B_{i})=A_{i}\} .\] \[S_{\Phi}=\{ \Sigma\in\mathrm{Sym}(\Omega):\Sigma(A_{i})=B_{i},\Sigma(B_{i})=A_{i}\} .\]
- 
(3) If  $\Sigma\in S_{\Phi}$
, then $\Sigma\in S_{\Phi}$
, then $\Sigma^{2}(A_{i})=A_{i}$
 and we define $\Sigma^{2}(A_{i})=A_{i}$
 and we define $\widetilde{\mathrm{Wg}}(\Sigma^{2})=\prod_{i=1}^{r}\mathrm{Wg}(\Sigma^{2}|_{A_{i}})$
. $\widetilde{\mathrm{Wg}}(\Sigma^{2})=\prod_{i=1}^{r}\mathrm{Wg}(\Sigma^{2}|_{A_{i}})$
.
Proposition 2.14. Let 
 $\Phi=(A,B)$
 be a symmetric partition of
$\Phi=(A,B)$
 be a symmetric partition of 
 $\Omega$
 and let
$\Omega$
 and let 
 $F,H:\Omega\rightarrow[d]$
. Then
$F,H:\Omega\rightarrow[d]$
. Then 
 \begin{align*}\mathbb{E}_{X\in\mathrm{U}_{d}}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}X_{F(y),H(y)}^{-1}\bigg)&=\mathbb{E}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}\overline{X_{H(y),F(y)}}\bigg)\\&=\sum_{\Sigma\in S_{\Phi}:\,H=F\circ\Sigma}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\end{align*}
\begin{align*}\mathbb{E}_{X\in\mathrm{U}_{d}}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}X_{F(y),H(y)}^{-1}\bigg)&=\mathbb{E}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}\overline{X_{H(y),F(y)}}\bigg)\\&=\sum_{\Sigma\in S_{\Phi}:\,H=F\circ\Sigma}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\end{align*}
Proof. Identify 
 $A\cong\{ 1,\ldots,m\} $
 and
$A\cong\{ 1,\ldots,m\} $
 and 
 $B\cong\{ -1,\ldots,-m\}$
 and let
$B\cong\{ -1,\ldots,-m\}$
 and let 
 $\overrightarrow{\!i},\overrightarrow{\!j},\overrightarrow{\!i}',\overrightarrow{\!j}'\in[d]^{m}$
 be
$\overrightarrow{\!i},\overrightarrow{\!j},\overrightarrow{\!i}',\overrightarrow{\!j}'\in[d]^{m}$
 be 
 \[i_{k}=F(k),\quad j_{k}=H(k),\quad i'_{k}=H(-k),\quad j'_{k}=F(-k).\]
\[i_{k}=F(k),\quad j_{k}=H(k),\quad i'_{k}=H(-k),\quad j'_{k}=F(-k).\]
Then, by Theorem 2.12,
 \begin{align*}&\mathbb{E}_{X\in\mathrm{\mathrm{U}}_{d}}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}X_{F(y),H(y)}^{-1}\bigg) =\mathbb{E}_{X\in\mathrm{\mathrm{U}}_{d}}\Big(X_{i_{1},j_{1}}\cdots X_{i_{m},j_{m}}\overline{X_{i'_{1},j'_{1}}\cdots X_{i'_{m},j'_{m}}}\Big)\\&\quad =\sum_{\sigma,\tau\in S_{m}}\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\cdot\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}\cdot\mathrm{Wg}(\sigma^{-1}\tau).\end{align*}
\begin{align*}&\mathbb{E}_{X\in\mathrm{\mathrm{U}}_{d}}\bigg(\prod_{x\in A}X_{F(x),H(x)}\prod_{y\in B}X_{F(y),H(y)}^{-1}\bigg) =\mathbb{E}_{X\in\mathrm{\mathrm{U}}_{d}}\Big(X_{i_{1},j_{1}}\cdots X_{i_{m},j_{m}}\overline{X_{i'_{1},j'_{1}}\cdots X_{i'_{m},j'_{m}}}\Big)\\&\quad =\sum_{\sigma,\tau\in S_{m}}\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\cdot\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}\cdot\mathrm{Wg}(\sigma^{-1}\tau).\end{align*}
 For 
 $\sigma,\tau\in S_{m}$
, let
$\sigma,\tau\in S_{m}$
, let 
 $\Sigma_{(\sigma,\tau)}\in\mathrm{Sym}(A\sqcup B)\cong\mathrm{Sym}(\{-m,\ldots,-1,1,\ldots,m\} )$
 be the permutation
$\Sigma_{(\sigma,\tau)}\in\mathrm{Sym}(A\sqcup B)\cong\mathrm{Sym}(\{-m,\ldots,-1,1,\ldots,m\} )$
 be the permutation 
 \[\Sigma_{(\sigma,\tau)}(x)=\begin{cases}-\tau(x) & x\in\{ 1,\ldots,m\}, \\\sigma^{-1}(-x) & x\in\{ -1,\ldots,-m\} .\end{cases}\]
\[\Sigma_{(\sigma,\tau)}(x)=\begin{cases}-\tau(x) & x\in\{ 1,\ldots,m\}, \\\sigma^{-1}(-x) & x\in\{ -1,\ldots,-m\} .\end{cases}\]
 The map 
 $(\sigma,\tau)\mapsto\Sigma_{(\sigma,\tau)}$
 is a bijection
$(\sigma,\tau)\mapsto\Sigma_{(\sigma,\tau)}$
 is a bijection 
 $S_{m}^{2}\cong S_{\Phi}$
 and the condition
$S_{m}^{2}\cong S_{\Phi}$
 and the condition 
 $\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\cdot\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}=1$
 is equivalent to
$\delta_{i_{1},i'_{\sigma(1)}}\cdots\delta_{i_{m},i'_{\sigma(m)}}\cdot\delta_{j_{1},j'_{\tau(1)}}\cdots\delta_{j_{m},j'_{\tau(m)}}=1$
 is equivalent to 
 $H=F\circ\Sigma_{(\sigma,\tau)}$
. Finally, the permutation
$H=F\circ\Sigma_{(\sigma,\tau)}$
. Finally, the permutation 
 $(\Sigma_{(\sigma,\tau)})^{2}$
 acts on A as
$(\Sigma_{(\sigma,\tau)})^{2}$
 acts on A as 
 $\sigma^{-1}\tau$
, and the result follows.
$\sigma^{-1}\tau$
, and the result follows.
Corollary 2.15. Let 
 $\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
 be a symmetric partition of
$\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
 be a symmetric partition of 
 $\Omega$
 and let
$\Omega$
 and let 
 $F,H:\Omega\rightarrow[d]$
. Then
$F,H:\Omega\rightarrow[d]$
. Then 
 \[\mathbb{E}\bigg(\prod_{i=1}^{r}\bigg(\prod_{x\in A_{i}}(X_{i})_{F(x),H(x)}\prod_{y\in B_{i}}(X_{i}^{-1})_{F(y),H(y)}\bigg)\bigg)=\sum_{\Sigma\in S_{\Phi}:\,H=F\circ\Sigma}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\]
\[\mathbb{E}\bigg(\prod_{i=1}^{r}\bigg(\prod_{x\in A_{i}}(X_{i})_{F(x),H(x)}\prod_{y\in B_{i}}(X_{i}^{-1})_{F(y),H(y)}\bigg)\bigg)=\sum_{\Sigma\in S_{\Phi}:\,H=F\circ\Sigma}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\]
3. The Engel word as a model case
‘Those who run to long words are mainly the unskillful and tasteless; they confuse pomposity with dignity, flaccidity with ease, and bulk with force.’ [Reference FowlerFow65, p. 342]
In this section we prove the following simplified version of Theorem 1.3 for the Engel word. We chose the Engel word since it is short enough to make the proof easier to digest, while at same time complicated enough so that the proof contains most of the key ideas in the paper.
Theorem 3.1. Let X,Y be independent random variables with respect to the normalized Haar measure on 
 $\mathrm{U}_{d}$
. For every
$\mathrm{U}_{d}$
. For every 
 $d\geq2m$
, one has
$d\geq2m$
, one has 
 \[\mathbb{E}(c_{m}([[X,Y],Y]))<2^{17m}.\]
\[\mathbb{E}(c_{m}([[X,Y],Y]))<2^{17m}.\]
 Let 
 $w=[[x,y],y]=xyx^{-1}yxy^{-1}x^{-1}y^{-1}$
 be the Engel word. We would like to compute
$w=[[x,y],y]=xyx^{-1}yxy^{-1}x^{-1}y^{-1}$
 be the Engel word. We would like to compute 
 $\mathbb{E}\big(\mathrm{tr}\bigwedge\nolimits^{\!m}w(X,Y)\big)$
. Denote
$\mathbb{E}\big(\mathrm{tr}\bigwedge\nolimits^{\!m}w(X,Y)\big)$
. Denote 
 ${\mathcal I}_{m,d}:=\{a_{1}<\cdots<a_{m}:a_{i}\in[d]\}$
, and note that
${\mathcal I}_{m,d}:=\{a_{1}<\cdots<a_{m}:a_{i}\in[d]\}$
, and note that 
 \begin{equation}\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}w(X,Y)_{a_{1}a_{\pi(1)}}\cdots w(X,Y)_{a_{m}a_{\pi(m)}}.\end{equation}
\begin{equation}\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}w(X,Y)_{a_{1}a_{\pi(1)}}\cdots w(X,Y)_{a_{m}a_{\pi(m)}}.\end{equation}
We have
 \begin{align}w(X,Y)_{a_{i}a_{\pi(i)}} &=\sum_{b_{i},c_{i},d_{i},A_{i},B_{i},C_{i},D_{i}\in[d]}X_{a_{i},D_{i}}Y_{D_{i},c_{i}}X_{c_{i},A_{i}}^{-1}Y_{A_{i},b_{i}}X_{b_{i},C_{i}}Y_{C_{i},d_{i}}^{-1}X_{d_{i},B_{i}}^{-1}Y_{B_{i},a_{\pi(i)}}^{-1}\nonumber\\& =\sum_{b_{i},c_{i},d_{i},A_{i},B_{i},C_{i},D_{i}\in[d]}X_{a_{i},D_{i}}X_{b_{i},C_{i}}\overline{X_{A_{i},c_{i}}X_{B_{i},d_{i}}}Y_{A_{i},b_{i}}Y_{D_{i},c_{i}}\overline{Y_{a_{\pi(i)},B_{i}}Y_{d_{i},C_{i}}}.\end{align}
\begin{align}w(X,Y)_{a_{i}a_{\pi(i)}} &=\sum_{b_{i},c_{i},d_{i},A_{i},B_{i},C_{i},D_{i}\in[d]}X_{a_{i},D_{i}}Y_{D_{i},c_{i}}X_{c_{i},A_{i}}^{-1}Y_{A_{i},b_{i}}X_{b_{i},C_{i}}Y_{C_{i},d_{i}}^{-1}X_{d_{i},B_{i}}^{-1}Y_{B_{i},a_{\pi(i)}}^{-1}\nonumber\\& =\sum_{b_{i},c_{i},d_{i},A_{i},B_{i},C_{i},D_{i}\in[d]}X_{a_{i},D_{i}}X_{b_{i},C_{i}}\overline{X_{A_{i},c_{i}}X_{B_{i},d_{i}}}Y_{A_{i},b_{i}}Y_{D_{i},c_{i}}\overline{Y_{a_{\pi(i)},B_{i}}Y_{d_{i},C_{i}}}.\end{align}
 The group 
 $S_{m}$
 acts on
$S_{m}$
 acts on 
 $[d]^{m}$
 by
$[d]^{m}$
 by 
 $\sigma(\overrightarrow{\!v})_{i}=\overrightarrow{\!v}_{\sigma^{-1}(i)}$
 for any
$\sigma(\overrightarrow{\!v})_{i}=\overrightarrow{\!v}_{\sigma^{-1}(i)}$
 for any 
 $\sigma\in S_{m}$
 and
$\sigma\in S_{m}$
 and 
 $\overrightarrow{\!v}\in[d]^{m}$
. Similarly, given
$\overrightarrow{\!v}\in[d]^{m}$
. Similarly, given 
 $\overrightarrow{\!v},\overrightarrow{w}\in[d]^{m}$
 and
$\overrightarrow{\!v},\overrightarrow{w}\in[d]^{m}$
 and 
 $\tau\in S_{2m}$
, we denote by
$\tau\in S_{2m}$
, we denote by 
 $(\overrightarrow{\!v},\overrightarrow{w})$
 the element in
$(\overrightarrow{\!v},\overrightarrow{w})$
 the element in 
 $[d]^{2m}$
 given by
$[d]^{2m}$
 given by 
 $(\overrightarrow{\!v},\overrightarrow{w})_{i}=\small{\begin{cases}\overrightarrow{\!v}_{i} & \text{if }i\leq m\\\overrightarrow{w}_{i-m} & \text{if }m<i\leq2m\end{cases}}$
, and denote by
$(\overrightarrow{\!v},\overrightarrow{w})_{i}=\small{\begin{cases}\overrightarrow{\!v}_{i} & \text{if }i\leq m\\\overrightarrow{w}_{i-m} & \text{if }m<i\leq2m\end{cases}}$
, and denote by 
 $\tau(\overrightarrow{\!v},\overrightarrow{w})_{i}=(\overrightarrow{\!v},\overrightarrow{w})_{\tau^{-1}(i)}$
. In particular, writing
$\tau(\overrightarrow{\!v},\overrightarrow{w})_{i}=(\overrightarrow{\!v},\overrightarrow{w})_{\tau^{-1}(i)}$
. In particular, writing 
 $X_{\overrightarrow{\!v},\overrightarrow{\!u}}:=\prod_{i=1}^{m}X_{v_{i},u_{i}}$
 for
$X_{\overrightarrow{\!v},\overrightarrow{\!u}}:=\prod_{i=1}^{m}X_{v_{i},u_{i}}$
 for 
 $\overrightarrow{\!v},\overrightarrow{\!u}\in[d]^{m}$
, we have
$\overrightarrow{\!v},\overrightarrow{\!u}\in[d]^{m}$
, we have 
 \begin{align}\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)&=\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern-.5pt}D}\in[d]^{m}}\sum_{\pi\in S_{m}}(-1)^{\pi}\big(X_{\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}D}}X_{\overrightarrow{\!b},\overrightarrow{\!{\kern.5pt}C}}\overline{X_{\overrightarrow{\!A},\overrightarrow{\!c}}}\overline{X_{\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern-.5pt}d}}}\big)\nonumber\\&\quad \times \big(Y_{\overrightarrow{\!A},\overrightarrow{\!b}}Y_{\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!c}}\overline{Y_{\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-1pt}B}}Y_{\overrightarrow{\!{\kern-.5pt}d},\overrightarrow{\!{\kern.5pt}C}}}\big).\end{align}
\begin{align}\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)&=\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern-.5pt}D}\in[d]^{m}}\sum_{\pi\in S_{m}}(-1)^{\pi}\big(X_{\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}D}}X_{\overrightarrow{\!b},\overrightarrow{\!{\kern.5pt}C}}\overline{X_{\overrightarrow{\!A},\overrightarrow{\!c}}}\overline{X_{\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern-.5pt}d}}}\big)\nonumber\\&\quad \times \big(Y_{\overrightarrow{\!A},\overrightarrow{\!b}}Y_{\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!c}}\overline{Y_{\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-1pt}B}}Y_{\overrightarrow{\!{\kern-.5pt}d},\overrightarrow{\!{\kern.5pt}C}}}\big).\end{align}
We now rewrite the expected value of (3.3) using Weingarten calculus. For this, define
 \[S(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D}):=\bigg\{(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in S_{2m}^{4}:\begin{array}{c}(\overrightarrow{\!A},\overrightarrow{\!{\kern-1pt}B})=\sigma_{1}(\overrightarrow{\!a},\overrightarrow{\!b}),\,\,\,(\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d})=\tau_{1}(\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!{\kern.5pt}C})\\(\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}d})=\sigma_{2}(\overrightarrow{\!A},\overrightarrow{\!{\kern-.5pt}D}),\,\,\,(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})=\tau_{2}(\overrightarrow{\!b},\overrightarrow{\!c})\end{array}\bigg\}\]
\[S(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D}):=\bigg\{(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in S_{2m}^{4}:\begin{array}{c}(\overrightarrow{\!A},\overrightarrow{\!{\kern-1pt}B})=\sigma_{1}(\overrightarrow{\!a},\overrightarrow{\!b}),\,\,\,(\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d})=\tau_{1}(\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!{\kern.5pt}C})\\(\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}d})=\sigma_{2}(\overrightarrow{\!A},\overrightarrow{\!{\kern-.5pt}D}),\,\,\,(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})=\tau_{2}(\overrightarrow{\!b},\overrightarrow{\!c})\end{array}\bigg\}\]
and
 \begin{equation}Z:=\{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in{\mathcal I}_{m,d}\times[d]^{7m}\times S_{2m}^{4}:(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in S(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D})\} .\end{equation}
\begin{equation}Z:=\{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in{\mathcal I}_{m,d}\times[d]^{7m}\times S_{2m}^{4}:(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in S(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D})\} .\end{equation}
Lemma 3.2 We have
 \begin{equation}\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2}).\end{equation}
\begin{equation}\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2}).\end{equation}
Proof. Using Weingarten calculus, i.e. Theorem 2.12, and (3.3),
 \begin{align}&\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern-.5pt}D}\in[d]^{m}}\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\sigma_{1},\widetilde{\sigma}_{2},\tau_{1},\tau_{2}\in S_{2m}}\delta_{(\overrightarrow{\!a},\overrightarrow{\!b}),\sigma_{1}^{-1}(\overrightarrow{\!A},\overrightarrow{\!{\kern-1pt}B})}\cdot\delta_{(\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!{\kern.5pt}C}),\tau_{1}^{-1}(\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d})}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\nonumber\\&\qquad \cdot\delta_{(\overrightarrow{\!A},\overrightarrow{\!{\kern-.5pt}D}),\widetilde{\sigma}_{2}^{-1}(\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-.5pt}d})}\cdot\delta_{(\overrightarrow{\!b},\overrightarrow{\!c}),\tau_{2}^{-1}(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})}\mathrm{Wg}(\widetilde{\sigma}_{2}^{-1}\tau_{2}).\end{align}
\begin{align}&\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern-.5pt}D}\in[d]^{m}}\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\sigma_{1},\widetilde{\sigma}_{2},\tau_{1},\tau_{2}\in S_{2m}}\delta_{(\overrightarrow{\!a},\overrightarrow{\!b}),\sigma_{1}^{-1}(\overrightarrow{\!A},\overrightarrow{\!{\kern-1pt}B})}\cdot\delta_{(\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!{\kern.5pt}C}),\tau_{1}^{-1}(\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d})}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\nonumber\\&\qquad \cdot\delta_{(\overrightarrow{\!A},\overrightarrow{\!{\kern-.5pt}D}),\widetilde{\sigma}_{2}^{-1}(\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-.5pt}d})}\cdot\delta_{(\overrightarrow{\!b},\overrightarrow{\!c}),\tau_{2}^{-1}(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})}\mathrm{Wg}(\widetilde{\sigma}_{2}^{-1}\tau_{2}).\end{align}
 Applying the change of coordinate 
 $\sigma_{2}:=(\mathrm{\pi}\times\mathrm{Id})\circ\widetilde{\sigma}_{2}$
, and observing that
$\sigma_{2}:=(\mathrm{\pi}\times\mathrm{Id})\circ\widetilde{\sigma}_{2}$
, and observing that 
 $\widetilde{\sigma}_{2}^{-1}(\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-.5pt}d})=\sigma_{2}^{-1}(\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}d})$
, (3.6) becomes
$\widetilde{\sigma}_{2}^{-1}(\pi^{-1}(\overrightarrow{\!a}),\overrightarrow{\!{\kern-.5pt}d})=\sigma_{2}^{-1}(\overrightarrow{\!a},\overrightarrow{\!{\kern-.5pt}d})$
, (3.6) becomes 
 \[\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\cdot\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2}).\]
\[\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)=\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\cdot\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2}).\]
 In order to bound (3.5), we consider a natural action of 
 $S_{m}^{7}$
 on Z, and find a suitable change of coordinates such that the average of the product of the Weingarten functions in (3.5) over any
$S_{m}^{7}$
 on Z, and find a suitable change of coordinates such that the average of the product of the Weingarten functions in (3.5) over any 
 $S_{m}^{7}$
-orbit is equal to a product of averages of individual Weingarten functions over cosets (see (3.8)). We then use Lemma 2.7 to estimate the contribution in (3.5) of each
$S_{m}^{7}$
-orbit is equal to a product of averages of individual Weingarten functions over cosets (see (3.8)). We then use Lemma 2.7 to estimate the contribution in (3.5) of each 
 $S_{m}^{7}$
-orbit. To conclude the estimates of (3.5), we will further provide estimates for
$S_{m}^{7}$
-orbit. To conclude the estimates of (3.5), we will further provide estimates for 
 $|Z|$
.
$|Z|$
.
 We first describe the action of 
 $S_{m}^{7}$
. The element
$S_{m}^{7}$
. The element 
 $(\pi_{b},\pi_{c},\ldots,\pi_{D})\in S_{m}^{7}$
 acts on
$(\pi_{b},\pi_{c},\ldots,\pi_{D})\in S_{m}^{7}$
 acts on 
 $(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D})$
 by
$(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D})$
 by 
 $(\overrightarrow{\!a},\pi_{b}(\overrightarrow{\!b}),\pi_{c}(\overrightarrow{\!c}),\ldots,\pi_{D}(\overrightarrow{\!{\kern-.5pt}D}))$
 and it acts on
$(\overrightarrow{\!a},\pi_{b}(\overrightarrow{\!b}),\pi_{c}(\overrightarrow{\!c}),\ldots,\pi_{D}(\overrightarrow{\!{\kern-.5pt}D}))$
 and it acts on 
 $(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})$
 by
$(\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})$
 by 
 \[\sigma_{1}\mapsto(\pi_{A}\times\pi_{B})\circ\sigma_{1}\circ(\mathrm{Id}\times\pi_{b}^{-1}),\]
\[\sigma_{1}\mapsto(\pi_{A}\times\pi_{B})\circ\sigma_{1}\circ(\mathrm{Id}\times\pi_{b}^{-1}),\]
 \[\tau_{1}\mapsto(\pi_{c}\times\pi_{d})\circ\tau_{1}\circ(\pi_{D}^{-1}\times\pi_{C}^{-1}),\]
\[\tau_{1}\mapsto(\pi_{c}\times\pi_{d})\circ\tau_{1}\circ(\pi_{D}^{-1}\times\pi_{C}^{-1}),\]
 \[\sigma_{2}\mapsto(\mathrm{Id}\times\pi_{d})\circ\sigma_{2}\circ(\pi_{A}^{-1}\times\pi_{D}^{-1}),\]
\[\sigma_{2}\mapsto(\mathrm{Id}\times\pi_{d})\circ\sigma_{2}\circ(\pi_{A}^{-1}\times\pi_{D}^{-1}),\]
 \[\tau_{2}\mapsto(\pi_{B}\times\pi_{C})\circ\tau_{2}\circ(\pi_{b}^{-1}\times\pi_{c}^{-1}).\]
\[\tau_{2}\mapsto(\pi_{B}\times\pi_{C})\circ\tau_{2}\circ(\pi_{b}^{-1}\times\pi_{c}^{-1}).\]
 This gives rise to an action of 
 $S_{m}^{7}$
 on Z. The action on the input of the Weingarten functions becomes
$S_{m}^{7}$
 on Z. The action on the input of the Weingarten functions becomes 
 \begin{equation}\mathrm{Wg}((\pi_{D}^{-1}\times\pi_{C}^{-1}\pi_{b})\sigma_{1}^{-1}(\pi_{A}^{-1}\pi_{c}\times\pi_{B}^{-1}\pi_{d})\tau_{1})\text{and }\mathrm{Wg}(\pi_{b}^{-1}\pi_{A}\times\pi_{c}^{-1}\pi_{D})\sigma_{2}^{-1}(\pi\pi_{B}\times\pi_{d}^{-1}\pi_{C})\tau_{2}),\end{equation}
\begin{equation}\mathrm{Wg}((\pi_{D}^{-1}\times\pi_{C}^{-1}\pi_{b})\sigma_{1}^{-1}(\pi_{A}^{-1}\pi_{c}\times\pi_{B}^{-1}\pi_{d})\tau_{1})\text{and }\mathrm{Wg}(\pi_{b}^{-1}\pi_{A}\times\pi_{c}^{-1}\pi_{D})\sigma_{2}^{-1}(\pi\pi_{B}\times\pi_{d}^{-1}\pi_{C})\tau_{2}),\end{equation}
 where we used the conjugacy invariance of Wg to move permutations from right to left. Consider the bijection 
 $\psi:S_{m}^{8}\rightarrow S_{m}^{8}$
, defined by
$\psi:S_{m}^{8}\rightarrow S_{m}^{8}$
, defined by 
 $(x_{1},\ldots,x_{8})\mapsto(x_{1},x_{1}x_{2},\ldots,x_{1}x_{2},\ldots, x_{8})$
. Under the change of coordinates
$(x_{1},\ldots,x_{8})\mapsto(x_{1},x_{1}x_{2},\ldots,x_{1}x_{2},\ldots, x_{8})$
. Under the change of coordinates 
 $(\theta_{D},\theta_{c},\theta_{A},\theta_{b},\theta_{C},\theta_{d},\theta_{B},\theta):=\psi^{-1}(\pi_{D},\pi_{c},\pi_{A},\pi_{b},\pi_{C},\pi_{d},\pi_{B},\pi^{-1})$
, the summation of (3.5) over an
$(\theta_{D},\theta_{c},\theta_{A},\theta_{b},\theta_{C},\theta_{d},\theta_{B},\theta):=\psi^{-1}(\pi_{D},\pi_{c},\pi_{A},\pi_{b},\pi_{C},\pi_{d},\pi_{B},\pi^{-1})$
, the summation of (3.5) over an 
 $S_{m}^{7}$
-orbit splits into a product of two separate sums:
$S_{m}^{7}$
-orbit splits into a product of two separate sums: 
 \begin{align}& \sum_{(\pi_{D},\ldots,\pi)\in S_{m}^{8}}(-1)^{\pi}\mathrm{Wg}((\pi_{D}^{-1}\times\pi_{C}^{-1}\pi_{b})\sigma_{1}^{-1}(\pi_{A}^{-1}\pi_{c}\times\pi_{B}^{-1}\pi_{d})\tau_{1})\nonumber\\&\qquad \times \mathrm{Wg}(\pi_{b}^{-1}\pi_{A}\times\pi_{c}^{-1}\pi_{D})\sigma_{2}^{-1}(\mathrm{\pi}\pi_{B}\times\pi_{d}^{-1}\pi_{C})\tau_{2})\nonumber\\&\quad = \sum_{(\theta_{D},\ldots,\theta)\in S_{m}^{8}}(-1)^{\theta_{D}\cdots\theta}\mathrm{Wg}((\theta_{D}^{-1}\times\theta_{C}^{-1})\sigma_{1}^{-1}(\theta_{A}^{-1}\times\theta_{B}^{-1})\tau_{1})\mathrm{Wg}(\theta_{b}^{-1}\times\theta_{c}^{-1})\sigma_{2}^{-1}(\mathrm{\theta}^{-1}\times\theta_{d}^{-1})\tau_{2})\nonumber\\&\quad = \sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\mathrm{Wg}(\eta_{1}\sigma_{1}^{-1}\eta'_{1}\tau_{1})\sum_{\eta_{2},\eta'_{2}\in S_{m}^{2}}(-1)^{\eta_{2}\eta'_{2}}\mathrm{Wg}(\eta_{2}\sigma_{2}^{-1}\eta'_{2}\tau_{2}).\end{align}
\begin{align}& \sum_{(\pi_{D},\ldots,\pi)\in S_{m}^{8}}(-1)^{\pi}\mathrm{Wg}((\pi_{D}^{-1}\times\pi_{C}^{-1}\pi_{b})\sigma_{1}^{-1}(\pi_{A}^{-1}\pi_{c}\times\pi_{B}^{-1}\pi_{d})\tau_{1})\nonumber\\&\qquad \times \mathrm{Wg}(\pi_{b}^{-1}\pi_{A}\times\pi_{c}^{-1}\pi_{D})\sigma_{2}^{-1}(\mathrm{\pi}\pi_{B}\times\pi_{d}^{-1}\pi_{C})\tau_{2})\nonumber\\&\quad = \sum_{(\theta_{D},\ldots,\theta)\in S_{m}^{8}}(-1)^{\theta_{D}\cdots\theta}\mathrm{Wg}((\theta_{D}^{-1}\times\theta_{C}^{-1})\sigma_{1}^{-1}(\theta_{A}^{-1}\times\theta_{B}^{-1})\tau_{1})\mathrm{Wg}(\theta_{b}^{-1}\times\theta_{c}^{-1})\sigma_{2}^{-1}(\mathrm{\theta}^{-1}\times\theta_{d}^{-1})\tau_{2})\nonumber\\&\quad = \sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\mathrm{Wg}(\eta_{1}\sigma_{1}^{-1}\eta'_{1}\tau_{1})\sum_{\eta_{2},\eta'_{2}\in S_{m}^{2}}(-1)^{\eta_{2}\eta'_{2}}\mathrm{Wg}(\eta_{2}\sigma_{2}^{-1}\eta'_{2}\tau_{2}).\end{align}
 We can now use the Fourier expansion of Wg (2.11) and the estimates in § 2.1.3 to bound the contribution of an 
 $S_{m}^{7}$
-orbit in Z to (3.5).
$S_{m}^{7}$
-orbit in Z to (3.5).
Proposition 3.3. Let 
 $\widetilde{v}:=(\widetilde{\overrightarrow{\!a}},\ldots,\,\widetilde{\overrightarrow{\!{\kern-.5pt}D}},\widetilde{\sigma}_{1},\widetilde{\sigma}_{2},\widetilde{\tau}_{1},\widetilde{\tau}_{2})\in Z$
 and let
$\widetilde{v}:=(\widetilde{\overrightarrow{\!a}},\ldots,\,\widetilde{\overrightarrow{\!{\kern-.5pt}D}},\widetilde{\sigma}_{1},\widetilde{\sigma}_{2},\widetilde{\tau}_{1},\widetilde{\tau}_{2})\in Z$
 and let 
 ${\mathcal O}_{\widetilde{v}}:=S_{m}^{7}\widetilde{v}$
 be its
${\mathcal O}_{\widetilde{v}}:=S_{m}^{7}\widetilde{v}$
 be its 
 $S_{m}^{7}$
-orbit. Then,
$S_{m}^{7}$
-orbit. Then, 
 \begin{equation}\bigg|\frac{1}{|{\mathcal{O}}_{\widetilde{v}}|}\sum_{(\overrightarrow{\!a},\ldots,\tau_{2})\in{\mathcal{O}}_{\widetilde{v}}}\sum_{\pi\in S_{m}}(-1)^{\pi}\,\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\,\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2})\bigg|\leq\frac{1}{(2m)!^{2}m!^{3}}\binom{d}{2m}^{\!\!-2}.\end{equation}
\begin{equation}\bigg|\frac{1}{|{\mathcal{O}}_{\widetilde{v}}|}\sum_{(\overrightarrow{\!a},\ldots,\tau_{2})\in{\mathcal{O}}_{\widetilde{v}}}\sum_{\pi\in S_{m}}(-1)^{\pi}\,\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\,\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2})\bigg|\leq\frac{1}{(2m)!^{2}m!^{3}}\binom{d}{2m}^{\!\!-2}.\end{equation}
Proof. By the orbit-stabilizer theorem, the left-hand side of (3.9) is the same as summing over all 
 $(\pi_{D},\ldots,\pi_{B})\in S_{m}^{7}$
 and dividing by
$(\pi_{D},\ldots,\pi_{B})\in S_{m}^{7}$
 and dividing by 
 $m!^{7}$
. By (3.8), the left-hand side of (3.9) is equal to
$m!^{7}$
. By (3.8), the left-hand side of (3.9) is equal to 
 \[\frac{1}{m!^{7}}\bigg|\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\,\mathrm{Wg}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\sum_{\eta_{2},\eta'_{2}\in S_{m}^{2}}(-1)^{\eta_{2}\eta'_{2}}\,\mathrm{Wg}(\eta_{2}\widetilde{\sigma}_{2}^{-1}\eta'_{2}\widetilde{\tau}_{2})\bigg|.\]
\[\frac{1}{m!^{7}}\bigg|\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\,\mathrm{Wg}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\sum_{\eta_{2},\eta'_{2}\in S_{m}^{2}}(-1)^{\eta_{2}\eta'_{2}}\,\mathrm{Wg}(\eta_{2}\widetilde{\sigma}_{2}^{-1}\eta'_{2}\widetilde{\tau}_{2})\bigg|.\]
 Note that 
 $(S_{2m},S_{m}\times S_{m})$
 is a sgn-twisted Gelfand pair, that is, the representation
$(S_{2m},S_{m}\times S_{m})$
 is a sgn-twisted Gelfand pair, that is, the representation 
 $\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn}$
 is multiplicity-free. By Frobenius reciprocity, each irreducible subrepresentation
$\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn}$
 is multiplicity-free. By Frobenius reciprocity, each irreducible subrepresentation 
 $(V_{\lambda},\pi_{\lambda})$
 of
$(V_{\lambda},\pi_{\lambda})$
 of 
 $\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\mathrm{sgn}$
 has a unique
$\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\mathrm{sgn}$
 has a unique 
 $(S_{m}^{2},\mathrm{sgn})$
-invariant unit vector, so
$(S_{m}^{2},\mathrm{sgn})$
-invariant unit vector, so 
 $\langle \chi_{\lambda}|_{S_{m}^{2}},\mathrm{sgn}\rangle_{S_{m}^{2}}=1$
. By Lemma 2.7, for each
$\langle \chi_{\lambda}|_{S_{m}^{2}},\mathrm{sgn}\rangle_{S_{m}^{2}}=1$
. By Lemma 2.7, for each 
 $\sigma\in S_{2m}$
, we have
$\sigma\in S_{2m}$
, we have 
 \begin{equation}\bigg|\!\sum_{h\in S_{m}^{2}}(-1)^{h}\chi_{\lambda}(h\sigma)\!\bigg|\leq m!^{2}\langle \chi_{\lambda}|_{S_{m}^{2}},\mathrm{sgn}\rangle_{S_{m}^{2}}=\begin{cases}m!^{2} & \text{if }\pi_{\lambda}\hookrightarrow\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn},\\0 & \text{otherwise}.\end{cases}\end{equation}
\begin{equation}\bigg|\!\sum_{h\in S_{m}^{2}}(-1)^{h}\chi_{\lambda}(h\sigma)\!\bigg|\leq m!^{2}\langle \chi_{\lambda}|_{S_{m}^{2}},\mathrm{sgn}\rangle_{S_{m}^{2}}=\begin{cases}m!^{2} & \text{if }\pi_{\lambda}\hookrightarrow\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn},\\0 & \text{otherwise}.\end{cases}\end{equation}
 By Lemma 2.3 it follows that 
 $\pi_{\lambda}\hookrightarrow\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\mathrm{sgn}$
 if and only if the Young diagram of
$\pi_{\lambda}\hookrightarrow\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\mathrm{sgn}$
 if and only if the Young diagram of 
 $\lambda\vdash2m$
 has at most two columns. Combining with (3.10), we have
$\lambda\vdash2m$
 has at most two columns. Combining with (3.10), we have 
 \begin{align}& \bigg|\!\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|=\bigg|\!\sum_{\eta_{1}'\in S_{m}^{2}}(-1)^{\eta'_{1}}\sum_{\eta_{1}\in S_{m}^{2}}(-1)^{\eta_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\nonumber\\&\quad \leq \sum_{\eta_{1}'\in S_{m}^{2}}\bigg|\!\sum_{\eta_{1}\in S_{m}^{2}}(-1)^{\eta_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\leq\begin{cases}m!^{4} & \text{if }\lambda\vdash2m,\ \lambda\text{ has } \leq2\text{ columns},\\0 & \text{otherwise}.\end{cases}\end{align}
\begin{align}& \bigg|\!\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|=\bigg|\!\sum_{\eta_{1}'\in S_{m}^{2}}(-1)^{\eta'_{1}}\sum_{\eta_{1}\in S_{m}^{2}}(-1)^{\eta_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\nonumber\\&\quad \leq \sum_{\eta_{1}'\in S_{m}^{2}}\bigg|\!\sum_{\eta_{1}\in S_{m}^{2}}(-1)^{\eta_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\leq\begin{cases}m!^{4} & \text{if }\lambda\vdash2m,\ \lambda\text{ has } \leq2\text{ columns},\\0 & \text{otherwise}.\end{cases}\end{align}
 By (2.11), (3.11), (2.3), and by our assumption that 
 $d\geq2m$
, we have
$d\geq2m$
, we have 
 \begin{align*}&\bigg|\!\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\mathrm{Wg}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|=\frac{1}{(2m)!^{2}}\,\bigg|\!\sum_{\lambda\vdash2m}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\\&\quad \leq \frac{m!^{4}}{(2m)!^{2}}\sum_{\lambda\vdash2m,\,\lambda\text{ has}\leq2\text{ columns}}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}=\frac{m!^{4}}{(2m)!}\sum_{\lambda\vdash2m,\,\lambda\text{has }\leq2\text{ columns}}\frac{\chi_{\lambda}(1)}{\prod_{(i,j)\in\lambda}(d+j-i)}\\&\quad \leq \frac{m!^{4}}{(2m)!}\cdot\frac{1}{d\cdot\cdots\cdot(d-2m+1)}\sum_{\lambda\vdash2m,\,\lambda\text{ has }\leq2\text{columns}}\chi_{\lambda}(1)=\frac{m!^{4}}{(2m)!}\frac{\dim\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn}}{d\cdot\cdots\cdot(d-2m+1)}\\&\quad =\frac{m!^{2}}{(2m)!}\binom{d}{2m}^{\!\!-1}.\end{align*}
\begin{align*}&\bigg|\!\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\mathrm{Wg}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|=\frac{1}{(2m)!^{2}}\,\bigg|\!\sum_{\lambda\vdash2m}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\sum_{\eta_{1},\eta'_{1}\in S_{m}^{2}}(-1)^{\eta_{1}\eta'_{1}}\chi_{\lambda}(\eta_{1}\widetilde{\sigma}_{1}^{-1}\eta'_{1}\widetilde{\tau}_{1})\bigg|\\&\quad \leq \frac{m!^{4}}{(2m)!^{2}}\sum_{\lambda\vdash2m,\,\lambda\text{ has}\leq2\text{ columns}}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}=\frac{m!^{4}}{(2m)!}\sum_{\lambda\vdash2m,\,\lambda\text{has }\leq2\text{ columns}}\frac{\chi_{\lambda}(1)}{\prod_{(i,j)\in\lambda}(d+j-i)}\\&\quad \leq \frac{m!^{4}}{(2m)!}\cdot\frac{1}{d\cdot\cdots\cdot(d-2m+1)}\sum_{\lambda\vdash2m,\,\lambda\text{ has }\leq2\text{columns}}\chi_{\lambda}(1)=\frac{m!^{4}}{(2m)!}\frac{\dim\mathrm{Ind}_{S_{m}^{2}}^{S_{2m}}\,\mathrm{sgn}}{d\cdot\cdots\cdot(d-2m+1)}\\&\quad =\frac{m!^{2}}{(2m)!}\binom{d}{2m}^{\!\!-1}.\end{align*}
This concludes the proposition.
We now turn to the last ingredient in the proof of Theorem 3.1.
Definition 3.4. Let 
 $f:S\rightarrow[d]$
 be a function on a set S. We define the shape
$f:S\rightarrow[d]$
 be a function on a set S. We define the shape 
 $\nu_{f}:[d]\rightarrow\mathbb{N}$
 of f as
$\nu_{f}:[d]\rightarrow\mathbb{N}$
 of f as 
 \[\nu_{f}=(\nu_{f,1},\ldots,\nu_{f,d}):=(|f^{-1}(1)|,\ldots,|f^{-1}(d)|),\]
\[\nu_{f}=(\nu_{f,1},\ldots,\nu_{f,d}):=(|f^{-1}(1)|,\ldots,|f^{-1}(d)|),\]
 and denote 
 $\nu_{f}!:=\prod_{u=1}^{d}\nu_{f,u}$
.
$\nu_{f}!:=\prod_{u=1}^{d}\nu_{f,u}$
.
Proposition 3.5. Let Z be as in (3.4). Then,
 \[|Z|\leq m!^{7}\binom{2m}{m}^{\!\!4}\binom{d}{m}\binom{d+m-1}{m}^{\!\!3}.\]
\[|Z|\leq m!^{7}\binom{2m}{m}^{\!\!4}\binom{d}{m}\binom{d+m-1}{m}^{\!\!3}.\]
Proof. We need to count all the possible tuples 
 $(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})$
 in Z. Suppose we have already fixed
$(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})$
 in Z. Suppose we have already fixed 
 $\overrightarrow{\!a}$
 and the shapes
$\overrightarrow{\!a}$
 and the shapes 
 of
 of 
 $\overrightarrow{\!b},\overrightarrow{\!c}$
 and
$\overrightarrow{\!b},\overrightarrow{\!c}$
 and 
 $\overrightarrow{\!{\kern-.5pt}d}$
, where
$\overrightarrow{\!{\kern-.5pt}d}$
, where 
 $\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 are considered as a functions
$\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 are considered as a functions 
 $[m]\rightarrow[d]$
. Given these data, we have the following.
$[m]\rightarrow[d]$
. Given these data, we have the following.
- 
(1) There are  $\frac{m!^{3}}{\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!}$
 options for $\frac{m!^{3}}{\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!}$
 options for $\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 with the above shapes. $\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 with the above shapes.
- 
(2) There are  $(2m)!^{2}$
 options for $(2m)!^{2}$
 options for $\sigma_{2}$
 and $\sigma_{2}$
 and $\tau_{2}$
. $\tau_{2}$
.
- 
(3) There are at most  $\binom{2m}{m}^{2}$
 options for choosing $\binom{2m}{m}^{2}$
 options for choosing $\tau_{1}^{-1}([m])$
 and $\tau_{1}^{-1}([m])$
 and $\sigma_{1}([m])$
, as subsets of [2m]. Note that we count both valid and invalid options. $\sigma_{1}([m])$
, as subsets of [2m]. Note that we count both valid and invalid options.
- 
(4) After fixing the subsets  $\tau_{1}^{-1}([m])$
 and $\tau_{1}^{-1}([m])$
 and $\sigma_{1}([m])$
, there are at most $\sigma_{1}([m])$
, there are at most $\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!$
 options for $\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!$
 options for $\tau_{1}$
 and $\tau_{1}$
 and $\nu_{\overrightarrow{\!b}}!$
 options for $\nu_{\overrightarrow{\!b}}!$
 options for $\sigma_{1}$
. $\sigma_{1}$
.
 Summarizing the above items, we get there are at most 
 $(\frac{m!^{3}(2m)!^{2}\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!}{\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!})\binom{2m}{m}^{\!2}=m!^{7}\binom{2m}{m}^{\!4}$
 options for
$(\frac{m!^{3}(2m)!^{2}\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!}{\nu_{\overrightarrow{\!b}}!\nu_{\overrightarrow{\!c}}!\nu_{\overrightarrow{\!{\kern-.5pt}d}}!})\binom{2m}{m}^{\!2}=m!^{7}\binom{2m}{m}^{\!4}$
 options for 
 $(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z$
 with the initial data
$(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z$
 with the initial data 
 . Note that there are
. Note that there are 
 $\binom{d}{m}$
 possible options for
$\binom{d}{m}$
 possible options for 
 $\overrightarrow{\!a}$
, and
$\overrightarrow{\!a}$
, and 
 $\binom{d+m-1}{m}^{\!3}$
 options for
$\binom{d+m-1}{m}^{\!3}$
 options for  . This gives the desired upper bound.
. This gives the desired upper bound.
We can now finish the proof of Theorem 3.1.
Proof of Theorem 3.1. Note that for every 
 $k\geq1$
 and
$k\geq1$
 and 
 $n\geq k$
 we have
$n\geq k$
 we have 
 \begin{equation}\bigg(\frac{n}{k}\bigg)^{\!\!k}\leq\prod_{j=0}^{k-1}\bigg(\frac{n-j}{k-j}\bigg)=\binom{n}{k}\leq\frac{n^{k}}{k!}\leq\bigg(\frac{n}{k}\bigg)^{\!\!k}e^{k},\end{equation}
\begin{equation}\bigg(\frac{n}{k}\bigg)^{\!\!k}\leq\prod_{j=0}^{k-1}\bigg(\frac{n-j}{k-j}\bigg)=\binom{n}{k}\leq\frac{n^{k}}{k!}\leq\bigg(\frac{n}{k}\bigg)^{\!\!k}e^{k},\end{equation}
where the rightmost inequality follows from Stirling’s approximation. By Lemma 3.2 and by Propositions 3.3 and 3.5,
 \begin{align}\Big|\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\!\Big|& =|Z|\cdot\bigg|\frac{1}{|Z|}\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\,\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\cdot\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2})\bigg|\nonumber \\&\leq|Z|\cdot\frac{1}{(2m)!^{2}m!^{3}}\binom{d}{2m}^{\!\!-2}\leq\binom{2m}{m}^{\!\!2}\binom{d}{m}\binom{d+m}{m}^{\!\!3}\binom{d}{2m}^{\!\!-2}.\end{align}
\begin{align}\Big|\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\!\Big|& =|Z|\cdot\bigg|\frac{1}{|Z|}\sum_{(\overrightarrow{\!a},\ldots,\overrightarrow{\!{\kern-.5pt}D},\sigma_{1},\sigma_{2},\tau_{1},\tau_{2})\in Z}\sum_{\pi\in S_{m}}(-1)^{\pi}\,\mathrm{Wg}(\sigma_{1}^{-1}\tau_{1})\cdot\mathrm{Wg}(\sigma_{2}^{-1}(\mathrm{\pi}\times\mathrm{Id})\tau_{2})\bigg|\nonumber \\&\leq|Z|\cdot\frac{1}{(2m)!^{2}m!^{3}}\binom{d}{2m}^{\!\!-2}\leq\binom{2m}{m}^{\!\!2}\binom{d}{m}\binom{d+m}{m}^{\!\!3}\binom{d}{2m}^{\!\!-2}.\end{align}
 By (3.13) and (3.12), by the inequality 
 $\binom{2m}{m}\leq2^{2m}$
, and by our assumption that
$\binom{2m}{m}\leq2^{2m}$
, and by our assumption that 
 $d\geq2m$
,
$d\geq2m$
, 
 \[\Big|\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\!\Big|\leq\frac{2^{4m}e^{4m}\big(\frac{d}{m}\big)^{m}\big(\frac{d+m}{m}\big)^{3m}}{\big(\frac{d}{2m}\big)^{4m}}\leq\frac{2^{7m}e^{4m}\big(\frac{d}{m}\big)^{4m}}{\big(\frac{d}{2m}\big)^{4m}}\leq2^{11m}e^{4m}\leq2^{17m}.\]
\[\Big|\mathbb{E}\Big(\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X,Y)\Big)\!\Big|\leq\frac{2^{4m}e^{4m}\big(\frac{d}{m}\big)^{m}\big(\frac{d+m}{m}\big)^{3m}}{\big(\frac{d}{2m}\big)^{4m}}\leq\frac{2^{7m}e^{4m}\big(\frac{d}{m}\big)^{4m}}{\big(\frac{d}{2m}\big)^{4m}}\leq2^{11m}e^{4m}\leq2^{17m}.\]
Remark 3.6. The current proof of Proposition 3.5 depends on the special structure of the Engel word. One can give a slightly more complicated argument, which can be easily generalized for every word w (this is done in § 6). Here are the main ideas of this alternative argument.
We encode the expression
 \begin{equation}X_{a,D}Y_{D,c}X_{c,A}^{-1}Y_{A,b}X_{b,C}Y_{C,d}^{-1}X_{d,B}^{-1}Y_{B,a}^{-1}\end{equation}
\begin{equation}X_{a,D}Y_{D,c}X_{c,A}^{-1}Y_{A,b}X_{b,C}Y_{C,d}^{-1}X_{d,B}^{-1}Y_{B,a}^{-1}\end{equation}
 from (3.2), graphically, by the 
 $4\times4$
 matrix
$4\times4$
 matrix 
 \begin{equation}\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\cdot & C & D & \cdot\\c & \cdot & \cdot & b\\d & \cdot & \cdot & a\\\cdot & B & A & \cdot\end{array}\right),\end{equation}
\begin{equation}\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\cdot & C & D & \cdot\\c & \cdot & \cdot & b\\d & \cdot & \cdot & a\\\cdot & B & A & \cdot\end{array}\right),\end{equation}
 which is constructed as follows. The rows and columns are indexed by 
 $x,y,x^{-1},y^{-1}$
. We order the rows by
$x,y,x^{-1},y^{-1}$
. We order the rows by 
 $x<y<y^{-1}<x^{-1}$
 and order the columns by
$x<y<y^{-1}<x^{-1}$
 and order the columns by 
 $x^{-1}<y^{-1}<y<x$
. To find the
$x^{-1}<y^{-1}<y<x$
. To find the 
 $(x,y^{-1})$
-entry of this matrix (i.e. the (1,2)-entry), we look for the subword
$(x,y^{-1})$
-entry of this matrix (i.e. the (1,2)-entry), we look for the subword 
 $XY^{-1}$
 in (3.14) and record the letter of the common index, which is C. All other entries are determined in similar fashion. Note that we do not have elements in the main diagonal since w is cyclically reduced.
$XY^{-1}$
 in (3.14) and record the letter of the common index, which is C. All other entries are determined in similar fashion. Note that we do not have elements in the main diagonal since w is cyclically reduced.
 We denote 
 $\eta_{1}=\tau_{1}$
,
$\eta_{1}=\tau_{1}$
, 
 $\eta_{2}=\tau_{2}$
,
$\eta_{2}=\tau_{2}$
, 
 $\eta_{3}=\sigma_{2}^{-1}$
 and
$\eta_{3}=\sigma_{2}^{-1}$
 and 
 $\eta_{4}=\sigma_{1}^{-1}$
. Note that
$\eta_{4}=\sigma_{1}^{-1}$
. Note that 
 $\eta_{i}$
 sends the ith row of (3.15) into a permuted copy of its ith column. The alternative counting argument in Proposition 3.5 goes as follows. We fix the upper triangular part, i.e.
$\eta_{i}$
 sends the ith row of (3.15) into a permuted copy of its ith column. The alternative counting argument in Proposition 3.5 goes as follows. We fix the upper triangular part, i.e. 
 $\overrightarrow{\!{\kern.5pt}C},\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!a},\overrightarrow{\!b}$
 (instead of
$\overrightarrow{\!{\kern.5pt}C},\overrightarrow{\!{\kern-.5pt}D},\overrightarrow{\!a},\overrightarrow{\!b}$
 (instead of 
 $\overrightarrow{\!a},\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 in the proof above). We then choose
$\overrightarrow{\!a},\overrightarrow{\!b},\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 in the proof above). We then choose 
 $\eta_{1}$
 (with
$\eta_{1}$
 (with 
 $2m!$
 options), which gives us
$2m!$
 options), which gives us 
 $\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 and, in particular, reveals the second row. Next, we choose all possible
$\overrightarrow{\!c},\overrightarrow{\!{\kern-.5pt}d}$
 and, in particular, reveals the second row. Next, we choose all possible 
 $\eta_{2}:(\overrightarrow{\!b},\overrightarrow{\!c})\rightarrow(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})$
, taking into consideration the fact that
$\eta_{2}:(\overrightarrow{\!b},\overrightarrow{\!c})\rightarrow(\overrightarrow{\!{\kern-1pt}B},\overrightarrow{\!{\kern.5pt}C})$
, taking into consideration the fact that 
 $\overrightarrow{\!{\kern.5pt}C}$
 is already known. We then proceed to the next row and guess
$\overrightarrow{\!{\kern.5pt}C}$
 is already known. We then proceed to the next row and guess 
 $\eta_{3}$
, taking into consideration that we already know
$\eta_{3}$
, taking into consideration that we already know 
 $\overrightarrow{\!{\kern-.5pt}D}$
. At this point, the vectors
$\overrightarrow{\!{\kern-.5pt}D}$
. At this point, the vectors 
 $\overrightarrow{\!a},\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern.5pt}C},\overrightarrow{\!{\kern-.5pt}D}$
 and the permutations
$\overrightarrow{\!a},\overrightarrow{\!b},\ldots,\overrightarrow{\!{\kern.5pt}C},\overrightarrow{\!{\kern-.5pt}D}$
 and the permutations 
 $\eta_{1},\eta_{2},\eta_{3}$
 are known, and the number of options for
$\eta_{1},\eta_{2},\eta_{3}$
 are known, and the number of options for 
 $\eta_{4}$
 is determined by the shapes of
$\eta_{4}$
 is determined by the shapes of 
 $\overrightarrow{\!a},\overrightarrow{\!b}$
. This argument will be generalized in § 6 for arbitrary words, where, instead of a
$\overrightarrow{\!a},\overrightarrow{\!b}$
. This argument will be generalized in § 6 for arbitrary words, where, instead of a 
 $4\times4$
 matrix, we will have a
$4\times4$
 matrix, we will have a 
 $2r\times2r$
 matrix and, each time we choose
$2r\times2r$
 matrix and, each time we choose 
 $\eta_{1},\ldots,\eta_{k}$
, the
$\eta_{1},\ldots,\eta_{k}$
, the 
 $k+1$
th row is revealed, allowing us to proceed by induction.
$k+1$
th row is revealed, allowing us to proceed by induction.
4 Rewriting Theorem 1.3 using Weingarten calculus
 In this section, we rewrite the expression 
 $\mathbb{E}\big(|\mathrm{tr}\bigwedge\nolimits^{\!m}w(X_{1},\ldots,X_{r})|^{2}\big)$
 of Theorem 1.3 as a finite sum of Weingarten functions.
$\mathbb{E}\big(|\mathrm{tr}\bigwedge\nolimits^{\!m}w(X_{1},\ldots,X_{r})|^{2}\big)$
 of Theorem 1.3 as a finite sum of Weingarten functions.
 Let 
 $\ell,m,d,w$
 be as in Theorem 1.3. We may assume that w is cyclically reduced, i.e. it does not contain a subword of the form
$\ell,m,d,w$
 be as in Theorem 1.3. We may assume that w is cyclically reduced, i.e. it does not contain a subword of the form 
 $x_{j}x_{j}^{-1}$
 and the first and last letters of w are not inverse of each other. For
$x_{j}x_{j}^{-1}$
 and the first and last letters of w are not inverse of each other. For 
 $u\in[\ell]$
, let
$u\in[\ell]$
, let 
 \[w(u)=\begin{cases}a & \text{if the } u\text{th letter of } w \text{ is } x_{a},\\-a & \text{if the } u\text{th letter of } w \text{ is } x_{a}^{-1}.\end{cases}\]
\[w(u)=\begin{cases}a & \text{if the } u\text{th letter of } w \text{ is } x_{a},\\-a & \text{if the } u\text{th letter of } w \text{ is } x_{a}^{-1}.\end{cases}\]
 If we denote 
 $x_{-a}=x_{a}^{-1}$
, then
$x_{-a}=x_{a}^{-1}$
, then 
 $w=\prod_{u}x_{w(u)}$
. We write
$w=\prod_{u}x_{w(u)}$
. We write 
 $w^{-1}$
 for the inverse word,
$w^{-1}$
 for the inverse word, 
 \begin{equation}w^{-1}:=x_{-w(\ell)}x_{-w(\ell-1)}\cdots x_{-1}.\end{equation}
\begin{equation}w^{-1}:=x_{-w(\ell)}x_{-w(\ell-1)}\cdots x_{-1}.\end{equation}
We start by noting that
 \begin{align}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\nonumber\\&\quad =\mathbb{E}\Big(\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\cdot\overline{\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)}\Big)\nonumber\\&\quad =\mathbb{E}\Big(\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\cdot\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{-1}(X_{1},\ldots,X_{r})\Big)\Big).\end{align}
\begin{align}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\nonumber\\&\quad =\mathbb{E}\Big(\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\cdot\overline{\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)}\Big)\nonumber\\&\quad =\mathbb{E}\Big(\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\cdot\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{-1}(X_{1},\ldots,X_{r})\Big)\Big).\end{align}
 Define 
 $\widetilde{T}\in\mathrm{Sym}([\ell]\times[m])$
 by
$\widetilde{T}\in\mathrm{Sym}([\ell]\times[m])$
 by 
 \begin{equation}\widetilde{T}(u,k)=\begin{cases}(u+1,k) & u\neq\ell,\\(1,k) & u=\ell.\end{cases}\end{equation}
\begin{equation}\widetilde{T}(u,k)=\begin{cases}(u+1,k) & u\neq\ell,\\(1,k) & u=\ell.\end{cases}\end{equation}
 Recall that 
 ${\mathcal I}_{m,d}=\{a_{1}<\cdots<a_{m}:a_{i}\in[d]\}$
. We have
${\mathcal I}_{m,d}=\{a_{1}<\cdots<a_{m}:a_{i}\in[d]\}$
. We have 
 \begin{align}&\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\prod_{k=1}^{m}w(X_{1},\ldots,X_{r})_{a_{k},a_{\pi(k)}}\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\prod_{k=1}^{m}\sum_{\substack{f_{k}:[\ell+1]\rightarrow[d]\\f_{k}(1)=a_{k},f_{k}(\ell+1)=a_{\pi(k)} }}\prod_{u=1}^{\ell}(X_{w(u)})_{f_{k}(u),f_{k}(u+1)}\nonumber\\[4pt]&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(1,k')=a_{k'},f(\ell+1,k')=a_{\pi(k')},\forall k'}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\nonumber\\[4pt]&\quad =\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(\ell+1,k')=f(1,\pi(k')),\forall k'\\ f(1,-)\text{ increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\nonumber\\[4pt]&\quad =\sum_{\pi\in\mathrm{Sym}(\{ \ell\}\times[m])}(-1)^{\pi}\sum_{\substack{F:[\ell]\times[m]\rightarrow[d]\\ F(1,-)\text{increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{F(u,k),F(\widetilde{T}\pi(u,k))},\end{align}
\begin{align}&\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\prod_{k=1}^{m}w(X_{1},\ldots,X_{r})_{a_{k},a_{\pi(k)}}\nonumber\\&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\prod_{k=1}^{m}\sum_{\substack{f_{k}:[\ell+1]\rightarrow[d]\\f_{k}(1)=a_{k},f_{k}(\ell+1)=a_{\pi(k)} }}\prod_{u=1}^{\ell}(X_{w(u)})_{f_{k}(u),f_{k}(u+1)}\nonumber\\[4pt]&\quad =\sum_{\overrightarrow{\!a}\in{\mathcal I}_{m,d}}\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(1,k')=a_{k'},f(\ell+1,k')=a_{\pi(k')},\forall k'}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\nonumber\\[4pt]&\quad =\sum_{\pi\in S_{m}}(-1)^{\pi}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(\ell+1,k')=f(1,\pi(k')),\forall k'\\ f(1,-)\text{ increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\nonumber\\[4pt]&\quad =\sum_{\pi\in\mathrm{Sym}(\{ \ell\}\times[m])}(-1)^{\pi}\sum_{\substack{F:[\ell]\times[m]\rightarrow[d]\\ F(1,-)\text{increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{F(u,k),F(\widetilde{T}\pi(u,k))},\end{align}
 where in the last equality we use the natural embedding 
 $\mathrm{Sym}(\{ \ell\}\times[m])\hookrightarrow\mathrm{Sym}([\ell]\times[m])$
 obtained by acting trivially on
$\mathrm{Sym}(\{ \ell\}\times[m])\hookrightarrow\mathrm{Sym}([\ell]\times[m])$
 obtained by acting trivially on 
 $[\ell-1]\times[m]$
. Applying this to
$[\ell-1]\times[m]$
. Applying this to 
 $w^{-1}$
, we get
$w^{-1}$
, we get 
 \begin{align}&\overline{\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})}\nonumber\\[4pt]&\quad =\sum_{\pi'\in\mathrm{Sym}(\{\ell\} \times[m])}(-1)^{\pi'}\sum_{\substack{F':[\ell]\times[m]\rightarrow[d]\\F'(1,-)\text{ increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w^{-1}(u)})_{F'(u,k),F'(\widetilde{T}\pi'(u,k))}.\end{align}
\begin{align}&\overline{\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})}\nonumber\\[4pt]&\quad =\sum_{\pi'\in\mathrm{Sym}(\{\ell\} \times[m])}(-1)^{\pi'}\sum_{\substack{F':[\ell]\times[m]\rightarrow[d]\\F'(1,-)\text{ increasing}}}\prod_{(u,k)\in[\ell]\times[m]}(X_{w^{-1}(u)})_{F'(u,k),F'(\widetilde{T}\pi'(u,k))}.\end{align}
 Set 
 $\Omega=[2]\times[\ell]\times[m]$
,
$\Omega=[2]\times[\ell]\times[m]$
, 
 $\Omega_{s,u}=\{ s\} \times\{ u\} \times[m]$
, and for
$\Omega_{s,u}=\{ s\} \times\{ u\} \times[m]$
, and for 
 $\gamma\in\Omega$
, define
$\gamma\in\Omega$
, define 
 \[\widetilde{w}(\gamma)=\begin{cases}w(u) & \gamma=(1,u,k),\\w^{-1}(u) & \gamma=(2,u,k).\end{cases}\]
\[\widetilde{w}(\gamma)=\begin{cases}w(u) & \gamma=(1,u,k),\\w^{-1}(u) & \gamma=(2,u,k).\end{cases}\]
 Define 
 $T\in\mathrm{Sym}(\Omega)$
 by
$T\in\mathrm{Sym}(\Omega)$
 by 
 \begin{equation}T(s,u,k):=(s,\widetilde{T}(u,k)).\end{equation}
\begin{equation}T(s,u,k):=(s,\widetilde{T}(u,k)).\end{equation}
By combining (4.4) and (4.5), we get
 \begin{equation}\Big|\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big|^{2}=\sum_{(\pi,\pi')\in\prod_{s=1}^{2}\mathrm{Sym}(\Omega_{s,\ell})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F(T\pi\pi'(\gamma))}.\end{equation}
\begin{equation}\Big|\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big|^{2}=\sum_{(\pi,\pi')\in\prod_{s=1}^{2}\mathrm{Sym}(\Omega_{s,\ell})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F(T\pi\pi'(\gamma))}.\end{equation}
 The map 
 $\pi\mapsto T\pi T^{-1}$
 is an isomorphism
$\pi\mapsto T\pi T^{-1}$
 is an isomorphism 
 $\mathrm{Sym}(\Omega_{s,\ell})\overset{\simeq}{\rightarrow}\mathrm{Sym}(\Omega_{s,1})$
, for
$\mathrm{Sym}(\Omega_{s,\ell})\overset{\simeq}{\rightarrow}\mathrm{Sym}(\Omega_{s,1})$
, for 
 $s\in[2]$
. Hence,
$s\in[2]$
. Hence, 
 \begin{align}&\Big|\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big|^{2}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\nonumber\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F(\pi\pi'T(\gamma))}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F((\pi\pi')^{-1}T(\gamma))}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F\circ\pi(1,1,-)\ \text{increasing}\\ F\circ\pi'(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\pi\pi'\gamma),F(T(\gamma))},\end{align}
\begin{align}&\Big|\mathrm{tr}\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big|^{2}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\nonumber\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F(\pi\pi'T(\gamma))}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F(1,1,-)\ \text{increasing}\\ F(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\gamma),F((\pi\pi')^{-1}T(\gamma))}\nonumber\\&\quad =\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}(-1)^{\pi\pi'}\sum_{\substack{F:\Omega\rightarrow[d]\\F\circ\pi(1,1,-)\ \text{increasing}\\ F\circ\pi'(2,1,-)\ \text{increasing}}}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\pi\pi'\gamma),F(T(\gamma))},\end{align}
 where, in the last equality, we replaced F by 
 $F\circ\big(\pi'\pi\big)^{-1}$
.
$F\circ\big(\pi'\pi\big)^{-1}$
.
 Let 
 $\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
 be the partition given by
$\Phi=(A_{1},\ldots,A_{r},B_{1},\ldots,B_{r})$
 be the partition given by 
 \begin{equation}A_{i}=\{ (s,u,k)\mid\widetilde{w}(s,u,k)=i\} \quad B_{i}=\{ (s,u,k)\mid\widetilde{w}(s,u,k)=-i\} .\end{equation}
\begin{equation}A_{i}=\{ (s,u,k)\mid\widetilde{w}(s,u,k)=i\} \quad B_{i}=\{ (s,u,k)\mid\widetilde{w}(s,u,k)=-i\} .\end{equation}
 For each 
 $(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
, set
$(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
, set 
 \begin{equation}Z_{\pi,\pi'}:=\left\{ (F,\Sigma):\substack{F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}\\F\circ\pi(1,1,-),F\circ\pi'(2,1,-)\text{ increasing}\\F\circ T=F\circ\pi\pi'\circ\Sigma} \right\} .\end{equation}
\begin{equation}Z_{\pi,\pi'}:=\left\{ (F,\Sigma):\substack{F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}\\F\circ\pi(1,1,-),F\circ\pi'(2,1,-)\text{ increasing}\\F\circ T=F\circ\pi\pi'\circ\Sigma} \right\} .\end{equation}
 The sets 
 $Z_{\pi,\pi'}$
 are disjoint. We use the notation
$Z_{\pi,\pi'}$
 are disjoint. We use the notation 
 \begin{equation}Z:=\bigcup_{\pi,\pi'}Z_{\pi,\pi'}.\end{equation}
\begin{equation}Z:=\bigcup_{\pi,\pi'}Z_{\pi,\pi'}.\end{equation}
Remark 4.1. Note that we have a map 
 $Z\rightarrow\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
 sending
$Z\rightarrow\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
 sending 
 $(F,\Sigma)$
 to the unique pair
$(F,\Sigma)$
 to the unique pair 
 $(\pi_{F},\pi'_{F})$
 such that
$(\pi_{F},\pi'_{F})$
 such that 
 $(F,\Sigma)\in Z_{\pi_{F},\pi'_{F}}$
.
$(F,\Sigma)\in Z_{\pi_{F},\pi'_{F}}$
.
Rewriting (4.8) using Weingarten calculus (Corollary 2.15), we have the following result.
Proposition 4.2. Let 
 $w\in F_{r}$
 be a cyclically reduced word. Then,
$w\in F_{r}$
 be a cyclically reduced word. Then, 
 \begin{equation}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\sum_{(F,\Sigma)\in Z}(-1)^{\pi_{F}\pi'_{F}}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\end{equation}
\begin{equation}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\sum_{(F,\Sigma)\in Z}(-1)^{\pi_{F}\pi'_{F}}\widetilde{\mathrm{Wg}}(\Sigma^{2}).\end{equation}
5. Estimating the contribution of a single orbit in Z
 In this section we introduce an action of 
 $H:=\prod_{(s,u)\in[2]\times[\ell]}\mathrm{Sym}(\Omega_{s,u})$
 on Z, and estimate (4.12) restricted to each H-orbit. The action can be described as follows.
$H:=\prod_{(s,u)\in[2]\times[\ell]}\mathrm{Sym}(\Omega_{s,u})$
 on Z, and estimate (4.12) restricted to each H-orbit. The action can be described as follows.
 For every 
 $(s,u)\in[2]\times[\ell]$
, the group
$(s,u)\in[2]\times[\ell]$
, the group 
 $\mathrm{Sym}(\Omega_{s,u})$
 acts on Z in the following way: if
$\mathrm{Sym}(\Omega_{s,u})$
 acts on Z in the following way: if 
 $u\neq1$
, the action of
$u\neq1$
, the action of 
 $\pi_{s,u}\in\mathrm{Sym}(\Omega_{s,u})$
 is
$\pi_{s,u}\in\mathrm{Sym}(\Omega_{s,u})$
 is 
 \begin{equation}\pi_{s,u}\cdot(F,\Sigma)=(F\circ\pi_{s,u}^{-1},\pi_{s,u}\circ\Sigma\circ T^{-1}\pi_{s,u}^{-1}T).\end{equation}
\begin{equation}\pi_{s,u}\cdot(F,\Sigma)=(F\circ\pi_{s,u}^{-1},\pi_{s,u}\circ\Sigma\circ T^{-1}\pi_{s,u}^{-1}T).\end{equation}
 If 
 $s\in[2]$
 and
$s\in[2]$
 and 
 $\pi_{s,1}\in\mathrm{Sym}(\Omega_{s,1})$
, then
$\pi_{s,1}\in\mathrm{Sym}(\Omega_{s,1})$
, then 
 \begin{equation}\pi_{s,1}\cdot (F,\Sigma)=(F\circ\pi_{s,1}^{-1},\Sigma\circ T^{-1}\pi_{s,1}^{-1}T).\end{equation}
\begin{equation}\pi_{s,1}\cdot (F,\Sigma)=(F\circ\pi_{s,1}^{-1},\Sigma\circ T^{-1}\pi_{s,1}^{-1}T).\end{equation}
 The above group actions commute, which gives rise to an action of H. Note that 
 $(\pi_{1,1},\pi_{2,1})\cdot Z_{\pi,\pi'}=Z_{\pi_{1,1}\pi,\pi_{2,1}\pi'}$
. If
$(\pi_{1,1},\pi_{2,1})\cdot Z_{\pi,\pi'}=Z_{\pi_{1,1}\pi,\pi_{2,1}\pi'}$
. If 
 $u\neq1$
, then
$u\neq1$
, then 
 $\pi_{s,u}\cdot(Z_{\pi,\pi'})=Z_{\pi,\pi'}$
.
$\pi_{s,u}\cdot(Z_{\pi,\pi'})=Z_{\pi,\pi'}$
.
Definition 5.1. For each 
 $u,v\in[\ell]$
, we define
$u,v\in[\ell]$
, we define 
 $*:\mathrm{Sym}(\Omega_{s,u})\times\mathrm{Sym}(\Omega_{s,v})\rightarrow\mathrm{Sym}(\Omega_{s,v})$
 by
$*:\mathrm{Sym}(\Omega_{s,u})\times\mathrm{Sym}(\Omega_{s,v})\rightarrow\mathrm{Sym}(\Omega_{s,v})$
 by 
 \begin{equation}\pi_{s,u}*\pi_{s,v}:=T^{v-u}\pi_{s,u}T^{u-v}\pi_{s,v}.\end{equation}
\begin{equation}\pi_{s,u}*\pi_{s,v}:=T^{v-u}\pi_{s,u}T^{u-v}\pi_{s,v}.\end{equation}
 Note that 
 $*$
 is associative.
$*$
 is associative.
 Let 
 $h:=\prod_{(s,u)}\pi_{s,u}\in H$
 and denote
$h:=\prod_{(s,u)}\pi_{s,u}\in H$
 and denote 
 $\overline{h}:=\prod_{(s,u)\neq(1,1),(2,1)}\pi_{s,u}$
. Then
$\overline{h}:=\prod_{(s,u)\neq(1,1),(2,1)}\pi_{s,u}$
. Then 
 $h\cdot\Sigma=\overline{h}\circ\Sigma\circ T^{-1}h^{-1}T$
. Since
$h\cdot\Sigma=\overline{h}\circ\Sigma\circ T^{-1}h^{-1}T$
. Since 
 $\widetilde{\mathrm{Wg}}$
 is invariant under conjugation in H,
$\widetilde{\mathrm{Wg}}$
 is invariant under conjugation in H, 
 \[\widetilde{\mathrm{Wg}}((h\cdot(\Sigma))^{2})=\widetilde{\mathrm{Wg}}(\Psi_{h}\circ\Sigma\circ\Psi_{h}\circ\Sigma),\]
\[\widetilde{\mathrm{Wg}}((h\cdot(\Sigma))^{2})=\widetilde{\mathrm{Wg}}(\Psi_{h}\circ\Sigma\circ\Psi_{h}\circ\Sigma),\]
 where 
 $\Psi_{h}=T^{-1}h^{-1}T\overline{h}\in H$
. On each
$\Psi_{h}=T^{-1}h^{-1}T\overline{h}\in H$
. On each 
 $\Omega_{s,u}$
,
$\Omega_{s,u}$
, 
 $\Psi_{h}$
 has the following form.
$\Psi_{h}$
 has the following form.
Lemma 5.2. We have
 \[\Psi_{h}|_{\Omega_{s,u}}=\begin{cases}T^{-1}\pi_{s,2}^{-1}T & \text{if }u=1,\\\pi_{s,u+1}^{-1}*\pi_{s,u} & \text{if }u\neq1,\ell,\\\pi_{s,1}^{-1}*\pi_{s,\ell} & \text{if }u=\ell.\end{cases}\]
\[\Psi_{h}|_{\Omega_{s,u}}=\begin{cases}T^{-1}\pi_{s,2}^{-1}T & \text{if }u=1,\\\pi_{s,u+1}^{-1}*\pi_{s,u} & \text{if }u\neq1,\ell,\\\pi_{s,1}^{-1}*\pi_{s,\ell} & \text{if }u=\ell.\end{cases}\]
Corollary 5.3. Let 
 $(\widehat{F},\widehat{\Sigma})$
 be a representative of an H-orbit
$(\widehat{F},\widehat{\Sigma})$
 be a representative of an H-orbit 
 ${\mathcal O}_{(\widehat{F},\widehat{\Sigma})}$
, with
${\mathcal O}_{(\widehat{F},\widehat{\Sigma})}$
, with 
 $(\pi_{\widehat{F}},\pi'_{\widehat{F}})=(\mathrm{Id},\mathrm{Id})$
. Then,
$(\pi_{\widehat{F}},\pi'_{\widehat{F}})=(\mathrm{Id},\mathrm{Id})$
. Then, 
 \begin{align}&\frac{1}{\big|{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}\big|}\sum_{(F,\Sigma)\in{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}}(-1)^{\pi_{F}\pi'_{F}}\,\widetilde{\mathrm{Wg}}(\Sigma^{2})\nonumber\\&\quad =\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u}) }}(-1)^{h_{i}h'_{i}}\,\mathrm{Wg}(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}}).\end{align}
\begin{align}&\frac{1}{\big|{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}\big|}\sum_{(F,\Sigma)\in{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}}(-1)^{\pi_{F}\pi'_{F}}\,\widetilde{\mathrm{Wg}}(\Sigma^{2})\nonumber\\&\quad =\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u}) }}(-1)^{h_{i}h'_{i}}\,\mathrm{Wg}(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}}).\end{align}
Proof. For each 
 $h=\prod_{(s,u)}\pi_{s,u}\in H$
, write
$h=\prod_{(s,u)}\pi_{s,u}\in H$
, write 
 $\nu(h):=(-1)^{\pi_{1,1}\pi_{2,1}}$
. Consider the bijection
$\nu(h):=(-1)^{\pi_{1,1}\pi_{2,1}}$
. Consider the bijection 
 $\psi:H\rightarrow H$
,
$\psi:H\rightarrow H$
, 
 $\psi\big(\prod_{(s,u)}\pi_{s,u}\big)=\prod_{(s,u)}\theta_{s,u}$
, where, for
$\psi\big(\prod_{(s,u)}\pi_{s,u}\big)=\prod_{(s,u)}\theta_{s,u}$
, where, for 
 $s=1,2$
,
$s=1,2$
, 
 \[(\theta_{s,2},\ldots,\theta_{s,\ell},\theta_{s,1})=(\pi_{s,2},\pi_{s,2}*\pi_{s,3},\ldots,\pi_{s,2}*\cdots*\pi_{s,\ell},\pi_{s,2}*\cdots*\pi_{s,\ell}*\pi_{s,1}),\]
\[(\theta_{s,2},\ldots,\theta_{s,\ell},\theta_{s,1})=(\pi_{s,2},\pi_{s,2}*\pi_{s,3},\ldots,\pi_{s,2}*\cdots*\pi_{s,\ell},\pi_{s,2}*\cdots*\pi_{s,\ell}*\pi_{s,1}),\]
 and observe that 
 $\nu(\psi(h))=(-1)^{h}=(-1)^{T^{-1}h^{-1}T}$
. Further note that
$\nu(\psi(h))=(-1)^{h}=(-1)^{T^{-1}h^{-1}T}$
. Further note that 
 \[(\pi_{s,2}*\cdots*\pi_{s,u+1})^{-1}*\pi_{s,2}*\cdots*\pi_{s,u}=T^{-1}\pi_{s,u+1}^{-1}T,\]
\[(\pi_{s,2}*\cdots*\pi_{s,u+1})^{-1}*\pi_{s,2}*\cdots*\pi_{s,u}=T^{-1}\pi_{s,u+1}^{-1}T,\]
 and hence 
 $\Psi_{\psi(h)}=\prod_{(s,u)}T^{-1}\pi_{s,u}^{-1}T$
. Changing variables using
$\Psi_{\psi(h)}=\prod_{(s,u)}T^{-1}\pi_{s,u}^{-1}T$
. Changing variables using 
 $\psi$
, the left-hand side of (5.4) is
$\psi$
, the left-hand side of (5.4) is 
 \begin{align*}\frac{1}{|H|}\!\sum_{h\in H}\!\nu(h)\widetilde{\mathrm{Wg}}(\Psi_{h}\circ\widehat{\Sigma}\circ\Psi_{h}\circ\widehat{\Sigma})& =\frac{1}{m!^{2\ell}}\!\sum_{h\in H}\!\nu(\psi(h))\widetilde{\mathrm{Wg}} \bigg(\prod_{(s,u)}\! T^{-1}\pi_{s,u}^{-1}T\circ\widehat{\Sigma}\circ\!\prod_{(s,u)}\! T^{-1}\pi_{s,u}^{-1}T\circ\widehat{\Sigma}\bigg)\\& =\frac{1}{m!^{2\ell}}\sum_{h\in H}(-1)^{h}\widetilde{\mathrm{Wg}}\bigg(\prod_{(s,u)}\pi_{s,u}\circ\widehat{\Sigma}\circ\prod_{(s,u)}\pi_{s,u}\circ\widehat{\Sigma}\bigg)\\& =\frac{1}{m!^{2\ell}}\sum_{h\in H}(-1)^{h}\prod_{i=1}^{r}\mathrm{Wg}\bigg(\prod_{(s,u):\widetilde{w}=i}\pi_{s,u}\widehat{\Sigma}|_{B_{i}}\!\prod_{(s,u):\widetilde{w}=-i}\!\pi_{s,u}\widehat{\Sigma}|_{A_{i}}\bigg),\end{align*}
\begin{align*}\frac{1}{|H|}\!\sum_{h\in H}\!\nu(h)\widetilde{\mathrm{Wg}}(\Psi_{h}\circ\widehat{\Sigma}\circ\Psi_{h}\circ\widehat{\Sigma})& =\frac{1}{m!^{2\ell}}\!\sum_{h\in H}\!\nu(\psi(h))\widetilde{\mathrm{Wg}} \bigg(\prod_{(s,u)}\! T^{-1}\pi_{s,u}^{-1}T\circ\widehat{\Sigma}\circ\!\prod_{(s,u)}\! T^{-1}\pi_{s,u}^{-1}T\circ\widehat{\Sigma}\bigg)\\& =\frac{1}{m!^{2\ell}}\sum_{h\in H}(-1)^{h}\widetilde{\mathrm{Wg}}\bigg(\prod_{(s,u)}\pi_{s,u}\circ\widehat{\Sigma}\circ\prod_{(s,u)}\pi_{s,u}\circ\widehat{\Sigma}\bigg)\\& =\frac{1}{m!^{2\ell}}\sum_{h\in H}(-1)^{h}\prod_{i=1}^{r}\mathrm{Wg}\bigg(\prod_{(s,u):\widetilde{w}=i}\pi_{s,u}\widehat{\Sigma}|_{B_{i}}\!\prod_{(s,u):\widetilde{w}=-i}\!\pi_{s,u}\widehat{\Sigma}|_{A_{i}}\bigg),\end{align*}
 where, in each line above, 
 $h=\prod_{(s,u)}\pi_{s,u}$
.
$h=\prod_{(s,u)}\pi_{s,u}$
.
Corollary 5.4. Set 
 $\ell_{i}:=\frac{|A_{i}|}{m}$
 for each
$\ell_{i}:=\frac{|A_{i}|}{m}$
 for each 
 $i\in[r]$
. Then the following holds:
$i\in[r]$
. Then the following holds: 
 \begin{equation}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{1}{d\cdots(d-m\ell_{i}+1)}.\end{equation}
\begin{equation}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{1}{d\cdots(d-m\ell_{i}+1)}.\end{equation}
Proof. By Proposition 4.2, Corollary 5.3, (2.11), Lemma 2.7, and by (2.3),
 \begin{align}\!\!\!\mathbb{E}\Big(\Big|\mathrm{tr}\!\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\!\Big)\!\Big|^{2}\Big)&= \sum_{(\widehat{F},\widehat{\Sigma})\in Z}(-1)^{\pi_{\widehat{F}}\pi'_{\widehat{F}}}\widetilde{\mathrm{Wg}}(\widehat{\Sigma}^{2})\nonumber\\&= \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{\big|{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}{\kern-1pt}\big|}\sum_{(F,\Sigma)\in{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}}(-1)^{\pi_{F}\pi'_{F}}\,\widetilde{\mathrm{Wg}}(\Sigma^{2})\nonumber\\&\leq \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\left|\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u}) }}(-1)^{h_{i}h'_{i}}\mathrm{Wg}(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}})\right|\nonumber\\&\leq \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\frac{m!^{2\ell_{i}}}{(m\ell_{i})!^{2}}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})}\!\langle\chi_{\lambda},\mathrm{sgn}\rangle_{\mathrm{S}_{m}^{\ell_{i}}}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\nonumber\\&=\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{m!^{\ell_{i}}}{(m\ell_{i})!}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})}\frac{\langle\chi_{\lambda},\mathrm{sgn}\rangle_{\mathrm{S}_{m}^{\ell_{i}}}\chi_{\lambda}(1)}{\prod_{(a,b)\in\lambda}(d+b-a)}.\end{align}
\begin{align}\!\!\!\mathbb{E}\Big(\Big|\mathrm{tr}\!\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\!\Big)\!\Big|^{2}\Big)&= \sum_{(\widehat{F},\widehat{\Sigma})\in Z}(-1)^{\pi_{\widehat{F}}\pi'_{\widehat{F}}}\widetilde{\mathrm{Wg}}(\widehat{\Sigma}^{2})\nonumber\\&= \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{\big|{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}{\kern-1pt}\big|}\sum_{(F,\Sigma)\in{\mathcal{O}}_{(\widehat{F},\widehat{\Sigma})}}(-1)^{\pi_{F}\pi'_{F}}\,\widetilde{\mathrm{Wg}}(\Sigma^{2})\nonumber\\&\leq \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\left|\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u}) }}(-1)^{h_{i}h'_{i}}\mathrm{Wg}(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}})\right|\nonumber\\&\leq \sum_{(\widehat{F},\widehat{\Sigma})\in Z}\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\frac{m!^{2\ell_{i}}}{(m\ell_{i})!^{2}}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})}\!\langle\chi_{\lambda},\mathrm{sgn}\rangle_{\mathrm{S}_{m}^{\ell_{i}}}\frac{\chi_{\lambda}(1)^{2}}{\rho_{\lambda}(1)}\nonumber\\&=\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{m!^{\ell_{i}}}{(m\ell_{i})!}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})}\frac{\langle\chi_{\lambda},\mathrm{sgn}\rangle_{\mathrm{S}_{m}^{\ell_{i}}}\chi_{\lambda}(1)}{\prod_{(a,b)\in\lambda}(d+b-a)}.\end{align}
 Note that the irreducible characters 
 $\chi_{\lambda}$
 in
$\chi_{\lambda}$
 in 
 $\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})$
 correspond to Young diagrams
$\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})$
 correspond to Young diagrams 
 $\lambda\vdash m\ell_{i}$
 with at most
$\lambda\vdash m\ell_{i}$
 with at most 
 $\ell_{i}$
 columns. If the columns of
$\ell_{i}$
 columns. If the columns of 
 $\lambda$
 are of lengths
$\lambda$
 are of lengths 
 $j_{1}\geq\cdots\geq j_{\ell_{i}}$
, then
$j_{1}\geq\cdots\geq j_{\ell_{i}}$
, then 
 \begin{align}\prod_{(a,b)\in\lambda}(d+b-a)&\geq d\cdots(d-j_{1}+1)\cdot d\cdots(d-j_{2}+1)\cdot \ldots\cdot d\cdots(d-j_{\ell_{i}}+1)\nonumber\\&\geq d\cdots(d-m\ell_{i}+1).\end{align}
\begin{align}\prod_{(a,b)\in\lambda}(d+b-a)&\geq d\cdots(d-j_{1}+1)\cdot d\cdots(d-j_{2}+1)\cdot \ldots\cdot d\cdots(d-j_{\ell_{i}}+1)\nonumber\\&\geq d\cdots(d-m\ell_{i}+1).\end{align}
6. Estimates on 
 $|Z|$
$|Z|$
 In this section we give upper bounds on 
 $|Z|$
, defined in (4.11). We first set some notation. For each
$|Z|$
, defined in (4.11). We first set some notation. For each 
 $0\neq i,j\in[-r,r]$
, set
$0\neq i,j\in[-r,r]$
, set 
 \[R_{i}:=\{ \gamma\in\Omega:\widetilde{w}\circ T^{-1}(\gamma)=i\} =\begin{cases}T(A_{i}) & i>0,\\T(B_{-i}) & i<0,\end{cases}\]
\[R_{i}:=\{ \gamma\in\Omega:\widetilde{w}\circ T^{-1}(\gamma)=i\} =\begin{cases}T(A_{i}) & i>0,\\T(B_{-i}) & i<0,\end{cases}\]
 \[C_{j}:=\{ \gamma\in\Omega:\widetilde{w}(\gamma)=-j\} =\begin{cases}B_{j} & j>0,\\A_{-j} & j<0,\end{cases}\]
\[C_{j}:=\{ \gamma\in\Omega:\widetilde{w}(\gamma)=-j\} =\begin{cases}B_{j} & j>0,\\A_{-j} & j<0,\end{cases}\]
 \[V_{ij}:=\{ \gamma\in\Omega:\widetilde{w}\circ T^{-1}(\gamma)=i,\widetilde{w}(\gamma)=-j\}=R_{i}\cap C_{j}.\]
\[V_{ij}:=\{ \gamma\in\Omega:\widetilde{w}\circ T^{-1}(\gamma)=i,\widetilde{w}(\gamma)=-j\}=R_{i}\cap C_{j}.\]
 Following Remark 3.6, it is helpful to picture a 
 $2r\times2r$
 matrix, whose (i,j)th entry is the set
$2r\times2r$
 matrix, whose (i,j)th entry is the set 
 $V_{ij}$
, with
$V_{ij}$
, with 
 $R_{-r},\ldots,R_{r}$
 correspond to rows, and
$R_{-r},\ldots,R_{r}$
 correspond to rows, and 
 $C_{-r},\ldots,C_{r}$
 correspond to columns. Denote
$C_{-r},\ldots,C_{r}$
 correspond to columns. Denote 
 \begin{equation}\ell_{i,j}:=\frac{|V_{ij}|}{m}\quad \text{and}\quad\ell_{i}:=\frac{|R_{i}|}{m}=\frac{|C_{i}|}{m}.\end{equation}
\begin{equation}\ell_{i,j}:=\frac{|V_{ij}|}{m}\quad \text{and}\quad\ell_{i}:=\frac{|R_{i}|}{m}=\frac{|C_{i}|}{m}.\end{equation}
 Observe that 
 $\ell_{i}=\ell_{-i}$
,
$\ell_{i}=\ell_{-i}$
, 
 $\ell_{i,j}=\ell_{j,i}$
 and note that
$\ell_{i,j}=\ell_{j,i}$
 and note that 
 $\ell_{i}=\frac{|A_{i}|}{m}$
 if
$\ell_{i}=\frac{|A_{i}|}{m}$
 if 
 $i>0$
, so that (6.1) extends the definition of
$i>0$
, so that (6.1) extends the definition of 
 $\ell_{i}$
 in Corollary 5.4. For each
$\ell_{i}$
 in Corollary 5.4. For each 
 $0\neq j\in[-r,r]$
 set
$0\neq j\in[-r,r]$
 set 
 \[C_{j}^{+}:=\bigcup_{i<j}V_{ij},\quad C_{+}:=\bigcup_{j}C_{j}^{+}.\]
\[C_{j}^{+}:=\bigcup_{i<j}V_{ij},\quad C_{+}:=\bigcup_{j}C_{j}^{+}.\]
 For each 
 $i\in[r]$
 and each
$i\in[r]$
 and each 
 $\Sigma\in S_{\Phi}$
, denote
$\Sigma\in S_{\Phi}$
, denote 
 $\eta_{i}:=T\circ(\Sigma^{-1})|_{B_{i}}$
 and
$\eta_{i}:=T\circ(\Sigma^{-1})|_{B_{i}}$
 and 
 $\eta_{-i}:=T\circ(\Sigma^{-1})|_{A_{i}}$
. Note that
$\eta_{-i}:=T\circ(\Sigma^{-1})|_{A_{i}}$
. Note that 
 $\eta_{i}(C_{i})=R_{i}$
 for all i. Define the following sets:
$\eta_{i}(C_{i})=R_{i}$
 for all i. Define the following sets: 
 \begin{equation}W':=\{ (F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}):F\circ T=F\circ\Sigma\} ,\end{equation}
\begin{equation}W':=\{ (F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}):F\circ T=F\circ\Sigma\} ,\end{equation}
and
 \[W:=\{ (F,\Sigma)\in W':F(s,1,-)\text{ is one-to-one } \forall s\in[2]\} .\]
\[W:=\{ (F,\Sigma)\in W':F(s,1,-)\text{ is one-to-one } \forall s\in[2]\} .\]
Proposition 6.1. We have
 \[|Z|=|W|\leq|W'|\leq\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\]
\[|Z|=|W|\leq|W'|\leq\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\]
Proof. The map 
 $(F,\Sigma)\mapsto(F\circ\pi_{F}\pi'_{F},\Sigma\circ T{}^{-1}\pi_{F}\pi'_{F}T)$
 is a bijection between Z and W, giving the first equality. Clearly,
$(F,\Sigma)\mapsto(F\circ\pi_{F}\pi'_{F},\Sigma\circ T{}^{-1}\pi_{F}\pi'_{F}T)$
 is a bijection between Z and W, giving the first equality. Clearly, 
 $|W|\leq|W'|$
.
$|W|\leq|W'|$
.
 In order to prove the last inequality, we use the map 
 $\Phi_{+}:W'\rightarrow\{f:C_{+}\rightarrow[d]\}$
, sending
$\Phi_{+}:W'\rightarrow\{f:C_{+}\rightarrow[d]\}$
, sending 
 $(F,\Sigma)\in W'$
 to
$(F,\Sigma)\in W'$
 to 
 $F|_{C_{+}}$
. We estimate
$F|_{C_{+}}$
. We estimate 
 $|W'|$
 by analyzing the fibers of
$|W'|$
 by analyzing the fibers of 
 $\Phi_{+}$
. Let
$\Phi_{+}$
. Let 
 $f\in\Phi_{+}(W')$
 and suppose it has a shape
$f\in\Phi_{+}(W')$
 and suppose it has a shape 
 $\nu_{+}$
 (see Definition 3.4). We write
$\nu_{+}$
 (see Definition 3.4). We write 
 $\nu_{j,+}$
 for the shapes of
$\nu_{j,+}$
 for the shapes of 
 $f|_{C_{j}^{+}}$
. We reveal
$f|_{C_{j}^{+}}$
. We reveal 
 $(F,\Sigma)\in\Phi_{+}^{-1}(f)$
 row by row, starting with the
$(F,\Sigma)\in\Phi_{+}^{-1}(f)$
 row by row, starting with the 
 $-r$
th row
$-r$
th row 
 $R_{-r}$
 and making sure that, in each step,
$R_{-r}$
 and making sure that, in each step, 
 $F\circ T|_{T^{-1}(R_{k})}=F\circ\Sigma|_{T^{-1}(R_{k})}$
, or, equivalently,
$F\circ T|_{T^{-1}(R_{k})}=F\circ\Sigma|_{T^{-1}(R_{k})}$
, or, equivalently, 
 $F|_{R_{k}}=F\circ\eta_{k}^{-1}|_{R_{k}}$
.
$F|_{R_{k}}=F\circ\eta_{k}^{-1}|_{R_{k}}$
.
- 
1. There are at most  $(m\ell_{-r})!$
 options for $(m\ell_{-r})!$
 options for $\eta_{-r}$
. Note that $\eta_{-r}$
. Note that $R_{-r\subseteq C_+}$
 and, hence, by (6.2), the choice of $R_{-r\subseteq C_+}$
 and, hence, by (6.2), the choice of $\eta_{-r}$
 determines $\eta_{-r}$
 determines $F|_{C_{-r}}$
. At this point, $F|_{C_{-r}}$
. At this point, $F|_{R_{-r+1}}$
 is determined as well. $F|_{R_{-r+1}}$
 is determined as well.
- 
2. Note that  $C_{-r+1}^{+}=V_{-r,-r+1}$
. There are at most: $C_{-r+1}^{+}=V_{-r,-r+1}$
. There are at most:- 
(a)  $\binom{m\ell_{-r+1}}{m\ell_{-r,-r+1}}$
 options for the sets $\binom{m\ell_{-r+1}}{m\ell_{-r,-r+1}}$
 options for the sets $\eta_{-r+1}(C_{-r+1}^{+})$
 and $\eta_{-r+1}(C_{-r+1}^{+})$
 and $\eta_{-r+1}(C_{-r+1}\backslash C_{-r+1}^{+})$
; $\eta_{-r+1}(C_{-r+1}\backslash C_{-r+1}^{+})$
;
- 
(b)  $(m(\ell_{-r+1}-\ell_{-r,-r+1}))!$
 options for $(m(\ell_{-r+1}-\ell_{-r,-r+1}))!$
 options for $\eta_{-r+1}|_{C_{-r+1}\backslash C_{-r+1}^{+}}:C_{-r+1}\backslash C_{-r+1}^{+}\rightarrow\eta_{-r+1}(C_{-r+1}\backslash C_{-r+1}^{+})$
; $\eta_{-r+1}|_{C_{-r+1}\backslash C_{-r+1}^{+}}:C_{-r+1}\backslash C_{-r+1}^{+}\rightarrow\eta_{-r+1}(C_{-r+1}\backslash C_{-r+1}^{+})$
;
- 
(c)  $(\nu_{-r+1,+})!$
 options for $(\nu_{-r+1,+})!$
 options for $\eta_{-r+1}:C_{-r+1}^{+}\rightarrow\eta_{-r+1}(C_{-r+1}^{+})$
. $\eta_{-r+1}:C_{-r+1}^{+}\rightarrow\eta_{-r+1}(C_{-r+1}^{+})$
.
 
- 
- 
3. More generally, assume, by induction, that we have fixed  $(\eta_{i})_{i<k}$
, and, thus, we have already determined $(\eta_{i})_{i<k}$
, and, thus, we have already determined $F|_{R_{i}}$
 for $F|_{R_{i}}$
 for $i\leq k$
, $i\leq k$
, $F|_{C_{i}}$
 for $F|_{C_{i}}$
 for $i<k$
, and $i<k$
, and $F|_{C_{+}}$
. Then there are at most: $F|_{C_{+}}$
. Then there are at most:
- 
(a)  $\binom{m\ell_{k}}{\sum_{i<k}m\ell_{i,k}}$
 options for the sets $\binom{m\ell_{k}}{\sum_{i<k}m\ell_{i,k}}$
 options for the sets $\eta_{k}(C_{k}^{+})$
 and $\eta_{k}(C_{k}^{+})$
 and $\eta_{k}(C_{k}\backslash C_{k}^{+})$
; $\eta_{k}(C_{k}\backslash C_{k}^{+})$
;
- 
(b)  $\big(m(\ell_{k}-\sum_{i<k}\ell_{i,k})\big)!$
 options for $\big(m(\ell_{k}-\sum_{i<k}\ell_{i,k})\big)!$
 options for $\eta_{k}|_{C_{k}\backslash C_{k}^{+}}:C_{k}\backslash C_{k}^{+}\rightarrow\eta_{k}(C_{k}\backslash C_{k}^{+})$
; $\eta_{k}|_{C_{k}\backslash C_{k}^{+}}:C_{k}\backslash C_{k}^{+}\rightarrow\eta_{k}(C_{k}\backslash C_{k}^{+})$
;
- 
(c)  $(\nu_{k,+})!$
 options for $(\nu_{k,+})!$
 options for $\eta_{k}|_{C_{k}^{+}}:C_{k}^{+}\rightarrow\eta_{k}(C_{k}^{+})$
. $\eta_{k}|_{C_{k}^{+}}:C_{k}^{+}\rightarrow\eta_{k}(C_{k}^{+})$
.
 After choosing 
 $\eta_{-r},\ldots,\eta_{r}$
, we have determined F. Furthermore, since
$\eta_{-r},\ldots,\eta_{r}$
, we have determined F. Furthermore, since 
 $\sum_{0\neq k=-r}^{r}\nu_{k,+}=\nu_{+}$
, we have
$\sum_{0\neq k=-r}^{r}\nu_{k,+}=\nu_{+}$
, we have 
 $\prod_{0\neq k=-r}^{r}(\nu_{k,+})!\leq\nu_{+}!$
. Hence,
$\prod_{0\neq k=-r}^{r}(\nu_{k,+})!\leq\nu_{+}!$
. Hence, 
 \begin{align}|\Phi_{+}^{-1}(f)| & \leq\prod_{0\neq k=-r}^{r}\bigg(\!\binom{m\ell_{k}}{\sum_{i<k}m\ell_{i,k}}\bigg(m\bigg(\ell_{k}-\sum_{i<k}\ell_{i,k}\bigg)\!\bigg)!(\nu_{k,+})!\bigg)\nonumber\\& =\prod_{0\neq k=-r}^{r}(\nu_{k,+})!\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\leq\nu_{+}!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\end{align}
\begin{align}|\Phi_{+}^{-1}(f)| & \leq\prod_{0\neq k=-r}^{r}\bigg(\!\binom{m\ell_{k}}{\sum_{i<k}m\ell_{i,k}}\bigg(m\bigg(\ell_{k}-\sum_{i<k}\ell_{i,k}\bigg)\!\bigg)!(\nu_{k,+})!\bigg)\nonumber\\& =\prod_{0\neq k=-r}^{r}(\nu_{k,+})!\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\leq\nu_{+}!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\end{align}
 Since 
 $|C_{+}|=m\ell$
, we have
$|C_{+}|=m\ell$
, we have 
 \begin{equation}|\{ f\in\Phi_{+}(W'):f\text{ is of shape }\nu_{+}\} |\leq\frac{(m\ell)!}{\nu_{+}!},\end{equation}
\begin{equation}|\{ f\in\Phi_{+}(W'):f\text{ is of shape }\nu_{+}\} |\leq\frac{(m\ell)!}{\nu_{+}!},\end{equation}
 and there are at most 
 $\binom{d+m\ell}{m\ell}$
 possible shapes
$\binom{d+m\ell}{m\ell}$
 possible shapes 
 $\nu_{+}$
. Combining (6.3) and (6.4) we conclude
$\nu_{+}$
. Combining (6.3) and (6.4) we conclude 
 \begin{align*}|W'| & \leq\sum_{\nu_{+}}|\{ f\in\Phi_{+}(W'):f\text{ is of shape}\nu_{+}\} |\cdot|\Phi_{+}^{-1}(f)|\\& \leq\sum_{\nu_{+}}\frac{(m\ell)!}{\nu_{+}!}\cdot\nu_{+}!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\leq\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\quad\quad\end{align*}
\begin{align*}|W'| & \leq\sum_{\nu_{+}}|\{ f\in\Phi_{+}(W'):f\text{ is of shape}\nu_{+}\} |\cdot|\Phi_{+}^{-1}(f)|\\& \leq\sum_{\nu_{+}}\frac{(m\ell)!}{\nu_{+}!}\cdot\nu_{+}!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\leq\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\quad\quad\end{align*}
7. Proof of Theorems 1.1 and 1.3
In this section we use the results of §§ 4, 5 and 6 to prove Theorems 1.1 and 1.3. We end the section with the proof of Theorem 1.6.
 
Proof of Theorem 1.3. Assume that 
 $d=am$
 for
$d=am$
 for 
 $a\geq\ell\geq2$
. By (3.12), we have
$a\geq\ell\geq2$
. By (3.12), we have 
 \begin{equation}\binom{d}{m\ell}=\binom{am}{m\ell}\geq\frac{a^{m\ell}}{\ell^{m\ell}},\end{equation}
\begin{equation}\binom{d}{m\ell}=\binom{am}{m\ell}\geq\frac{a^{m\ell}}{\ell^{m\ell}},\end{equation}
 \begin{equation}\binom{d+m\ell}{m\ell}\leq\bigg(\frac{a+\ell}{\ell}\bigg)^{\!\!m\ell}e^{m\ell}\leq\frac{a^{m\ell}(2e)^{m\ell}}{\ell^{m\ell}}.\end{equation}
\begin{equation}\binom{d+m\ell}{m\ell}\leq\bigg(\frac{a+\ell}{\ell}\bigg)^{\!\!m\ell}e^{m\ell}\leq\frac{a^{m\ell}(2e)^{m\ell}}{\ell^{m\ell}}.\end{equation}
 We remind the reader the definition of 
 $\ell_{i}$
 and
$\ell_{i}$
 and 
 $\ell_{i,j}$
 in (6.1). Concretely, for each
$\ell_{i,j}$
 in (6.1). Concretely, for each 
 $0\neq i,j\in[-r,r]$
,
$0\neq i,j\in[-r,r]$
, 
 $\ell_{i}$
 is the combined number of appearances of the letter
$\ell_{i}$
 is the combined number of appearances of the letter 
 $x_{i}$
 (with the convention that
$x_{i}$
 (with the convention that 
 $x_{-i}=x_{i}^{-1}$
) in w and
$x_{-i}=x_{i}^{-1}$
) in w and 
 $w^{-1}$
, and
$w^{-1}$
, and 
 $\ell_{i,j}$
 is the combined number of appearances of the string ‘
$\ell_{i,j}$
 is the combined number of appearances of the string ‘
 $x_{i}x_{j}^{-1}$
’ in w and in
$x_{i}x_{j}^{-1}$
’ in w and in 
 $w^{-1}$
. In particular, we have
$w^{-1}$
. In particular, we have 
 $\sum_{i=1}^{r}\ell_{i}=\ell$
,
$\sum_{i=1}^{r}\ell_{i}=\ell$
, 
 $\ell_{i,i}=0$
 and
$\ell_{i,i}=0$
 and 
 $\sum_{0\neq i\in[-r,r]}\ell_{i,k}=\ell_{k}$
 and, therefore,
$\sum_{0\neq i\in[-r,r]}\ell_{i,k}=\ell_{k}$
 and, therefore, 
 \begin{equation}\begin{aligned}&\prod_{i=1}^{r}d\cdots(d-m\ell_{i}+1)\geq d\cdots(d-m\ell+1)\\ \text{and}\quad &\frac{(m\ell_{k})!}{\big(\sum_{i\lt k}m\ell_{i,k}\big)!\big(\sum_{i>k}m\ell_{i,k}\big)!}=\binom{m\ell_{k}}{\sum_{i>k} m\ell_{i,k}} \leq2^{m\ell_{k}}.\end{aligned}\end{equation}
\begin{equation}\begin{aligned}&\prod_{i=1}^{r}d\cdots(d-m\ell_{i}+1)\geq d\cdots(d-m\ell+1)\\ \text{and}\quad &\frac{(m\ell_{k})!}{\big(\sum_{i\lt k}m\ell_{i,k}\big)!\big(\sum_{i>k}m\ell_{i,k}\big)!}=\binom{m\ell_{k}}{\sum_{i>k} m\ell_{i,k}} \leq2^{m\ell_{k}}.\end{aligned}\end{equation}
By Corollary 5.4, Proposition 6.1 and by (7.3), (7.1) and (7.2), we obtain
 \begin{align*}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{1}{d\cdots(d-m\ell_{i}+1)}\\&\quad \leq \binom{d+m\ell}{m\ell}\cdot\frac{(m\ell)!}{\prod_{i=1}^{r}d\cdots(d-m\ell_{i}+1)}\cdot\frac{1}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\\&\ \leq \binom{d+m\ell}{m\ell}\cdot\frac{(m\ell)!}{d\cdots(d-m\ell+1)}\cdot\frac{\prod_{0\neq k=-r}^{r}\big(\sum_{i>k}m\ell_{i,k}\big)!}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!\big(\sum_{i>k}m\ell_{i,k}\big)!}\\[5pt]&\ \leq \binom{d+m\ell}{m\ell}\cdot\binom{d}{m\ell}^{-1}\cdot\frac{(m\ell)!}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}2^{m\ell_{k}}\leq(2e)^{m\ell}\ell^{m\ell}\cdot2^{2m\ell}\leq(8e\ell)^{m\ell}\leq(22\ell)^{m\ell}.\end{align*}
\begin{align*}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\frac{|Z|}{m!^{\ell}}\prod_{i=1}^{r}\frac{1}{d\cdots(d-m\ell_{i}+1)}\\&\quad \leq \binom{d+m\ell}{m\ell}\cdot\frac{(m\ell)!}{\prod_{i=1}^{r}d\cdots(d-m\ell_{i}+1)}\cdot\frac{1}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\\&\ \leq \binom{d+m\ell}{m\ell}\cdot\frac{(m\ell)!}{d\cdots(d-m\ell+1)}\cdot\frac{\prod_{0\neq k=-r}^{r}\big(\sum_{i>k}m\ell_{i,k}\big)!}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!\big(\sum_{i>k}m\ell_{i,k}\big)!}\\[5pt]&\ \leq \binom{d+m\ell}{m\ell}\cdot\binom{d}{m\ell}^{-1}\cdot\frac{(m\ell)!}{m!^{\ell}}\cdot\prod_{0\neq k=-r}^{r}2^{m\ell_{k}}\leq(2e)^{m\ell}\ell^{m\ell}\cdot2^{2m\ell}\leq(8e\ell)^{m\ell}\leq(22\ell)^{m\ell}.\end{align*}
 Finally, note that if 
 $d\geq(22\ell)^{\ell}m$
, then
$d\geq(22\ell)^{\ell}m$
, then 
 $(22\ell)^{m\ell}\leq(\frac{d}{m})^{m}\leq\binom{d}{m}$
.
$(22\ell)^{m\ell}\leq(\frac{d}{m})^{m}\leq\binom{d}{m}$
.
We now turn to the proof of Theorem 1.1. We first deal with the case when the rank is bounded (and prove Conjecture 1.7 in this case) and then prove Theorem 1.1 in the unbounded case.
Definition 7.1. Given 
 $w_{1}\in F_{r_{1}}$
 and
$w_{1}\in F_{r_{1}}$
 and 
 $w_{2}\in F_{r_{2}}$
, we denote by
$w_{2}\in F_{r_{2}}$
, we denote by 
 $w_{1}*w_{2}\in F_{r_{1}+r_{2}}$
 their concatenation. For example, if
$w_{1}*w_{2}\in F_{r_{1}+r_{2}}$
 their concatenation. For example, if 
 $w=[x,y]$
, then
$w=[x,y]$
, then 
 $w*w=[x,y]\cdot[z,w]$
.
$w*w=[x,y]\cdot[z,w]$
.
 We remind the reader that for a compact group G, and a word 
 $w\in F_{r}$
, we denote by
$w\in F_{r}$
, we denote by 
 $\tau_{w,G}:=(w_{G})_{*}(\mu_{G}^{r})$
 the word measure associated to w and G, and the Fourier coefficient of
$\tau_{w,G}:=(w_{G})_{*}(\mu_{G}^{r})$
 the word measure associated to w and G, and the Fourier coefficient of 
 $\tau_{w,G}$
 at
$\tau_{w,G}$
 at 
 $\rho\in\mathrm{Irr}(G)$
 is
$\rho\in\mathrm{Irr}(G)$
 is 
 $a_{w,G,\rho}:=\int_{G^{r}}\rho(w(x_{1},\ldots,x_{r}))\mu_{G}^{r}=\int_{G}\rho(y)\tau_{w,G}$
. If G is a compact connected semisimple Lie group, by [Reference BorelBor83], the map
$a_{w,G,\rho}:=\int_{G^{r}}\rho(w(x_{1},\ldots,x_{r}))\mu_{G}^{r}=\int_{G}\rho(y)\tau_{w,G}$
. If G is a compact connected semisimple Lie group, by [Reference BorelBor83], the map 
 $w_{G}:G^{r}\rightarrow G$
 is a submersion outside a proper subvariety in
$w_{G}:G^{r}\rightarrow G$
 is a submersion outside a proper subvariety in 
 $G^{r}$
. It follows that in this case, or e.g. when G is a finite group,
$G^{r}$
. It follows that in this case, or e.g. when G is a finite group, 
 $\tau_{w,G}$
 is absolutely continuous with respect to
$\tau_{w,G}$
 is absolutely continuous with respect to 
 $\mu_{G}$
, and we can write
$\mu_{G}$
, and we can write 
 $\tau_{w,G}=f_{w,G}\cdot \mu_{G}$
, with
$\tau_{w,G}=f_{w,G}\cdot \mu_{G}$
, with 
 $f_{w,G}\in L^{1}(G)$
. Since
$f_{w,G}\in L^{1}(G)$
. Since 
 $\tau_{w,G}$
 is conjugate invariant,
$\tau_{w,G}$
 is conjugate invariant, 
 $f_{w,G}$
 is a class function, and it can be written as a linear combination of characters
$f_{w,G}$
 is a class function, and it can be written as a linear combination of characters 
 $f_{w,G}=\sum_{\rho\in\mathrm{Irr}(G)}\overline{a_{w,G,\rho}}\cdot\rho$
.
$f_{w,G}=\sum_{\rho\in\mathrm{Irr}(G)}\overline{a_{w,G,\rho}}\cdot\rho$
.
 By Definition 7.1, we see that 
 $\tau_{w_{1}*w_{2},G}=\tau_{w_{1},G}*\tau_{w_{2},G}$
 for every
$\tau_{w_{1}*w_{2},G}=\tau_{w_{1},G}*\tau_{w_{2},G}$
 for every 
 $w_{1}\in F_{r_{1}}$
 and
$w_{1}\in F_{r_{1}}$
 and 
 $w_{2}\in F_{r_{2}}$
. Since
$w_{2}\in F_{r_{2}}$
. Since 
 $\rho_{1}*\rho_{2}=\frac{\delta_{\rho_{1},\rho_{2}}}{\rho_{1}(1)}\cdot\rho_{1}$
 for every
$\rho_{1}*\rho_{2}=\frac{\delta_{\rho_{1},\rho_{2}}}{\rho_{1}(1)}\cdot\rho_{1}$
 for every 
 $\rho_{1},\rho_{2}\in\mathrm{Irr}(G)$
, we have
$\rho_{1},\rho_{2}\in\mathrm{Irr}(G)$
, we have 
 \begin{equation}a_{w_{1}*w_{2},G,\rho}=\int_{G}\rho(g)\tau_{w_{1}*w_{2},G}(g)=\int_{G}\rho(g)\tau_{w_{1},G}*\tau_{w_{2},G}(g)=\frac{a_{w_{1},G,\rho}\cdot a_{w_{2},G,\rho}}{\rho(1)}.\end{equation}
\begin{equation}a_{w_{1}*w_{2},G,\rho}=\int_{G}\rho(g)\tau_{w_{1}*w_{2},G}(g)=\int_{G}\rho(g)\tau_{w_{1},G}*\tau_{w_{2},G}(g)=\frac{a_{w_{1},G,\rho}\cdot a_{w_{2},G,\rho}}{\rho(1)}.\end{equation}
Proposition 7.2. For every 
 $1\neq w\in F_{r}$
 and
$1\neq w\in F_{r}$
 and 
 $d\in\mathbb{N}$
, there exists
$d\in\mathbb{N}$
, there exists 
 $\epsilon(d,w)>0$
 such that:
$\epsilon(d,w)>0$
 such that:
- 
(1) for every compact connected semisimple Lie group G of rank d and every  $\rho\in\mathrm{Irr}(G)$
, we have $\rho\in\mathrm{Irr}(G)$
, we have $|a_{w,G,\rho}|\leq\rho(1)^{1-\epsilon(d,w)}$
; $|a_{w,G,\rho}|\leq\rho(1)^{1-\epsilon(d,w)}$
;
- 
(2) in particular, for every  $1\leq m\leq d$
, $1\leq m\leq d$
, \[\mathbb{E}_{\mathrm{U}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\mathbb{E}_{\mathrm{SU}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\binom{d}{m}^{\!\!2(1-\epsilon(d,w))}.\] \[\mathbb{E}_{\mathrm{U}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\mathbb{E}_{\mathrm{SU}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\binom{d}{m}^{\!\!2(1-\epsilon(d,w))}.\]
Proof. We first prove item (1). Fix 
 $w\in F_{r}$
 and a compact connected semisimple Lie group G. Let
$w\in F_{r}$
 and a compact connected semisimple Lie group G. Let 
 $\tau_{w,G}=f_{w,G}\mu_{G}$
 be the word measure. By (7.4), and since
$\tau_{w,G}=f_{w,G}\mu_{G}$
 be the word measure. By (7.4), and since 
 $a_{w^{-1},G,\rho}=\overline{a_{w,G,\rho}}$
 for each
$a_{w^{-1},G,\rho}=\overline{a_{w,G,\rho}}$
 for each 
 $\rho\in\mathrm{Irr}(G)$
, we have
$\rho\in\mathrm{Irr}(G)$
, we have 
 \begin{equation}a_{w*w^{-1},G,\rho}=\frac{|a_{w,G,\rho}|^{2}}{\rho(1)}.\end{equation}
\begin{equation}a_{w*w^{-1},G,\rho}=\frac{|a_{w,G,\rho}|^{2}}{\rho(1)}.\end{equation}
 Replacing w by 
 $w*w^{-1}$
, we may assume that all Fourier coefficients
$w*w^{-1}$
, we may assume that all Fourier coefficients 
 $a_{w,G,\rho}$
 are in
$a_{w,G,\rho}$
 are in 
 $\mathbb{R}_{\geq0}$
.
$\mathbb{R}_{\geq0}$
.
 It follows from [Reference Glazer, Hendel and SodinGHS24, Theorem 1.1] that 
 $f_{w,G}\in L^{1+\epsilon'}(G)$
 for some
$f_{w,G}\in L^{1+\epsilon'}(G)$
 for some 
 $\epsilon'=\epsilon'(G,w)>0$
. By Young’s convolution inequality, it follows that
$\epsilon'=\epsilon'(G,w)>0$
. By Young’s convolution inequality, it follows that 
 $f_{w,G}^{*t}\in L^{\infty}(G)$
 for all
$f_{w,G}^{*t}\in L^{\infty}(G)$
 for all 
 $t\geq t_{0}(G,w):=\lceil\frac{1+\epsilon'(G,w)}{\epsilon'(G,w)}\rceil $
 (see, e.g., [Reference Glazer, Hendel and SodinGHS24, Section 1.1, end of p.3]). In particular, by (7.4), we deduce that
$t\geq t_{0}(G,w):=\lceil\frac{1+\epsilon'(G,w)}{\epsilon'(G,w)}\rceil $
 (see, e.g., [Reference Glazer, Hendel and SodinGHS24, Section 1.1, end of p.3]). In particular, by (7.4), we deduce that 
 \[f_{w,G}^{*t_{0}}(1)=\sum_{\rho\in\mathrm{Irr}(G)}\rho(1)^{2-t_{0}}a_{w,G,\rho}^{t_{0}}<\infty.\]
\[f_{w,G}^{*t_{0}}(1)=\sum_{\rho\in\mathrm{Irr}(G)}\rho(1)^{2-t_{0}}a_{w,G,\rho}^{t_{0}}<\infty.\]
 Since 
 $a_{w,G,\rho}\geq0$
, we deduce that
$a_{w,G,\rho}\geq0$
, we deduce that 
 $a_{w,G,\rho}<\rho(1)^{1-\frac{2}{t_{0}(G,w)}}$
 for all but finitely many
$a_{w,G,\rho}<\rho(1)^{1-\frac{2}{t_{0}(G,w)}}$
 for all but finitely many 
 $\rho\in\mathrm{Irr}(G)$
. To deal with the remaining finitely many (non-trivial) representations of G, we simply use the bound
$\rho\in\mathrm{Irr}(G)$
. To deal with the remaining finitely many (non-trivial) representations of G, we simply use the bound 
 $a_{w,G,\rho}<\rho(1)$
, which follows e.g. by the Itô–Kawada equidistribution theorem [Reference Kawada and ItôKI40] (see also [Reference ApplebaumApp14, Theorem 4.6.3]), since
$a_{w,G,\rho}<\rho(1)$
, which follows e.g. by the Itô–Kawada equidistribution theorem [Reference Kawada and ItôKI40] (see also [Reference ApplebaumApp14, Theorem 4.6.3]), since 
 $\mathrm{Supp}(\tau_{w,G})$
 generates G. Since there are only finitely many compact semisimple connected Lie groups of rank d, this implies item (1).
$\mathrm{Supp}(\tau_{w,G})$
 generates G. Since there are only finitely many compact semisimple connected Lie groups of rank d, this implies item (1).
 Note that the character 
 $\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}$
 of the representation
$\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}$
 of the representation 
 $\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\otimes\big(\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\big)^{\vee}$
 of
$\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\otimes\big(\bigwedge\nolimits^{\!m}\mathbb{C}^{d}\big)^{\vee}$
 of 
 $\mathrm{SU}_{d}$
 is given by
$\mathrm{SU}_{d}$
 is given by 
 $\big|\mathrm{tr}\big(\bigwedge\nolimits^{\!m}(A)\big)\big|^{2}$
. Since
$\big|\mathrm{tr}\big(\bigwedge\nolimits^{\!m}(A)\big)\big|^{2}$
. Since 
 $\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}$
 is a sum of irreducible characters, by applying the Itô–Kawada equidistribution theorem to each irreducible character, for each
$\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}$
 is a sum of irreducible characters, by applying the Itô–Kawada equidistribution theorem to each irreducible character, for each 
 $1\leq m\leq d$
, we have
$1\leq m\leq d$
, we have 
 \begin{align*}\mathbb{E}_{\mathrm{SU}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!m}(w(X_{1},\ldots,X_{r}))\Big)\!\Big|^{2}\Big)&=\mathbb{E}_{\mathrm{SU}_{d}}(\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}(w(X_{1},\ldots,X_{r})))\\&<\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}(1)=\binom{d}{m}^{\!\!2}.\end{align*}
\begin{align*}\mathbb{E}_{\mathrm{SU}_{d}}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!m}(w(X_{1},\ldots,X_{r}))\Big)\!\Big|^{2}\Big)&=\mathbb{E}_{\mathrm{SU}_{d}}(\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}(w(X_{1},\ldots,X_{r})))\\&<\rho_{(1^{m})}\otimes\rho_{(1^{m})}^{\vee}(1)=\binom{d}{m}^{\!\!2}.\end{align*}
Since there are only finitely many such m’s, this implies item (2).
Theorem 1.1 now follows from Proposition 7.2 and the following theorem.
Theorem 7.3. For every 
 $\ell\in\mathbb{N}$
, there exist
$\ell\in\mathbb{N}$
, there exist 
 $\epsilon(\ell),C(\ell)>0$
 such that, for every
$\epsilon(\ell),C(\ell)>0$
 such that, for every 
 $d\geq C(\ell)$
, every
$d\geq C(\ell)$
, every 
 $1\leq m\leq d$
, and every word
$1\leq m\leq d$
, and every word 
 $w\in F_{r}$
 of length
$w\in F_{r}$
 of length 
 $\ell$
, one has
$\ell$
, one has 
 \[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\binom{d}{m}^{\!\!2(1-\epsilon(\ell))}.\]
\[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\leq\binom{d}{m}^{\!\!2(1-\epsilon(\ell))}.\]
In order to prove Theorem 7.3, we need the following technical lemma.
Lemma 7.4. Let 
 $H(x)=-x\log(x)-(1-x)\log(1-x)$
 be the binary entropy function. Then:
$H(x)=-x\log(x)-(1-x)\log(1-x)$
 be the binary entropy function. Then:
- 
(1) for every  $d\in\mathbb{N}$
 and every $d\in\mathbb{N}$
 and every $0<x<1$
 such that $0<x<1$
 such that $dx\in\mathbb{N}$
, we have $dx\in\mathbb{N}$
, we have $({2^{dH(x)}}/{\sqrt{8dx(1-x)}})\leq\binom{d}{xd}\leq({2^{dH(x)}}/{\sqrt{\pi dx(1-x)}})\leq2^{dH(x)}$
; $({2^{dH(x)}}/{\sqrt{8dx(1-x)}})\leq\binom{d}{xd}\leq({2^{dH(x)}}/{\sqrt{\pi dx(1-x)}})\leq2^{dH(x)}$
;
- 
(2) let  $0<\delta\leq\frac{1}{2}$
, then for every $0<\delta\leq\frac{1}{2}$
, then for every $b\in[\delta,\frac{1}{2}]$
, $b\in[\delta,\frac{1}{2}]$
, $a\in[\delta,b]$
, and $a\in[\delta,b]$
, and $d>({1}/{\delta^{4}})$
 such that bd,ad,d are integers, one has $d>({1}/{\delta^{4}})$
 such that bd,ad,d are integers, one has \[\binom{d}{(b-a)d}\leq\binom{d}{bd}^{\!\!1-\delta^{2}}.\] \[\binom{d}{(b-a)d}\leq\binom{d}{bd}^{\!\!1-\delta^{2}}.\]
Proof. Item (1) follows, e.g., from [Reference Cover and ThomasCT06, Lemma 17.5.1]. The Taylor series of H(x) around 
 $1/2$
 is
$1/2$
 is 
 \begin{equation}H(x)=1-\frac{1}{2\ln2}\sum_{n=1}^{\infty}\frac{(1-2x)^{2n}}{n(2n-1)}.\end{equation}
\begin{equation}H(x)=1-\frac{1}{2\ln2}\sum_{n=1}^{\infty}\frac{(1-2x)^{2n}}{n(2n-1)}.\end{equation}
 Since 
 $H'(x)=\log(\frac{1-x}{x})$
, H(x) is monotone increasing in
$H'(x)=\log(\frac{1-x}{x})$
, H(x) is monotone increasing in 
 $(0,1/2)$
 and, therefore,
$(0,1/2)$
 and, therefore, 
 \begin{align*}H(b)-H(b-a) & \geq H(b)-H(b-\delta)=\frac{1}{2\ln2}\bigg(\sum_{n=1}^{\infty}\frac{(1-2b+2\delta)^{2n}-(1-2b)^{2n}}{n(2n-1)}\bigg)\\&\geq\frac{1}{2\ln2}((1-2b+2\delta)^{2}-(1-2b)^{2})=\frac{1}{2\ln2}(4\delta^{2}+4\delta(1-2b))\geq2\delta^{2}.\end{align*}
\begin{align*}H(b)-H(b-a) & \geq H(b)-H(b-\delta)=\frac{1}{2\ln2}\bigg(\sum_{n=1}^{\infty}\frac{(1-2b+2\delta)^{2n}-(1-2b)^{2n}}{n(2n-1)}\bigg)\\&\geq\frac{1}{2\ln2}((1-2b+2\delta)^{2}-(1-2b)^{2})=\frac{1}{2\ln2}(4\delta^{2}+4\delta(1-2b))\geq2\delta^{2}.\end{align*}
 Since 
 $d>\frac{1}{\delta^{4}}\geq16$
, we have
$d>\frac{1}{\delta^{4}}\geq16$
, we have 
 $\frac{\log(d)}{d}\leq\frac{1}{\sqrt{d}}\leq\delta^{2}$
. Combining with item (1), we have
$\frac{\log(d)}{d}\leq\frac{1}{\sqrt{d}}\leq\delta^{2}$
. Combining with item (1), we have 
 \begin{align*}\hphantom{000000000}\binom{d}{(b-a)d} &\leq2^{dH(b-a)}\leq\sqrt{8db(1-b)}2^{d(H(b-a)-H(b))}\binom{d}{bd}\leq2^{-2d\delta^{2}+\log(d)}\binom{d}{bd}\\&\leq2^{-d\delta^{2}}\binom{d}{bd}=(2^{-dH(b)})^{\frac{\delta^{2}}{H(b)}}\binom{d}{bd}\leq\binom{d}{bd}^{\!\!1-\frac{\delta^{2}}{H(b)}}\leq\binom{d}{bd}^{\!\!1-\delta^{2}}.\hphantom{000000000}\end{align*}
\begin{align*}\hphantom{000000000}\binom{d}{(b-a)d} &\leq2^{dH(b-a)}\leq\sqrt{8db(1-b)}2^{d(H(b-a)-H(b))}\binom{d}{bd}\leq2^{-2d\delta^{2}+\log(d)}\binom{d}{bd}\\&\leq2^{-d\delta^{2}}\binom{d}{bd}=(2^{-dH(b)})^{\frac{\delta^{2}}{H(b)}}\binom{d}{bd}\leq\binom{d}{bd}^{\!\!1-\frac{\delta^{2}}{H(b)}}\leq\binom{d}{bd}^{\!\!1-\delta^{2}}.\hphantom{000000000}\end{align*}
 
Proof of Theorem 7.3. Since 
 $\bigwedge\nolimits^{\!m}V\simeq\big(\bigwedge\nolimits^{\!d-m}V\big)^{\vee}\otimes\chi_{\mathrm{det}}$
, we may assume that
$\bigwedge\nolimits^{\!m}V\simeq\big(\bigwedge\nolimits^{\!d-m}V\big)^{\vee}\otimes\chi_{\mathrm{det}}$
, we may assume that 
 $2m\leq d$
. Let
$2m\leq d$
. Let 
 $\delta(\ell):=(25\ell)^{-\ell}$
, let
$\delta(\ell):=(25\ell)^{-\ell}$
, let 
 $C(\ell)=\delta(\ell)^{-7}$
, and suppose that
$C(\ell)=\delta(\ell)^{-7}$
, and suppose that 
 $d\geq C(\ell)$
. By Theorem 1.3, we may assume that
$d\geq C(\ell)$
. By Theorem 1.3, we may assume that 
 $d\leq\delta(\ell)^{-1}m$
, and, in particular,
$d\leq\delta(\ell)^{-1}m$
, and, in particular, 
 $m\geq\delta(\ell)^{-6}$
. As in the proof of Proposition 7.2, by replacing w by
$m\geq\delta(\ell)^{-6}$
. As in the proof of Proposition 7.2, by replacing w by 
 $w*w^{-1}$
, we may assume that
$w*w^{-1}$
, we may assume that 
 $a_{w,\mathrm{U}_{d},\rho}\in\mathbb{R}_{\geq0}$
 for all
$a_{w,\mathrm{U}_{d},\rho}\in\mathbb{R}_{\geq0}$
 for all 
 $\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
. By Theorem 2.5, we have for all
$\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
. By Theorem 2.5, we have for all 
 $c\leq \frac{d}{2}$
:
$c\leq \frac{d}{2}$
: 
 \[\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}\simeq\Big(\!\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!d-c}V\Big)\otimes\chi_{\mathrm{det}}^{-1}\simeq\bigoplus_{j=0}^{c}V_{\lambda_{(j)}},\]
\[\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}\simeq\Big(\!\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!d-c}V\Big)\otimes\chi_{\mathrm{det}}^{-1}\simeq\bigoplus_{j=0}^{c}V_{\lambda_{(j)}},\]
 where 
 $\lambda_{(j)}=(1,\ldots,1,0,\ldots,0,-1,\ldots,-1)$
, with
$\lambda_{(j)}=(1,\ldots,1,0,\ldots,0,-1,\ldots,-1)$
, with 
 $-1$
 and 1 appearing j times. Moreover,
$-1$
 and 1 appearing j times. Moreover, 
 $V_{\lambda_{(c)}}$
 is the largest irreducible subrepresentation of
$V_{\lambda_{(c)}}$
 is the largest irreducible subrepresentation of 
 $\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}$
, and we have
$\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}$
, and we have 
 $\rho_{\lambda_{(c)}}(1)\geq (\frac{1}{c+1})\binom{d}{c}^{\!2}\geq\binom{d}{c}^{\!3/2}$
. By Theorem 1.3, and since all
$\rho_{\lambda_{(c)}}(1)\geq (\frac{1}{c+1})\binom{d}{c}^{\!2}\geq\binom{d}{c}^{\!3/2}$
. By Theorem 1.3, and since all 
 $a_{w,\mathrm{U}_{d},\rho}$
 are non-negative, if
$a_{w,\mathrm{U}_{d},\rho}$
 are non-negative, if 
 $c\leq\lceil\delta(\ell)d\rceil\leq(22\ell)^{-\ell}d$
, then
$c\leq\lceil\delta(\ell)d\rceil\leq(22\ell)^{-\ell}d$
, then 
 \[\mathbb{E}\big(\rho_{\lambda_{(c)}}\circ w\big)\leq\sum_{j=0}^{c}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w\big)=\mathbb{E}\big(\rho_{\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}}\circ w\big)=\mathbb{E}\big(\big|\big(\rho_{\bigwedge\nolimits^{\!c}V}\circ w\big)\big|^{2}\big)\leq\binom{d}{c}\leq\rho_{\lambda_{(c)}}(1)^{2/3}.\]
\[\mathbb{E}\big(\rho_{\lambda_{(c)}}\circ w\big)\leq\sum_{j=0}^{c}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w\big)=\mathbb{E}\big(\rho_{\bigwedge\nolimits^{\!c}V\otimes\bigwedge\nolimits^{\!c}V^{\vee}}\circ w\big)=\mathbb{E}\big(\big|\big(\rho_{\bigwedge\nolimits^{\!c}V}\circ w\big)\big|^{2}\big)\leq\binom{d}{c}\leq\rho_{\lambda_{(c)}}(1)^{2/3}.\]
 Applying the last inequality for 
 $w^{*9}$
, recalling that
$w^{*9}$
, recalling that 
 $a_{w^{*t},\mathrm{U}_{d},\rho}=\frac{a_{w,\mathrm{U}_{d},\rho}^{t}}{\rho(1)^{t-1}}$
 for all
$a_{w^{*t},\mathrm{U}_{d},\rho}=\frac{a_{w,\mathrm{U}_{d},\rho}^{t}}{\rho(1)^{t-1}}$
 for all 
 $\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
, we get
$\rho\in\mathrm{Irr}(\mathrm{U}_{d})$
, we get 
 \begin{align}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)&=\sum_{j=0}^{\lceil\delta(\ell)d\rceil}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w^{*9}\big)\leq\sum_{j=0}^{\lceil\delta(\ell)d\rceil}\rho_{\lambda_{(j)}}(1)^{-2}\nonumber\\&\leq\sum_{j=1}^{\infty}\frac{1}{j^{2}}<2.\end{align}
\begin{align}\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)&=\sum_{j=0}^{\lceil\delta(\ell)d\rceil}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w^{*9}\big)\leq\sum_{j=0}^{\lceil\delta(\ell)d\rceil}\rho_{\lambda_{(j)}}(1)^{-2}\nonumber\\&\leq\sum_{j=1}^{\infty}\frac{1}{j^{2}}<2.\end{align}
 Note that, for each 
 $\delta(\ell)d\leq m\leq\frac{d}{2}$
,
$\delta(\ell)d\leq m\leq\frac{d}{2}$
, 
 $\bigwedge\nolimits^{\!m}V$
 is a subrepresentation of
$\bigwedge\nolimits^{\!m}V$
 is a subrepresentation of 
 $\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\otimes\bigwedge\nolimits^{\!m-\lceil\delta(\ell)d\rceil}V$
, so
$\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\otimes\bigwedge\nolimits^{\!m-\lceil\delta(\ell)d\rceil}V$
, so 
 \[\bigwedge\nolimits^{\!\!m}V\otimes\Big(\!\bigwedge\nolimits^{\!\!m}V\!\Big)^{\vee}\!\hookrightarrow\!\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\otimes\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\!\Big)^{\vee}\Big)\otimes\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}V\otimes\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}V\!\Big)^{\vee}\Big).\]
\[\bigwedge\nolimits^{\!\!m}V\otimes\Big(\!\bigwedge\nolimits^{\!\!m}V\!\Big)^{\vee}\!\hookrightarrow\!\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\otimes\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}V\!\Big)^{\vee}\Big)\otimes\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}V\otimes\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}V\!\Big)^{\vee}\Big).\]
 Finally, by the positivity of the Fourier coefficients of w, by (7.7), by Lemma 7.4 (note that 
 $m\geq\lceil\delta(\ell)d\rceil$
) and by (3.12) (note that
$m\geq\lceil\delta(\ell)d\rceil$
) and by (3.12) (note that 
 $\delta(\ell)^{2}m\geq1$
),
$\delta(\ell)^{2}m\geq1$
), 
 \begin{align*}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\\&\quad \leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\\&\quad \leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\cdot\binom{d}{m-\lceil\delta(\ell)d\rceil}^{\!\!2}\\&\quad \leq2\binom{d}{m-\lceil\delta(\ell)d\rceil}^{\!\!2}\leq\frac{d}{m}\binom{d}{m}^{\!\!2-2\delta(\ell)^{2}}\leq\binom{d}{m}^{\!\!2-\delta(\ell)^{2}}.\end{align*}
\begin{align*}&\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\\&\quad \leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m-\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\\&\quad \leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\lceil\delta(\ell)d\rceil}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\cdot\binom{d}{m-\lceil\delta(\ell)d\rceil}^{\!\!2}\\&\quad \leq2\binom{d}{m-\lceil\delta(\ell)d\rceil}^{\!\!2}\leq\frac{d}{m}\binom{d}{m}^{\!\!2-2\delta(\ell)^{2}}\leq\binom{d}{m}^{\!\!2-\delta(\ell)^{2}}.\end{align*}
 By (3.12), 
 $m+1\leq2^{2\sqrt{m}}\leq\binom{d}{m}^{\!2/\sqrt{m}}$
 for each
$m+1\leq2^{2\sqrt{m}}\leq\binom{d}{m}^{\!2/\sqrt{m}}$
 for each 
 $m\leq\frac{d}{2}$
. Hence,
$m\leq\frac{d}{2}$
. Hence, 
 \begin{equation}\rho_{\lambda_{(m)}}(1)\geq\frac{1}{m+1}\binom{d}{m}^{\!\!2}\geq\binom{d}{m}^{\!\!2(1-\frac{1}{\sqrt{m}})}\geq\binom{d}{m}^{\!\!2-2\delta(\ell)^{3}}.\end{equation}
\begin{equation}\rho_{\lambda_{(m)}}(1)\geq\frac{1}{m+1}\binom{d}{m}^{\!\!2}\geq\binom{d}{m}^{\!\!2(1-\frac{1}{\sqrt{m}})}\geq\binom{d}{m}^{\!\!2-2\delta(\ell)^{3}}.\end{equation}
Consequently, we get
 \begin{align*}\big(\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w\big)\big)^{9} &=\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w^{*9}\big)\rho_{\lambda_{(m)}}(1)^{8}\leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\rho_{\lambda_{(m)}}(1)^{8}\\&\leq\binom{d}{m}^{\!\!2-\delta(\ell)^{2}}\rho_{\lambda_{(m)}}(1)^{8}\leq\rho_{\lambda_{(m)}}(1)^{9-\frac{\delta(\ell)^{2}}{4}},\end{align*}
\begin{align*}\big(\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w\big)\big)^{9} &=\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w^{*9}\big)\rho_{\lambda_{(m)}}(1)^{8}\leq\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w^{*9}(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)\rho_{\lambda_{(m)}}(1)^{8}\\&\leq\binom{d}{m}^{\!\!2-\delta(\ell)^{2}}\rho_{\lambda_{(m)}}(1)^{8}\leq\rho_{\lambda_{(m)}}(1)^{9-\frac{\delta(\ell)^{2}}{4}},\end{align*}
 and, thus, 
 $\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w\big)\leq\rho_{\lambda_{(m)}}(1)^{1-\frac{\delta(\ell)^{2}}{36}}$
. Taking
$\mathbb{E}\big(\rho_{\lambda_{(m)}}\circ w\big)\leq\rho_{\lambda_{(m)}}(1)^{1-\frac{\delta(\ell)^{2}}{36}}$
. Taking 
 $\epsilon(\ell):=\frac{\delta(\ell)^{2}}{72}$
, and using
$\epsilon(\ell):=\frac{\delta(\ell)^{2}}{72}$
, and using 
 $m+1\leq\binom{d}{m}^{\!2\delta(\ell)^{3}}$
, we get
$m+1\leq\binom{d}{m}^{\!2\delta(\ell)^{3}}$
, we get 
 \[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\sum_{j=0}^{m}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w\big)\leq(m+1)\binom{d}{m}^{\!\!2-4\epsilon(\ell)}\leq\binom{d}{m}^{\!\!2(1-\epsilon(\ell))}.\]
\[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w(X_{1},\ldots,X_{r})\Big)\!\Big|^{2}\Big)=\sum_{j=0}^{m}\mathbb{E}\big(\rho_{\lambda_{(j)}}\circ w\big)\leq(m+1)\binom{d}{m}^{\!\!2-4\epsilon(\ell)}\leq\binom{d}{m}^{\!\!2(1-\epsilon(\ell))}.\]
We end the section with a proof of Theorem 1.6.
 
Proof of Theorem 1.6. Let 
 $w\in F_{r}$
. Denote
$w\in F_{r}$
. Denote 
 $\widetilde{w}:=w*w^{-1}$
. Recall that
$\widetilde{w}:=w*w^{-1}$
. Recall that 
 $\tau_{\widetilde{w},G}=f_{\widetilde{w},G}\mu_{G}$
 and note that for every
$\tau_{\widetilde{w},G}=f_{\widetilde{w},G}\mu_{G}$
 and note that for every 
 $t\in\mathbb{N}$
,
$t\in\mathbb{N}$
, 
 \[ f_{\widetilde{w}^{*t},G}=f_{\widetilde{w},G}^{*t}.\]
\[ f_{\widetilde{w}^{*t},G}=f_{\widetilde{w},G}^{*t}.\]
 Applying [Reference Larsen, Shalev and TiepLST19, Theorem 4], there are 
 $C',M(w)\in\mathbb{N}$
 such that, for
$C',M(w)\in\mathbb{N}$
 such that, for 
 $N(w):=C'\ell(w)^{4}$
 and for every finite simple group G of size
$N(w):=C'\ell(w)^{4}$
 and for every finite simple group G of size 
 $>M(w)$
, one has
$>M(w)$
, one has 
 \[\bigg|\!\sum_{1\neq\rho\in\mathrm{Irr}(G)}\frac{a_{\widetilde{w},G,\rho}^{N(w)}}{\rho(1)^{N(w)-1}}\rho(1)\bigg|=\bigg|\!\sum_{1\neq\rho\in\mathrm{Irr}(G)}a_{\widetilde{w}^{*N(w)},G,\rho}\rho(1)\bigg|=|f_{\widetilde{w}^{*N(w)},G}(1)-1|=|f_{\widetilde{w},G}^{*N(w)}(1)-1|<1,\]
\[\bigg|\!\sum_{1\neq\rho\in\mathrm{Irr}(G)}\frac{a_{\widetilde{w},G,\rho}^{N(w)}}{\rho(1)^{N(w)-1}}\rho(1)\bigg|=\bigg|\!\sum_{1\neq\rho\in\mathrm{Irr}(G)}a_{\widetilde{w}^{*N(w)},G,\rho}\rho(1)\bigg|=|f_{\widetilde{w}^{*N(w)},G}(1)-1|=|f_{\widetilde{w},G}^{*N(w)}(1)-1|<1,\]
 where the first equality follows from (7.4). Since 
 $a_{\widetilde{w},G,\rho}=\frac{|a_{w,G,\rho}|^{2}}{\rho(1)}\geq0$
, we deduce that for each
$a_{\widetilde{w},G,\rho}=\frac{|a_{w,G,\rho}|^{2}}{\rho(1)}\geq0$
, we deduce that for each 
 $1\neq\rho\in\mathrm{Irr}(G)$
$1\neq\rho\in\mathrm{Irr}(G)$
 
 \[\frac{|a_{w,G,\rho}|^{2N(w)}}{\rho(1)^{2N(w)-2}}=\frac{|a_{w,G,\rho}|^{2N(w)}}{\rho(1)^{2N(w)-1}}\rho(1)=\frac{a_{\widetilde{w},G,\rho}^{N(w)}}{\rho(1)^{N(w)-1}}\rho(1)<1,\]
\[\frac{|a_{w,G,\rho}|^{2N(w)}}{\rho(1)^{2N(w)-2}}=\frac{|a_{w,G,\rho}|^{2N(w)}}{\rho(1)^{2N(w)-1}}\rho(1)=\frac{a_{\widetilde{w},G,\rho}^{N(w)}}{\rho(1)^{N(w)-1}}\rho(1)<1,\]
 from which the theorem follows for 
 $\epsilon=\frac{1}{N(w)}=\frac{1}{C'\ell(w)^{4}}$
.
$\epsilon=\frac{1}{N(w)}=\frac{1}{C'\ell(w)^{4}}$
.
8. Fourier coefficients of symmetric powers
 In this section, we prove Theorem 1.4. Denote 
 ${\mathcal J}_{m,d}=\{c_{1}\leq\cdots\leq c_{m}:c_{i}\in[d]\}$
. We first claim that, for each
${\mathcal J}_{m,d}=\{c_{1}\leq\cdots\leq c_{m}:c_{i}\in[d]\}$
. We first claim that, for each 
 $A\in\mathrm{End}(\mathbb{C}^{d})$
 and
$A\in\mathrm{End}(\mathbb{C}^{d})$
 and 
 $m\geq1$
,
$m\geq1$
, 
 \[\mathrm{tr}(\mathrm{Sym}^{m}A)=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}A_{a_{1}a_{\pi(1)}}\cdots A_{a_{m}a_{\pi(m)}}.\]
\[\mathrm{tr}(\mathrm{Sym}^{m}A)=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}A_{a_{1}a_{\pi(1)}}\cdots A_{a_{m}a_{\pi(m)}}.\]
 Indeed, for each 
 $\overrightarrow{\!c}\in{\mathcal J}_{m,d}$
, let
$\overrightarrow{\!c}\in{\mathcal J}_{m,d}$
, let 
 $\nu_{\overrightarrow{\!c}}$
 be the shape of
$\nu_{\overrightarrow{\!c}}$
 be the shape of 
 $\overrightarrow{\!c}$
 (see Definition 3.4) and set
$\overrightarrow{\!c}$
 (see Definition 3.4) and set 
 \[v_{\overrightarrow{\!c}}:=\sqrt{\frac{1}{m!\cdot\nu_{\overrightarrow{\!c}}!}}\sum_{\pi\in S_{m}}e_{c_{\pi(1)}}\otimes\cdots\otimes e_{c_{\pi(m)}}.\]
\[v_{\overrightarrow{\!c}}:=\sqrt{\frac{1}{m!\cdot\nu_{\overrightarrow{\!c}}!}}\sum_{\pi\in S_{m}}e_{c_{\pi(1)}}\otimes\cdots\otimes e_{c_{\pi(m)}}.\]
 Then 
 $\{ v_{\overrightarrow{\!c}}\}_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}$
 is an orthonormal basis for
$\{ v_{\overrightarrow{\!c}}\}_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}$
 is an orthonormal basis for 
 $\mathrm{Sym}^{m}(\mathbb{C}^{d})$
. Given
$\mathrm{Sym}^{m}(\mathbb{C}^{d})$
. Given 
 $A\in\mathrm{End}(\mathbb{C}^{d})$
, we have
$A\in\mathrm{End}(\mathbb{C}^{d})$
, we have 
 \begin{align*}\mathrm{tr}(\mathrm{Sym}^{m}A) & =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\langle A\cdot v_{\overrightarrow{\!c}},v_{\overrightarrow{\!c}}\rangle\\&=\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{m!\cdot\nu_{\overrightarrow{\!c}}!}\sum_{\pi,\pi'\in S_{m}}\big\langle Ae_{c_{\pi(1)}}\otimes\cdots\otimes Ae_{c_{\pi(m)}},e_{c_{\pi'(1)}}\otimes\cdots\otimes e_{c_{\pi'(m)}}\big\rangle\\& =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{\nu_{\overrightarrow{\!c}}!}\sum_{\pi\in S_{m}}\big\langle Ae_{c_{1}}\otimes\cdots\otimes Ae_{c_{m}},e_{c_{\pi(1)}}\otimes\cdots\otimes e_{c_{\pi(m)}}\big\rangle\\& =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{\nu_{\overrightarrow{\!c}}!}\sum_{\pi\in S_{m}}A_{c_{1}c_{\pi(1)}}\cdots A_{c_{m}c_{\pi(m)}}=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}A_{a_{1}a_{\pi(1)}}\cdots A_{a_{m}a_{\pi(m)}},\end{align*}
\begin{align*}\mathrm{tr}(\mathrm{Sym}^{m}A) & =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\langle A\cdot v_{\overrightarrow{\!c}},v_{\overrightarrow{\!c}}\rangle\\&=\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{m!\cdot\nu_{\overrightarrow{\!c}}!}\sum_{\pi,\pi'\in S_{m}}\big\langle Ae_{c_{\pi(1)}}\otimes\cdots\otimes Ae_{c_{\pi(m)}},e_{c_{\pi'(1)}}\otimes\cdots\otimes e_{c_{\pi'(m)}}\big\rangle\\& =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{\nu_{\overrightarrow{\!c}}!}\sum_{\pi\in S_{m}}\big\langle Ae_{c_{1}}\otimes\cdots\otimes Ae_{c_{m}},e_{c_{\pi(1)}}\otimes\cdots\otimes e_{c_{\pi(m)}}\big\rangle\\& =\sum_{\overrightarrow{\!c}\in{\mathcal J}_{m,d}}\frac{1}{\nu_{\overrightarrow{\!c}}!}\sum_{\pi\in S_{m}}A_{c_{1}c_{\pi(1)}}\cdots A_{c_{m}c_{\pi(m)}}=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}A_{a_{1}a_{\pi(1)}}\cdots A_{a_{m}a_{\pi(m)}},\end{align*}
 where the last equality follows since 
 $\sum_{\pi\in S_{m}}A_{c_{1}c_{\pi(1)}}\cdots A_{c_{m}c_{\pi(m)}}$
 is invariant under permuting
$\sum_{\pi\in S_{m}}A_{c_{1}c_{\pi(1)}}\cdots A_{c_{m}c_{\pi(m)}}$
 is invariant under permuting 
 $c_{1},\ldots,c_{m}$
, and since there are
$c_{1},\ldots,c_{m}$
, and since there are 
 $\frac{m!}{\nu_{\overrightarrow{\!c}}!}$
 vectors
$\frac{m!}{\nu_{\overrightarrow{\!c}}!}$
 vectors 
 $\overrightarrow{\!a}\in[d]^{m}$
 of a shape
$\overrightarrow{\!a}\in[d]^{m}$
 of a shape 
 $\nu_{\overrightarrow{\!c}}$
. In particular, for any word w,
$\nu_{\overrightarrow{\!c}}$
. In particular, for any word w, 
 \begin{equation}\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}w(X_{1},\ldots,X_{r})_{a_{1}a_{\pi(1)}}\cdots w(X_{1},\ldots,X_{r})_{a_{m}a_{\pi(m)}}.\end{equation}
\begin{equation}\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}w(X_{1},\ldots,X_{r})_{a_{1}a_{\pi(1)}}\cdots w(X_{1},\ldots,X_{r})_{a_{m}a_{\pi(m)}}.\end{equation}
Proposition 8.1. Let 
 $w\in F_{r}$
 be a cyclically reduced word. With
$w\in F_{r}$
 be a cyclically reduced word. With 
 $\Phi,T,\Omega,\Omega_{s,u}$
 as in § 4, we have
$\Phi,T,\Omega,\Omega_{s,u}$
 as in § 4, we have 
 \begin{equation}\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})=\frac{1}{m!^{2}}\sum_{(\pi,\pi',F,\Sigma)\in\widetilde{Z}}\widetilde{\mathrm{Wg}}(\Sigma^{2}),\end{equation}
\begin{equation}\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})=\frac{1}{m!^{2}}\sum_{(\pi,\pi',F,\Sigma)\in\widetilde{Z}}\widetilde{\mathrm{Wg}}(\Sigma^{2}),\end{equation}
where
 \[\widetilde{Z}:=\bigg\{ (\pi,\pi',F,\Sigma):\substack{F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}\\ \pi,\pi'\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})\\ F\circ T=F\circ\pi\pi'\circ\Sigma } \bigg\} .\]
\[\widetilde{Z}:=\bigg\{ (\pi,\pi',F,\Sigma):\substack{F:\Omega\rightarrow[d],\Sigma\in S_{\Phi}\\ \pi,\pi'\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})\\ F\circ T=F\circ\pi\pi'\circ\Sigma } \bigg\} .\]
Proof. Similarly to (4.4), we have
 \begin{align*}\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})) &=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(1,k)=a_{k},f(\ell+1,k)=a_{\pi(k)} }}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\\& =\sum_{\pi\in\mathrm{Sym}(\{ \ell\}\times[m])}\sum_{F:[\ell]\times[m]\rightarrow[d]}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{F(u,k),F(\widetilde{T}\pi(u,k))}.\end{align*}
\begin{align*}\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r})) &=\frac{1}{m!}\sum_{\overrightarrow{\!a}\in[d]^{m}}\sum_{\pi\in S_{m}}\sum_{\substack{f:[\ell+1]\times[m]\rightarrow[d]\\f(1,k)=a_{k},f(\ell+1,k)=a_{\pi(k)} }}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{f(u,k),f(u+1,k)}\\& =\sum_{\pi\in\mathrm{Sym}(\{ \ell\}\times[m])}\sum_{F:[\ell]\times[m]\rightarrow[d]}\prod_{(u,k)\in[\ell]\times[m]}(X_{w(u)})_{F(u,k),F(\widetilde{T}\pi(u,k))}.\end{align*}
Consequently, as in (4.8), we have
 \[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})=\frac{1}{m!^{2}}\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}\sum_{F:\Omega\rightarrow[d]}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\pi\pi'\gamma),F(T(\gamma))}.\]
\[\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2})=\frac{1}{m!^{2}}\sum_{(\pi,\pi')\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})}\sum_{F:\Omega\rightarrow[d]}\prod_{\gamma\in\Omega}(X_{\widetilde{w}(\gamma)})_{F(\pi\pi'\gamma),F(T(\gamma))}.\]
The proposition now follows from Corollary 2.15.
 We next define an action of 
 $H:=\prod_{(s,u)\in[2]\times[\ell]}\mathrm{Sym}(\Omega_{s,u})$
 on
$H:=\prod_{(s,u)\in[2]\times[\ell]}\mathrm{Sym}(\Omega_{s,u})$
 on 
 $\widetilde{Z}$
 in the same way as in § 5. For
$\widetilde{Z}$
 in the same way as in § 5. For 
 $(s,u)\in[2]\times([\ell]\backslash\{1\})$
 and
$(s,u)\in[2]\times([\ell]\backslash\{1\})$
 and 
 $\pi_{s,u}\in\mathrm{Sym}(\Omega_{s,u})$
,
$\pi_{s,u}\in\mathrm{Sym}(\Omega_{s,u})$
, 
 \[\pi_{s,u}\cdot(\pi,\pi',F,\Sigma):=(\pi,\pi',F\circ\pi_{s,u}^{-1},\pi_{s,u}\circ\Sigma\circ T^{-1}\pi_{s,u}^{-1}T),\]
\[\pi_{s,u}\cdot(\pi,\pi',F,\Sigma):=(\pi,\pi',F\circ\pi_{s,u}^{-1},\pi_{s,u}\circ\Sigma\circ T^{-1}\pi_{s,u}^{-1}T),\]
 and if 
 $(\pi_{1,1},\pi_{2,1})\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
,
$(\pi_{1,1},\pi_{2,1})\in\mathrm{Sym}(\Omega_{1,1})\times\mathrm{Sym}(\Omega_{2,1})$
, 
 \[(\pi_{1,1},\pi_{2,1})\cdot(\pi,\pi',F,\Sigma):=(\pi_{1,1}\pi,\pi_{2,1}\pi',F\circ\pi_{1,1}^{-1}\pi_{2,1}^{-1},\Sigma\circ T^{-1}\pi_{1,1}^{-1}\pi_{2,1}^{-1}T).\]
\[(\pi_{1,1},\pi_{2,1})\cdot(\pi,\pi',F,\Sigma):=(\pi_{1,1}\pi,\pi_{2,1}\pi',F\circ\pi_{1,1}^{-1}\pi_{2,1}^{-1},\Sigma\circ T^{-1}\pi_{1,1}^{-1}\pi_{2,1}^{-1}T).\]
 
Proof of Theorem 1.4. The proof is similar to the proof of Theorem 1.3. The only difference is that now, summing over the H-orbit kills all representations that do not appear in 
 $\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)$
, rather than the representations not in
$\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)$
, rather than the representations not in 
 $\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})$
. By Lemma 2.3, the irreducible subrepresentations
$\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(\mathrm{sgn})$
. By Lemma 2.3, the irreducible subrepresentations 
 $\chi_{\lambda}$
 of
$\chi_{\lambda}$
 of 
 $\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)$
 correspond to partitions
$\mathrm{Ind}_{S_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)$
 correspond to partitions 
 $\lambda=(\lambda_{1},\ldots,\lambda_{\ell_{i}})$
 with at most
$\lambda=(\lambda_{1},\ldots,\lambda_{\ell_{i}})$
 with at most 
 $\ell_{i}$
 rows, and, therefore,
$\ell_{i}$
 rows, and, therefore, 
 $\prod_{(a,b)\in\lambda}(d+b-a)\geq(d-\ell)^{m\ell_{i}}$
. As in Corollary 5.3 and (5.6), the average of
$\prod_{(a,b)\in\lambda}(d+b-a)\geq(d-\ell)^{m\ell_{i}}$
. As in Corollary 5.3 and (5.6), the average of 
 $\widetilde{\mathrm{Wg}}(\Sigma^{2})$
 over an H-orbit
$\widetilde{\mathrm{Wg}}(\Sigma^{2})$
 over an H-orbit 
 $H\cdot(\widehat{\pi},\widehat{\pi'},\widehat{F},\widehat{\Sigma})$
 is bounded by
$H\cdot(\widehat{\pi},\widehat{\pi'},\widehat{F},\widehat{\Sigma})$
 is bounded by 
 \begin{align}&\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\left|\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u})}}\mathrm{Wg}\big(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}}\big)\right|\nonumber\\&\quad \leq \frac{1}{m!^{\ell}}\prod_{i=1}^{r}\frac{m!^{\ell_{i}}}{(m\ell_{i})!}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)}\frac{\chi_{\lambda}(1)\langle\chi_{\lambda},1\rangle_{\mathrm{S}_{m}^{\ell_{i}}}}{\prod_{(a,b)\in\lambda}(d+b-a)}\leq\frac{1}{m!^{\ell}}\frac{1}{(d-\ell)^{m\ell}}.\end{align}
\begin{align}&\frac{1}{m!^{2\ell}}\prod_{i=1}^{r}\left|\sum_{\substack{h_{i}\in\prod_{\widetilde{w}=i}\mathrm{Sym}(\Omega_{s,u})\\h'_{i}\in\prod_{\widetilde{w}=-i}\mathrm{Sym}(\Omega_{s,u})}}\mathrm{Wg}\big(h_{i}\widehat{\Sigma}|_{B_{i}}h'_{i}\widehat{\Sigma}|_{A_{i}}\big)\right|\nonumber\\&\quad \leq \frac{1}{m!^{\ell}}\prod_{i=1}^{r}\frac{m!^{\ell_{i}}}{(m\ell_{i})!}\sum_{\lambda\vdash m\ell_{i}:\chi_{\lambda}\subseteq\mathrm{Ind}_{\mathrm{S}_{m}^{\ell_{i}}}^{S_{m\ell_{i}}}(1)}\frac{\chi_{\lambda}(1)\langle\chi_{\lambda},1\rangle_{\mathrm{S}_{m}^{\ell_{i}}}}{\prod_{(a,b)\in\lambda}(d+b-a)}\leq\frac{1}{m!^{\ell}}\frac{1}{(d-\ell)^{m\ell}}.\end{align}
 Denote 
 $\widetilde{Z}_{\pi,\pi'}:=\{ (F,\Sigma):(\pi,\pi',F,\Sigma)\in\widetilde{Z}\}$
. Since
$\widetilde{Z}_{\pi,\pi'}:=\{ (F,\Sigma):(\pi,\pi',F,\Sigma)\in\widetilde{Z}\}$
. Since 
 $\widetilde{Z}_{\mathrm{Id},\mathrm{Id}}=W'$
, Proposition 6.1 implies that
$\widetilde{Z}_{\mathrm{Id},\mathrm{Id}}=W'$
, Proposition 6.1 implies that 
 \begin{equation}|\widetilde{Z}|=m!^{2}|\widetilde{Z}_{\mathrm{Id},\mathrm{Id}}|=m!^{2}|W'|\leq m!^{2}\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\end{equation}
\begin{equation}|\widetilde{Z}|=m!^{2}|\widetilde{Z}_{\mathrm{Id},\mathrm{Id}}|=m!^{2}|W'|\leq m!^{2}\binom{d+m\ell}{m\ell}(m\ell)!\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}.\end{equation}
 As in the proof of Theorem 1.3, if 
 $d\geq m\ell$
, then
$d\geq m\ell$
, then 
 \begin{align*}\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2}) &=\frac{1}{m!^{2}}\sum_{(\pi,\pi',F,\Sigma)\in\widetilde{Z}}\widetilde{\mathrm{Wg}}(\Sigma^{2})\leq|\widetilde{Z}|\frac{1}{m!^{\ell+2}}\frac{1}{(d-\ell)^{m\ell}}\\& \leq\frac{(d+m\ell)\cdots(d+1)}{(d-\ell)^{m\ell}m!^{\ell}}\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\\& \leq4^{m\ell}\ell^{m\ell}\prod_{0\neq k=-r}^{r}\binom{m\ell_{k}}{m\ell_{k}/2}4^{m\ell}\ell^{m\ell}2^{2m\ell}=(16\ell)^{m\ell}.\qquad \qquad \end{align*}
\begin{align*}\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w(X_{1},\ldots,X_{r}))|^{2}) &=\frac{1}{m!^{2}}\sum_{(\pi,\pi',F,\Sigma)\in\widetilde{Z}}\widetilde{\mathrm{Wg}}(\Sigma^{2})\leq|\widetilde{Z}|\frac{1}{m!^{\ell+2}}\frac{1}{(d-\ell)^{m\ell}}\\& \leq\frac{(d+m\ell)\cdots(d+1)}{(d-\ell)^{m\ell}m!^{\ell}}\prod_{0\neq k=-r}^{r}\frac{(m\ell_{k})!}{\big(\sum_{i<k}m\ell_{i,k}\big)!}\\& \leq4^{m\ell}\ell^{m\ell}\prod_{0\neq k=-r}^{r}\binom{m\ell_{k}}{m\ell_{k}/2}4^{m\ell}\ell^{m\ell}2^{2m\ell}=(16\ell)^{m\ell}.\qquad \qquad \end{align*}
Appendix A. Fourier coefficients of the power word and a Diaconis–Shahshahani-type result
In this appendix, we formulate two results. The first is a computation of the Fourier coefficients of the power word 
 $w=x^{l}$
 for representations
$w=x^{l}$
 for representations 
 $\rho_{\lambda}\in\mathrm{Irr}\big(\mathrm{U}_{d}\big)$
, where
$\rho_{\lambda}\in\mathrm{Irr}\big(\mathrm{U}_{d}\big)$
, where 
 $\widetilde{\lambda}$
 (see Remark 2.6) has at most
$\widetilde{\lambda}$
 (see Remark 2.6) has at most 
 $\frac{d}{2l}$
 boxes. The second is a Diaconis–Shahshahani-type result for the mth coefficient of the characteristic polynomial of a word w in random unitary matrices. Both statements are consequences of known results.
$\frac{d}{2l}$
 boxes. The second is a Diaconis–Shahshahani-type result for the mth coefficient of the characteristic polynomial of a word w in random unitary matrices. Both statements are consequences of known results.
Proposition A.1. Let 
 $w=x^{l}$
 be the lth power word. Then, for every
$w=x^{l}$
 be the lth power word. Then, for every 
 $m\in\mathbb{N}$
 and every
$m\in\mathbb{N}$
 and every 
 $d\geq2ml$
:
$d\geq2ml$
:
- 
(1) we have for all \[\mathbb{E}(|\rho_{\lambda}\circ w|^{2})=\frac{1}{m!}\sum_{\sigma\in S_{m}}l^{\ell(\sigma)}|\chi_{\lambda}(\sigma)|^{2},\] \[\mathbb{E}(|\rho_{\lambda}\circ w|^{2})=\frac{1}{m!}\sum_{\sigma\in S_{m}}l^{\ell(\sigma)}|\chi_{\lambda}(\sigma)|^{2},\] $\lambda\vdash m$
; in particular, $\lambda\vdash m$
; in particular, $\mathbb{E}(|\rho_{\lambda}\circ w|^{2})\leq l^{m}$
; $\mathbb{E}(|\rho_{\lambda}\circ w|^{2})\leq l^{m}$
;
- 
(2) we have  \[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w\Big)\!\Big|^{2}\Big)=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\binom{l+m-1}{m}.\] \[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w\Big)\!\Big|^{2}\Big)=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\binom{l+m-1}{m}.\]
Proof. For every matrix 
 $A\in\mathrm{U}_{d}$
 and every
$A\in\mathrm{U}_{d}$
 and every 
 $\mu\vdash m$
, set
$\mu\vdash m$
, set 
 \begin{equation}\mathrm{tr}_{\mu}(A):=\prod_{j=1}^{m}\mathrm{tr}(A^{j})^{a_{j}},\end{equation}
\begin{equation}\mathrm{tr}_{\mu}(A):=\prod_{j=1}^{m}\mathrm{tr}(A^{j})^{a_{j}},\end{equation}
 where 
 $\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 is the partition
$\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 is the partition 
 $m=\underset{a_{1}\text{times}}{\underbrace{(1+\cdots+1)}}+\cdots+\underset{a_{m}\text{times}}{\underbrace{(m+\cdots+m)}}$
. The functions
$m=\underset{a_{1}\text{times}}{\underbrace{(1+\cdots+1)}}+\cdots+\underset{a_{m}\text{times}}{\underbrace{(m+\cdots+m)}}$
. The functions 
 $\mathrm{tr}_{\mu}$
 correspond to the power-sum symmetric functions
$\mathrm{tr}_{\mu}$
 correspond to the power-sum symmetric functions 
 $p_{\mu}$
. Given
$p_{\mu}$
. Given 
 $\lambda\vdash m$
, the character
$\lambda\vdash m$
, the character 
 $\rho_{\lambda}(A)$
 is a Schur polynomial in the eigenvalues of A, and, hence, it can be expressed in terms of
$\rho_{\lambda}(A)$
 is a Schur polynomial in the eigenvalues of A, and, hence, it can be expressed in terms of 
 $\mathrm{tr}_{\mu}(A)$
 via the following formula (see, e.g., [Reference MacdonaldMac95, I.7, p. 114]),
$\mathrm{tr}_{\mu}(A)$
 via the following formula (see, e.g., [Reference MacdonaldMac95, I.7, p. 114]), 
 \begin{equation}\rho_{\lambda}(A)=\sum_{\mu\vdash m}\frac{\chi_{\lambda}(\mu)}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}\cdot\mathrm{tr}_{\mu}(A),\end{equation}
\begin{equation}\rho_{\lambda}(A)=\sum_{\mu\vdash m}\frac{\chi_{\lambda}(\mu)}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}\cdot\mathrm{tr}_{\mu}(A),\end{equation}
 where 
 $\chi_{\lambda}(\mu)$
 is the value of the character
$\chi_{\lambda}(\mu)$
 is the value of the character 
 $\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
 on the elements with cycle type
$\chi_{\lambda}\in\mathrm{Irr}(S_{m})$
 on the elements with cycle type 
 $\mu$
. In addition, by (1.2), for every pair of partitions
$\mu$
. In addition, by (1.2), for every pair of partitions 
 $\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 and
$\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 and 
 $\mu'=(1^{b_{1}}\cdots m^{b_{m}})$
 of m, we have
$\mu'=(1^{b_{1}}\cdots m^{b_{m}})$
 of m, we have 
 \begin{equation}\mathbb{E}\big(\mathrm{tr}_{\mu}(X^{l})\mathrm{tr}_{\mu'}(\overline{X}^{l})\big)=\mathbb{E}\bigg(\prod_{j=1}^{m}\mathrm{tr}(X^{jl})^{a_{j}}\mathrm{tr}(\overline{X}^{jl})^{b_{j}}\bigg)=\delta_{\mu,\mu'}\prod_{j=1}^{m}(jl)^{a_{j}}a_{j}!.\end{equation}
\begin{equation}\mathbb{E}\big(\mathrm{tr}_{\mu}(X^{l})\mathrm{tr}_{\mu'}(\overline{X}^{l})\big)=\mathbb{E}\bigg(\prod_{j=1}^{m}\mathrm{tr}(X^{jl})^{a_{j}}\mathrm{tr}(\overline{X}^{jl})^{b_{j}}\bigg)=\delta_{\mu,\mu'}\prod_{j=1}^{m}(jl)^{a_{j}}a_{j}!.\end{equation}
 Combining (A.2) and (A.3), and using the fact that the number of permutations 
 $\sigma\in S_{m}$
 of cycle type
$\sigma\in S_{m}$
 of cycle type 
 $\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 is
$\mu=(1^{a_{1}}\cdots m^{a_{m}})$
 is 
 $\frac{m!}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}$
, we obtain
$\frac{m!}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}$
, we obtain 
 \begin{align}\mathbb{E}(|\rho_{\lambda}(X^{l})|^{2}) & =\sum_{\mu\vdash m}|\chi_{\lambda}(\mu)|^{2}\frac{\mathbb{E}(|\mathrm{tr}_{\mu}(X^{l})|^{2})}{\big(\prod_{j=1}^{m}a_{j}!j^{a_{j}}\big)^{2}}=\sum_{\mu\vdash m}\frac{l^{\ell(\mu)}|\chi_{\lambda}(\mu)|^{2}}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}\nonumber\\[8pt]& =\frac{1}{m!}\sum_{\mu\vdash m}\frac{m!}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}l^{\ell(\mu)}|\chi_{\lambda}(\mu)|^{2}=\frac{1}{m!}\sum_{\sigma\in S_{m}}l^{\ell(\sigma)}|\chi_{\lambda}(\sigma)|^{2}.\end{align}
\begin{align}\mathbb{E}(|\rho_{\lambda}(X^{l})|^{2}) & =\sum_{\mu\vdash m}|\chi_{\lambda}(\mu)|^{2}\frac{\mathbb{E}(|\mathrm{tr}_{\mu}(X^{l})|^{2})}{\big(\prod_{j=1}^{m}a_{j}!j^{a_{j}}\big)^{2}}=\sum_{\mu\vdash m}\frac{l^{\ell(\mu)}|\chi_{\lambda}(\mu)|^{2}}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}\nonumber\\[8pt]& =\frac{1}{m!}\sum_{\mu\vdash m}\frac{m!}{\prod_{j=1}^{m}a_{j}!j^{a_{j}}}l^{\ell(\mu)}|\chi_{\lambda}(\mu)|^{2}=\frac{1}{m!}\sum_{\sigma\in S_{m}}l^{\ell(\sigma)}|\chi_{\lambda}(\sigma)|^{2}.\end{align}
 The second claim of item (1) follows from Schur orthogonality and the inequality 
 $l^{\ell(\sigma)}\leq l^{m}$
.
$l^{\ell(\sigma)}\leq l^{m}$
.
 For item (2), note that 
 $\mathrm{tr}\big(\!\bigwedge\nolimits^{\!m}w\big)=\rho_{(1^{m})}\circ w$
 and
$\mathrm{tr}\big(\!\bigwedge\nolimits^{\!m}w\big)=\rho_{(1^{m})}\circ w$
 and 
 $\mathrm{tr}(\mathrm{Sym}^{m}w)=\rho_{(m^{1})}\circ w$
. The corresponding characters of
$\mathrm{tr}(\mathrm{Sym}^{m}w)=\rho_{(m^{1})}\circ w$
. The corresponding characters of 
 $S_{m}$
 are the sign and the trivial characters. Thus, (A.4) becomes
$S_{m}$
 are the sign and the trivial characters. Thus, (A.4) becomes 
 \[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w\Big)\!\Big|^{2}\Big)=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\mathbb{E}_{S_{m}}(l^{\ell(\sigma)})=\frac{1}{m!}\sum_{k=1}^{m}\left[\begin{array}{c}m\\ k\end{array}\right]l^{k}=\binom{l+m-1}{m},\]
\[\mathbb{E}\Big(\Big|\mathrm{tr}\Big(\!\bigwedge\nolimits^{\!\!m}w\Big)\!\Big|^{2}\Big)=\mathbb{E}(|\mathrm{tr}(\mathrm{Sym}^{m}w)|^{2})=\mathbb{E}_{S_{m}}(l^{\ell(\sigma)})=\frac{1}{m!}\sum_{k=1}^{m}\left[\begin{array}{c}m\\ k\end{array}\right]l^{k}=\binom{l+m-1}{m},\]
 where 
 $\left[\begin{smallmatrix} m\\ k\end{smallmatrix}\right]$
 is the number of permutations of m elements with exactly k disjoint cycles, also known as the unsigned Stirling number of the first kind. The last equality follows, for example, from [Reference Graham, Knuth and PatashnikGKP94, Equation (6.11)]. This concludes item (2).
$\left[\begin{smallmatrix} m\\ k\end{smallmatrix}\right]$
 is the number of permutations of m elements with exactly k disjoint cycles, also known as the unsigned Stirling number of the first kind. The last equality follows, for example, from [Reference Graham, Knuth and PatashnikGKP94, Equation (6.11)]. This concludes item (2).
We next prove a Diaconis–Shahshahani-type result. We first recall the following proposition, which is a consequence of [Reference Mingo, ś niady and SpeicherMSS07, Theorem 2] and [R, Theorem 4.1] (see also [Reference Magee and PuderMP19, Corollary 1.13]).
Proposition A.2. Let 
 $w\in F_{r}$
, and let
$w\in F_{r}$
, and let 
 $\mu=(1^{a_{1}}\cdots m^{a_{m}})$
,
$\mu=(1^{a_{1}}\cdots m^{a_{m}})$
, 
 $\mu'=(1^{b_{1}}\cdots m^{b_{m}})$
 be partitions of m. Let
$\mu'=(1^{b_{1}}\cdots m^{b_{m}})$
 be partitions of m. Let 
 $p(w)\in\mathbb{N}$
 be such that
$p(w)\in\mathbb{N}$
 be such that 
 $w=u^{p(w)}$
 with
$w=u^{p(w)}$
 with 
 $u\in F_{r}$
 a non-power. Then,
$u\in F_{r}$
 a non-power. Then, 
 \begin{equation}\lim_{d\rightarrow\infty} \mathbb{E}_{\mathrm{U}_{d}}(\mathrm{tr}_{\mu}(w)\mathrm{tr}_{\mu'}(w^{-1}))=\lim_{d\rightarrow\infty} \mathbb{E}_{\mathrm{U}_{d}}\bigg(\prod_{j=1}^{m}\mathrm{tr}(w^{j})^{a_{j}}\mathrm{tr}(w^{-j})^{b_{j}}\bigg)=\delta_{\mu,\mu'}\prod_{j=1}^{m}a_{j}!(jp(w))^{a_{j}}.\end{equation}
\begin{equation}\lim_{d\rightarrow\infty} \mathbb{E}_{\mathrm{U}_{d}}(\mathrm{tr}_{\mu}(w)\mathrm{tr}_{\mu'}(w^{-1}))=\lim_{d\rightarrow\infty} \mathbb{E}_{\mathrm{U}_{d}}\bigg(\prod_{j=1}^{m}\mathrm{tr}(w^{j})^{a_{j}}\mathrm{tr}(w^{-j})^{b_{j}}\bigg)=\delta_{\mu,\mu'}\prod_{j=1}^{m}a_{j}!(jp(w))^{a_{j}}.\end{equation}
Since the joint moments of 
 $\mathrm{tr}(w^{1}),\ldots,\mathrm{tr}(w^{m})$
 converge, as
$\mathrm{tr}(w^{1}),\ldots,\mathrm{tr}(w^{m})$
 converge, as 
 $d\rightarrow\infty$
, to the joint moments of independent complex normal random variables, an application of the moment method (as was done in [Reference Diaconis and ShahshahaniDS94] for
$d\rightarrow\infty$
, to the joint moments of independent complex normal random variables, an application of the moment method (as was done in [Reference Diaconis and ShahshahaniDS94] for 
 $w=x$
, and later in [Reference RădulescuRăd06, Reference Mingo, ś niady and SpeicherMSS07] for a general word) implies the following.
$w=x$
, and later in [Reference RădulescuRăd06, Reference Mingo, ś niady and SpeicherMSS07] for a general word) implies the following.
Corollary A.3 (see [Reference RădulescuRăd06, Theorem 4.1] and [Reference Mingo, ś niady and SpeicherMSS07, Theorem 2]). The random variables 
 $\mathrm{tr}(w^{1}),\ldots,\mathrm{tr}(w^{m})$
 converge in distribution to
$\mathrm{tr}(w^{1}),\ldots,\mathrm{tr}(w^{m})$
 converge in distribution to 
 $\sqrt{p(w)}Z_{1},\ldots,\sqrt{mp(w)}Z_{m}$
, as
$\sqrt{p(w)}Z_{1},\ldots,\sqrt{mp(w)}Z_{m}$
, as 
 $d\rightarrow\infty$
, where
$d\rightarrow\infty$
, where 
 $Z_{1},\ldots,Z_{m}$
 are independent complex normal variables.
$Z_{1},\ldots,Z_{m}$
 are independent complex normal variables.
In [Reference Diaconis and GamburdDG06], Diaconis and Gamburd combined Corollary A.3 for 
 $w=x$
 (namely [Reference Diaconis and ShahshahaniDS94]), together with Newton’s identities relating elementary and power-sum symmetric functions to give a formula for the limit behavior of the random variables
$w=x$
 (namely [Reference Diaconis and ShahshahaniDS94]), together with Newton’s identities relating elementary and power-sum symmetric functions to give a formula for the limit behavior of the random variables 
 $\mathrm{tr}\bigwedge\nolimits^{\!m}X$
 with X is a random unitary matrix in
$\mathrm{tr}\bigwedge\nolimits^{\!m}X$
 with X is a random unitary matrix in 
 $\mathrm{U}_{d}$
. Repeating the argument for a general word w yields the following description of
$\mathrm{U}_{d}$
. Repeating the argument for a general word w yields the following description of 
 $\underset{d\rightarrow\infty}{\lim}\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!m}w$
.
$\underset{d\rightarrow\infty}{\lim}\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!m}w$
.
Corollary A.4 (cf. [Reference Diaconis and GamburdDG06, Proposition 4]). Let 
 $w\in F_{r}$
 be a word and let
$w\in F_{r}$
 be a word and let 
 $m\in\mathbb{N}$
. Then the sequence of random variables
$m\in\mathbb{N}$
. Then the sequence of random variables 
 $\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!m}w$
 converges in distribution, as
$\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!m}w$
 converges in distribution, as 
 $d\rightarrow\infty$
, to the polynomial in the normal variables
$d\rightarrow\infty$
, to the polynomial in the normal variables 
 $Z_{1},\ldots,Z_{m}$
 given by
$Z_{1},\ldots,Z_{m}$
 given by 
 \[\frac{1}{m!}\det\left(\begin{array}{c@{\hskip8pt}c@{\hskip8pt}c@{\hskip8pt}c@{\hskip8pt}c}\sqrt{p(w)}Z_{1} & 1 & 0 & \ldots & 0\\\sqrt{2p(w)}Z_{2} & \sqrt{p(w)}Z_{1} & 2 & \ldots & 0\\\vdots & \vdots & \vdots & \ddots & \vdots\\\sqrt{(m-1)p(w)}Z_{m-1} & \sqrt{(m-2)p(w)}Z_{m-2} & \sqrt{(m-3)p(w)}Z_{m-3} & \ldots & (m-1)\\\sqrt{mp(w)}Z_{m} & \sqrt{(m-1)p(w)}Z_{m-1} & \sqrt{(m-2)p(w)}Z_{m-2} & \ldots & \sqrt{p(w)}Z_{1}\end{array}\right).\]
\[\frac{1}{m!}\det\left(\begin{array}{c@{\hskip8pt}c@{\hskip8pt}c@{\hskip8pt}c@{\hskip8pt}c}\sqrt{p(w)}Z_{1} & 1 & 0 & \ldots & 0\\\sqrt{2p(w)}Z_{2} & \sqrt{p(w)}Z_{1} & 2 & \ldots & 0\\\vdots & \vdots & \vdots & \ddots & \vdots\\\sqrt{(m-1)p(w)}Z_{m-1} & \sqrt{(m-2)p(w)}Z_{m-2} & \sqrt{(m-3)p(w)}Z_{m-3} & \ldots & (m-1)\\\sqrt{mp(w)}Z_{m} & \sqrt{(m-1)p(w)}Z_{m-1} & \sqrt{(m-2)p(w)}Z_{m-2} & \ldots & \sqrt{p(w)}Z_{1}\end{array}\right).\]
Example A.5. Let 
 $m=3$
. Then for every Borel set
$m=3$
. Then for every Borel set 
 $A\subseteq\mathbb{C}$
,
$A\subseteq\mathbb{C}$
, 
 \[\lim_{d\rightarrow\infty} \mathbb{P}\bigg(\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!3}w(X_{1},\ldots,X_{r})\in A\bigg)=\mathbb{P}(f(Z_{1},Z_{2},Z_{3})\in A),\]
\[\lim_{d\rightarrow\infty} \mathbb{P}\bigg(\mathrm{tr}_{\mathrm{U}_{d}}\bigwedge\nolimits^{\!3}w(X_{1},\ldots,X_{r})\in A\bigg)=\mathbb{P}(f(Z_{1},Z_{2},Z_{3})\in A),\]
 where 
 $Z_{1},Z_{2},Z_{3}$
 are independent and identically distributed normal variables, and
$Z_{1},Z_{2},Z_{3}$
 are independent and identically distributed normal variables, and 
 \[f(Z_{1},Z_{2},Z_{3})=\frac{p(w)^{3/2}}{6}Z_{1}^{3}-\frac{p(w)}{\sqrt{2}}Z_{1}Z_{2}+\frac{p(w)^{1/2}}{\sqrt{3}}Z_{3}.\]
\[f(Z_{1},Z_{2},Z_{3})=\frac{p(w)^{3/2}}{6}Z_{1}^{3}-\frac{p(w)}{\sqrt{2}}Z_{1}Z_{2}+\frac{p(w)^{1/2}}{\sqrt{3}}Z_{3}.\]
Acknowledgements
We thank Rami Aizenbud, Yotam Hendel, Michael Larsen, Michael Magee, Doron Puder, Yotam Shomroni, Ofer Zeitouni and Steve Zelditch for useful conversations. We thank the referees for their useful comments and for improving the readability of the paper.
Conflicts of interest
None.
Financial support
NA was supported by NSF grant DMS–1902041, IG was supported by AMS–Simons travel grant, and both of us were supported by BSF grant 2018201.
Journal information
Compositio Mathematica is owned by the Foundation Compositio Mathematica and published by the London Mathematical Society in partnership with Cambridge University Press. All surplus income from the publication of Compositio Mathematica is returned to mathematics and higher education through the charitable activities of the Foundation, the London Mathematical Society and Cambridge University Press.
 
 










 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
