1 Introduction
Mean-field quantum spin systems are ubiquitous in effective descriptions of a variety of phenomena. A popular example is the family of Lipkin–Meshkov–Glick Hamiltonians, which were originally conceived in [Reference Lipkin, Meshkov and Glick34, Reference Meshkov, Glick and Lipkin36, Reference Glick, Lipkin and Meshkov21] to explain shape transitions in nuclei but also feature in descriptions of bosons in a double well and quantum-spin tunnelling in molecular magnets [Reference Cirac, Lewenstein, Mølmer and Zoller15, Reference Bartolomé, Luis and Fernández5]. This family includes the quantum Curie–Weiss Hamiltonian, whose simplicity continues to draw the attention of many communities [Reference Brout, Müller and Thomas11, Reference Ribeiro, Vidal and Mosseri46, Reference Chayes, Crawford, Ioffe and Levit14, Reference Landsman, Moretti and van de Ven31, Reference van de Ven, Groenenboom, Reuvers and Landsman52, Reference Björnberg, Fröhlich and Ueltschi9, Reference Bulekov12]. In particular, in [Reference Bapst and Semerjian4] such models were used to test conjectures related to quantum annealing for which information about the spectral gap is crucial.
Most mean-field spin Hamiltonians in the literature are defined in terms of a noncommuting self-adjoint polynomial of the three components of the total spin-vector $ \mathbf {S} = \sum _{n=1}^N \mathbf {S}(n) $ . For a system of $ N $ interacting qubits, the Hilbert space on which these operators act is the tensor product $ \mathcal {H}_N = \bigotimes _{n=1}^N \mathbb {C}^2 $ . The vectors stand for the natural lift of the spin vectors $ \mathbf {s} = (s_x, s_y, s_y ) $ to the $ n $ -th component of the tensor product. On each copy of $ \mathbb {C}^2 $ , the spin vector coincides with the three generators of $ SU(2) $ :
By definition, a noncommuting self-adjoint polynomial $ \textrm {P}\Big (\tfrac {2}{N} \mathbf {S} \Big ) $ is a finite linear combination of products of the three rescaled components of the total spin-vector in which each product is averaged (corresponding to Weyl ordering) under the three components so that it becomes a self-adjoint operator. This renders the associated mean-field Hamiltonian
self-adjoint on $ \mathcal {H}_N $ . The dependence on the particle number is twofold and quintessential for the mean-field nature. The scaling of the spin ensures that the operator is norm-bounded by one, $ \left \| \tfrac {2}{N} S_{\xi } \right \| \leq 1 $ for all $ \xi = x,y, z $ . Moreover, the prefactor $ N $ forces the energy $ H $ to be extensive.
For example, the Lipkin–Meshkov–Glick model is given by $ P( \mathbf {m}) = -\alpha m_y^2 - \beta m_z^2 -\gamma m_x $ with $ \alpha ,\beta , \gamma \in \mathbb {R} $ . The special case $ \alpha = 0 $ , $ \beta = 1 $ corresponds to the quantum Curie–Weiss model with $ \gamma $ playing the role of the transversal, external magnetic field. In [Reference Bapst and Semerjian4], the p-spin generalization of the Curie–Weiss model has been considered, for which $ P( \mathbf {m}) = - \beta m_z^p -\gamma m_x $ with $ p \in \mathbb {N} $ . In these examples, the monomials do not involve mixed products, which makes the Weyl-ordering obsolete. To illustrate this issue, consider $ P( \mathbf {m}) = m_x m_y $ , which then results in $ \textrm {P}\big (2 \mathbf {S} /N \big ) =2 \big ( \mathbf {S}_x \mathbf {S}_y + \mathbf {S}_y \mathbf {S}_x \big ) / N^2$ .
Since $ H $ in Equation (1.1) is a function of the total spin $ \mathbf {S} $ , it is block diagonal with respect to the decomposition of the tensor-product Hilbert space according to the irreducible representations of the total spin corresponding to the eigenspaces of $ \mathbf {S}^2 $ with eigenvalues $ J(J+1) $ , that is,
The total spin $ J $ of $ N $ qubits can take any value from $ N/2 $ down in integers to either $ 1/2 $ if $ N $ is odd or $ 0 $ if $ N $ is even. The degeneracy of the representation of spin $ J $ in this decomposition is $ M_{N,J} $ [Reference Mihailov39]. On each block $ (J, \alpha ) $ , the Hamiltonian (1.1) then acts as the given polynomial of the generators of the irreducible representation of $ SU(2) $ on $ \mathbb {C}^{2J+1} $ .
The analysis of such systems in the limit of large spin quantum number $ J$ is known [Reference Lieb33, Reference Hepp and Lieb27, Reference Michoel and Nachtergaele37, Reference Michoel and Nachtergaele38, Reference Biskup, Chayes and Starr7] to be facilitated by Bloch coherent states on the Hilbert space $ \mathbb {C}^{2J+1} $ . They are parametrized by an angle $\Omega =(\theta , \varphi ) $ on the unit sphere $ S^2 $ with $ 0\leq \theta \leq \pi $ , $ 0 \leq \varphi \leq 2 \pi $ . In bra-ket-notation, which we will use in this paper, the Bloch-coherent states are given by
The reference vector $ \big | J \rangle \in \mathbb {C}^{2J+1} $ is the normalized eigenstate of the $ z $ -component of the spin corresponding to (maximal) eigenvalue $ J $ on the Hilbert space $ \mathbb {C}^{2J+1} $ . The operators $ S_{\pm } = S_x \pm i S_y $ are the spin raising and lowering operators of the irreducible representation of $ SU(2) $ on $ \mathbb {C}^{2J+1} $ .
Bloch coherent states have many remarkable features. First and foremost, they form an overcomplete set of vectors as expressed through the resolution of unity on $ \mathbb {C}^{2J+1} $ :
Every linear operator $ G $ on $ \mathbb {C}^{2J+1} $ is associated with a lower and upper symbol. The lower symbol is $ G(\Omega ,J) := \langle \Omega , J \big | G \big | \Omega , J \rangle $ , and the upper symbol is characterized through the property
The choice of $ g $ is not unique. For example, through an explicit expression [Reference Kutzner30], one sees that there is always an arbitrarily often differentiable choice, $ g(\cdot , J) \in C^{\infty }(S^2) $ . More properties of coherent states are collected in Appendix A; see also [Reference Arecchi, Courtens, Gilmore and Thomas3, Reference Perelomov42, Reference Gazeau20, Reference Combescure and Robert16].
1.1 Semiclassics for the free energy
The lower and upper symbol feature prominently in Berezin and Lieb’s semiclassical bounds [Reference Berezin6, Reference Lieb33, Reference Simon47] on the partition function associated with a self-adjoint Hamiltonian $ G $ on $ \mathbb {C}^{2J+1} $ :
In the semiclassical limit of large spin quantum number $ J $ , these bounds are known to asymptotically coincide [Reference Lieb33, Reference Simon47, Reference Duffield17]. In the same spirit, for any polynomial of the spin operator as in Equation (1.1) restricted to $ \mathbb {C}^{2J+1} $ both the upper and lower symbols agree to leading order in $ N $ with the corresponding classical polynomial function on the unit ball $ B_1 $ , which parametrises the Hilbert space (1.2) semiclassically. Using spherical coordinates $ \mathbf {e}(\Omega ) = \left (\sin \theta \cos \varphi , \sin \theta \sin \varphi , \cos \theta \right ) \in S^2 $ , one has
for the operator norm $ \| \cdot \| $ on $ \mathbb {C}^{2J+1} $ . We use the Landau $ \mathcal {O} $ -notation, that is, the error on the right is bounded by $ C N^{-1} $ with a constant $ C $ which only depends on the coefficients of the polynomial. This statement is a quantitative version of Duffield’s theorem [Reference Duffield17]. Since it is hard to locate general quantitative error estimates, we include a proof as Proposition A.4 in the appendix. As is recalled in Proposition A.3, the lower symbol then shares the same classical asymptotics
The Berezin–Lieb inequalities (1.6) immediately imply that the free energy of the Hamiltonian (1.1) is determined by minimizing a variational functional involving the classical energy $ \textrm {P} $ on the unit ball and the (shifted) binary entropy
A straightforward, rigorous saddle-point evaluation, which we spell out in the proof of Proposition A.7 – a generalisation to mean-field models with regular symbols – yields the pressure as a min-max variational principle on $ B_1 $ .
Proposition 1.1. For a mean-field Hamiltonian $ H = N \ \textrm {P}\Big (\tfrac {2}{N} \mathbf {S} \Big ) $ with a noncommuting self-adjoint polynomial $ \textrm {P} $ , the pressure for any $ \beta> 0 $ is given by
As a special case with constant field $ P(\mathbf {m}) = -\lambda m_z $ , $ \lambda \geq 0 $ , one obtains the Legendre relation
By inverting this Legendre transform, $ I(r) = \min _{\lambda \geq 0 } \left [ \ln 2 \cosh \left ( \beta \lambda \right ) - \beta \lambda r\right ] $ , one may rewrite Equation (1.8) in the slightly more familiar form
Investigations of the free energy have been of great interests over many decades [Reference Brout, Müller and Thomas11, Reference Lieb33, Reference Fannes, Spohn and Verbeure18, Reference Cegła, Lewis and Raggio13, Reference Raggio and Werner44, Reference Duffield17, Reference Chayes, Crawford, Ioffe and Levit14, Reference van de Ven51]. They are, however, not the main focus of this paper. We therefore conclude this topic with only one brief comment on the literature.
Among the numerous results, it is worth mentioning that alternatively to the sketched approach via the Berezin–Lieb inequalities, the formula (1.8) may be derived exploiting the exchange symmetry of the mean-field Hamiltonian using a version of the quantum de Finetti theorem. This road was taken by Fannes, Spohn and Verbeure [Reference Fannes, Spohn and Verbeure18]. Their result essentially covers all Hamiltonians $ H = \sum _{p=1}^m A^{(p)} $ on $ \mathcal {H}_N $ with exchange-symmetric $ p $ -spin interactions $ A^{(p)} $ and yields
For a mean-field system of $ N $ spin- $1/2 $ , the supremum is over states which are parametrized by the Bloch sphere, , and whose entropy is $ E(\varrho ) = - {\operatorname {Tr}\,} \varrho \ln \varrho = I(r) $ . As an aside, we note that this powerful de Finetti approach generalizes from spin- $1/2 $ to spin- $ s $ [Reference Fannes, Spohn and Verbeure18], and then covers the results on the free energy obtained in [Reference Björnberg8, Reference Björnberg, Rosengren and Ryan10]. For example, for the exchange Hamiltonian on for which ${\operatorname {Tr}\,} T \varrho \otimes \varrho = \sum _{i,j=1}^d \lambda _i \lambda _j {\operatorname {Tr}\,} T |u_i \rangle \langle u_i | \otimes |u_j \rangle \langle u_j | = \sum _{i=1}^d \lambda _i^2 $ , one immediately gets $p(\beta ) = \sup _{\mathbf {\lambda } \in \Delta ^{d}} \sum _{i=1}^d(- \lambda _i \ln \lambda _i -\beta \lambda _i^2 )$ , where $ \mathbf {\lambda } = (\lambda _1,\dots , \lambda _d) \in \Delta ^{d} $ is the vector of eigenvalues of the state $ \varrho $ on .
1.2 Spectral gap from semiclassics plus fluctuations
The main result of this paper is a simple quasi-classical explanation and formulae for the low-energy part of the spectrum of a self-adjoint mean-field operator $ H $ as in Equation (1.1) in the limit $ N \to \infty $ . We will denote by $ E_0(H) \leq E_1(H) \leq E_2(H) \leq \dots $ the ordered sequence of its eigenvalues counted with multiplicities. In particular, the existence and leading asymptotic value of the spectral gap
can be read of from the location of the minimum $ \mathbf {m}_0 $ of the polynomial $ P: \mathbb {R}^3 \to \mathbb {R} $ restricted to the unit ball $ B_1 $ . In case the minimum is unique and located on the surface $ S^2 $ , the operator (1.1) generically has a spectral gap. To leading order in $ N $ the value of this gap is in fact completely determined by the coefficients of the quadratic polynomial which is uniquely associated with $ P $ . In view of the notorious difficulty of determining the spectral gap in quantum lattice systems [Reference Haldane24, Reference Affleck and Lieb2, Reference Affleck1, Reference Koma, Nachtergaele and Starr29], this simplicity might be somewhat surprising.
Broadly speaking, our results are in accordance with the general belief of fluctuation theory that the second-order approximation to $ P $ , which involves the gradient $ \nabla P( \mathbf {m}_0 ) $ and the Hessian $ D_P( \mathbf {m}_0 )= \left ( \partial _j \partial _k P( \mathbf {m}_0 ) \right )_{j,k=1}^3 $ yields the description of the low-energy spectra. Related statements have been proven in the context of mean-field Bose systems (see, e.g., [Reference Grech and Seiringer22, Reference Lewin, Nam, Serfaty and Solovej32]). For the precise formulation of such a result for quantum spin systems and in order to point out a subtlety caused by the geometry, we need some basic geometric facts on functions on $ B_1 $ .
If $ \mathbf {m}_0 \in S^2 $ is a minimum of $ P $ on $ B_1 $ , the gradient either vanishes or points towards the center of the ball, $ \nabla P( \mathbf {m}_0 ) = - | \nabla P( \mathbf {m}_0 ) | \ \mathbf {m}_0 $ . The quadratic approximation of the polynomial is then given by $ D_P( \mathbf {m}_0 ) $ projected on the directions perpendicular to $ \mathbf {m}_0 $ . In terms of the normalized directional vector
which is understood as a linear projection map on $ \mathbb {R}^3 $ . Introducing a local chart $ \Phi : \mathbb {R}^2 \to T_{\mathbf {m}_0} S^2 $ , the linear map on $ \operatorname {\mathrm {ran}} Q_{\perp } \equiv T_{\mathbf {m}_0} S^2 $ given by
is then the quadratic approximation to $ P \circ \Phi $ at $ \mathbf {m}_0 $ . The shift of the Hessian in Cartesian coordinates by the norm of the gradient $|\nabla P( \mathbf {m}_0 ) |$ is thus an effect of the constraint due to the spherical geometry.
Theorem 1.2. Let $ H $ be a self-adjoint operator on $ \mathcal {H}_N $ of the form (1.1) with a noncommuting self-adjoint polynomial $ P: \mathbb {R}^3 \to \mathbb {R} $ . Suppose that the minimum of $ P $ restricted to the unit ball $ B_1 $ is unique and located at a point $ \mathbf {m}_0 \in S^2 $ on the unit sphere. Then,
is the spectral gap above the unique ground state in case the right-hand side is strictly positive.
This theorem is a special case of Theorem 2.3, which deals with mean-field spin Hamiltonians with more general regular symbols than just polynomials. Theorem 2.3 also has the corresponding description of the leading asymptotics of the entire low-energy spectrum in terms of fluctuation operators in directions $ \operatorname {\mathrm {ran}} Q_{\perp } $ . By Theorem 2.1, these more detailed results also apply to the polynomial case. To the best of our knowledge, all these results are entirely new in the mathematics literature. Previously, the spectrum $ \operatorname {\mathrm {spec}} H $ of mean-field operators as in Equation (1.1) has been identified only on order $ N $ . In [Reference van de Ven51, Cor. 2.2] it is shown that
where $ \operatorname {\mathrm {ran}} P $ is the range of the function $ P: B_1 \to \mathbb {R} $ .
To demonstrate the applicability of Theorem 1.2, we consider the quantum Curie–Weiss Hamiltonian $H = -\frac {4}{N} S_x^2 - 2 \gamma S_z$ , which corresponds to the choice $P(\mathbf {m}) = -m_x^2-\gamma m_z$ . The gradient and Hessian in Cartesian coordinates are given by
In the paramagnetic phase $\gamma> 2$ , $ P $ has only one minimum on $ B_1 $ at $\mathbf {m}_0 = (0, 0, 1)^T$ . The eigenvalues of the orthogonal part $ D_P^{\perp }(\mathbf {m}_0) $ of the Hessian are $\omega _1 = -2 + \gamma $ , $ \omega _2 = \gamma $ and $|\nabla P( \mathbf {m}_0 )| = \gamma $ . Theorem 1.2 then yields:
The gap closes like a square root close to the critical point $\gamma =2$ . This is the end point (at $ \beta = \infty $ ) of the critical line which separates the ferromagnetic phase of the quantum Curie–Weiss model at low temperatures from the paramagnetic phase at high temperature or large transversal field (cf. [Reference Bapst and Semerjian4]).
In the ferromagnetic phase $|\gamma | < 2$ , minima of $ P $ are found at $ \mathbf {m}_0^{\pm } =( \pm \sqrt {1-\gamma ^2/4}, 0 , \gamma /2)^T$ . The eigenvalues of the orthogonal part $ D_P^{\perp }(\mathbf {m}_0^{\pm }) $ of the Hessian are $\omega _1 = 2 ,\omega _2 = 2 (1-\gamma ^2/4) $ and $|\nabla P( \mathbf {m}_0^{\pm } )| =2$ . If we ignore the degeneracy of these two minima for the moment and pretend that only the positive solution $ \mathbf {m}_0^+ $ exists, this leads to the expression $ 4 \sqrt {1-\gamma ^2/4} $ in the right side of Equation (1.10). As will be explained below Theorem 2.5, the gap vanishes in this phase due to the degeneracy of the two minima. What is calculated here is in fact the gap between the second excited state and the ground state.
2 Low-energy spectra for operators with regular symbols
Our result on the gap and the low-energy spectrum applies to a more general class of mean-field quantum spin Hamiltonians than just noncommutative self-adjoint polynomials of the total spin. Next, we describe this class, which involves operators defined via their upper symbols. Besides the quest for generality, the semiclassical methods used in the proof of our results for Hamiltonians as in Equation (1.1) already naturally brings up such operators with more general symbols.
2.1 Assumptions
A0 We assume that $ H $ is block diagonal with respect to the orthogonal decomposition (1.2) of $ \mathcal {H}_N $ ,
with self-adjoint blocks $ H_{J,\alpha } $ acting on a copy of . Moreover, there is a twice continuously differentiable symbol $ h: B_1 \to \mathbb {R} $ such that all blocks are uniformly approximable in operator norm on $ \mathbb {C}^{2J+1} $ to order one as $ N \to \infty $ :
where the maximum runs over $ \alpha \in \{ 1, \dots , M_{N,J} \} $ and $ J \in \{ \frac {N}{2} - \lfloor \frac {N}{2}\rfloor , \dots , N/2\} $ .
For our semiclassical analysis, we introduce subspaces associated with a fixed block $(J,\alpha ) $ and any direction defined by $ \mathbf {0} \neq \mathbf {m}_0 \in B_1 $ :
The associated orthogonal projection on $ \mathbb {C}^{2J+1}$ will be denoted by $ P_{J}^{K}(\mathbf {m}_0) $ . We will work with quadratic approximations at $ \mathbf {m}_0 $ defined in terms of the matrix-valued self-adjoint second-order Taylor polynomial associated with $ h $ and the total spin $ \mathbf {S} $ on $ \mathbb {C}^{2J+1} $ :
The operators $( \mathbf {S} - N \mathbf {m}_0/2)/\sqrt { J_N(\mathbf {m}_0 )} $ with are the fluctuation operators (cf. [Reference Michoel and Nachtergaele37, Reference Michoel and Nachtergaele38]) with respect to the coherent state $ | \Omega _0 , J_N(\mathbf {m}_0 ) \rangle $ , where $ \Omega _0 $ is the spherical angle of $\mathbf {e}_{\mathbf {m}_0}$ . We assume the approximability of $ H_{J,\alpha } $ with $ J $ close to $ J_N(\mathbf {m}_0 ) $ either solely for the set of minima $ \mathcal {M} \subset B_1 $ or globally at every point.
A1 We assume that there is a continuously differentiable $ \kappa : B_1 \to \mathbb {R} $ and diverging sequences $ K_N $ , $ \overline {K}_N \in \mathbb {N} $ such that for all minima $ \mathbf {m}_0 \in \mathcal {M} $ :
where Landau’s $ o(1) $ stands for a null sequence as $ N \to \infty $ .
A1’ We assume that there is a continuously differentiable $ \kappa : B_1 \to \mathbb {R} $ and a diverging sequence $ K_N\in \mathbb {N} $ such that
Before moving on to the main results, let us add a few comments.
-
1. The reason for including an offset function $ \kappa $ in the quadratic approximations (2.4) and (2.5) is that the symbol $ h $ is only assumed to approximate $ H $ up to order one, cf. Equation (2.2). However, our main results address the spectrum exactly to this order.
-
2. In case the minimum $ \mathbf {m}_0 \in \mathcal {M} $ is in the interior of the ball, $ | \mathbf {m}_0 | < 1 $ , then $ \nabla h( \mathbf {m}_0) = \mathbf {0} $ . The second term in the quadratic approximation (2.3) is hence absent. In case $ | \mathbf {m}_0 | = 1 $ , the gradient either vanishes or points to the center of the ball, $ \nabla h( \mathbf {m}_0) = - | \nabla h( \mathbf {m}_0) | \ \mathbf {e}_{\mathbf {m}_0} $ . The second term in Equation (2.3) then equals $2 | \nabla h( \mathbf {m}_0) | ( \mathbf {e}_{\mathbf {m}_0} \cdot \mathbf {S} - N/2) $ .
-
3. If A1 or A1’ hold for diverging sequences $ K_N, \overline {K}_N $ , then they also hold for any such sequences, which are upper bounded by $ K_N, \overline {K}_N $ .
The projections corresponding to the subspaces $ \mathcal {H}^K_{J}( \mathbf {m}_0) $ are chosen such that
To estimate the norm of the spin operator $ Q_{\perp } \mathbf {S} $ projected to orthogonal directions, perpendicular to $ \mathbf {m}_0 $ , it is convenient to introduce a coordinate system. When focusing on a patch around one point, we may always assume without loss of generality that $ \mathbf {m}_0 = (0,0,|\mathbf {m}_0|)^T $ . This can be accomplished by the unitary rotation in Equation (1.3), for which $ U(\Omega _0)^* \ \mathbf {e}_{\mathbf {m}_0} \cdot \mathbf {S} \ U(\Omega _0) = S_z $ if $ \Omega _0$ denotes the spherical coordinates of $ \mathbf {e}_{\mathbf {m}_0} $ . In this coordinate system, the spin operators in the two orthogonal directions are given by $ S_x $ and $ S_y $ , and the range of the projections $ P_{J,\alpha }^{K}(\mathbf {m}_0) $ are spanned by the canonical orthonormal eigenbasis of $ S_z $ on $ \mathbb {C}^{2J+1} $ , that is, $ S_z | J- k \rangle = (J- k) | J- k \rangle $ for all $ k \in \{0, 1, \dots , 2J\} $ . We recall that both $ S_x $ and $ S_y $ are tridiagonal matrices in terms of this basis
Therefore, for any $ d \in \mathbb {N} $ there is some $ C_d < \infty $ such that for both $ \xi \in \{ x,y \} $ :
This renders evident that on $ \mathcal {H}^K_{J}( \mathbf {m}_0)$ the scaled operators $ N ( 2 S_{\xi }/N)^d $ are negligible if $ d \geq 3 $ and $ K = o(N^{1/3} ) $ .
The above arguments also show that our assumptions are tailored to apply to Equation (1.1) with an arbitrary noncommuting self-adjoint polynomial $ P $ .
Theorem 2.1. For $ H = N \textrm {P}\big (\tfrac {2}{N} \mathbf {S} \big ) $ on $ \mathcal {H}_N $ with any noncommuting self-adjoint polynomial $ P $ , all assumptions A0, A1, A1’ are satisfied with $ h = P $ and $ \overline {K}_N = o(N^{2/3})$ , $ K_N = o(N^{1/3}) $ .
Proof. Thanks to Equation (1.7), an approximate symbol of the Hamiltonian is indeed the polynomial, $ h = P $ .
Let $ \mathbf {m}_0 \in B_1 $ be arbitrary, not necessarily a minimum of $ P $ . Without loss of generality, we may assume $ \mathbf {m}_0 = (0,0,|\mathbf {m}_0|)^T $ and that the Hamiltonian is of the form $ H = N \textrm {P}\left ( s_N(1) , s_N(2) , s_N(3)\right ) $ with
The polynomial $ \textrm {P} $ is a linear combination of monomial products of a fixed order $ d \in \mathbb {N} $ . Due to noncommutativity of matrix multiplication such monomials include all products of the form
with an arbitrary choice of ordered indices $\mathbf {i} =( i_1, \dots , i_d) \in \{ 1,2,3\}^d $ . Up to order $ d \leq 2 $ and due to the assumed self-adjointness of the noncommutative product, they coincide with the terms in the definition (2.3) of $ Q( \mathbf {m}_0) $ .
Each of the above matrices $ \Pi _N(\mathbf {i}) $ is block diagonal on $ \mathcal {H}_N $ and, when restricted to a copy of $ \mathbb {C}^{2J+1} $ , at most $2d+1$ -diagonal. We may therefore estimate similarly as in Equation (2.8) for all $ J \leq N/2 $ and any choice of indices $ \mathbf {i} \in \{ 1,2,3\}^d$ :
with a constant $ C_d < \infty $ , which changes from line to line and which is independent of $ J $ . For any $ d \geq 3 $ , the right side vanishes if $ K_N = o(N^{1/3}) $ and $ |J-J_N(\mathbf {m}_0) | = o(N^{2/3})$ . Since
any monomial of order $ d \geq 3 $ indeed does not contribute in Equation (2.4).
Using similar estimates and under some more restrictive assumptions on $ K_N $ and $ \overline {K}_N $ , one may replace $ Q( \mathbf {m}_0) $ in assumption (2.4) by the second-order polynomial
which only involves the projection $ Q_{\perp } $ of the Hessian. This means that the fluctuation operator in the radial direction of $ \mathbf {m}_0 $ is negligible.
Lemma 2.2. If $ K_N = o(N^{1/3}) $ and $ \overline {K}_N^2 K_N = o(N) $ , then
Proof. Using the same coordinate system and notation as in the proof of Theorem 2.1, the difference $ Q( \mathbf {m}_0) - \widehat Q( \mathbf {m}_0) $ is a linear combination of the five monomials of the form $ N \Pi _N(\mathbf {i}) $ with $ \mathbf {i} =(i_1,i_2) \in \{ (1,3), (3,1), (2,3), (3,2), (3,3) \} $ .
By Equation (2.6), we have $ \left \| N \Pi _N(3,3) P_{J}^{K_N}(\mathbf {m}_0) \right \| \leq 8 \ ( | J - J_N(\mathbf {m}_0 )|^2 + K_N^2 ) / N = o(1) $ . In all other cases, we estimate similarly as in Equation (2.9). For example,
which by Equation (2.10) leads to $ \left \| N \Pi _N(3,1) P_{J,\alpha }^{K_N}(\mathbf {m}_0) \right \| \leq o(1) $ . The remaining terms are estimated similarly.
2.2 The case of a unique minimum
The following is our main result for mean-field Hamiltonians whose approximate symbol has a unique minimum, which is at the surface of the unit ball.
Theorem 2.3. Assuming A0 and A1 and that the symbol $ h $ has a unique global minimum at $ \mathbf {m}_0 \in S^2 $ , where $ \nabla h(\mathbf {m}_0) \neq \mathbf {0} $ and $ \det D_h^{\perp }(\mathbf {m}_0)> 0 $ :
-
1. The ground state $\psi _0$ of $ H $ on $ \mathcal {H}_N $ is unique (up to phase) and contained in the subspace with maximal total spin $J= N/2$ . In terms of the eigenstates $| N/2 - m \rangle $ of $ \mathbf {m}_0 \cdot \mathbf {S} $ in that subspace, we have for any $ m \in \mathbb {N}_0 $ :
(2.13) $$ \begin{align} \langle J-m | \psi_0 \rangle = \omega^{1/4} \sqrt{\frac{2}{(\omega +1) n!}} \left(\sqrt{\frac{\omega-1}{2(\omega +1)}}\right)^n H_n(0) + o(1) , \end{align} $$where $H_n$ denotes the n-th Hermite polynomial, and the ground state energy is given by(2.14) $$ \begin{align} E_0(H) = N h( \mathbf{m}_0) + \kappa( \mathbf{m}_0) - | \nabla h(\mathbf{m}_0) | + \sqrt{\det D_h^{\perp}(\mathbf{m}_0)} +o(1). \end{align} $$ -
2. For any energy below $ E_0(H) + \Delta $ with $ \Delta> 0 $ fixed but arbitrary, the eigenvalues of $ H $ stem from blocks $ (J,\alpha ) $ with $ k = N/2 - J \in \mathbb {N}_0 $ fixed. When counted with multiplicity, these low-energy eigenvalues of $ H_{N/2-k,\alpha } $ for $ k \in \mathbb {N}_0 $ asymptotically coincide up to $ o(1) $ with the points in
(2.15) $$ \begin{align} N h( \mathbf{m}_0) + \kappa( \mathbf{m}_0) + (2k-1)\ | \nabla h(\mathbf{m}_0) | + (2m+1) \sqrt{\det D_h^{\perp}(\mathbf{m}_0)} , \end{align} $$with $ m \in \mathbb {N}_0 $ . The spectral gap of $ H $ is(2.16) $$ \begin{align} \operatorname{\mathrm{gap}} H = 2 \min \left\{ |\nabla h( \mathbf{m}_0 ) |, \sqrt{\det D_h^{\perp}(\mathbf{m}_0)} \right\}+o(1). \end{align} $$
The proof of Theorem 2.3 is the topic of Section 3.2. As can be inferred from there, the error estimates hiding in the $ o(1) $ -terms are made up from two sources: first, the accuracy of the assumed quadratic approximation (2.4), and second, the cumulative subsequent error estimates. Moreover, an asymptotic form of all eigenfunctions of $ H_{J,\alpha } $ on the outer blocks and not only the ground state (2.13) is found there. Although the ground state is simple and the same applies to the oscillator spectrum (2.15) of $ H_{J,\alpha } $ for fixed $ (J,\alpha ) $ , the multiplicity $ M_{N,J} $ in Equation (2.1) will cause the eigenvalues of $ H $ to occur approximately to order $ o(1) $ with these multiplicities.
Before moving on to the general case, let us put Theorem 2.3 in the context of available results.
-
1. The spectra of quadratic mean-field Hamiltonians such as the Lipkin–Meshkov–Glick model can be described exactly through Bethe–Ansatz-type equations [Reference Turbiner50, Reference Pan and Draayer41, Reference Ortiz, Somma, Dukelsky and Rombouts40]. Since the latter are in the most general case hard to solve, much attention has been given to finding approximate semiclassical solutions [Reference Ribeiro, Vidal and Mosseri46, Reference Bulekov12]. In view of this, it is worth emphasizing that the above theorem is (through Theorem 2.1) applicable to all $ H $ of polynomial form (1.1). Their low-energy spectra are proven to agree with that of the associated quadratic term. The latter turns out to produce the harmonic oscillator spectrum (2.15), in which the frequency is determined by the Hessian in the spherical geometry. The spectrum of such general mean-field Hamiltonians has so far only been determined to a coarser order $ N $ in [Reference van de Ven51] (cf.Equation (1.11)) and not on the fine scale $ o(1) $ .
-
2. Expressions for the spectral gap of certain polynomial mean-field quantum-spin Hamiltonians have been derived on the level of rigor of theoretical physics in [Reference Ribeiro, Vidal and Mosseri46, Reference Bapst and Semerjian4]. These works assume that the low-energy spectrum of the relevant blocks $ H_{J,\alpha } $ are equally spaced and argue that the gap is proportional to its inverse density of states at the ground state. A posteriori and thanks to the proven equal spacing of the low-energy eigenvalues of $ H_{N/2 - k ,\alpha } $ , this is correct in case the minimum in Equation (2.16) is attained at $ \big (\det D_h^{\perp }(\mathbf {m}_0)\big )^{1/2}$ . It hence works in case of the quantum Curie–Weiss model in the paramagnetic phase. However, in case the minimum in Equation (2.16) is found at $ |\nabla h( \mathbf {m}_0 )| $ and hence stems from the secondmost outer shell, this strategy fails.
Theorem 2.3 should be contrasted with the case that the minimum of the symbol is found strictly inside the unit ball. In this case, the spectral gap vanishes since the following theorem shows that all blocks $ (J,\alpha ) $ with $ | J - J_N(\mathbf {m}_0) | \leq o(\sqrt {N }) $ have the same ground-state energy up to an $ o(1) $ -error.
Theorem 2.4. Assuming A0 and A1’and that the symbol $ h \in C^3 $ has a unique global minimum at $ \mathbf {m}_0 \in B_1 $ with $0 < | \mathbf {m}_0| < 1$ and $ D_h( \mathbf {m}_0)> 0 $ . Then:
-
1. Any ground-state eigenvector of $ H $ on $ \mathcal {H}_N $ is contained in a subspace with total spin $ J $ with $ {| J - J_N(\mathbf {m}_0) | \leq \mathcal {O}(\sqrt {N })}$ and
(2.17) $$ \begin{align} E_0(H) = E_0(H_{J,\alpha}) = N h(\mathbf{m}_0) + \kappa( \mathbf{m}_0) + | \mathbf{m}_0| \sqrt{\det D_h^{\perp}(\mathbf{m}_0)} + o(1). \end{align} $$ -
2. For any $ J $ with $ | J - J_N(\mathbf {m}_0) | \leq o(\sqrt {N }) $ , the ground-state energy $ E_0(H_{J,\alpha }) $ is still given by Equation (2.17).
The proof largely builds on the techniques of the proof of Theorem 2.3 and is found in Section 3.3. The techniques allow in fact to determine the whole low-energy spectrum of every block $ H_{J,\alpha } $ with $ | J - J_N(\mathbf {m}_0) | \leq o(\sqrt {N }) $ .
2.3 The case of a finite number of minima
Our last main result concerns the case of symbols with finitely many minima on the surface of the quantum sphere.
Theorem 2.5. Assuming A0 and A1 and that the symbol $ h $ has $ L $ global minima at $ \{ \mathbf {m}_1, \dots , \mathbf {m}_L \} \subset S^2 $ , where at each minimum $ \nabla h(\mathbf {m}_l) \neq \mathbf {0} $ and $ \det D_h^{\perp }(\mathbf {m}_l)> 0 $ :
-
1. Any ground-state eigenvector of $ H $ on $ \mathcal {H}_N $ is contained in the subspace with maximal total spin $J= N/2$ . The ground-state energy is
$$ \begin{align*}E_0(H) = \min_{l \in \{1,\dots,L\}} \left[ h( \mathbf{m}_l) + \kappa( \mathbf{m}_l) - | \nabla h(\mathbf{m}_l) | + \sqrt{\det D_h^{\perp}(\mathbf{m}_l)} \right] +o(1). \end{align*} $$ -
2. For any energy below $ E_0(H) + \Delta $ with $ \Delta> 0 $ fixed but arbitrary, the eigenvalues of $ H $ stem from blocks $ (J,\alpha ) $ with $ k = N/2 - J \in \mathbb {N}_0 $ fixed but arbitrary. When counted with multiplicity, these low-energy eigenvalues of $ H_{N/2-k,\alpha } $ for $ k \in \mathbb {N}_0 $ asymptotically coincide up to $ o(1) $ with the points in
(2.18) $$ \begin{align} N h( \mathbf{m}_l) + \kappa( \mathbf{m}_l) + (2k-1)\ | \nabla h(\mathbf{m}_l) | + (2m+1) \sqrt{\det D_h^{\perp}(\mathbf{m}_l)} , \end{align} $$with $ m \in \mathbb {N}_0 $ and $ l \in \{1,\dots , L \} $ .
The proof, which in comparison to Theorem 2.3 poses the additional difficulty of controlling the tunnelling between minima, is found in Section 3.4.
The theorem allows for degeneracies in the spectrum already at the level of the ground state. The quantum Curie–Weiss model $P(\mathbf {m}) = -m_x^2-\gamma m_z$ in the ferromagnetic phase $ |\gamma | < 2 $ is an example. The gradient’s norm $ |\nabla P(\mathbf {m}_0^{\pm })| = 2 $ and $ \det D_P^{\perp }(\mathbf {m}_0^{\pm }) = 4- \gamma ^2 $ agree for the two minima $ \mathbf {m}_0^{\pm } \in S^2 $ . Therefore, all the low-energy eigenvalues described by Equation (2.18) are approximately doubly degenerate. In this situation, the formula (1.10) yields the gap of the nearly $ L $ -fold degenerate ground state ( $L=2$ for Curie–Weiss) to the next energy levels. Our proof enables to show that the level splitting due to tunnelling through a macroscopic barrier from one minimum to the other is smaller than any polynomial in $ N^{-1} $ . As demonstrated numerically in [Reference van de Ven, Groenenboom, Reuvers and Landsman52], the quantum Curie–Weiss’s ferromagnetic phase exhibits the ‘flea on the elephant phenomenon’ [Reference Jona-Lasinio, Martinelli and Scoppola28], that is, the sensitivity of the ground-state function to perturbations. It might be interesting to combine the techniques in this paper with [Reference Helffer and Sjostrand26, Reference Simon48] for a proof of this.
Let us conclude the main part of the paper with a general outlook. Having derived precise low-energy asymptotics of eigenvalues in terms of quadratic approximations and using techniques as in [Reference Manai and Warzel35, Sec. 5], our techniques should extend to derive the fluctuations of the free energy not only at $ \beta = \infty $ but also for finite temperature.
It would also be interesting to investigate the dynamical properties of the mean-field Hamiltonians. Coherent wavepackets evolve semiclassically [Reference Fröhlich, Knowles and Lenzmann19]. The latter work also connects to the question of whether the description of the low-energy spectra in the present paper are helpful in the analysis of models which become semiclassical only in a Kac-type scaling limit (cf. [Reference Presutti43]).
3 Semiclassical analysis
This section is dedicated to the proofs of Theorems 2.3-2.5. The proofs combine projection techniques in a Schur-complement analysis, which in the present paper constitutes of the following simple principle.
Proposition 3.1. Let $ A $ be a bounded self-adjoint operator on a Hilbert space $ \mathcal {H} $ , and $ E $ and $ F $ orthogonal projections with . Assume that $ a < \inf \operatorname {\mathrm {spec}} F A F $ , and let $ R(a) = ( F AF - a F)^{-1} $ stand for the block inverse on $ F\mathcal {H} $ . Then
In particular, the eigenvalues $ \alpha _0(A) \leq \alpha _1(A) \leq \dots $ of $ A $ (counted with multiplicities) and the respective eigenvalues of $ EAE $ satisfy
provided $ \alpha _j(A) < a < \operatorname {\mathrm {spec}} FAF$ .
The proof is elementary. Extensions of this statement can be found in [Reference Sjöstrand and Zworski49, Reference Zhang53, Reference Zworski54, Reference Gustafson and Sigal23].
Through suitably defined projections, our proof of Theorems 2.3–2.5 focuses the spectral analysis of the mean-field Hamiltonian on subspaces associated with patches around the classical minima of its symbol. More precisely, we proceed by a two-step localization procedure. To illustrate the main idea, let us focus on the proof of Theorem 2.3. For a first coarse localization and at a fixed value of the total spin $ J $ , we restrict attention to the subspaces $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_0) $ with $ K_N $ increasing with $ N $ and $ \mathbf {m}_0 $ the unique minimum of the symbol. On these increasing nested subspaces, we construct a sequence of approximate Hamiltonians, which result from an explicit limit operator of the fluctuation operators. In a second localization, the Schur-complement analysis of Proposition 3.1 is then applied with $ E $ the subspace spanned by an arbitrary, but finite, number of eigenspaces of the approximate limit operator, which turns out to coincide with a harmonic oscillator. The off-diagonal terms featuring in Proposition 3.1 are estimated with the help of exponential decay estimates on the explicit eigenfunctions of the limit operator.
In case of Theorem 2.5, we proceed similarly. An additional challenge is to control the tunneling between patches of different minima. This is done here using the exponential decay of coherent-state inner products, which we derive in Appendix A.1. Let us stress that our method does not rely on sharp semiclassical tunneling estimates in terms of the Agmon metric, which are available, for example, for related problems of multiwell tunneling for Schrödinger operators [Reference Helffer and Sjostrand26, Reference Simon48].
To the best of our knowledge, the above proof strategy is new in the present context of a spectral analysis of mean-field quantum spin systems. In a broader context. the Schur-complement methods or more generally, Gushin’s method, have been employed before in semiclassical analysis, for example, for stability analysis and, under the name Feshbach–Schur method, in perturbation theory in quantum mechanics; cf. [Reference Sjöstrand and Zworski49, Reference Zworski54, Reference Gustafson and Sigal23].
We start with a description of the limit operators. The proof of Theorems 2.3-2.5 follow in the subsequent sections.
3.1 Limit of fluctuation operators
We start our proof by introducing two operators $L_x$ and $L_y$ on the Hilbert space $\ell ^2({\mathbb {N}}_0)$ , which turn out to be unitarily equivalent to the position and momentum operator on $L^2({\mathbb {R}})$ . This enables us to determine the spectrum and eigenfunctions of the operator
which is equivalent to a harmonic oscillator with frequency $\omega $ . In the following subsections, we show that the Hamiltonians described in Section 2.1 indeed converge in a sense to be specified locally to an operator of the form D. This is the key to determine their low-energy spectra explicitly.
To set the stage, we define $L_x$ and $L_y$ via their matrix elements in terms of the canonical orthonormal bases in $ \ell ^2(\mathbb {N}_0) $ :
They give rise to essentially self-adjoint operators on the dense domain
Moreover, by an elementary calculation
that is, $L_x$ and $L_y$ satisfy the canonical commutation relations. The following proposition is essentially a consequence of this observation.
Proposition 3.2. Let $\omega \geq 1$ . Then, $D = \omega ^2 L_x^2+ L_y^2$ is a positive, essentially self-adjoint operator on the domain $ c_{00} \subset \ell ^2(\mathbb {N}_0)$ with spectrum
Every point in the spectrum is a nondegenerate eigenvalue. The ground-state $\psi _0$ is
The k-th excited state is given by $\psi _k = (a^{\dagger })^k \psi _0/\sqrt {k!} $ with the raising operator . In particular, $ \langle n | \psi _k\rangle = 0 $ unless $ k-n $ is even in which case, we have the exponential decay estimate
The value $H_n(0)$ is explicit:
In case $ \omega = 1 $ , the eigenbasis turns out to agree with the canonical orthonormal basis, $ | \psi _k \rangle = | k \rangle $ for all $ k \in \mathbb {N}_0 $ . The exponential decay estimates (3.5) and (3.6) will play a crucial role in subsequent approximation results.
Proof of Proposition 3.2.
The commutation relation (3.2) implies that there is a unitary $U \colon \ell ^2({\mathbb {N}}_0) \to L^2({\mathbb {R}})$ such that $ U L_x U^{*} = \hat {x} $ and $ U L_y U^{*} = \hat {p} $ , where $\hat {x}$ and $\hat {p}$ denote the position and momentum operator. In particular, $U D U^{*}$ is the standard harmonic oscillator with frequency $\omega $ , which yields the basic assertion on $\operatorname {\mathrm {spec}} D $ . The eigenfunctions of $ U D U^{*} $ are known to be given by
This unitary equivalence also proves the ladder operator representation for the excited states (cf. [Reference Hall25]). Since $U| n \rangle = | \varphi _n^{1} \rangle $ , we seek for a representation of $\varphi _m^{\omega }$ and, in particular $ m = 0 $ in terms of the $\omega = 1$ eigenfunctions:
Using a change of variables and subsequently the multiplication theorem for Hermite polynomials,
we arrive after some elementary algebra at
The above sum is only nonzero if $ n - k $ is even. In particular, due to the indicator function requiring $ l = n/2 $ , for $ k = 0 $ only one term in the sum survives yielding Equation (3.4) with $ H_n(0) $ replaced by Equation (3.7).
For a proof of the exponential decay estimates (3.5), we start from Equation (3.8) with the triangle inequality. Since $ \frac {2\sqrt {\omega }}{\omega +1} \leq 1 $ , this term can be upper bounded by one. Furthermore, since also $ \frac {\omega -1}{2(\omega +1)} \leq 1 $ , the we may lower bound $ 2l +\tfrac {k-n}{2} \geq \frac {|k-n|}{2} $ since $ l\geq 0 $ in case $ k\geq n $ and $ l\geq (n-k)/2 $ in case $ n\geq k $ . Therefore, in case $ k \geq n $ it remains to estimate
The last ratio is then estimated with the help of standard Stirling bounds,
together with the elementary bound $ \exp \left ( - (k-n) \ln \left ( 1 - \frac {n}{k}\right ) \right ) \leq 2^n $ valid for all $ k \geq 2n $ . This yields Equation (3.5) after some algebra. In case $ n \geq k $ , we proceed similarly. The only difference is that the sum in Equation (3.8) starts from $\frac {|k-n|}{2}$ .
The connection of $ D $ to our models will be through its approximations defined on $ \mathbb {C}^{2J+1} $ :
In this subsection, $ |\mathbf {m}_0| \in (0,1] $ is treated as a scaling parameter. When restricting the commutation relation, $ \big [ L^{(N)}_x , L^{(N)}_y \big ] = i S_z / J_N $ , to the subspace
with $ |J-J_N| \leq \overline {K}_N $ , the commutator asymptotically agrees as $ N \to \infty $ with the canonical one as long as both $ \overline {K}_N $ and $ K_N $ grow only suitably slow with $ N $ . Hence, $ D_N $ when restricted to $ \mathcal {H}^{K_N}_{J} $ is expected to be equal to the harmonic oscillator $ D $ in the limit $ N \to \infty $ . This observation is at the heart of many works on quantum spin systems in the large $ J $ limit [Reference Hepp and Lieb27, Reference Michoel and Nachtergaele37, Reference Michoel and Nachtergaele38]. To turn it into a mathematical argument in the present context, we note that through the identification $ | J-k \rangle \equiv | k \rangle $ of their canonical orthonormal basis, the Hilbert spaces $ \mathcal {H}^{K_N}_{J} \subset \mathbb {C}^{2J+1} $ are all canonically embedded into $ \ell ^2(\mathbb {N}_0 ) $ . This embedding will be denoted by
stands for its corresponding projection. Thus,
when restricted to $ \mathcal {H}^{K_N}_{J} $ is unitarily equivalent to
when restricted to
with $ P_{K_N} $ denoting the corresponding orthogonal projection in $ \ell ^2(\mathbb {N}_0) $ .
Lemma 3.3.
-
1. For any choice of sequences $ K_N = o(N^{1/2}) $ and $ K_N \overline {K}_N^3 = o(N^2) $ ,
(3.10) $$ \begin{align} & \max_{|J-J_N|\leq \overline{K}_N} \left\| \left(\overline D_{J,N} - D_N \right) P_J^{K_N^-} \right\| = o(1) , \end{align} $$where and $ P_J^{K_N^-} $ denotes the orthogonal projection onto $ \mathcal {H}^{K_N^-}_{J} \subset \mathcal {H}^{K_N}_{J}$ . -
2. For any sequence $ K_N \to \infty $ , the operators $ \overline D_N = P_{K_N} D P_{K_N} $ converge as $ N \to \infty $ in strong-resolvent sense to $ D $ . Moreover, we also have the pointwise convergence of eigenvalues:
(3.11) $$ \begin{align} E_k\left( \overline D_N \right) = E_k\left( D \right) + o(K_{N}^{-\infty}) \end{align} $$for any $ k \in \mathbb {N}_0 $ . If $ E_K $ and $ E_K^{(N)} $ denote the spectral projection onto the $ K \in \mathbb {N} $ lowest eigenvalues of $ D $ and $ \overline D_N $ on $ P_{K_N} \ell ^2(\mathbb {N}_0) $ , then(3.12) $$ \begin{align} \left\| \left( E_K^{(N)} - E_K\right) E_K^{(N)} \right\| = o(K_{N}^{-\infty}), \end{align} $$where $ o(K_{N}^{-\infty }) $ denotes a sequence, which when multiplied by an arbitrary but fixed power of $ K_N $ goes to zero.
Proof. 1. For any $ k, k^{\prime } \leq K_N{\kern-1.2pt}, $ the matrix-elements of the difference in the canonical basis of $I_J^{(N)} \mathcal {H}^{K_N}_{J} $ are given by
Each of the two terms terms in the right side are explicit thanks to EQuations (3.1) and (2.7). If $ k^{\prime } \leq K_N-2 $ , their difference can be expressed in terms of
with $ m, m^{\prime } \leq K_N $ and analogously for $ y $ instead of $ x $ , for which the expression differs only by an overall complex phase. By an explicit calculation, the modulus of the last term is upper bounded according to
for both $ \xi \in \{ x, y\} $ . Since also
we conclude that for some constant $ C< \infty $ and all $ k\leq K_N $ , $ k^{\prime } \leq K_N-2 $ :
By an analogous estimate as in Equation (2.10), we thus arrive at the claimed norm estimates in Equation (3.10).
2. Since $ c_{00} $ is a common core for $ D $ and $\overline D_N = P_{K_N} D P_{K_N} $ , the claimed strong-resolvent convergence is immediate from from the fact that the matrix elements of $ D $ vanish unless their difference is smaller than two (cf. [Reference Reed and Simon45]). To boost this strong convergence to convergence of eigenvalues, we use the block approximation
with $ Q_{K_N} = 1 - P_{K_N} $ . For $ E_K = \sum _{k=0}^{K-1} |\psi _k\rangle \langle \psi _k| $ , we have
The error estimate as $ N \to \infty $ follows from the exponential decay estimate (3.5).
The matrix elements of $\langle n | D-\widehat D_N | m \rangle $ are nonvanishing only for $ | n-K_N|, |m-K_N| \leq 2 $ . Moreover, $ | \langle n | D- \widehat D_N | m \rangle | \leq \mathcal {O}(K_N) $ such that again by the exponential decay estimate (3.5):
Denoting by $ F_K = 1 - E_K $ , this implies that $ \left \| E_K (D- \widehat D_N) E_K \right \| = o(K_N^{-\infty }) $ and $ \left \| F_K \widehat D_N E_K \right \| = o(K_N^{-\infty }) $ . Since also
the Schur-complement method of Proposition 3.1 proves that the eigenvalues of $\widehat D_N$ strictly below $ E_{K}(D) \left ( 1 - o(1) \right ) $ aymptotically coincide with those of $ D $ up to an error of order $ o(K_N^{-\infty }) $ .
A similar estimate also shows that $ Q_{K_N} D Q_{K_N} \geq E_{K}(D) Q_{K_N} \left ( 1 - o(1) \right ) $ for any $ K\in \mathbb {N}$ so that the low-energy spectrum of $\widehat D_N$ entirely coincides with that of its first block $ \overline D_N $ . This finishes the proof of Equation (3.11).
The assertion (3.12) then follows from Equation (3.15), the discrete nature of the spectrum of $ D $ and a standard perturbation theory bound based on the representation of the spectral projection as a contour integral involving resolvents (cf. [Reference Reed and Simon45]).
3.2 Proof of Theorem 2.3
For a proof of Theorem 2.3, we analyse all the blocks $ H_{J,\alpha } $ in the decomposition (2.1) of the Hamiltonian separately. The main contribution to the lowest energies of $ H $ will come from the largest $ J $ and the vicinity of the minimum. Projection techniques will help to focus on this patch.
To facilitate notation, by a unitary rotation we subsequently assume without loss of generality that the minimum of $ h $ is at $ \mathbf {m}_0 = (0,0, 1)^T $ and that the projected Hessian $ Q_{\perp } D_h(\mathbf {m}_0) Q_{\perp } $ as a matrix on $ \operatorname {\mathrm {ran}} Q_{\perp } $ has its eigenvectors aligned with the $ x $ - and $ y $ -direction with $ \omega _x $ and $ \omega _y $ the corresponding eigenvalues. This entails that
cf. Equation (2.11). We will also assume without loss of generality that the positive eigenvalues of $ D_h^{\perp }(\mathbf {m}_0) $ are ordered according to $ 0 < \omega _y^{\perp } = \omega _y + |\nabla h(\mathbf {m}_0) | \leq \omega _x + |\nabla h(\mathbf {m}_0) | = \omega _x^{\perp } $ , and we note that
Moreover, we will use throughout the whole proof $ K_N = \overline {K}_N = o(N^{1/3}) $ diverging as $ N \to \infty $ .
3.2.1 Limit operator for $ J \geq N/2 - K_N $
By assumption (2.4) and Lemma 2.2, on the increasing subspaces $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_0) $ with $ J \geq N/2 - K_N $ , the shifted quadratic polynomial approximates $ H_{J,\alpha } $ uniformly in norm to order $ o(1) $ . Next, we show that this quadratic polynomial is well approximated by
with $ D = \omega ^2 L_x^2+ L_y^2 $ on $ \ell ^2(\mathbb {N}_0) $ and $ \omega $ from Equation (3.17).
Lemma 3.4. In the situation of Theorem 2.3, if $ K_N = o(N^{1/3}) $ :
Proof. We use assumption (2.4) and Lemma 2.2 to establish the claim (3.18) with $ H_J^{(N)} $ replaced by . In order to simplify the term in Equation (3.16) proportional to $ | \nabla h( \mathbf {m}_0)| $ , we rewrite
On $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_0) $ with $ J \geq N/2 - K_N $ , the norm of the last term is bounded by $ (4 K_N)^2/2N = o(1) $ . To that order, we can therefore replace $ H_J^{(N)} $ by
Lemma 3.3 then yields
which completes the proof.
3.2.2 Truncations
To control the blocks $ H_{J,\alpha } $ for values of $ J $ other than the ones of the previous paragraphs, we use the following lemma.
Lemma 3.5. In the situation of Theorem 2.3, there are constants $ c, C \in (0,\infty ) $ such that for all $ (J,\alpha ) $
Proof. From assumption (2.2), we infer that for some $ C < \infty $ :
Since $ \mathbf {m}_0 \in S^2 $ was assumed to be the unique minimum of $ h: B_1 \to \mathbb {R} $ and $ |\nabla h( \mathbf {m}_0 ) |> 0 $ , there is some $ c> 0 $ such that for all $ \mathbf {m} \in B_1 $ :
Plugging this estimate into the above operator inequality and using the fact that by Lemma A.5
with some constant $ C $ , which does not depend on $ N, J $ , we arrive at the claimed matrix inequality.
3.2.3 Finishing the proof
With the above preparations, we are ready to spell out the proof of Theorem 2.3. Aside from exploiting the block decomposition (2.1), the argument essentially relies on a two-step approximation procedure and a Schur-complement analysis in the main blocks corresponding to $ J \geq N/2 - K_N $ .
Proof of Theorem 2.3.
We investigate the spectrum of each block $ H_{J,\alpha } $ of the Hamiltonian in Equation (2.1) separately and distinguish cases.
In case $ J \leq N/2 - K_N $ , we use Lemma 3.5 to conclude that for all $ \alpha $
Since $ K_N \to \infty $ as $ N \to \infty $ , these blocks clearly do not contribute to the asserted low-energy spectrum below $ E_0(H) + \Delta $ with arbitrary $ \Delta> 0 $ .
In case $ J \geq N/2 - K_N $ , we consider the approximating Hamiltonian $H_J^{(N)} $ on $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_0) $ from Lemma 3.4 and define its projection
Note that this operator is unitarily equivalent to
We fix $ K \in \mathbb {N} $ arbitrary and let $ E_K^{(N)} $ stand for the orthogonal projection onto the $ K $ -dimensional subspace of $ \mathcal {H}^{K_N^-}_{J}( \mathbf {m}_0) $ spanned by eigenvectors of the $ K $ lowest eigenvalues of $ \widetilde H_J^{(N)} $ . Its orthogonal complement in will be denoted by . Using Lemma 3.4, we conclude
The term in the right side equals
and its convergence follows from Equation (3.12) and (3.15).
For the Schur-complement principle in Proposition 3.1, it thus remains to control the lowest eigenvalue of the block $F_K^{(N)} H_{J,\alpha } F_K^{(N)} $ . For this task, we again employ Lemma 3.5, which yields the matrix inequality
where $ P^M_J $ is the orthogonal projection in $ \mathbb {C}^{2J+1} $ spanned by the eigenvectors $ | J \rangle , | J-1 \rangle , \dots | J - M \rangle $ of $ S_z $ corresponding to the highest $ M+1 $ eigenvalues. We establish a lower bound on the last term in Equation (3.26) with the help of an upper bound on
By Lemma 3.3 as $ N \to \infty $ and for any sequence $ J \geq N/2 - K_N $ , the matrices $P_{K_N^-} D P_{K_N^-} $ converge in strong-resolvent sense to $ D $ . This implies the weak convergence of the corresponding spectral projections. In particular, for each fixed $ m \in \mathbb {N}_0 $ and any sequence $ J \geq N/2 - K_N $ :
where $ (\psi _k)_{k\in \mathbb {N}_0} $ denotes the orthonormal eigenbasis of $ D $ . The latter are estimated in Proposition 3.2. Thanks to the exponential decay estimate (3.6), the right side is exponentially small in $ K - M $ for all $ m \in \{ 0, 1, \dots , M \} $ provided $ K \geq 2 M $ . Hence, if we choose $ M $ large enough and subsequently $ K $ suitably larger, the prefactor in the last term in Equation (3.26) exceeds any constant. The block $ F_K^{(N)} H_{J,\alpha } F_K^{(N)} $ then does not contribute to the low-energy spectrum of $ H_{J,\alpha } $ .
By Equations (3.24)–(3.28) and Proposition 3.1 the low-energy spectrum of $ H_{J,\alpha } $ hence asymptotically agrees with the lowest eigenvalues of $\widetilde H_J^{(N)} $ . In turn, they agree with those of the operator in Equation (3.23). By the convergence (3.11), the eigenvalues of the last term in Equation (3.23) asymptotically agree with those of $\omega _y^{\perp } D $ , which are given by the oscillator values (3.3). For $ k = N/2 - J \in \mathbb {N}_0 $ fixed, this yields the expression (2.15) for the eigenvalues including their multiplicities.
The minimal eigenvalue $ E_0(H) $ asymptotically corresponds to the choice $ k = m = 0 $ , which establishes Equation (2.14). The corresponding weak convergence of the eigenvector (2.13) is a consequence of the Schur-complement analysis, Lemma 3.3 and the explicit expression (3.4) for the ground-state of $ D $ .
Finally, the expression (2.16) for the spectral gap is immediate from Equation (2.15) since either the first excited corresponds to $ k= 0 $ and $ m = 1 $ or vice versa.
3.3 Proof of Theorem 2.4
In case the unique minimum is found at at $ \mathbf {m}_0 \in B_1 $ with $0 < | \mathbf {m}_0| < 1$ , the low-energy spectrum of $ H $ stems again from blocks $ H_{J,\alpha } $ with $ J $ -values in the vicinity of $ J_N( \mathbf {m}_0) = N |\mathbf {m}_0|/ 2 $ . This is evident from the following lower bound, which substitutes Lemma 3.5 in the present case.
Lemma 3.6. In the situation of Theorem 2.4, there are constants $ c, C \in (0,\infty ) $ such that for all $ (J,\alpha ) $ and all $ \mathbf {m}_1 \in B_1 $ :
Proof. We use that $ \mathbf {m}_0 \in B_1$ is the unique minimum of $ h: B_1 \to \mathbb {R} $ at which $ D_h( \mathbf {m}_0)> 0 $ . Hence, there is some $ c> 0 $ such that
The last inequality was obtained using Cauchy–Schwarz and holds for all $ \mathbf {m} \in B_1 $ . Plugging the estimate into Equation (3.20) yields the claim using the same arguments as in the proof of Lemma 3.5.
Proof of Theorem 2.4.
A Rayleigh–Ritz bound with the variational state $ | \Omega _0 , J_N(\mathbf {m}_0) \rangle $ , with $ \Omega _0 $ the spherical angle of $ \mathbf {e}_{\mathbf {m}_0 }$ , yields the upper bound
by Proposition A.3. Combining this with Lemma 3.6 with $ \mathbf {m}_1 = \mathbf {m}_0 $ , we immediately conclude that the blocks $ H_{J,\alpha } $ corresponding to $ | J - J_N( \mathbf {m}_0) | \geq C \sqrt {N } $ with $ C> 0 $ suitably large, but fixed, do not contribute to the spectrum near the ground state.
It thus remains to analyse the blocks corresponding to $ | J - J_N( \mathbf {m}_0) | \leq C \sqrt {N } $ . For any $ J $ in this regime, we associate a radius $ r_{J,N} := 2J/N $ , which by construction satisfies $ \left | r_{J,N} - |\mathbf {m}_0| \right | \leq C N^{-1/2} $ . The value of $ h $ at the point on this sphere in direction of $ \mathbf {m}_0 $ is controlled by a second-order Taylor estimate using $ | \nabla h(\mathbf {m}_0) | = 0 $ :
where here and in the following the constant $ C $ changes from line to line, but remains independent of $ J $ and $ N $ . Hence, the minimum $\mathbf {m}_{J,N} := \arg \min \{ h(\mathbf {m}) \, | \, |\mathbf {m} | = r_{J,N} \} $ on the sphere of radius $ r_{J,N} $ has distance to $ \mathbf {m}_0 $ at most
since otherwise the first inequality in (3.30) together with the above estimate of $ h\left ( r_{J,N} \mathbf {e}_{\mathbf {m}_0} \right ) $ would yield a contradiction to the minimality of $ h(\mathbf {m}_{J,N} ) $ . Using Taylor estimates for $ h \in C^3 $ , this also implies
Having singled out $ \mathbf {m}_{J,N} $ , we now use assumption A1 to approximate $ H_{J,\alpha } $ in the $ \mathcal {H}_{J}^{K_N}(\mathbf {m}_{J,N} ) $ , where we pick $ K_N = o(N^{1/6}) $ diverging to infinity. This enables us to use Lemma 2.2 (with $ \overline {K}_N = 1 $ ) and the second-order polynomial $ \widehat Q(\mathbf {m}_{J,N}) $ in this approximation. The above estimate on the gradient implies that the first-order term is negligible:
Changing coordinates by a unitary rotation such that $ \mathbf {m}_{J,N} = (0,0,r_{J,N}) $ , we thus conclude that on $ \mathcal {H}_{J}^{K_N}( \mathbf {m}_{J,N} ) $ we may approximate $ H_{J,\alpha } $ in terms of
where $\omega _x^{(N)} , \omega _y^{(N)} $ are the two eigenvalues of $ D_h^{\perp }(\mathbf {m}_{J,N} ) $ , that is, the Hessian projected onto the directions perpendicular to $ \mathbf {m}_{J,N} $ . Since $ D_h( \mathbf {m}_0 )> 0 $ , these eigenvalues are uniformly bounded away from zero for all $ | J - J_N( \mathbf {m}_0) | \leq C \sqrt {N } $ by Equation (3.33). Moreover, they uniformly converge to the eigenvalues $ 0 < \omega _y \leq \omega _x$ of $ D_h^{\perp }(\mathbf {m}_{0} ) $ as $ N \to \infty $ . In fact, from Equation (3.33) we have the estimates $ | \omega _{\xi }^{(N)} - \omega _{\xi } | \leq C N^{-1/4} $ for both $ \xi \in \{x,y\} $ . Since also $\left \| S_{\xi }^2 P_{J}^{K_N}(\mathbf {m}_{J,N}) \right \| \leq C N K_N $ (cf. Equation (2.8)), we may thus use
to show the following substitute for Lemma 3.4:
We then proceed with the Schur-complement analysis as in the proof of Theorem 2.3 with only one modification. To control the last block, we use Lemma 3.6 with $ \mathbf {m}_1 = \mathbf {m}_{J,N} $ . For this choice, the quadratic difference term in its right side is bounded by a constant thanks to Equation (3.32).
This proves that $ E_0(H_{J,\alpha }) = N h(\mathbf {m}_{J,N}) + \kappa ( \mathbf {m}_0) + | \mathbf {m}_0| \sqrt {\det D_h^{\perp }(\mathbf {m}_0)} + o(1) $ for all blocks $ (J,\alpha ) $ with $ | J - J_N( \mathbf {m}_0) | \leq C \sqrt {N } $ . Clearly for $ J = J_N( \mathbf {m}_0) $ , we have $ \mathbf {m}_{J,N} = \mathbf {m}_0 $ , which concludes the proof of Equation (2.17). The assertion concerning the regime $ | J - J_N( \mathbf {m}_0) | \leq o(\sqrt {N}) $ follows by a Taylor estimate as in Equation (3.31).
3.4 Proof of Theorem 2.5
Compared to the case of one minimum, the case of several minima $ \mathcal {M} = \{ \mathbf {m}_1, \dots , \mathbf {m}_L \} \subset S^2 $ of $ h $ poses the additional problem to separate the patches around them. In the next subsection, we use semiclassical analysis based on refined projection techniques to tackle this issue and to focus on the subspaces, from which the low-energy spectrum arises. The part going beyond this semiclassics largely parallels the proof in the previous section.
3.4.1 Semiclassics for subspace decompositions
In each of the subspaces $ \mathcal {H}_J^{K}(\mathbf {m}_l) \subset \mathbb {C}^{2J+1} $ , which are associated to the minimizing directions $ \mathbf {m}_l $ , we may choose the canonical orthonormal basis consisting of normalized eigenvectors of $ \mathbf {m}_l \cdot \mathbf {S} $ , that is,
and hence $ \mathcal {H}_J^{K}(\mathbf {m}_l) = \operatorname {span}\left \{ | J- k; \mathbf {m}_l \rangle \ | \ k \in \{ 0, 1 , \dots , K \} \right \} $ .
Lemma 3.7. Let $ \mathcal {M} := \{ \mathbf {m}_1, \dots , \mathbf {m}_L \} \subset S^2 $ be a finite set and $ J \in \mathbb {N} /2 $ and $ K \in \mathbb {N} $ fixed. Then the spectrum of the Gram matrix
is contained in the set $ [1-R_J^K , 1+ R_J^K ] $ with
with $ \sphericalangle ( \mathbf {m}_l, \mathbf {m}_{l^{\prime }}) $ the spherical angle between the two points.
Proof. From Lemma A.2, we infer that for all $ k,k^{\prime } \leq K $ :
Since $ G $ is a Hermitian block matrix with $ L \times L $ blocks of size $ K+1 $ and the diagonal blocks all equal to the unit matrix, the assertion follows straightforwardly from Gershgorin’s circle theorem.
The last lemma ensures that for $ J \geq N/2 - K_N $ and $ K_N = o(N /\ln N ) $ :
The union of the individual basis vectors of $ \mathcal {H}_J^{K_N}(\mathbf {m}_l) $ for any finite number of directions hence still form a set of linearly independent vectors and thus a basis of joint subspace
In this situation, we may construct an orthonormal basis using the square root of the inverse, $G^{-1/2} $ , of the Gram matrix defined as in Equation (3.34):
Clearly, $\langle (k^{\prime },l^{\prime }) \big | (k,l) \rangle = \delta _{l, l^{\prime }} \delta _{k, k^{\prime }} $ . The spectral projections onto $ \mathcal {H}_J^{K_N}(\mathbf {m}_l) $ and its cousins after orthogonalization,
are then norm-close to each other.
Theorem 3.8. For $ N/2 \geq J \geq N/2 - K_N $ and $ K_N = o(N/ \ln N ) $ ,
-
1. $ \displaystyle \big \| | (k,l) \rangle - | J-k; \mathbf {m}_{l} \rangle \big \| = o(N^{-\infty }) $ for all $ (k,l) \in \{0,\dots , K_N\} \times \{1,\dots , L\} $ .
-
2. $\displaystyle \max _{l} \left \| P_J^{K_N}( \mathbf {m}_l) - P_J^{K_N}(l) \right \| = o(N^{-\infty }) $ , and the projection $ P_J^{K_N} := \sum _{l=0}^L P_J^{K_N}(l) $ onto $ \mathcal {H}_J^{K_N} $ is approximated according to
(3.39) $$ \begin{align} \left\| P_J^{K_N}- \sum_{l=0}^LP_J^{K_N}( \mathbf{m}_l) \right\| = o(N^{-\infty}). \end{align} $$
Proof. 1. By definition, we have
The norm of the difference vector is hence bounded according to
The last inequality follows from the bounds on the eigenvalues of $ G $ established in Lemma 3.7. The assertion is therefore a simple consequence of Equation (3.36).
2. The assertions follow immediately from the first item with the help of the triangle inequality.
In order to be able to control the relation of vectors in $ \mathcal {H}_J^{K_N}(\mathbf {m}_l) $ to a slightly fattened version of $ \mathcal {H}_J^{K_N}(\mathbf {m}_{l^{\prime }}) $ with $ l\neq l^{\prime }$ , we also need the following lemma. It will play a crucial role in the truncation procedure below.
Lemma 3.9. Let
and suppose that $ N/2 \geq J \geq N/2 - K_N $ and $ K_N = o(N/ \ln N ) $ . Then for any $ k \in \{0, \dots , K_N\} $ :
Proof. We split the $ k^{\prime } $ -sum in two terms. The first sum to $ K_N $ is estimates with the help of Equation (3.35) from which the claim follows by the same lines of reasoning as above. For the second sum, we have $ k \leq k^{\prime } $ . Abbreviating $ \theta := \sphericalangle ( \mathbf {m}_l, \mathbf {m}_{l^{\prime }}) $ , we estimate this part with the help of Equation (A.4) by
The truncated binomial is bounded by a standard Chernoff bound for any $ t> 0 $
Choosing $ t(\theta ) = \ln \frac {1 - \kappa }{\kappa } \left (\tan \frac {\theta }{2}\right )^2> 0 $ , the right side is of the form $ \exp \left ( -N \alpha (\theta ) \right ) $ with $ \alpha (\theta ) = - \kappa t(\theta ) - \ln \left [ \cos ^2\left (\frac {\theta }{2}\right )/(1-\kappa ) \right ]< 0 $ . Since the remaining summation is estimated trivially by the number of terms $ K_N $ times the maximum of each term, which occurs at $ k = K_N $ with the binomial also trivially bounded, $ \binom {2J}{k} \leq N^k \leq N^{K_N} $ , the result follows.
3.4.2 Truncation
By assumption, the minima of $ h $ have the property $h(\mathbf {m}) \geq h( \mathbf {m}_1 ) = \dots = h( \mathbf {m}_L ) $ for all $\mathbf {m} \in B_1 $ . A substitute for Equation (3.21) is provided by a bound of the form
with $ f_0> 0 $ and a monotone decreasing $ C^2$ -function $ f: [0,1] \to [0,\infty ) $ of the form
with some $ c> 0 $ . The parameter $ \xi \in (0,1) $ is chosen close enough to one such that the supports corresponding to distinct $ l \neq l^{\prime } $ do not overlap, that is, $ f(\mathbf {m}_l\cdot \mathbf {m}) f(\mathbf {m}_{l^{\prime }}\cdot \mathbf {m}) = 0 $ . We will choose
The same strategy as in the proof of Lemma 3.5 immediately yields that for some constant $ C_L \in (0,\infty ) $ and all $ (J,\alpha ) $ :
This would enable us to discard all blocks $ J < N/2 - K_N $ as far as the low-energy spectrum is concerned. For the other blocks, we, however, need a slightly more refined lower bound which distinguishes-patches corresponding to the subspace decomposition in the previous subsection.
Lemma 3.10. In the situation of Theorem 2.5, there are constants $ C , c \in (0,\infty ) $ such that for all $ (J,\alpha ) $ with $ J \geq N/2- K_N $ and $ K_N = o(N/ln N) $ :
where is the orthogonal projection to the complement of $ \mathcal {H}_J^{K_N} $ .
Proof. We start from Equation (3.45). In order to ease the notation, in this proof we abbreviate $ \hat f_l := N f\left (\frac {2}{N} \mathbf {m}_l\cdot \mathbf {S}\right ) $ and we will drop the super-/subscripts on the projection, for example, $ P := P_J^{K_N} $ . We will estimate the block of this operator in the decomposition separately. For the blocks involving $ P $ , we use
The operator $ \hat f_l $ is diagonal in the eigenbasis of $ \mathbf {m}_l\cdot \mathbf {S}$ , i.e.
where the truncation of the $ k $ -sum results from the bounds on the support of $ f $ . Evidently, $ \| \hat f_l \| \leq N f_0 $ . Therefore, for any $ l^{\prime } \neq l $ :
where the last step is Lemma 3.9. Its applicability is ensured by the choice (3.44) of $ \xi $ . From Equation (3.47) and Theorem 3.8, we this conclude $ \left \| \hat f_l P - \hat f_l P( \mathbf {m}_l) \right \| = o(N^{-\infty }) $ . By the spectral representation (3.48), we also have for all sufficiently large $ N $ :
Upon summation over $ l \in \{ 1, \dots , L \} $ and adding $ Nf_0 P $ , this term produces the last term in the right side of Equation (3.46) up to another norm-error of order $ o(N^{-\infty }) $ due to Equation (3.39). These error terms are absorbed in the constant in Equation (3.46).
It thus remains to investigate the block $ Q \hat f_l Q + N( f_0+ h(\mathbf {m}_1)) Q $ . To do so, it is most convenient to switch back to the representation using coherent states. Since $ f \in C^2 $ , this may be done at the expense of another constant thanks to Equation (A.3). We then lower bound
where $ C_N^c = S^2 \backslash \bigcup _{l=1}^L C_N(\mathbf {m}_l) $ is the complement of the union of the spherical caps
These are chosen such that on $ C_N^c $ we have the lower bound $ N h(\mathbf {m}) \geq N h(\mathbf {m}_1) + c K_N $ . Thanks to the decomposition of unity (1.4), to complete the proof, it remains to establish an upper bound for all $ l $ on
The spherical volume of $ C_N(\mathbf {m}_l) $ is $ \pi K_N /N $ . To estimate the norm in the integrand, we fix $ \Omega \in C_N(\mathbf {m}_l) $ and employ the approximate decomposition of unity as expressed in Equation (3.39):
By a unitary rotation, we may assume without loss of generality that $ \mathbf {m}_l = (0,0,1)^T $ . In this case, an estimate on the first term is contained in Proposition A.1 in the appendix. For its application we note that $ 2J \sin ^2(\frac {\theta }{2}) = J ( 1-\cos \theta ) \leq K_N /4 $ . Choosing $ \delta = K_N / [ 8J \sin ^2(\frac {\theta }{2})] \geq 1 $ , we hence conclude
As a consequence of this, we also have for all $ l^{\prime } \neq l $
where the last estimate is due to Theorem 3.8. This completes the proof.
3.4.3 Finishing the proof
Once the above semiclassical reduction to the relevant subspaces in the vicinity of the minima is accomplished, the proof largely follows the strategy of the case of one minimum. In the following. we will therefore only highlight the differences.
We start by setting some notation. By assumption the projection $ D_h^{\perp }(\mathbf {m}_l) $ of the Hessian of $ h $ at each minimum onto the plane perpendicular to $ \mathbf {m}_l $ has two strictly positive eigenvalues,
Throughout the proof, we again use $ K_N = \overline {K}_N = o(N^{1/3}) $ diverging as $ N \to \infty $ .
By assumption (2.4) and Lemma 2.2, on the increasing subspaces $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_l) $ , we approximate $ H_{J,\alpha } $ in terms of the quadratic term $ \widehat Q(\mathbf {m}_l) $ , which involves $ 2 \left ( \omega _{x,l} S_x^2 + \omega _{y,l} S_y^2 \right )/N $ . Proceeding as in Lemma 3.4, we therefore arrive at
where
The operator $ D(l) := \omega _l^2 L_x^2 + L_y^2 $ acts in $ \ell ^2(\mathbb {N}_0) $ and
is the natural injection respectively projection with respect to the $z $ -basis in the $ l $ -direction, that is, $ I_{J,l}^{(N)} | J-k;\mathbf {m}_l \rangle = | k \rangle $ for all $ k \in \{0, \dots , K_N\} $ . Theorem 3.8 ensures the proximity of these $z $ -basis vectors to the orthonormalized basis $ | (k,l) \rangle $ , which compose an orthonormal basis for the joint subspace $ \mathcal {H}_J^{K_N} = \bigvee _{l=1}^L \mathcal {H}_J^{K_N}(\mathbf {m}_l) $ . We therefore replace the above isometric embeddings and projections by
where $ I_J^{(N)} | (k,l) \rangle = | k \rangle _l $ . These embeddings are direct sums, $ I_J^{(N)} = \bigoplus _{l=1}^L I_J^{(N)}(l) $ , of embeddings of
into the $l $ th copy of $ \ell ^2(\mathbb {N}_0) $ ; cf. Equation (3.38). Theorem 3.8 then allows us to replace $ H_{J,l}^{(N)} $ on $ \mathcal {H}^{K_N}_{J}( \mathbf {m}_l) $ by
These operators can be lifted to the direct sum
The above argument, then yields to following modification of Lemma 3.4.
Lemma 3.11. In the situation of Theorem 2.5, if $ K_N = o(N^{1/3}) $ :
where with $ K_N^- = K_N -2 $ .
The proof is straightforward from Equation (3.49) and Theorem 3.8. Equipped with this, we then proceed with the proof of Theorem 2.5 in the same way as for the case of one minimum.
Proof of Theorem 2.5.
In case $ J \leq N/2 - K_N $ , we use Equation (3.45) to conclude that the ground-state of $ H_{J.\alpha } $ is found above $ Nh(\mathbf {m}_1) - C + c K_N $ and hence does not contribute at the energies considered.
In case $ J> N/2 - K_N $ , we consider the projections
Note that this matrix is still a direct sum of matrices associated with the subspaces corresponding to $ P_{J}^{K_N^-}(l) = \sum _{k=0}^{K_N^-} | (k,l)\rangle \langle (k,l) | $ . The matrices forming the direct sum are unitarily equivalent to
on $ P_{K_N^-} \ell ^2(\mathbb {N}_0) = \operatorname {span}\left \{ | k\rangle | k \in \{0, \dots , K_N^-\} \right \} $ . In turn, these operators have been described in detail in Section 3.1.
We now fix $ K \in \mathbb {N} $ arbitrary, and let $ E_{K}^{(N)} $ stand for the orthogonal projection onto the subspace of $ P_{J}^{K_N^-} \mathcal {H}_{J}^{K_N} $ spanned by eigenvectors of the $ K $ lowest eigenvalues of $ \widetilde H_J^{(N)} $ . Its orthogonal complement in will be denoted by .
Lemma 3.11 ensures the validity of the estimates (3.24)–(3.25) (with minor modifications in the notation). It thus remains to again control the block $ F_{K}^{(N)} H_{J,\alpha } F_{K}^{(N)} $ . To do so, we modify the argument in Equation (3.26). With the help of Lemma 3.10, we arrive at
where $ M \in \mathbb {N} $ is arbitrary. Using Theorem 3.8, we can replace the projection $ P_J^{M}(\mathbf {m}_l) $ by $ P_J^{M}(l) $ at the expense of a term which is $ o(N^{-\infty }) $ . The latter can be added to the order one term proportional to $ C $ . Proceeding as in Equation (3.27), it thus remains to estimate
Since $ \widetilde H^{(N)}_J $ is a direct sum, for each of the terms in the $ l $ -sum we are therefore back to Equation (3.28) with $ K $ changed depending on how the $ L $ harmonic oscillator levels interlace. Since Equation (3.28) was identified to be exponentially small in case $ K $ is chosen much larger than $ M $ , this still shows that the last term in the right side of Equation (3.51) is bounded independent of $ M $ . Hence, choosing $ M $ large enough and subsequently $ K $ larger, the ground state energy of the block $ F_{K}^{(N)} H_{J,\alpha } F_{K}^{(N)} $ is seen to be much larger than the energies of interest.
By a Schur-complement analysis (Proposition 3.1) the low-energy spectrum of $ H_{J,\alpha } $ agrees with that of $ \widetilde H^{(N)}_J $ . In the limit of $ N \to \infty $ and using Lemma 3.3, the spectrum of $ \widetilde H^{(N)}_J $ is a direct sum of $ L $ harmonic oscillator spectra as described in Proposition 3.2.
A Miscellanea on spin-coherent states
In this appendix, we collect properties of the spin-coherent states as defined in Equation (1.3). We restrict attention to the semiclassical properties, which were essential for the analysis in this paper. We refer to the textbooks [Reference Perelomov42, Reference Gazeau20, Reference Combescure and Robert16] and [Reference Arecchi, Courtens, Gilmore and Thomas3, Reference Lieb33] for further information and references.
A.1 Semiclassical estimates for the states
The spin-coherent states (1.3) on $ \mathbb {C}^{2J+1} $ are parametrised by the angles $ \Omega =(\theta ,\varphi ) $ on the unit sphere. Their scalar product
shows that, for large values of $ J \in \mathbb {N}/2 $ , they are sharply localised. Denoting by $ \sphericalangle (\Omega ,\Omega ^{\prime }) $ the spherical angle between two points on the unit sphere, one has the Gaussian-type localisation
with width proportional to $ J^{-1/2} $ .
With respect to the orthonormal eigenbasis of $ S_z $ on $ \mathbb {C}^{2J+1} $ , the spin-coherent states are linear combinations with coefficients given by
for any $ k \in \{0,1, \dots , 2J\} $ and $ \Omega =(\theta ,\varphi )$ . Measurement of $ S_z $ will therefore result in a binomial distribution of $ 2J $ independent Bernoulli variables with parameter $ p = \sin ^2\left (\frac {\theta }{2}\right ) $ . The following lemma records the standard upper-tail Chernoff estimate for the binomial distribution.
Proposition A.1. For any $ J \in \mathbb {N} /2 $ and any $ k \in \{0,1, \dots , 2J\} $ and any $ \delta> 0 $ ,
We will also need the following generalisation of the identity (A.1), which involves the unitary $ U(\Omega ) $ defined in (1.3).
Lemma A.2. For any $ k , k' \in \{0,1, \dots , 2J\} $ and any $\Omega =(\theta ,\varphi ) $ ,
Moreover, if $ k\leq k^{\prime } $ (with the convention that $ 0^0 = 1 $ ),
Proof. We use the decomposition of the unitary [Reference Perelomov42, Eq. (4.3.14)] (see also [Reference Arecchi, Courtens, Gilmore and Thomas3]):
The formula (A.3) follows from a straightforward, but tedious, calculation which uses that
and similarly for $ S_- $ (and the usual convention that the binomial is zero if the upper integer is smaller than the lower one); cf. [Reference Perelomov42, Eq. (4.2.3)].
The bound (A.4) then follows by estimating
and similarly
This yields the claim by binomial formula.
A.2 Semiclassical estimates for the symbols
Associated to any linear operator on $ \mathbb {C}^{2J+1} $ are two semiclassical symbols: the lower and upper one. We recall from [Reference Lieb33] the lower symbols of the spin operator
as well as its upper symbol alongside, the symbol of the square of the $ z $ -component,
For operators with a smooth upper symbol, the lower symbol is know to agree with the upper symbol in the semiclassical limit [Reference Lieb33, Reference Duffield17]. Since we need quantitative error estimates, we include the following result, which is tailored to the applications considered here (see also [Reference Landsman, Moretti and van de Ven31, Prop. 4.2]).
Proposition A.3. For any
on $ \mathbb {C}^{2J+1} $ with some $ f \in C^2 $ , there is some $ C < \infty $ such that for all $ J \leq N/2 $
Proof. The proof is based on the standard Taylor estimate
Using the representation (A.6) and Equation (A.5) and the completeness (1.4), we arrive at
The remainder term is estimated similarly
Inserting the Taylor estimate into the integral expression for $ H $ and using the normalization $ {\langle \Omega , J | \Omega , J\rangle = 1} $ , we thus arrive at
from which the claim follows.
The last proposition shows the consistency of the two symbols in the semiclassical limit. We also need the following quantitative version of Duffield’s theorem [Reference Duffield17] on the consistency of the quantisation of noncommuting self-adjoint polynomials with the help of the lower symbol.
Proposition A.4. If $ H = N \ \textrm {P}\Big (\tfrac {2}{N} \mathbf {S} \Big )$ on $ \mathbb {C}^{2J+1} $ with a noncommuting self-adjoint polynomial P, then there is some $ C < \infty $ , which is independent of $ J \leq N/2 $ such that
The proof, which is spelled out at the end of this subsection, will rest on two preparatory lemmas. The first lemma deals with operators of the type $f(2S_z/N)$ .
Lemma A.5. If $ H = N f(2 \mathbf {v} \cdot \mathbf {S} /N)$ on $ \mathbb {C}^{2J+1} $ for some $f \in C^2$ , then there is some $ C < \infty $ , which is independent of $ J \leq N/2 $ , such that for all unit vectors $ \mathbf {v} \in S^2 $ and $ N $ , $ J \leq N/2 $ :
Proof. We first reduce the assertion to the case $ \mathbf {v} = \mathbf {e}_z $ . If $\Omega $ stands for the spherical angles of $\mathbf {v} = \mathbf {e}(\Omega ) $ , we have $ U(\Omega )^* (\mathbf {v}\cdot \mathbf {S} )U(\Omega ) = S_z $ with the unitary from Equation (1.3). Similarly, by the definition of the coherent states, one easily arrives at
for any continuous function f.
In order to establish the claim in case $ \mathbf {v} = \mathbf {e}_z $ , we first show that
s diagonal in the orthonormal basis $|k \rangle $ with $k \in \{ -J,\ldots , J\}$ for which the operator $S_z$ is diagonal. Inserting the explicit expression (A.1) in
the $ \varphi $ -integration of the spherical angle $ \Omega = (\theta ,\varphi )$ immediately yield $ \langle k | H^{\prime } | k^{\prime } \rangle =0 $ if $k \neq k^{\prime }$ .
It remains to control the diagonal elements of $H^{\prime }$ for which we fix $ k =k^{\prime }$ in the above integral. By a standard Taylor approximation, we have
In particular, we have
To estimate the right side, we make use of the explicit operator representations (A.6) for $ S_z $ and Equation (A.7) which immediately yields
with some numerical constant C.
Our second ingredient is an algebraic result, which allows to write any homogeneous polynomial of degree d as sum of linear forms to the power d:
Lemma A.6. Let $Q_{\text {hom}}^d(x_1,\ldots ,x_k)$ be the real vector space of homogeneous polynomials of degree d in the k variables $x_1,\ldots x_k$ . Then,
We remark that this a commutative result in the sense that we distinguish between the order of variables. For example, the polynomials $x_1 x_2$ and $x_2 x_1$ are considered to be the same.
Proof. It is clear that
is a closed subspace of $Q_{\text {hom}}^d(x_1,\ldots ,x_k)$ . Let
. Note that $f_{\alpha } - f_{\alpha ^{\prime }} \in W^d(x_1,\ldots ,x_k)$ , and since $W^d(x_1,\ldots ,x_k)$ is closed, we see that
Similarly, partial derivatives of higher order are element of $W^d(x_1,\ldots ,x_k)$ ; in particular, the d-th order derivatives which agree with the monomials of degree d. Thus, $W^d(x_1,\ldots ,x_k)$ contains all monomials of degree d and, hence, coincides with $Q_{\text {hom}}^d(x_1,\ldots ,x_k)$ .
We are finally ready to spell out the following.
A.3 Semiclassics for the free energy
Proposition A.7. For a Hamiltonian $ H $ of the form (2.1) on $ \mathcal {H}_N $ with a regular symbol $ h \in C^2$ satisfying A0, the pressure at inverse temperature $ \beta> 0 $ is
Proof. We first use the block decomposition (2.1) in the trace to write
By Equation (2.2), the logarithm of the trace in the right side bounded according to
We then use the Berezin–Lieb inequalities (1.6) and Proposition A.3, which ensures that the upper and lower symbol agree up to order one, to replace the last trace by an integral,
In order to use a standard Laplace evaluation for the above integral in the limit $ N \to \infty $ , we set $ J = r N/2 $ with fixed $ r \in (0,1] $ . Since $ h \in C^2 $ , we arrive at
The assertion then follows from the known asymptotics of the binomial coefficient in Equation (1.2) for $ M_{N,J} $ , that is, $ \frac {1}{N} \ln M_{N,J} = I(r) + o(1) $ .
Acknowledgements
We thank the referees for their suggestions to improve the manuscript.
Competing interests
The authors have no competing interest to declare.
Data availability statement
Not applicable.
Funding statement
Funded by the Deutsche Forschungsgemeinschaft (DFG) – TRR 352 Project-ID 470903074.
Ethical standards
The research meets all ethical guidelines, including adherence to the legal requirements of the study country.