1. Introduction
It has long been known that the bounce-averaged drift that a trapped particle experiences is central to both linear and nonlinear stability of gyrokinetic trapped-particle modes (Kadomtsev & Pogutse Reference Kadomtsev and Pogutse1967; Rosenbluth Reference Rosenbluth1968; Helander, Proll & Plunk Reference Helander, Proll and Plunk2013; Helander Reference Helander2017). This motion of trapped particles can serve as an energy source or sink for various instabilities, and thus their study is central to understanding their behaviour in any plasma-field scenario.
The behaviour of trapped particles depends crucially on the class of fields considered. In an effort to study stellarators, so-called omnigeneous fields (Hall & McNamara Reference Hall and McNamara1975; Bernardin, Moses & Tataronis Reference Bernardin, Moses and Tataronis1986; Cary & Shasharina Reference Cary and Shasharina1997; Landreman & Catto Reference Landreman and Catto2012; Helander Reference Helander2014) are of particular interest. In such fields, composed of nested flux surfaces on which field lines live, trapped particles have, by definition, a vanishing averaged drift in the direction normal to flux surfaces (i.e. radially). This restricts the dynamics of trapped particles to an average drift within flux surfaces, often referred to as precession and denoted by $\omega _\alpha$. Many authors have investigated the behaviour of this quantity (White & Chance Reference White and Chance1984; Roach, Connor & Janjua Reference Roach, Connor and Janjua1995), and analytic expressions exist for large-aspect ratio tokamaks with circular cross-sections and small non-axisymmetric perturbations (Connor, Hastie & Martin Reference Connor, Hastie and Martin1983; Hegna Reference Hegna2015). For general omnigeneous stellarators (and even more so, upon relaxing omnigeneity), expressions for $\omega _\alpha$ rarely allow for analytical calculation (Velasco et al. Reference Velasco, Calvo, Sánchez and Parra2023). This ends up impeding the dissection of the underlying physics and effects.
This paper carries out such calculations in a more general scenario. To make the problem tractable, we consider two main simplifications. First, we specialise to quasisymmetric (Boozer Reference Boozer1983; Nührenberg & Zille Reference Nührenberg and Zille1988; Rodriguez, Helander & Bhattacharjee Reference Rodriguez, Helander and Bhattacharjee2020) and axisymmetric fields with stellarator symmetry (Dewar & Hudson Reference Dewar and Hudson1998) and up-down symmetry, respectively, two special sub-classes of the wider class of omnigenous systems. The central feature of both of these classes is the symmetry of their magnetic field magnitude, $|\boldsymbol {B}|$. This reduces the complexity of the particle dynamics significantly, especially in the region close to the magnetic axis (the centremost field line of the field, around which flux surfaces accrue). This leads to the second simplifying consideration in this paper: the asymptotic description near the axis. In this near-axis approach, the field magnitude may be directly parametrised, and the framework developed by Garren & Boozer (Reference Garren and Boozer1991b) and Landreman & Sengupta (Reference Landreman and Sengupta2019) may be employed directly. Both these simplifying considerations enable an explicit description of the trapped particle motion, whose construction and interpretation we present in §§ 2 and 3.
Once the particle precession is known, we next investigate its role on trapped-particle mode stability. We do so by studying the available energy (Æ) of trapped electrons (Helander Reference Helander2020); that is, a measure of the available thermal energy that may be liberated by appropriate rearrangements of the electron distribution function. We compute Æ analytically in § 4, explicitly showing its dependence on various important parameters. This enables a direct comparison with other physically relevant properties such as MHD stability and flux surface shaping.
2. Asymptotic expression for the second adiabatic invariant
The description of the trapped particle precession requires the evaluation of the bounce-averaged drift around flux-surfaces, denoted as $\omega _\alpha$. To calculate such a quantity, we must begin by appropriately defining flux surfaces and a notion of direction over them. To this end, we first introduce the Clebsch representation (D'haeseleer et al. Reference D'haeseleer, Hitchon, Callen and Shohet2012) of the magnetic field; namely, $\boldsymbol {B}=\boldsymbol {\nabla }\psi \times \boldsymbol {\nabla }\alpha$, where $\psi$ is the magnetic toroidal flux divided by $2{\rm \pi}$ and $\alpha$ is an angular potential defined as $\alpha = \theta - \iota \varphi$. Here $\theta$ and $\varphi$ are straight-field line (D'haeseleer et al. Reference D'haeseleer, Hitchon, Callen and Shohet2012) poloidal and toroidal angular coordinates, respectively, and $\iota$ is the rotational transform. The flux surfaces are assumed to be nested and correspond to constant pressure surfaces, following a magnetic field that is in equilibrium, $\boldsymbol {j}\times \boldsymbol {B}=\boldsymbol {\nabla } p$. The angle $\alpha$ can be interpreted as a field line label within flux surfaces (following $\boldsymbol {B}\boldsymbol {\cdot }\boldsymbol {\nabla }\alpha =0$). Thus, we define the precession $\omega _\alpha$ as the rate at which trapped particles change the field line within a flux surface, formally,
This is the common bounce-averaged binormal drift (Kadomtsev & Pogutse Reference Kadomtsev and Pogutse1967; White & Chance Reference White and Chance1984; Helander Reference Helander2014). Here, the overline operator denotes a bounce-averaging: that is, a time average over the back-and-forth motion of the trapped particle along the field line (thus assuming a ‘thin-orbit’),
where $v_\parallel$ is the parallel velocity, the arc-length along a magnetic field-line is parametrised by $\ell$ and the domain of integration is taken to be a simply connected region that satisfies $v_\parallel (\ell ) \geq 0$ (which is typically referred to as a bounce well). This is an integral at constant $\psi$ and $\alpha$, but also particle energy $H$ and first moment $\mu$.
It is convenient to write the bounce-average drift in (2.1) in terms of derivatives of a single scalar quantity, $\mathcal {J}_\parallel$ (Helander Reference Helander2014). This scalar quantity is the so-called second adiabatic invariant,
and is an approximately conserved quantity of trapped particles. Importantly, this quantity serves as the Hamiltonian of the trapped-averaged dynamics of trapped particles, meaning, as can be shown explicitly (Helander Reference Helander2014; Calvo et al. Reference Calvo, Parra, Velasco and Alonso2017), that
Here $q$ is the charge of the particle (which we shall take to be $q=-1$ for electrons), and $\Delta \alpha$ and $\tau _b$ have been defined as the total $\alpha$-excursion and elapsed time in a particle bounce, respectively.
Because $\mathcal {J}_\parallel$ encodes the dynamics of trapped particles in a single scalar expression (and more generally, also allows one to calculate the radial drift), we shall explicitly calculate the asymptotic expression for $\mathcal {J}_\parallel$ as a first step towards finding $\omega _\alpha$.
2.1. Expanding the second adiabatic invariant
Let us write $\mathcal {J}_\parallel$ explicitly as a function of the field line following coordinate $\ell$ (where we have taken the particle mass $m=1$),
Here we have introduced the pitch-angle $\lambda = \mu B_0 / H$ (using for the first adiabatic invariant $\mu =(2H-v_\parallel ^2)/B$), which distinguishes between different trapped particles (deeper or more shallowly trapped), and we have normalised the magnetic field by some reference field strength $\hat {B} = B/B_0$. The integral is taken between bounce points, i.e. between points at which $\hat {B}=1/\lambda$. We are assuming there is no electric field within each flux surface and, as such, no electrostatic potential appears in (2.5).
It will prove convenient to express the field line following coordinate $\ell$ in terms of Boozer angles (Boozer Reference Boozer1981). We define for that purpose the helical angle,
where $N$ is an integer that defines a helical angle, foreseeing the application to quasisymmetric devices with a helical symmetry. Using this angular parametrisation, the Boozer-coordinate Jacobian $\mathcal {J}=B_\alpha (\psi )/B^2$, where $B_\alpha =G+\iota I$ (in the standard Boozer notation (Boozer Reference Boozer1981; Helander Reference Helander2014)), and defining $\bar {\iota }=\iota -N$, the second adiabatic invariant in these coordinates may now be written as
It is crucial to note that $\mathcal {J}_\parallel$ depends directly on the magnetic field magnitude along a field line, with minimal involvement of other geometric elements. This simplifies the calculation of $\mathcal {J}_\parallel$ ostensibly when considering quasi- and axisymmetric configurations, and makes them rather analogous.
To construct a representative $\hat {B}$, we resort to the inverse-coordinate near-axis framework (Garren & Boozer Reference Garren and Boozer1991b; Landreman & Sengupta Reference Landreman and Sengupta2019), in which the asymptotic form of $\hat {B}$ near the axis is rather simple. We will employ essential results from the near-axis theory of quasisymmetric equilibrium fields as needed without re-deriving them and refer to the literature for details (Garren & Boozer Reference Garren and Boozer1991b; Landreman & Sengupta Reference Landreman and Sengupta2019). That way, and to second order in the distance from the magnetic axis, $r=\sqrt {2\psi /B_0}$, we write $\hat {B}$, following Garren & Boozer (Reference Garren and Boozer1991b, (1)) or Landreman & Sengupta (Reference Landreman and Sengupta2019, (2.15)), and noting that this behaviour goes beyond the particular form of equilibrium assumed (Rodríguez, Sengupta & Bhattacharjee Reference Rodríguez, Sengupta and Bhattacharjee2022), as
where we have separated
and the second order
In the ideal quasi- or axisymmetric limit, all parameters ($\eta$, $B_{20}$, $B_{22}^C$ and $B_{22}^S$) are constants instead of functions of the toroidal angle $\varphi$, and for stellarator symmetry $B_{22}^S=0$. From here on, we shall assume that this condition is satisfied exactly. Note that in practice, this condition is only achieved approximately: the near-axis expansion generally fails to find exactly quasisymmetric solutions for equilibria at second order in $r$ (unless exactly axisymmetric fields are considered). This is commonly referred to as the Garren–Boozer overdetermination problem (Garren & Boozer Reference Garren and Boozer1991a), and results from a clash between the symmetry and the equilibrium (Rodriguez & Bhattacharjee Reference Rodriguez and Bhattacharjee2021; Rodríguez et al. Reference Rodríguez, Sengupta and Bhattacharjee2022). In that sense, the idealised framework is only an approximation, but it will prove useful and, as we shall see, practical in approximating quasisymmetric configurations. The constants that define $|\boldsymbol {B}|$ can then be interpreted as parameters that describe different configurations. In fact, with these parameters together with a magnetic axis shape, the field and flux surfaces may be constructed explicitly (Landreman & Sengupta Reference Landreman and Sengupta2019). Thus, expressing the dependence of the second adiabatic invariant on these parameters will provide a direct link to the configuration and its distinguishing features. A note of caution: although we are considering the asymptotic limit in the distance to the magnetic axis, we cannot approach it closer than the gyroradius-related ‘potato-orbit’ size (Helander & Sigmar Reference Helander and Sigmar2005, Ch. 7) without violating the ‘thin orbit limit’ of our $\mathcal {J}_\parallel$ calculation.Footnote 1
The way that we have grouped the terms in (2.8) might be unexpected, given that we have mixed asymptotically unequal terms: the constant on-axis field ($r=0$) with the first-order variation. However, since we are interested in trapped particles, variation in $|\boldsymbol {B}|$ along the field line is necessary, otherwise trapped particles would not exist. Therefore, in our perturbative description of the problem, we must take $\bar {B}$ to constitute the leading order magnetic field magnitude. It would then appear natural to write
where for every $\lambda$, the bounce points $\chi _b^\pm$ (left and right) of the integral are given by
and attempt to expand it in powers of $\delta B$ (i.e. expand around smallness of $\delta B$). There are however two important sources of conflict in doing so. First, and formally, evaluating $\sqrt {1-\lambda \bar {B}}$ at the bounce points $\chi _b^\pm$ can lead to imaginary contributions near these points without additional careful consideration of the bounce points. Second, physically, there are also issues related to the behaviour of classes such as barely trapped particles, which under an infinitesimal perturbation may undergo a finite (non-infinitesimal) change. This translates into diverging expressions in the perturbative construction.
To deal with these issues consistently, we start by defining a correction field $\delta B_b (\lambda )$, defined to be the perturbed field $\delta B(\chi )$ evaluated, for a given particle class $\lambda$, at the bouncing points, see (2.12). We shall assume, for simplicity, that the device is stellarator symmetric about the bottom of the magnetic well (i.e. we ignore the $B^S_{22}$ component), so that $\delta B_b (\lambda )$ is unique (i.e. it does not have left and right values). Introducing this correction, let us rewrite $\mathcal {J}_\parallel$ for a stellarator symmetric field in the following form:
so that $\mathcal {J}_\parallel =\lim _{\epsilon \rightarrow 1^-} \mathcal {I}(\epsilon )$. Expressing the integral in this form ensures that the integrand upholds positive definiteness for all $\epsilon \in [0,1)$, evading the issue of the integrand becoming imaginary. This furthermore circumvents the need to expand the bounce points of the integral. Expressed in this form, the integral may now be Taylor expanded in $\epsilon$,
where we shall take $\epsilon \rightarrow 1$ in the final answer so that $\mathcal {J}_\parallel \approx \mathcal {J}_\parallel ^{(0)}+\mathcal {J}_\parallel ^{(1)}$. If each of these terms is evaluated to the right order, we shall retrieve a consistent expression for the adiabatic invariant $\mathcal {J}_\parallel$.
2.2. Leading order expression
Let us start by investigating the leading order term of the expansion in $\epsilon$. Setting the expansion parameter $\epsilon$ to zero results in the following integral:
This integral is highly reminiscent of that occurring in the magnetic field of a large-aspect-ratio tokamak with circular cross-section, which has been considered by many authors before in various asymptotic regimes (Connor et al. Reference Connor, Hastie and Martin1983; Roach et al. Reference Roach, Connor and Janjua1995; Helander & Sigmar Reference Helander and Sigmar2005; Hegna Reference Hegna2015) and, as such, the derivation closely mirrors these calculations. One may refer to Appendix A for a complete derivation. The main step required is to re-express the integral in terms of a trapping parameter $k$, which we define as
where $\chi =0$ is defined to be the magnetic well minimum. The most deeply trapped particles reside here, and thus have $k^2 = 0$. The most shallowly trapped particles reside at the tops of the well, namely $\chi _b={\rm \pi}$, and thus correspond to $k^2 = 1$. The two trapped particle classes are connected monotonically, in the sense that $\lambda$ and $k$ maintain the order of trapped classes. With this definition, the integral may be expressed in terms of complete elliptic integrals of the first and second kind (also known as Legendre's integrals, see e.g. Olver et al. Reference Olver, Olde Daalhuis, Lozier, Schneider, Boisvert, Clark, Mille, Saunders, Cohl and McClain2020, § 19), which we define as
With these definitions, one can express $\mathcal {J}_\parallel ^{(0)}$ in closed form expanding $\mathcal {I}(0)$ around the smallness of $r$. The result is, to order $O(r^{5/2})$,
where the functions $I_1$ and $I_2$ describe the behaviour of different trapped-particle classes via $k$,
One can readily interpret the leading form of $\mathcal {J}_\parallel ^{(0)}$ by referring back to its basic form in terms of a parallel velocity and bounce distance, $\mathcal {J}_\parallel \sim v_\parallel \ell$. The scaling with $\sqrt {H r\eta }$ is directly related to the reduced parallel velocity that trapped particles must have for them to be trapped.Footnote 2 The factor $B_{\alpha 0}/\bar {\iota }_0B_0$ represents the changes in the ‘connection length’ along the magnetic field line. In terms of geometric quantities and using Ampére's law, one can estimate this length scale to be $B_{\alpha 0}/\bar {\iota }_0B_0\sim R_0/(\iota _0-N)$, where $R_0$ is the major radius of the device and $N$ represents the helicity of the $|\boldsymbol {B}|$ symmetry determined by the shape of the axis (Landreman & Sengupta Reference Landreman and Sengupta2019; Rodriguez, Sengupta & Bhattacharjee Reference Rodriguez, Sengupta and Bhattacharjee2022b). As the major radius increases, the total distance travelled by a trapped particle grows and so does $\mathcal {J}_\parallel$. Similarly, decreasing $\bar {\iota }$ increases the distance between bounce points, as field lines become further misaligned with respect to $|\boldsymbol {B}|$ contours. Finally, we observe that $I_1$, which describes the trapped-particle class dependence of $\mathcal {J}_\parallel$ to leading order, monotonically increases with $k$, as does the bounce distance. Under such a perspective, it is then clear that it must vanish for the deeply trapped particles (i.e. $I_1(k=0)=0$).
To provide a full description of $\mathcal {J}_\parallel$ to next order, it is important to note that although the expression we found is correct to order $O(r^{5/2})$, it does not correspond to the value of $\mathcal {J}_\parallel$ to that order. The expression is incomplete, as it is missing the contribution from $\mathcal {J}_\parallel ^{(1)}$.
2.3. The first-order correction
To evaluate the second-order term, we follow (2.14), for which we need the following integral:
The second term in the integrand is a factor $r$ smaller than the first one (which may be verified by writing expressions explicitly in terms of $k$) and hence, for our current expansion, only the first term needs to be taken into account. This term requires rewriting to involve the integration variable $\chi$ explicitly. Using the near-axis form of $|\boldsymbol {B}|$ for a quasi- and stellarator symmetric field, (2.10),
From this point, the procedure is analogous to the steps followed in the leading order case; the details are given, once again, in Appendix A. We simply denote the central result here, which is the expression for $\mathcal {J}_\parallel ^{(1)}$ expanded to leading order in $r$,
where the function $\mathcal {I}_{22}^C(k)$ is defined to be
Combining this result with (2.18), we may complete the asymptotic form of the second adiabatic invariant to order $O(r^{5/2})$,
3. Trapped particle precession
With the second adiabatic invariant constructed, we are in a position to evaluate the bounce-average precession. We remind ourselves that we considered the exact quasisymmetric limit and stellarator symmetryFootnote 3 (e.g. a tokamak with up-down symmetry) when constructing $\mathcal {J}_\parallel$. Because of the idealised omnigeneous nature of the field, we need not worry about the field-line dependence (i.e. $\alpha$ dependence), as the behaviour is ‘identical’ in all field lines as far as the precession is concerned. This is apparent in (2.24). The formalism presented could however be extended to incorporate a description of said $\alpha$ dependence upon controlled deviations from omnigeneity. We present in Appendix B an explicit estimation of the radial average drift in configurations that only achieve quasisymmetry approximately, providing a previously lacking physically meaningful measure of deviations from quasisymmetry within the near-axis framework, which could aid as an optimisation target (Landreman Reference Landreman2022; Rodriguez, Paul & Bhattacharjee Reference Rodriguez, Paul and Bhattacharjee2022a; Rodriguez et al. Reference Rodriguez, Sengupta and Bhattacharjee2022b).
Let us nevertheless return to the calculation of precession in our idealised scenario, (2.4). We have almost all that is needed to compute $\omega _\alpha$. The only remaining step is taking partial derivatives of (2.24) with respect to $\psi$ and the particle energy $H$. Acknowledging the dependence of the trapped particle label $k$ on both these variables, the result of this calculation may be written as
where the leading term scales like $1/r$ and $\omega _{\alpha,1}\sim O(r^0)$ (details of this derivation may be found in Appendix C).
The leading order term $\omega _{\alpha,0}$ is
which is precisely of the form found by Connor et al. (Reference Connor, Hastie and Martin1983) for a large-aspect-ratio tokamak, without magnetic shear or a pressure gradient. Elements of pressure and shaping are involved in the present approach (although not explicitly) through the next order correction, $\omega _{\alpha,1}$,
where the functions $\mathcal {G}$ may be expressed in terms of elliptic integrals,
3.1. Leading order precession: a tokamak-like behaviour
Let us start by analysing the leading order precession of trapped particles focusing on $\omega _{\alpha,0}$, (3.2). The expression includes physics in two ways: the overall scaling factors in front and the $k$ dependence through $G(k)$, which describes the behaviour of different classes of trapped particles. We first investigate the former.
Precession is proportional to $\eta$, the parameter defined in (2.9) as a measure of the leading order variation of $|\boldsymbol {B}|$ over flux surfaces. This variation is however intimately linked to the near-axis elliptical shaping of cross-sections (see Garren & Boozer Reference Garren and Boozer1991a; Landreman & Sengupta Reference Landreman and Sengupta2019; Rodríguez Reference Rodríguez2023). In fact, for up-down symmetric cross-sections, the elongation along the horizontal axis (i.e. ratio of horizontal to vertical axes of the cross-section) is $\mathcal {E}=(\eta /\kappa )^2$, where $\kappa$ is the curvature of the magnetic axis at the point where the cross-section is being assessed. Thus, for a fixed elongation, $\eta \sim \kappa$. In the special case of an axisymmetric field, this means that $\omega _\alpha \sim \sqrt {\mathcal {E}}/R_0$, where $R_0$ is the major radius. Going back to $\omega _{\alpha,0}$, for a fixed cross-sectional shape, increasing the major radius reduces precession, a consequence of the field becoming more straight and the gradients in $|\boldsymbol {B}|$ (and with it the drift) becoming smaller. In quasisymmetric fields, the local curvature of the axis defines a ‘major radius’, which leads to strongly curved magnetic axis shapes having increased precession. Quasi-helically symmetric fields require more strongly shaped magnetic axes (for a fixed average major radius) and thus will tend to have a stronger precession. This provides a qualitative separation between quasi-axisymmetric (QA) and quasi-helically symmetric (QH) stellarators (Rodriguez et al. (Reference Rodriguez, Sengupta and Bhattacharjee2022b), see some values of $\eta$ in table 1).Footnote 4
We finally take note of the divergent nature of $\omega _\alpha$ with the radial coordinate. The $1/r$ behaviour may initially appear worrying, but it can be easily understood in the following terms. From the form of the poloidal drift, we estimate $\boldsymbol {v}_D \boldsymbol {\cdot } \boldsymbol {\nabla } \theta \sim v_{\boldsymbol {\nabla } B} |\boldsymbol {\nabla } \theta |$, where $v_{\boldsymbol {\nabla } B}$ denotes the gradient drift driven by the radial variation of $B$. As $\boldsymbol {\nabla } \theta \sim 1/r$ and the gradient drift does not scale with $r$ to leading order, the result is the $1/ r$ dependence (the result of a finite drift velocity over an ever shrinking surface).
We next shift our attention to the dependency on the trapped class dependence of (3.2). A plot of $G$ as a function of $k$ is presented in figure 1(b). The plot shows that for the vast majority of trapped particles, the drift of the electrons is positive. Physically, positive values imply that the trapped particles precess in the direction of the diamagnetic drift (see figure 1a), an important feature which will become relevant for the discussion on trapped particle modes later. This behaviour only changes for the barely trapped particle classes which end up spending a significant fraction of their bounce-time near the maximum of $|\boldsymbol {B}|$, where there is ‘good curvature’. The transition point occurs at $k_0$, where $G(k_0)=0$, roughly at $k_0\sim 0.9$. Such a class of particles is, to leading order, stationary. The existence of these groups of trapped particles precessing in opposite directions proves the impossibility of making quasisymmetric configurations exactly maximum-$\mathcal {J}$. This simply follows from the definition of maximum-$\mathcal {J}$ as the property of a field that guarantees $\partial _\psi \mathcal {J}_\parallel <0$ for all trapped particles, which in terms of the electron precession is equivalent to $\omega _\alpha (k)<0$ for all $k$. Of course, this is not to say that the behaviour of a quasisymmetric field cannot become closer to maximum-$\mathcal {J}$, as higher order shaping and equilibrium parameters modify the leading order behaviour above. However, one may not achieve it exactly everywhere and especially close to the axis. In practice, one may only get around this issue at a finite radius if the higher order contributions are strong enough.
Comparing this result against previous investigation, we find, as we already pointed out, the behaviour to be identical to that shown in the work by Connor et al. (Reference Connor, Hastie and Martin1983) for a large-aspect-ratio circular tokamak. That this, correspondence found in the more general quasisymmetric case should not come as a surprise, given the existing isomorphism between axisymmetric and quasisymmetric fields (Boozer Reference Boozer1983). We have, however, gained generality beyond circular-shaped cross-sections as $\eta$ allows for non-unity ellipticity. We also note that the asymptotic considerations here and in the literature are in many respects different. Many previous investigations have focused on employing radially local solutions to the MHD equation (see e.g. Mercier & Luc Reference Mercier and Luc1974; Miller et al. Reference Miller, Chu, Greene, Lin-Liu and Waltz1998; Hegna Reference Hegna2000) to discuss precession, which weakens the coupling between $|\boldsymbol {B}|$ and the geometry of the field that exists in the near-axis treatment. We take that additional coupling in the near-axis consideration to form part of a more globally consistent description of the field. This results in higher order effects showing quantitative differences (though qualitative trends are retained), as we shall now explore.
3.2. The higher order effects on precession
Let us now focus on the effect of second-order elements on precession. In the form presented in (3.3), the order $r$ correction on the leading order precession consists of three different terms: an ‘intrinsic’ term (purely a consequence of consistency of the field with its elliptical shape), a term that is proportional to $B_{20}$ (and thus the radial increase of the average $B$) and another proportional to $B_{22}^C$. The behaviour of each one of these terms with $k$ is illustrated in figure 2(a).
From these three contributions, that coming from $B_{20}$ (often called the magnetic well term) is the simplest: a positive $B_{20}$ pushes particles against the diamagnetic drift. That is, deeply trapped particles decrease their precession rate, while barely trapped ones precess even faster. This behaviour is a direct consequence of the influence of $B_{20}$ on the gradient $\boldsymbol {\nabla } B$. The magnetic well term reinforces the outwards magnetic field gradient everywhere, affecting all particles equally and in the direction opposed to the diamagnetic drift. More precisely, the drift $v_{\boldsymbol {\nabla } B}\sim \boldsymbol {B}\times \boldsymbol {\nabla }(1/B)$ is driven by the gradient of $1/B$ and, thus, it is the change in the gradient of $1/B$ that most directly affects precession. As $\partial _\psi (1/B) \sim \eta ^2/2-B_{20}$, this explains not only the $B_{20}$ contribution, but also the ‘intrinsic’ $\mathcal {G}$ one.
The direct effect of $B_{20}$ relates precession naturally to MHD stability. MHD stability of interchange modes is improved by the enhancement of the so-called magnetic well of the field (Greene Reference Greene1997), which corresponds in the near-axis limit to increasing $B_{20}$ (the radial derivative of the average $B$) (Landreman & Jorge Reference Landreman and Jorge2020; Kim, Jorge & Dorland Reference Kim, Jorge and Dorland2021; Rodríguez Reference Rodríguez2023). Thus, there is a natural synergy between improving MHD stability and making particles precess in the direction opposite to the diamagnetic drift. This behaviour, obvious from the $B_{20}$ dependence of $\omega _{\alpha,1}$, aligns with the general observation made in Helander (Reference Helander2014, § 3.7) relating the ‘averaged’ behaviour of precession over a flux surface and all particle classes to the magnetic well. However, the problem of MHD stability is more subtle, as precession is also affected by the variation of the magnetic field $B_{22}^C$ explicitly, and MHD stability is further influenced by pressure (as becomes clear when considering the Mercier criterion (Mercier Reference Mercier1962, Reference Mercier1974; Greene & Johnson Reference Greene and Johnson1962; Bauer, Betancourt & Garabedian Reference Bauer, Betancourt and Garabedian2012; Freidberg Reference Freidberg2014) to assess it). We shall revisit this relation on more solid grounds later.
Setting $B_{20}$ aside for now, the $B_{22}^C$ contribution to precession introduces a richer $k$ dependence than considered so far at this order. This is so because different trapped particles experience different modifications of the magnetic field through the $\chi$ dependence of $|\boldsymbol {B}|\sim B_{22}^C\cos 2\chi$. Naturally, both the most deeply and shallowly trapped particles, who live at $\chi =0,{\rm \pi}$, respectively, feel the same effect, marked by $\mathcal {G}_{22}^C(0)=\mathcal {G}_{22}^C(1)$. In between these classes, particles perform an average of the gradient over their bounce, leading to a maximum at $k \sim 0.8$.
The components $B_{22}^C$ and $B_{20}$ have proven especially convenient to describe the higher order effects on drifts. However, they do not provide a good sense for what the field looks like in terms of its geometry or its equilibrium. A physical discussion requires us to make this connection, which we shall do within the near-axis framework. To that end, we introduce the pressure gradient supported by the field as $p_2=(B_0\,\mathrm {d}p/\mathrm {d}\psi |_{\psi =0})/2$, as well as the triangularity of cross-sections (normalised by $r$)Footnote 5 as $\delta$. These two parameters can substitute $B_{22}^C$ and $B_{20}$ to write the precession explicitly in terms of $p_2$ and $\delta$.
The details of this linear mapping between parameters were presented by Rodríguez (Reference Rodríguez2023). We include in this paper only the key elements necessary to make that connection. This is particularly important to make sense of what $\delta$ truly represents. For an up-down symmetric cross-section, we define triangularity as the relative displacement of the vertical turning points of a cross-section with respect to its centre-point along the symmetry line normalised to its width (Rodríguez Reference Rodríguez2023, Appendix C), and $\delta$, as its asymptotic form in $r$, normalised by $r$. For simplicity, we define this triangularity in the near-axis, Frenet–Serret frame. This makes the regular notion of triangularity in the lab frame (i.e. the triangularity of the cross-section at a constant cylindrical angle) generally different by an offset when the magnetic axis is not perpendicular to a constant cylindrical angle plane (see some details in Appendix D). However, by changing $\delta$, we are changing triangularity in the lab frame by the same amount, although $\delta$ is generally not the triangularity there. The axisymmetric case is an exception in which this correspondence is exact. We also pick the sign of triangularity to be defined with respect to the direction of curvature of the axis; i.e. positive triangularity, $\delta >0$, indicates a shift of the turning points in the direction of the curvature.Footnote 6
In the general quasisymmetric scenario, following this definition of $\delta$, there appears to be a multitude of ‘triangularities’. Each cross-section around the torus is generally different (but for an $N$-fold symmetry), but it is sufficient to describe the triangularity of any of its cross-sections to describe the field uniquely (given some choice of lower order parameters and a pressure gradient).Footnote 7 Once such a cross-section has been chosen, then one should interpret $\delta$ as a measure of its triangularity in the Frenet–Serret frame and any discussion about the effect of modifying triangularity should be interpreted as the effect of changing the triangularity of that very cross-section. We shall conveniently choose an up-down symmetric cross-section to represent the quasisymmetric stellarator and when it comes to analysing real configurations, we shall take the most vertically elongated one (often referred to as the bean-shaped cross-section Rodríguez Reference Rodríguez2023). We do so by analogy with the axisymmetric scenario. As this cross-section is changed, the remainder of the field must also change as a consequence of satisfying both equilibrium and quasisymmetry simultaneously.
In short, the pressure gradient and the triangularity as defined above provide sufficient information and a minimal second-order parametrisation for both axisymmetric and quasisymmetric configurations.
3.3. Relation to geometric and equilibrium parameters
Let us then proceed to write and analyse the precession of trapped particles $\omega _{\alpha,1}$ in terms of triangularity, $\delta$, and pressure gradient, $p_2$. The details of how to do so are presented in Appendix D and rely heavily on the work of Rodríguez (Reference Rodríguez2023). The result reads
The function $\mathcal {G}_{p_2}$ encodes the effect of the pressure gradient and $\mathcal {G}_\delta$ that of the triangularity, and their full explicit forms may be found in Appendix D. The function $\tilde {\mathcal {G}}$ is a complicated function of lower order quantities that we do not present explicitly and shall ignore for the analysis in this paper. For a discussion on the form and meaning of the other contributions, we specialise now to a generally shaped, up-down symmetric tokamak configuration.
In the axisymmetric limit, the functions become
using the definitions in (3.4). Here the parameter $\bar {\alpha }=(\eta R_0)^4=\mathcal {E}^2$ is the square of the elongation of the flux surfaces along the major radial direction. To arrive at such an expression, we used the relation $\bar {\iota }_0=2\sqrt {\bar {\alpha }}G_0I_2/B_0^2(1+\bar {\alpha })$, which holds true for a tokamak, whose rotational transform must be fully driven by a toroidal plasma current.
Let us analyse the behaviour of the finite pressure term first. It is clear from (3.6a) that $\mathcal {G}_{p_2}^\mathrm {axi}<-2$ for all $k$ and possible combinations of parameters, as $\mathcal {G}_{22}^C\geq -2$ and, therefore, $1+\mathcal {G}_{22}^C/4\geq 1/2$. This negative sign of $\mathcal {G}_{p_2}^\mathrm {axi}$ indicates that the usual peaked pressure profile (i.e. $p_2<0$ for the assumed $\mathrm {sgn}(\psi )=+1$) leads to an increase in precession in the direction opposite to the diamagnetic frequency; a direct consequence of the magnetic well term discussed in the previous section. A finite $\beta$ (at fixed triangularity) assists in making the behaviour of trapped particles more maximum-$\mathcal {J}$. This is a well-known effect (Rosenbluth & Sloan Reference Rosenbluth and Sloan1971; Connor et al. Reference Connor, Hastie and Martin1983), referred to as the diamagnetic effect of the toroidal field. In fact, we note that in the circular tokamak limit, the expression becomes almost identical to the result of Connor et al. (Reference Connor, Hastie and Martin1983), see the details in Appendix D. Although maximum-$\mathcal {J}$ is being approached by increasing $\beta$ and this is generally regarded as a positive effect, at an intermediate stage, particle precession reaches $\omega _\alpha \sim 0$ for many trapped particles. This slow precession scenario is generally associated with enhanced fast particle losses (especially of deeply trapped particles) whenever deviations from QS exist (Nemov et al. Reference Nemov, Kasilov, Kernbichler and Leitold2008; Bader et al. Reference Bader, Anderson, Drevlak, Faber, Hegna, Henneberg, Landreman, Schmitt, Suzuki and Ware2021; Velasco et al. Reference Velasco, Calvo, Mulas, Sánchez, Parra and Cappa2021).
Different trapped particles are affected differently by pressure as a consequence of the evolving Shafranov shift (Shafranov Reference Shafranov1963; Wesson Reference Wesson2011; Rodríguez Reference Rodríguez2023), which perturbs the field in a non-symmetric way. This underlying role of the Shafranov shift is clear from the contribution of the factor $f=B_0^2(1+\bar {\alpha })^2/R_0^2I_2^2(\bar {\alpha }+3)$, which shows an amplified effect of pressure for low currents (i.e. or low rotational transform), larger horizontal elongation or smaller major radius. All of these are known to increase the Shafranov-shift effect and will enhance the counter-precession of particles with respect to the diamagnetic drift (see figure 2).
Because, to leading order, deeply trapped particles co-rotate with the diamagnetic drift, we may estimate when the plasma $\beta$ is sufficient to reverse their direction. We interpret the resulting as the critical plasma $\beta$ for which the field becomes barely maximum-$\mathcal {J}$. Formally, this involves equating two different asymptotic orders, which goes against the very nature of the asymptotic treatment. One may nevertheless interpret this as an estimate of the precession at a ‘finite’ radius.Footnote 8 This shows that one may try to increase the maximum-$\mathcal {J}$ behaviour of a QS by enpowering some of the higher order contributions (pressure and other shaping). In practice, the effective radius in which the leading order is dominant may be small enough that we may refer to the field as maximum-$\mathcal {J}$. As shown in the examples of figure 4, this does not seem to be the natural tendency of QS fields and certainly is not asymptotically.
Focusing on the behaviour of $k=0$ (such deeply trapped particles are typically the least maximum-$\mathcal {J}$), $\omega _{\bar {\alpha },0}=H\eta /rB_0$ from (3.2) and for the pressure, we have $\omega _{\alpha,1}=-(H/B_0)\mu _0|p_2|(2+f)/B_0^2$. Equating the two, we find
For a parabolic pressure profile, $a^2 p_2 = -p_0$, where $a$ is the minor radius and $p_0$ the pressure on axis, one finds that the critical plasma $\beta$ on axis is
We thus see that the most susceptible fields are those with a large-aspect ratio, vertical elongation (small $\mathcal {E}$) and lower current (large $f$). As expected, these finite $\beta$ effects become more pronounced as we move away from the magnetic axis. For further illustration, consider the scenario of a circular-shaped tokamak with a representative safety factor of $q=2$ and aspect ratio $\sim 3$ for which, at the edge, $\beta _\mathrm {crit}\sim (a/R_0)/(1+q^2/2)\sim 11\,\%$. Reversing the precession of deeply trapped particles is thus predicted to require a significant plasma $\beta$.
The effects of triangularity are markedly more involved than those of pressure, which prevents us from writing a parameter-insensitive bound like we did for the effect of pressure (see figure 2). Depending on the value of elongation, $\bar {\alpha }=\mathcal {E}^2$, the precession due to triangularity will tend to be in one direction or the other, a behaviour that also changes depending on the particle class considered. There exists, though, a critical value of $\bar {\alpha }$ beyond which $\mathcal {G}_\delta ^{\mathrm {axi}}>0$ for all $k$. As $\mathcal {G}_{22}^C$ has a maximal value $\max (\mathcal {G}_{22}^C) \approx 1.1$, one can find that this critical point occurs at
Thus, for tokamaks that are horizontally elongated (beyond some $\sim 14\,\%$), negative triangularity tends to make all particles precess against the diamagnetic drift. This distinction regarding the effect of triangularity is reminiscent of the effects of triangularity on MHD stability. In that case and describing stability through the Mercier criterion Solov'ev & Shafranov Reference Solov'ev and Shafranov1970; Lortz & Nührenberg Reference Lortz and Nührenberg1978; Shafranov Reference Shafranov1983; Freidberg Reference Freidberg2014; Rodríguez Reference Rodríguez2023), one can show that for $\bar {\alpha }>\bar {\alpha }_\mathrm {MHD}=1$, negative triangularity contributes positively to stability. Thus, MHD stability seems to align with precession against the diamagnetic drift, at least for sufficiently horizontally elongated configurations. That is, there is some synergy, which is the opposite to the changes due to plasma-$\beta$.
Most commonly, however, most tokamak fields are vertically elongated and thus have $\bar {\alpha }<1<\bar {\alpha }_\mathrm {crit}$. In that usual scenario, different trapped particles respond differently, some tending to precess in one direction and others in the opposite. The most deeply and barely trapped particles are a special case, as taking $k=0,~1$, $\mathcal {G}_\delta = 4 \bar {\alpha }/(3 + \bar {\alpha })>0$ has a maximum value (see figure 2). With a sign independent of elongation, one can conclude that positive-triangularity tokamaks will always tend to make deeply and shallowly trapped particles precess in the direction of the diamagnetic drift. Thus, only through negative triangularity can this shaping be used to reverse the behaviour of deeply trapped particles. In the vertically elongated regime, negative triangularity hampers MHD stability, thus opposing the tendency to improve the maximum-$\mathcal {J}$ behaviour. As noted in the plasma $\beta$ scenario, as precession of particles is reduced, fast-ion confinement can be negatively affected in an imperfect QS stellarator. The different behaviour of each trapped particle makes an assessment of the overall effect complex.
It would be appropriate, as we did to illustrate the effect of plasma $\beta$, to introduce the idea of a critical triangularity: a value of triangularity such that one expects precession of deeply trapped particles to leading order to be significantly affected, i.e. $r \delta \mathcal {G}_\delta (k)/2 \sim G(k)$ for $k=0$. Precisely as in the case of pressure,
The interpretation of $r\delta$ as triangularity of the cross-section in the Frenet frame of the magnetic axis (see Appendix D) is an asymptotic concept. As such, this geometric interpretation of $r\delta$ ceases to be accurate upon approaching unity (especially as a significant bean-shape is developed). This limit of large $r\delta$ is nevertheless a limit of strongly shaped flux surfaces, which could even describe surfaces that self-intersect or intersect with other flux surfaces (Landreman Reference Landreman2021; Rodríguez Reference Rodríguez2023). To make sense of whether $(r\delta )_\mathrm {crit}$ is feasible in practice, we should compare this measure to the maximum triangularity achievable without incurring into these geometric defects. The critical $r_c$ was defined by Landreman (Reference Landreman2021). In any case, (3.10) indicates that a very significant triangularity (of order 1) is needed to significantly affect precession of deeply trapped particles in a tokamak. Hence, we conclude that although triangular shaping may help in achieving the maximum-$\mathcal {J}$ property together with finite $\beta$ effects, achieving it via shaping alone is more difficult in tokamaks.
Before concluding this section, let us briefly consider the full quasisymmetric case, beyond the special case of axisymmetry, through some examples. Unlike in the axisymmetric case, this general scenario preserves a measure of symmetry-breaking of the geometry. The measure $\bar {F}$ (Rodríguez Reference Rodríguez2023), for which explicit expressions are presented in Appendix D, depends on the shape of the axis and modifies the effects of pressure and triangularity. Reminding ourselves that in such a scenario, $\delta$ represents the triangularity of the up-down symmetric cross-section with the largest binormal elongation, we show in figure 3 the values of $\mathcal {G}_\delta$ and $\mathcal {G}_{p_2}$ for a number of QS configurations.Footnote 9 Each segment consists of pairs $(\mathcal {G}_\delta,\mathcal {G}_{p_2})$ for different $k$ for the same configuration, taking the ideal QS limit of the configurations (which are only approximately so).
From the plot, it is clear that for those quasiymmetric configurations analysed, the effect of a finite pressure gradient is the same as in the axisymmetric limit (i.e. an increase in pasma $\beta$ leads to precession in the direction opposite to the diamagnetic drift). The effect of triangularity is analogous to a tokamak that is elongated in the horizontal direction, where negative triangularity pushes particles against the diamagnetic drift. From the MHD stability analysis of these configurations by Rodríguez (Reference Rodríguez2023), one recovers the synergy of horizontally elongated tokamaks for quasisymmetric stellarators. Thus, the behaviour is quite different from that of regularly shaped tokamaks.
An interpretation of the magnitudes of $\mathcal {G}_{\delta }$ and $\mathcal {G}_{p_2}$ may be provided by considering the critical $\beta$ and $r\delta$ once more. As in the tokamak scenario, at the edge of the configuration, $\beta _\mathrm {crit}\sim 2a|\eta |/\mathcal {G}_{p_2}(0)$ and $(r\delta )_\mathrm {crit}\sim 2/\mathcal {G}_\delta$. A complete table for the configurations in figure 3 is included in Appendix E. We note here that in QA configurations, $\beta _\mathrm {crit}\sim 5\,\%$, while QH stellarators generally exhibit a more resilient behaviour with $\beta _\mathrm {crit}\sim 10\unicode{x2013}15\,\%$. Reversing the precession of deeply trapped particles via finite $\beta$ effects thus appears to be a possibility most easily in QA configurations. Given the simple form of $(r\delta )_\mathrm {crit}$, it is straightforward to see that $O(1)$ triangularity is generally required to observe a significant effect. In many of these configurations, such values are indeed achievable without incurring in forbidden flux surface shapes (see Appendix E).
3.4. Verification of the expansion
In the preceding sections, we investigated the analytical behaviour of the trapped particle precession. We derived these under two important assumptions: (i) exact quasisymmetry (or symmetry) of the fields and (ii) the near-axis expansion. It is thus natural to wonder how close trapped particle precession is to the idealised limit in realistic configurations, as these assumptions become increasingly less accurate. We check this through three numerical examples, in which we compare the bounce-averaged drift computed from a global MHD code (in this case, VMEC (Hirshman & Whitson Reference Hirshman and Whitson1983), using numerical methods detailed by Mackenbach et al. Reference Mackenbach, Duff, Gerard, Proll, Helander and Hegna2023a) against the analytical results. For this practical comparison, we employ the definition of $k$ in terms of the bounce point, (2.16), which we compute for the numerical precession calculation. Note that upon significant deviations from QS, this choice ceases to be convenient, especially when there exist differently shaped wells within a flux surface.Footnote 10
This comparison is shown in figure 4, where the bounce-averaged precession is plotted as a function of $k$ and for two radial locations, $\varrho \stackrel {\cdot }{=} \sqrt {\psi /\psi _\mathrm {edge}}$. The correspondence is excellent for all displayed $\varrho$ in the precise quasisymmetric configurations, recently presented by Landreman & Paul (Reference Landreman and Paul2022), which are characterised for having an excellent degree of quasisymmetry. The theory works remarkably well even at larger radii, where one would expect the near-axis expansion to falter, although this near-axis nature of the configurations had been previously noticed (Rodriguez Reference Rodriguez2022) and could be a feature necessary to achieve excellent global quasisymmetry. This numerical comparison evidences that the second-order calculation is also appropriate. For HSX (Anderson et al. Reference Anderson, Almagri, Anderson, Matthews, Talmadge and Shohet1995), we see more significant deviations from the analytical result. This is mainly a consequence of the breaking of QS (for a naive fitting of its near-axis behaviour, $B_{20}$ variations are $\sim 4$), and significant shaping beyond the near-axis behaviour (see $\varrho =1$). Although the trends and magnitude seem correct, there are clear deviations from the idealised limit.
4. Energy available for trapped particle modes
The preceding study of particle precession was strongly motivated by its central role in driving trapped-particle modes (TPMs). In essence, TPM turbulence arises driven by the trapped-particle precession when it resonates and destabilises the diamagnetic drift wave. Simply put, whenever the trapped particles co-drift with the diamagnetic drift wave, energy may be transferred to the drift wave, driving the instability thereof. As we have learnt, a special feature of axisymmetric and quasisymmetric fields is that $\omega _\alpha$ has, to leading order, a zero crossing. That is, there always exists a subgroup of trapped particles (which includes deeply trapped particles) that co-rotate with the drift wave and thus potentially drive TPMs. To assess how the size of this group and the magnitude of its precession feeds TPMs, a more qualitative treatment is necessary. To perform such an analysis, we delve into the stability of TPMs by studying the available energy of the trapped particles (Helander Reference Helander2017, Reference Helander2020).
Available energy (Æ) is an upper bound on the free energy available to the plasma after a constrained rearrangement of the distribution function (called Gardner restacking, see Kolmes, Helander & Fisch Reference Kolmes, Helander and Fisch2020), rearrangement that needs to conserve certain dynamical quantities like $\mathcal {J}_\parallel$. Restricting ourselves to the available energy contained in trapped particles, one obtains a proxy measure of nonlinear turbulent activity of TPMs (and upon specialising to electrons, TEMs Proll et al. Reference Proll, Helander, Connor and Plunk2012). This is, nevertheless, a simplified description of the turbulence, in the best of cases a bound, which does not include a discussion of accessibility. That is, although energy may be available, it might not be possible for a system to evolve to such lower energy state and access the available energy, thus the turbulence activity would be over-estimated. The Æ measure is nevertheless useful (Mackenbach, Proll & Helander Reference Mackenbach, Proll and Helander2022) and it provides insight into TPMs.
The calculation of available energy for trapped electrons in a flux tube was recently presented by Mackenbach et al. (Reference Mackenbach, Proll and Helander2022, Reference Mackenbach, Proll, Wakelkamp and Helander2023c). For a flux tube of length $L$ and elliptical cross-section $\Delta \alpha$ and $\Delta \psi$ in $(\psi,\alpha )$-space (principal axes), the available energy in an omnigeneous ($\omega _\psi =0$) field may be written as
where $\hat {G}^{1/2}=\int (1 - \lambda \hat {B})^{-1/2} \,\mathrm {d} \ell /L$ is the normalised bounce time, $\mathcal {R}$ is the ramp function (indicating that only faster than the diamagnetic drift co-precessing particles contribute to $A$) and the hats denote normalisation of the frequencies $\hat {\omega }=\Delta \psi ~\omega /H$. The integral is performed over $z=H/T$ and $\lambda$, the combination of which constitute all trapped particle energies and classes (the exponential in the integrand is a consequence of the chosen distribution function of which the Æ is to be calculated, a Maxwellian). The sum over wells simply indicates that the available energy is the result of adding the contributions from every well along the flux tube, as trapped particles may be rearranged within each.
Of course, the amount of energy available depends on the size of the flux tube considered; the measure is extensive. To construct an intensive measure, we normalise it to the total thermal energy available in the tube. The total thermal energy in the flux tube for a plasma of temperature $T_0$ and density $n_0$ is (using $\boldsymbol {B}\boldsymbol {\cdot }\boldsymbol {\nabla }\ell =B$)
Hence, the normalised Æ becomes
where the normalising factor in front will henceforth be succinctly referred to as $\mathcal {V}=2\sqrt {{\rm \pi} }\int \mathrm {d}\ell /L\hat {B}$. To make further progress, we realise that the integral over $z$ is analytically tractable if one splits up $\hat {\omega }_*^T$ as
where $\hat {\omega }_{*,z}^T=-\Delta \psi \partial _\psi \ln T$ and $\hat {\omega }_{\star,0}^T=-\Delta \psi (\partial _\psi \ln n-\tfrac {3}{2}\partial _\psi \ln T)$. To ease the calculation (although it may be extended to the more general case), we shall take the temperature gradient to be zero and consider the limit of a peaked density profile (i.e. $\partial _\psi \ln n$ is negative for $\psi >0$); we are specialising to density-driven trapped-particle instabilities. This assumption makes the diamagnetic drift $\omega _\star$ particle-energy independent, leading to
where
and the $\varTheta$ function is a Heaviside function that vanishes for $\omega _\alpha (\lambda )<0$, which physically represents the inability of counter-rotating particles to drive the TPM.
The function $\mathcal {F}$, see figure 5, may be interpreted as a measure of the coupling of different particle classes to the available energy (ignoring further contributions from the normalised bounce time). With the energy dependence averaged out after the integral over $z$, the comparison between the diamagnetic drift and precession is captured by $c_1$. Both trapped particles that are drifting too fast (i.e. $|c_1| \ll 1$) and which are drifting too slow (i.e. $|c_1| \gg 1$) as compared to the drift wave have a vanishing contribution to the Æ, as $\mathcal {F}\rightarrow 0$. This is a formal statement of the resonance requirement for an effective drive of the drift-wave instability, where the coupling is largest at $c_1 \approx 3.9$.
To proceed further, and since the expressions for the precession derived in the preceding section depend on the trapping parameter $k$ explicitly, it will be natural to write Æ in (4.5) as an integral over $k$.
4.1. Leading order form of Æ
Let us begin the asymptotic analysis by considering the first-order expression of the trapped-particle precession $\omega _\alpha \approx \omega _{\alpha,0}(k)$, defined in (3.2). To perform the integral in (4.5), we need a number of ingredients. First of all, we must consider the integral only over trapped-particle classes that co-rotate with $\hat {\omega }_{*,0}^T$ (i.e. the range allowed by the Heaviside). The domain of integration thus runs from the most deeply trapped particles to the transition point of $\omega _{\alpha,0}$, i.e. $k \in [0,k_0]$ (with the definition of $k_0|\omega _{\alpha,0}=0$ from before).
The second ingredient that naturally arises is then the need to express the integration variable $\lambda$ in terms of $k$. The presence of $c_1=\hat {\omega }_{*,0}^T/\hat {\omega }_{\alpha,0}(k)$ as a function of $k$ inside $\mathcal {F}$ makes the integral highly nested and, thus, appears to make it difficult to approach analytically. However, given the form of the precession, $c_1$ is asymptotically small in the sense that $c_1\sim r$. This appears to offer a way to proceed by expanding $\mathcal {F}$ in the small $c_1$ limit. However, that would be wrong, as it would neglect the most important contribution to Æ. Recall that the particle precession vanishes at $k_0$ and, thus, near this value of $k$, the function $c_1$ cannot be considered small. This resonant behaviour is, in the asymptotic limit, the principal contributor. The integral is significant only in a narrow region $\Delta k\sim r$, close to $k_0$, where $c_1 \sim O(1)$ (see a clarifying sketch in figure 6). This teaches us that in this asymptotic limit, the most important class of particles are those with relatively small precession, as in this limit, $\omega _\alpha$ tends to be much larger than the diamagnetic drift. The evaluation of the integral may then be carried out analytically considering a local approximation of the integrand in this contributing narrow band (correct down to a correction $O(r^{1/2})$ on the leading contribution), the details of which are presented in Appendix F.
Before writing the result for the Æ out, one last consideration is required. In this case, one needs to make an explicit assumption regarding the width of the flux tube $\Delta \psi$, on which the Æ will depend. This width should be interpreted as the ‘length’ over which density gradients may be flattened by the turbulence to extract energy. Following the steps taken by Mackenbach et al. (Reference Mackenbach, Proll and Helander2022, Reference Mackenbach, Proll, Wakelkamp and Helander2023c), we estimate such a flattening length scale to be the correlation length and to be of the order of the Larmor radius $\rho$. As such, we write
where $C_r$ may formally be dependent on other equilibrium parameters (e.g. the rotational transform $\iota$ or the flux expansion $|\boldsymbol {\nabla }\psi |$). We shall nevertheless take $C_r$ to be a constant, assuming that the flattening length scale providing free energy to the TPMs is simply proportional to the Larmor radius.
It is customary to define a minor radius $a$ for the field configuration so that the radial coordinate may be normalised as $\varrho = r/a$. This way, we define the density gradient $\hat {\omega }_n \equiv - a \partial _r \ln n = - \partial _\varrho \ln n$, which scales like $\varrho$ for a quadratic (in $\varrho$) density profile. In terms of these variables, the Æ becomes
where the common gyrokinetic expansion parameter is defined as $\rho _* \stackrel {\cdot }{=} \rho /a$.
The leading order expression for $\hat {A}_{w}$ includes important information regarding the behaviour of the available energy near the axis. We highlight various scalings here.
(i) One observes a strong scaling with the distance from the axis $\varrho$, whose origin may be presented in simple terms as follows (see the integral expression in (4.5) for reference). One factor of $\varrho ^{1/2}$ may be accounted for due to the trapping fraction of particles, which leaves one with an overall $\varrho ^3$ scaling. In this scenario, the energy scale is set by the diamagnetic drift (as only precessing particles with speeds analogous to those of the diamagnetic drift contribute to the available energy), which goes like $\hat {\omega }_n\propto \varrho$ near the axis. Thus, two powers of $\varrho$ can be argued to come from the kinetic drive of the diamagnetic drift. The final power of $\varrho$ comes from the small fraction of trapped particles that contribute to the available energy, as the band near $k_0$ scales with $\varrho$.
(ii) Another scaling of interest in (4.8) is its dependency on the stellarator shaping parameter $\eta$. Increased $\eta$ leads to a more restricted access to energy and, thus, a reduced TPM activity (as measured by Æ). Thus, horizontally elongated shapes would seem to favour stability and, in the context of quasisymmetric stellarators, quasi-helically symmetric configurations. In tokamak terms, $a\eta \sim a\sqrt {\mathcal {E}}/R_0\sim \varepsilon \sqrt {\mathcal {E}}$, where $\varepsilon$ is the inverse aspect ratio. Thus, $\hat {A}_{w} \sim 1/(\mathcal {E}^{1/4}\sqrt {\varepsilon })$. Hence, the available energy increases with aspect ratio keeping the minor radius fixed. This dependency becomes even stronger if one chooses $\rho _* \varepsilon = \rho /R_0$ to be constant.
(iii) One finally observes a scaling with $(C_r \rho _*)^2$, which is the square of the assumed length scale over which energy is available. Importantly, we note that an implicit magnetic-field-strength dependency arises here (for fixed minor radius and $C_r$), as $\rho \sim 1/B_0$. Hence, in terms of Æ, it is beneficial to have a strong magnetic field so that the length scale over which energy is available decreases as $\hat {A}_{w} \sim 1/B_0^2$.
As noted before, a full interpretation of these scalings for a comparison between different configurations would have to take into account the form of $C_r$ that most closely describes the volume that is accessible to the rearrangement of energy. This may be particularly important when proceeding to a comparison between highly different configurations. The normalisation with $C_r$ being a constant appears to provide a reasonable description in Æ as a measure of heat transport in practice (see Mackenbach et al. Reference Mackenbach, Proll and Helander2022).Footnote 11
4.2. A strongly driven finite-radius asymptotic regime
In the derivation above, it was key to consider the contribution to available energy from a narrow band of trapped particles with ‘slow’ precession. As such, the particular form of the expression in (4.8) is only valid in the regime where $\omega _{\star,0}^T/ \omega _\alpha$ can be considered small, that is, when the trapped particle drifts are (as a group) much faster than the diamagnetic drift. As these roles revert, either because the turbulence becomes strongly driven (i.e. large density gradient) or the precession diminishes, the ‘weak’ approach presented before will cease to provide us with a good approximation to the available energy.
As the precession becomes smaller relative to the diamagnetic drift, we however find another tractable limit, which we refer to as the ‘strongly driven’ limit. That is, we still consider a near-axis description of the magnetic field and precession, but at the same time, order the diamagnetic drift to be large, i.e. vigorously driven.Footnote 12 Although this might appear inconsistent, it is not, as any ordering assumption about $\omega _n$ only affects the evaluation of the Æ integral, but not the particle precession itself. From the set-up, it should be clear that this ‘strongly driven’ regime gains relevance away from the magnetic axis, where the precession of trapped electrons decreases and the diamagnetic drift increases.Footnote 13
In this new scenario, the integral for available energy may be recomputed (see Appendix F) using standard methods, as there no longer exists a narrow contributing band (see figure 7). All in all, one finds that the integral reduces to
A different regime brings a different scaling with $\varrho$ and $\hat {\omega }_n$, in both cases with weaker dependencies than in the narrow-band regime. These changes are a result of: (i) the particle precession that serves as energy source contributing directly to Æ, and thus introducing a $1/\varrho \hat {\omega }_n$ factor compared with the weak regime (simply because, on average, particles do not quite reach the diamagnetic drift) and (ii) the contributing trapped particle fraction corresponding to the whole population with a positive precession, which no longer is a narrow band and thus does not contribute with an additional $\hat {\omega }_n\varrho$ factor. Importantly, the scaling with $\eta$ inverts compared with the weak regime, $\hat {A}_{w}$. Using the same tokamak estimation for $\eta$ as there, one finds $\hat {A}_{s} \sim \varepsilon ^{3/2} \mathcal {E}^{3/4}$, suggesting that a large-aspect-ratio tokamak which is vertically elongated reduces Æ. Once again, this is under the assumption of keeping the minor radius $a$ fixed. The behaviour will change depending on which features of the equilibrium are kept constant.
The existence of these two different regimes of available energy naturally defines a transition. One can calculate this transition point by equating $\hat {A}_{w}\approx \hat {A}_{s}$, denoting the ‘critical’ transition gradient as $-a \partial _r \ln n|_\mathrm {crit} = a / L_{n}|_\mathrm {crit}$. We find
For a quadratic density profile ($\hat {\omega }_n/\varrho \approx 2$), the radial position $\varrho$ at which this critical transition occurs is
Using some typical tokamak values, we estimate $a \eta \sim \varepsilon \sqrt {\mathcal {E}} \sim 0.3$ and thus $\varrho _\mathrm {crit} \sim 0.2$. Hence, in a standard tokamak, one can transition to the strongly driven regime fairly close to the axis and we expect the strongly driven regime description to be most suitable for most of its volume.
We conclude this leading order Æ analysis by presenting a numerical verification in figure 8, where we compare both asymptotic regimes in one device and the weakly asymptotic regime across multiple devices.Footnote 14 We observe excellent agreement in the asymptotic behaviour between the found results and the numerical calculation.
4.3. Additional dependence of Æ
To learn anything about how triangularity of flux surfaces and pressure may affect this available energy, and thus how TPMs may be affected by them, we need to proceed to higher order in the calculation of $\hat {A}$. We show how to do this to obtain the dependence of $\hat {A}$ on $p_2$ and $\delta$ to leading order at the end of Appendix F.Footnote 15 After such considerations, we may write $\hat {A}\approx \hat {A}_0+\hat {A}_1+\ldots$, where $\hat {A}_0$ is the leading order expression and $\hat {A}_1$ the pressure and triangularity dependent piece. As before, for the discussion in the text, we specialise to the generally shaped up-down symmetric tokamak. Full expressions that apply to the quasisymmetric case may be found in Appendix F.
In the weakly driven regime, one finds
where $\mathcal {R}_{20}\approx 1.37$.
It follows from this that, regardless of the choice of parameters, increasing the pressure gradient always leads to an increase in the available energy. Note that this is the case even if the density gradient, here controlled by the diamagnetic frequency $\hat {\omega }_n$, is fixed.Footnote 16 To picture what is happening, we resort to the discussion on precession presented before. As we increase the pressure gradient, we learnt that the trapped-electron precession goes against the diamagnetic drift, which means that the trapped population is brought further away from resonance. However, from this behaviour, one would expect the drive of the instability to decrease and with it, Æ to do so. So, how is getting further away from the diamagnetic resonance making things worse?
To figure this out, it is illuminating to formally assess the origin of the sign of $\mathcal {R}_{20}$. The sign is, to a large extent, a result of the correction to the integral measure $\mathrm {d}k/\mathrm {d}c_1$ needed when writing the Æ integral in $c_1$ (as it is necessary for the weak regime calculation, see Appendix F). This piece of the integral measures the number of trapped particles that exist with a precession that is similar to the resonant one. The question is then how this population fraction changes as the precession slows down. The answer is that the population that has a near-vanishing precession grows, as most directly seen in the smaller gradient of $\omega _\alpha$ with $k$ (see figure 1b). Because this fraction of the population is the only one that may contribute to the total available energy, the result is the increase of Æ with plasma $\beta$. This is an important feature of available energy, which not only assigns value to the magnitude of $\omega _\alpha (\omega _{*,0}^T-\omega _\alpha )$, but also to the number of particles with a particular value for its precession. As a consequence, we expect this behaviour to change in the strongly driven regime, which we will visit later.
The effect of triangularity, $\delta$, in (4.12) depends critically on whether cross-sections are elongated vertically or horizontally, as we saw MHD stability to do in the preceding discussion. In the most common case of vertically elongated cross-sections, negative triangularity is seen to reduce the Æ (which increases the precession of the trapped-particle class at $k_0$). Thus, the effect is not synergistic with MHD stability, as triangularity has precisely the opposite effect there. This anti-correlation holds also in the more general case of quasisymmetric configurations, which is readily seen by comparing (F26) for $\mathcal {R}_\delta$ directly to Rodríguez (Reference Rodríguez2023, (4.2)) for $\mathcal {T}_\delta$. This intimate relation between MHD stability and what may be interpreted as TPM activity has been observed on many occasions (in fact, could be interpreted as the driver for many reverse triangularity studies in advance tokamak scenarios). Here we have formally proven in the weak asymptotic regime that a compromise between the two properties is needed in this regime. This opposed behaviour is not shared by plasma $\beta$, which acts in a detrimental form on both MHD and TPM activity.
Performing a similar analysis in the strongly driven regime, we find $\hat {A}_1/\hat {A}_0|_{\mathrm {strong}}\approx -2.85\hat {A}_1/\hat {A}_0|_{\mathrm {weak}}$, which presents the opposite sign to the weak regime. That is, an increased plasma $\beta$ (in the form of pressure gradient) always decreases the Æ and in a standard tokamak with $\bar {\alpha }<1$, positive triangularity becomes TEM-stabilising. In the strongly driven regime, reducing precession brings the zero-crossing point closer to $k=0$, thus reducing the total $k$-space available to drive TEMs. In that limit, with both precession and accessible population decreasing with increased pressure gradient and positive triangularity, we expect available energy to decrease and, regarding fast particle confinement due to non-QS behaviour, to momentarily grow before eventually decreasing as precession grows in the direction of maximum-$\mathcal {J}$. The details will depend on how different trapped particles are affected and how important is their contribution to confinement. Unlike in the weak regime, the whole trapped population becomes important and not just a narrow band, figure 6. In the strongly driven regime, there is a synergy between MHD stability and TEM activity with respect to the triangular shaping of cross-sections, but opposed in the effect of plasma $\beta$. The expected behaviour of a stellarator will thus depend critically on the particular regime considered.
In addition to the sign, there is also a difference in magnitude of roughly a factor $3$ between the relative effects of triangularity and plasma $\beta$ in the strong regime compared with the weak regime. For the discussion following, we focus on the strongly driven regime. This effect can be quantified as we did in the discussion of precession, which we do as follows. When the first-order correction significantly affects the available energy, i.e. $\hat {A}_1/\hat {A}_0\sim 1$, we state that we have a critical scenario. At the edge, the critical $\beta$ becomes $\beta _\mathrm {crit}^{\unicode{x00C6}}\sim 2a|\eta |/\mathcal {R}_{20}(1+f)$, with $f$ as defined before (with its QS generalisation, which may be found in (D5a)). This shows that plasma $\beta$ becomes effective in significantly changing Æ in the strong regime for QAs in the regime of $\beta _\mathrm {crit}^{\unicode{x00C6}}\sim 2\unicode{x2013}3\,\%$, while QH $\beta _\mathrm {crit}^{{\unicode{x00C6}} }\sim 4\unicode{x2013}7\,\%$. Finite $\beta$ QA equilibria appear, therefore, to significantly affect the behaviour of TPM-like behaviour, while QHs remain more resilient, as expected from the behaviour of precession. As far as triangularity is concerned, the expression in (3.10) may be used for Æ with $\mathcal {G}_\delta =3[(3+\bar {\alpha })-(\bar {\alpha }+1)\bar {F}]/[(1-\bar {\alpha })+(1+\bar {\alpha })\bar {F}]$ there. The values for QS configurations may be found in Appendix E in the range $(r\delta )_\mathrm {crit}\sim 0.4\unicode{x2013}1.5$, which is a significant triangularity, nevertheless consistently achievable in many configurations (see Appendix E). Note that in the circular tokamak limit, triangularity has no effect on Æ (only a very small one in the weak regime from $\mathcal {R}_{22}^C$).
Given the changes in the weak and the strong regime, the critical transition gradient also changes and may be computed by
This means that the critical gradient decreases for an increased pressure gradient (as the factor multiplying the pressure is always positive) and increases for negative triangularity tokamaks which are vertically elongated.
To close the paper, we make a comparative study of the insight and results presented in this paper with the literature. The comparison to a recent numerical analysis of the Æ for tokamak equilibria described by a Miller (Miller et al. Reference Miller, Chu, Greene, Lin-Liu and Waltz1998) geometry (Mackenbach et al. Reference Mackenbach, Proll, Snoep and Helander2023b) is most straightforward. One can see that many of the trends found there are recovered in the current paper on an analytical basis; in particular, negative triangularity decreases the Æ for vertically elongated tokamaks only if the gradient is sufficiently small and the gradient threshold has the same dependencies as derived here. Of course, our work sheds light on the origin of such behaviour and can be applied beyond axisymmetric configurations.
The comparison to other turbulent study results and experiments requires us to carefully discern between the two distinct regimes where we have shown the Æ exhibits. Depending on which regime a given scenario is in, the beneficial or detrimental nature of various equilibrium shaping parameters may change. In general and bearing this important caveat in mind, the strongly driven regime is most often entered fairly close to the magnetic axis (as argued before). It is also, in magnitude, the principal contributor to Æ and the very character of the weak regime may make it numerically elusive (as the narrow Æ band would have to be resolved in simulations). As such, it is natural to take the Æ in the strongly driven regime as indicative of overall behaviour of a configuration to TPM mediated transport. In terms of leading order effects, the prediction that increasing the vertical elongation in tokamaks improves transport agrees with existing knowledge (Chu, Ott & Manheimer Reference Chu, Ott and Manheimer1978; Qi et al. Reference Qi, Kwon, Hahm, Yi and Choi2019). Concerning higher order effects, the beneficial nature of a pressure gradient on the trapped electron mode has been noted by many authors (Rosenbluth Reference Rosenbluth1968; Connor et al. Reference Connor, Hastie and Martin1983; Li & Kishimoto Reference Li and Kishimoto2002). The effect of triangularity on Æ, however, is more paradoxical. In experiments, it has been shown that negative triangularity exhibits improved confinement whilst remaining in L-mode (Marinoni et al. Reference Marinoni, Austin, Hyatt, Walker, Candy, Chrystal, Lasnier, McKee, Odstrčil and Petty2019). The current model, however, would predict an increase in the Æ in such a scenario and hence more unfavourable transport. Part of this discrepancy may be explained by the role of magnetic shear, which is positive and significant near the edge of the tokamak, but we have not explicitly considered it here. The trapped particle precession increases with increasing magnetic shear, which may push one to a more weakly driven regime in which negative triangularity is beneficial.
However, the discrepancy may also come from the consideration that behaviour within the strongly driven regime may not be the most adequate to describe the turbulent performance of a configuration. To illustrate this, take a scenario in which Æ is large. Turbulence is expected to be virulent in such a scenario, which will enhance transport and ultimately limit the attainable density gradient (as a maximum transport may be supported). This limiting factor to the gradient naturally leads to consider profile stiffness (Garbet et al. Reference Garbet, Mantica, Ryter, Cordey, Imbeaux, Sozzi, Manini, Asp, Parail and Wolf2004); profiles are stuck to the maximal sustainable gradient values, which are fixed by the point at which a regime of increased turbulence is accessed. Under such a prism, it is not that important what occurs within the strong and weak turbulent regimes, but rather what happens to the transition point. In such a context, support of larger gradients is seen as beneficial, as higher central densities and higher confinement times can be achieved. The key is then to understand the behaviour of this threshold. In practice, this transition point is found through gyrokinetic simulations (Dimits et al. Reference Dimits, Bateman, Beer, Cohen, Dorland, Hammett, Kim, Kinsey, Kotschenreuther and Kritz2000) to find when a steep decrease in the nonlinear heat flux is seen when the gradient value is decreased below some threshold value. Recognising an analogous behaviour in Æ, where $\hat {A}_{w} \sim \hat {\omega }_n^3$ below criticality and $\hat {A}_{s} \sim \hat {\omega }_n$ above it, one may postulate that the behaviour in (4.13) is similar to that which one would observe for the common conception of critical gradient. As a consequence of this threshold behaviour, one would then expect that the attained gradient in experiments is the one which we have calculated in (4.13): the system would reside on the weak-to-strong Æ boundary. We do not attempt to verify this relationship here, which for a consistent consideration would require consistent plasma profiles based on (estimated) heat fluxes, which would feedback onto the geometry (e.g. Shafranov shifts). We make no attempt of solving this problem here and leave it to a future investigation, but merely note similarities in the behaviour of our transition threshold and Merlo et al. (Reference Merlo, Brunner, Sauter, Camenen, Görler, Jenko, Marinoni, Told and Villard2015) and Merlo & Jenko (Reference Merlo and Jenko2023) in the increase of gradient thesholds with negative triangularity tokamaks.
5. Conclusions and outlook
In this paper, we explored the trapped-particle precession and its effects on the available energy to trapped particle modes in axisymmetric and quasisymmetric fields. We do so by considering a near-axis description of the fields, in which the magnitude of the magnetic field is directly prescribed and interlinked to other geometric and equilibrium features. As such, this may be regarded as an extension or alternative to previous local considerations (Connor et al. Reference Connor, Hastie and Martin1983; Roach et al. Reference Roach, Connor and Janjua1995; Hegna Reference Hegna2015). The precession of trapped particles is constructed explicitly, and analytic expressions are given to leading order and the first-order correction. This allows us to prove the impossibility of the maximum-$\mathcal {J}$ property in such fields to leading order, as follows from Boozer (Reference Boozer1983). The role of a pressure gradient in helping to attain this property at a finite radius is presented. Investigating the effect of triangularity in axisymmetric devices, we find that negative triangularity may aid in attaining the maximum-$\mathcal {J}$ property, but the influence on different classes of trapped particles is generally different. In the full quasisymmetric case, closed forms are also provided and some practical examples discussed.
The influence of such precession on density-gradient driven trapped particle modes in the system is then analysed by assessing its effect on the available energy (Helander Reference Helander2017, Reference Helander2020; Mackenbach et al. Reference Mackenbach, Proll, Wakelkamp and Helander2023c). Two different asymptotic regimes naturally arise in the form of ‘weakly’ and ‘strongly’ driven regimes, each with a different behaviour and physical mechanism, which furthermore allows one to define a critical transition density gradient. In the weakly driven regime, a narrow band (in $\lambda$-space) of the trapped particle population is responsible for the available energy, whilst in the strongly driven regime, all resonating trapped particles contribute. This physical difference between the two asymptotic regimes leads to a difference in the dependencies of Æ on the field.
This is certainly true for the effects of pressure gradient and triangularity: its beneficial nature depends on the asymptotic regime considered. Interestingly, we find that the dependence of Æ on triangularity is of the same form as that in Mercier's criterion for MHD stability, up to a sign that changes depending on the driving regime. In the strongly driven regime, a synergy is found between improving MHD stability and lowering energy available to the trapped particles through triangularity, meaning that improving one will improve the other (the opposite being true of plasma $\beta$). The reduction in precession of deeply trapped particles behind this behaviour will, whenever deviations from ideal QS exist, lead to increased fast particle losses, at least until a regime of significant precession in the direction of maximum-$\mathcal {J}$ is reached. The synergy between MHD and turbulent activity through triangularity inverts in the weakly driven regime, where it is under plasma $\beta$ that this synergy is observed. This difference in behaviour affects the critical-gradient estimate for the transition between the two regimes. This gradient grows with negative triangularity, which could align with some of the existing observations in advanced tokamak scenarios.
All in all, one finds that the near-axis framework is a convenient model to assess properties of trapped particles in quasisymmetric magnetic fields. The notions of maximum-$\mathcal {J}$, omnigeneity (through the bounce-averaged radial drift) or Æ, for which analytical expression may be found, allows one to readily evaluate several aspects of performance of different stellarator configurations at negligible computational cost (as measured by Æ). This may be helpful as a proxy for turbulence optimisation. Finally, though we have specialised to quasisymmetric configurations, many of the techniques presented may be applied to other contexts (such as quasi-isodynamic fields or Miller geometry models) allowing one to make similar statements. For the quasi-isodynamic case, such an investigation is currently being undertaken and an even more distinct case in which the bounce-averaged radial drift may play a significant role could also benefit from the current framework.
Supplementary material
Supplementary material is available at the Zenodo repositories with DOI/URL 10.5281/zenodo.8344200 and 10.5281/zenodo.8199904.
Acknowledgements
The authors would like to acknowledge fruitful discussion with P. Helander, J. Caballero and I. Calvo.
Editor P. Catto thanks the referees for their advice in evaluating this article.
Funding
E.R. was supported by a grant by Alexander-von-Humboldt-Stiftung, Bonn, Germany, through a postdoctoral research fellowship. R.M. is partly supported by a grant from the Simons Foundation (560651, PH) and through the project ‘Shaping turbulence – building a framework for turbulence optimisation of fusion reactors’, with Project No. OCENW.KLEIN.013 of the research program ‘NWO Open Competition Domain Science’ which is financed by the Dutch Research Council (NWO).
Declaration of interests
The authors report no conflict of interest.
Data availability
The data that support the findings of this study are openly available at the Zenodo repositories with DOI/URL 10.5281/zenodo.8344200.
Appendix A. Calculation of the second adiabatic invariant
In this appendix, we write out the full derivation of the second adiabatic invariant $\mathcal {J}_\parallel$. Our starting point is the leading order integral given in (2.15) after a straightforward expansion in the ordering parameter $\epsilon$, defined in the main text.
To make progress with this integral, start by defining a so-called trapping parameter $k$, which relates to the pitch-angle parameter $\lambda$ via
Such a definition gives $k\in [0,1]$, with the limits corresponding to deeply and barely trapped particles, respectively. It must be noted that despite its simple appearance, this definition hides complexity in the trapped particle class dependence of $\delta B_b$, (2.12, the field perturbation at the bouncing point. With this definition, we rewrite the integral
This integral can be cast into a form close to the one required for elliptic integrals by employing the double-angle identity $\cos \chi = 1 - 2\sin ^2 (\chi /2 )$, with which the integral reduces to
where the integration variable has been set to $\bar {\chi }=\chi /2$. The final subtlety that one needs to take into account is that the limits of integration are currently set by the zeros of the argument of the numerator in the integrand (namely, the bouncing points), which also gives an intuitive (and equivalent) definition of the trapping parameter $k$,
where we remind the reader that $\chi _b$ denotes the bounce angle. This shows, as did (A1), that $k$ includes some of the higher order corrections to $B$. This arises naturally from the integration and, importantly, preserves the meaning of deeply and barely trapped particles at $k=0,~1$, regardless of the perturbation.
A final substitution puts the integral into the standard form required for elliptic integrals. Define
in which case, the bounce points simply become $\zeta =\pm {\rm \pi}/2$, independent of $k$. This transformation is well defined because $k\in [0,1]$. The leading order contribution to the second adiabatic invariant is now equal to
As part of the asymptotic near-axis treatment, $r$ is to be considered small and we may thus expand the non-singular denominator of the integrand in powers of $r$. Including terms up to the first order,Footnote 17 we find
where we have introduced
and define complete elliptic integrals of the first and second kind (Olver et al. Reference Olver, Olde Daalhuis, Lozier, Schneider, Boisvert, Clark, Mille, Saunders, Cohl and McClain2020, § 19),
Our final step is to expand $\bar {\mathcal {J}}$ to the required order in $r$. This expansion is readily done and one can show that, neglecting terms of order $O(r^2)$, this reduces to
Collecting all the terms of order $O(r)$ in $\mathcal {J}_\parallel ^{(0)}$,
We now turn to the next order correction in $\epsilon$ following (2.14),
The second term in the integrand is a factor $r$ smaller than the first and, hence, for our current expansion, only the first term needs to be taken into account. To perform that remaining integral, we ought to express the integrand explicitly as a function of the integration variable $\chi$. Using the near-axis expansion form of $B$ for a quasi- and stellarator symmetric fields, we know that
With these, introducing the trapping parameter, using double angle identities, and employing $\zeta$ as the integration parameter (like in the previous order), the integral reduces to
where we have kept leading order terms in $r$. The function describing the behaviour of different trapped particle classes is $\mathcal {I}_{22}^C(k)$, which we have defined as
Expanding to order $r^2$ and combining the first- and second-order result, (2.14), gives us our final expression for the second adiabatic invariant,
Figure 9 shows the comparison of this analytic expression with the numerical calculation of $\mathcal {J}_\parallel$.
Appendix B. Integral expressions for the radial drift
In this section, we use the near-axis framework to find expressions for the bounce-averaged radial drifts to leading order in the near-axis expansion. Of course, only if the system is not exactly quasisymmetric will the radial drift be non-vanishing. We will assume that quasisymmetry is broken at second order in the near-axis expansion.
As is well known, it is generally not possible to guarantee the symmetry of $|\boldsymbol {B}|$ to second order in the expansion. Thus, generally, to form a consistent description to second order, one formally allows $B_{20}$ to be a function of $\varphi$, rather than a constant. This is the conventional choice and keeps the other pieces of $|\boldsymbol {B}|$ constant. Let us then consider here a field that is quasisymmetric to first order, but at second order, has $B_2=B_{20}(\varphi )+B_{22}^C \cos 2\chi$, which retains stellarator symmetry for an even $B_{20}(\varphi )$. Including the contributions from the other terms if their $\varphi$-independence was relaxed would also be straightforward.
Let us then consider the $\alpha$ derivative of the second adiabatic invariant needed for assessing the averaged radial drift explicitly,
The integrand vanishes at the bouncing points by construction and, therefore, by Leibniz's rule, the boundary terms may be dropped when enacting the derivative of the integral. This is not completely true for barely and deeply trapped particles, as their bounce points may change in a non-smooth way as we move from field line to field line. To picture this, think of a top of a magnetic well coming down on one side of the well as we move to a different field line. The original barely trapped particle ‘leaks’, undergoing a sudden change in behaviour, leading to a new class of trapped particle. Such a behaviour cannot be appropriately captured in a perturbative sense, but the importance of this particle ‘leak’ may be assessed by constructing a measure of the leaked particle fraction $f_\mathrm {leak}=\max _\alpha [|B_{20}(({\rm \pi} -\alpha )/\bar {\iota })-B_{20}(({\rm \pi} +\alpha )/\bar {\iota })|]/(B_\mathrm {max}-B_\mathrm {min})$, where we assumed stellarator symmetry (i.e. $B_{20}(-\varphi )=B_{20}(\varphi )$ at second order).
With this in mind, and ignoring this fringing case, we may pull the derivative through inside the integral. The integral is taken along field lines (i.e. at constant $\alpha$), which means that the Boozer toroidal angle $\varphi =-(\alpha -\chi )/\bar {\iota }_0$ becomes a function of both $\alpha$ and $\chi$. That means that $\partial _\alpha f(\varphi )=-\partial _\varphi f/\bar {\iota }_0$, which, keeping the leading order near-axis term and using the same notation as in Appendix A, gives
where the prime indicates the derivative with respect to the only argument of $B_{20}$, $\varphi$. The integration variable is $\zeta$ and one should interpret $\bar {\chi }(\zeta,k) = \arcsin ( k \sin \zeta )$.
The integral may be readily evaluated using standard numerical methods. To find $\omega _\psi$, the bounce-averaged radial drift, we need to evaluate the leading order bounce time, $\tau _b$. Using the expression for $\mathcal {J}_\parallel ^{(1)}$, the bounce time is equal to
Hence, the bounce-averaged radial drift to leading order can readily be found to be
The averaged radial-drift $\omega _\psi$ serves as a physically meaningful measure of the quasisymmetry quality of a configuration in this near-axis construction, vanishing when it is omnigeneous. The expression for $\omega _\psi$ is however a function of both $\alpha$ and $k$ and, thus, for a single scalar measure that characterises the radial drift performance of a field at a given flux surface, we must reduce it. Note that given the periodicity of $B_{20}$, the average of $\omega _\psi$ over all field lines (i.e. $\alpha$) vanishes. This might suggest that there is no net detrimental effect to having this radial drift, as on average, there is the ‘same’ amount of particles going one way or the other. However, the neoclassical transport in the low collisionality limit as measured by $\epsilon _\mathrm {eff}$ is insensitive to the sign of $\omega _\psi$. To find a single measure of its magnitude, we attempt to find an upper bound for $\omega _\psi$.
Note that the denominator in the integrand of $\mathcal {I}_{20,\alpha }$ is an always positive function within the integration domain
Thus,
with the equality only holding when the field line label $\alpha$ makes the largest gradient $B_{20,\mathrm {max}}'$ match the bottom of the well and deeply trapped particles are considered (see figure 10). Only in this limit, the particle samples the largest non-QS value of $B_{20}'$. Any other value will necessarily be smaller.
This bound provides a simple relation between the derivative of $B_{20}$ and the radial drift of particles. The derivative of $B_{20}$ is thus a more physical form of measuring quasisymmetry breaking within the near-axis framework compared with simply using the peak-to-peak $B_{20}$ variation as is customary (Landreman & Sengupta Reference Landreman and Sengupta2019; Landreman Reference Landreman2022; Rodriguez et al. Reference Rodriguez, Sengupta and Bhattacharjee2022b).
Appendix C. Full expressions for the particle precession
In this appendix, we present some key elements and expression for the calculation of the precession of trapped particles, $\omega _\alpha =\partial _\psi \mathcal {J}_\parallel /\partial _H \mathcal {J}_\parallel$. Obtaining such expressions from the asymptotic form of $\mathcal {J}_\parallel$ is straightforward, but it requires taking care of partial derivatives appropriately.
To that end, let us remind ourselves of the set of independent variables: $\psi$, $\alpha$, $H$ and $\mu$. Because we are using the toroidal flux over $2{\rm \pi}$ as our flux surface label,
Furthermore, as shown in Appendix A, the pitch angle is related to the trapping parameter via
We thus require expressions for $\delta B_b$, which may readily be computed as
With this expression at hand, it is straightforward to compute the radial derivative of $k$ (at constant $\mu$ and $H$, and thus constant $\lambda$),
Similarly, the derivative with respect $H$ can be found as
The particle class label $k$ changes both with radius and particle energy, as it follows from the change in $|\boldsymbol {B}|$.
We first use the above expressions to find the total bounce-averaged excursion in $\alpha$, $\Delta \alpha = \partial _\psi \mathcal {J}_\parallel$. Using the chain rule and keeping the leading two non-zero orders,
Next, the bounce time $\tau _b=\partial _H \mathcal {J}_\parallel$ is
The trapped-particle precession is now readily found by taking the ratio of $\Delta \alpha /\tau _b$ and expanding to include the leading-order and the first correction terms. Doing the expansion, one finds this to be equal to
which is equivalent to (3.1).
Appendix D. Precession in terms of triangularity and pressure gradient
In the main text and Appendix C, we showed how to get an expression for the precession of trapped particles at second order in the near-axis expansion. This involved the parameters $B_{22}^C$ and $B_{20}$, natural parameters in the near-axis expansion which directly describe the shape of $|\boldsymbol {B}|$. Although most natural in the calculations of trapped-particle precession, they do however not offer a clear connection to the physical nor geometric features of the field.
Within the near-axis framework, such a connection can be made. At second order in the near-axis expansion of an exactly quasisymmetric, stellarator-symmetric configuration, $B_{22}^C$ and $B_{20}$ can be re-expressed as linear combinations of pressure gradient, $p_2$, and triangularity of an up-down cross-section, $\delta$. The latter two can then be used as the independent parameter choices at second order in the expansion of the field.
The definition of pressure gradient is straightforward as the flux derivative of the pressure on axis, $p_2=(B_0/2)\,\mathrm {d}p/\mathrm {d}\psi$. The notion of triangularity, $\delta$, requires additional care. We will define $\delta$ as the relative displacement of the vertical turning points of the cross-section to the width of the cross-section along the up-down symmetry line (normalised by $r$), and do so in the $(\kappa,~\tau )$ Frenet frame. That is, in the plane orthogonal to the magnetic axis where the cross-section is up-down symmetric. Asymptotically, and in terms of the near-axis expansion quantities (Landreman & Sengupta Reference Landreman and Sengupta2019; Rodríguez Reference Rodríguez2023),
Here the $X$ and $Y$ expansion parameters describe the flux surface shape in the Frenet–Serret frame, and details on them may be found from Landreman & Sengupta (Reference Landreman and Sengupta2019). Note that dimensionally, the definition in (D1) has units of inverse length. This is so because $\delta$ is defined not as triangularity, but rather as $1/r$ times triangularity, which accounts for the fact that close to the magnetic axis, where cross-sections are elliptical, the triangularity vanishes. This notion of normalised triangularity in the plane normal to the axis, as explained by Rodríguez (Reference Rodríguez2023), is generally different to the common definition of triangularity in the lab frame, $\delta _\mathrm {lab}$. That is, it is not equivalent to the triangularity of the cross-section that results from making a cut of the configuration at a constant cylindrical angle. If the magnetic axis has a relative tilt with respect to the cylindrical coordinate system, then $\delta _\mathrm {lab}=\delta +\varLambda$, where $\varLambda$ is a term that depends only on the axis shape and first-order near-axis shaping.Footnote 18 In the special case of an axisymmetric field, this geometric transformation term $\varLambda$ vanishes. In general, varying $\delta _\mathrm {lab}$ or $\delta$ is equivalent, with all other near-axis features kept constant.
With the notions of triangularity and pressure gradient in place, the equilibrium field of a quasisymmetric configuration can be uniquely defined at second order (Rodríguez Reference Rodríguez2023). In the axisymmetric limit, this is evident following the behaviour of the Grad–Shafranov equation and the set-up of its solutions. In the general quasisymmetric configuration, this is more subtle, as the cross-section shapes change around the torus driven by the asymmetry of the magnetic axis. Specifying the triangularity of a single cross-section might then appear insufficient to describe uniquely the whole field, but the conditions of quasisymmetry and equilibrium are sufficient to grant this uniqueness. Schematically, one may picture the situation as a kind of initial value problem, in which the single cross-section is the initial value and the axis-shape describes the flow of evolution. There are therefore different ways of describing the very same field, as different cross-sections may be chosen.
It should appear clear that the shape of the axis thus plays a crucial role in the problem, especially through its asymmetry. Described by its curvature, $\kappa$, and torsion, $\tau$, it is natural to construct a measure of said asymmetry like $\bar {F}$, introduced by Rodríguez (Reference Rodríguez2023), and defined to be
where $I_2$ is the toroidal current and $\sigma$ is a measure of up-down asymmetry, a solution to the Riccati $\sigma$ equation (see Landreman & Sengupta Reference Landreman and Sengupta2019), and thus not a degree of freedom. The measure $\bar {F}$ must be evaluated at the location of an up-down symmetric cross-section. As we are assuming stellarator symmetry, there are at least two such distinct positions (in the axisymmetric limit, an infinite number of them, but $\bar {F}=0$ in that case). This emphasises the freedom mentioned before about the description of stellarator fields; there are two naturally simple ways of identifying the very same field, depending on with which up-down symmetric cross-section is chosen to identify the configuration. Depending on this choice, the meaning of the ‘effects of changing the cross-section’ (and, in particular, triangularity) will of course change and, with it, the conclusions derived. One must interpret it as the effects of changing the shape of that very cross-section, modifying the remaining of the configuration in a consistent way, keeping the axis and profiles fixed. For consistency and analogy with the typical axisymmetric case, we shall choose to identify configurations with their most vertically elongated, up-down symmetric cross-section, which often exhibit a bean-shape.
With the definitions above, the magnetic parameters $B_{20}$ and $B_{22}^C$ may be written explicitly in terms of the pressure gradient $p_2$ and the triangularity $\delta$ (defined to be positive in the direction of the normal curvature) of the cross-section at $\varphi =0$,
where $\bar {\alpha }=(\eta /\kappa (0))^4$ and the dots denote terms independent of second-order choices on which we shall not focus.
With these expressions in place, we may then rewrite the effects of second order on the trapped-particle precession (noting that we assumed $\eta >0$ in the adiabatic invariant calculation in this paper),
and write
These two expressions describe the effect of the pressure gradient and the cross-section triangularity on the trapped-particle precession as a function of the class of trapped particle. Note that $\tilde {\mathcal {G}}$ is independent of both the pressure gradient and triangularity. As such, when investigating dependencies on these parameters (at fixed $\eta$ and axis), we may safely ignore contributions of $\tilde {\mathcal {G}}$. The derivation may be checked with computer algebra, as provided in the repository associated with this paper.
D.1. Relation to the large-aspect ratio circular tokamak limit
Here we relate the found results to the large-aspect ratio tokamak limit investigated by Connor et al. (Reference Connor, Hastie and Martin1983). Let us start comparing the pressure dependence of precession and, as such, let us define $\alpha _p= - 4 \mu _0 r p_2/ |\eta | B_0^2 \bar {\iota }_0^2$. One may then write
Specialising to the circular tokamak case (i.e. $\bar {\alpha }=1$ and $\bar {F}=0$), we find
In the circular limit, we may write
acknowledging that we are mixing terms of different order in $r$ and not keeping all relevant terms consistently to the right order.
This expression has a similar form to Connor et al. (Reference Connor, Hastie and Martin1983, (9)). The first two terms are exactly identical, while the latter is a different from,
We thus see that, though the functional form is similar, it is not the same, especially near deeply trapped particles. These differences correspond to the difference between the models considered and the meaning of changing a certain parameter thereof. In the approach of Connor, the field is being constructed in a way that locally, at finite radius, equilibrium is satisfied (as explicitly shown byRoach et al. Reference Roach, Connor and Janjua1995). Such a method requires the shape of the flux surface and radial variations of various geometric and equilibrium quantities to fully specify the local equilibrium conditions. Furthermore, many of these parameters are treated independently, and one requires subsidiary assumptions to set these values (in particular, the ‘artifice’ (Connor et al. Reference Connor, Hastie and Martin1983) of ordering the pressure in a particular form). Though this does allow one to investigate the effect of such parameters independently, it is not guaranteed that there exists a global solution adhering to the set of local conditions. The near-axis approach presented, although asymptotic in nature, is valid in some region near the axis and, as such, can perhaps be thought of as a ‘more global’ solution. This translates into a larger degree of coupling between the geometry and the magnitude of the field (due to the singular nature of the axis) compared with the radially local approach, which ends up reducing the number of free parameters while it retains realism.
The difference is especially noticeable in the involvement of the magnetic shear, which appears explicitly in the work of Connor et al. (Reference Connor, Hastie and Martin1983), but becomes a higher order effect in the current treatment. In fact, we may formally obtain a sense of the involvement of the magnetic shear by considering the next order in the expansion for the precession and, focusing on the variation of the field line length due to the change in $\iota$ when taking the derivative of $\mathcal {J}_\parallel$ with respect to $\psi$, it can straightforwardly be shown to give
where we have defined the magnetic shear as $s = - r \partial _r \bar {\iota }/\bar {\iota }_0$ and the term is clearly a second-order effect. This term has precisely the same form as that reported by Connor et al. (Reference Connor, Hastie and Martin1983). To arrive at such a term, we considered the magnetic shear separate from other elements in the model (an independence that is partially correct given the freedom in the toroidal current profile), but hides connections to higher order considerations within the near-axis framework (Rodríguez et al. Reference Rodríguez, Sengupta and Bhattacharjee2022), especially its ties to the third-order harmonics of $|\boldsymbol {B}|$, which we would expect to modify this dependence at least partially.
Appendix E. Critical value estimates for plasma $\beta$ and triangularity
In the main text, we presented some estimates for the magnitude of plasma $\beta$ and cross-section triangularity that made their effect on precession of deeply trapped particles and available energy significant. In this appendix, we present the values for these estimates for the sample near-axis QS configurations in figure 3.
Table 2 includes the following parameters. First, the critical plasma $\beta$ for reversal of the deep-particle precession at the edge of the configuration, (3.8),
with $a$ the minor axis and taking $r=a$. The second is the critical beta but for the available energy (see Appendix F and § 4),
where the denominator is given in (4.12).
For the critical triangularity, the value of $(r\delta )$ to revert the precession of deeply trapped particles, we consider the definition in (3.10),
Similarly, for the available energy, we shall use the same expression but for the reinterpretation of $\mathcal {G}_\delta$ with
which follows directly from (4.12). Finally, for a reference, we shall compare this critical triangularity to $r_c\delta$, the maximum attainable triangularity in a given near-axis configuration without incurring unphysical flux surface shapes.
Finally, to include a sense of the QS quality, we introduce the QS measure motivated by Appendix B, (B7),
where a vanishing value indicates an exactly QS configuration to second order. This measure shows that some near-axis configurations (especially that of HSX) lie far from ideal QS. In the special case of HSX, the near-axis model is constructed to reproduce the leading harmonic content of $|\boldsymbol {B}|$; small variations can however lead to significant effects especially at second order in the near-axis expansion (that is, where we assess QS or compute triangularity). Because we are using these as illustrating examples, we do not however consider a further refinement of these configurations.
The values in Table 2 show that physically relevant finite beta effects can have significant effect on the leading order behaviour of both deeply trapped particle precession and Æ. This effect is stronger in quasi-axisymmetric configurations and, we should remind ourselves, on the outermost surface. As the magnetic axis is approached, the critical $\beta$ needed will grow. Interestingly, Æ is more susceptible than the precession itself. The case of triangularity shows that a significant degree of shaping is required to affect either precession or Æ. This level of shaping is in many configurations present or exceeded, as the relative comparison of the critical $r\delta$ to $r_c\delta$ shows. Once again, these effects become strongest as the outer surface is approached.
Appendix F. Details of asymptotic available energy integral
In this appendix, we provide mathematical and algebraic details concerning the asymptotic evaluation of Æ in the two considered regimes. Note that these considerations may be extended simply to other approximation schemes, such as a large aspect ratio model.
F.1 The weakly driven regime
To perform the available energy integral in this regime, we showed how it is the population of trapped particles that have an almost vanishing precession that contribute. Thus, we first re-define the zero crossing of the trapped particle precession, $k_0$,
To make further progress with the integral, we ought to transform the integration variable $\lambda$ into $k$ and, thus, require $\mathrm {d} \lambda / \mathrm {d} k$. Details of this derivation are included in Appendix C and, to leading order, it reduces to
The final component needed for the integral is the normalised bounce time $\hat {G}^{1/2} \approx \int (1 - \lambda \hat {B})^{-1/2} \mathrm {d}\ell / L$, for which the leading order expression reduces to
which may be calculated analytically or verified via the expressions for the bounce time given in Appendix C.
To exploit the existence of a narrow contributing band, we will then make a local approximation of the integrand about $k_0$. The region can be shown to be small,
so that conveniently using $c_1$ as integration variable,
the remaining integral becomes
In doing so, we approximated the integration domain $c_1 \in ( \hat {\omega }_{*,0}^T B_0 r/\Delta \psi \eta | G'(k_0) | k_0, \infty ]$ as $(0,\infty ]$. Doing so only introduces an error of order $\sqrt {r}$ on the integral, which is the asymptotic value of $\mathcal {F}$ at small argument.
Collecting all the factors involved, the leading order expression of the Æ simply becomes
Computing the dimensionless factor $\mathcal {V}$ to leading order,
and easing notation by approximating
we find the result
One may now retrieve the result stated in the main text by imposing that $\Delta \psi = C_r r B_0 \rho$, $\varrho = r/a$, $\rho _* = \rho /a$ and $-\partial _\varrho \ln n = \hat {\omega }_n$, resulting in
This concludes the analysis of the weakly driven asymptotic regime.
F.2 The strongly driven regime
The integral in the strongly driven regime is straightforward. Our starting point is the full integral given in (4.5),
We again employ the fact that $\mathrm {d} \lambda / \mathrm {d} k \approx -4 k r \eta$ to write
In the strongly driven limit, the main assumption is that $c_1\gg 1$ for all $k$, and thus we may replace $\mathcal {F}$ with its asymptotic form for large $c_1$. Using (Olver et al. Reference Olver, Olde Daalhuis, Lozier, Schneider, Boisvert, Clark, Mille, Saunders, Cohl and McClain2020, §§ 7.2, 7.12), $\mathcal {F}(c_1) \approx 3 \sqrt {{\rm \pi} }/4c_1$, which yields
One can numerically evaluate the integral to find
Introducing the dimensionless variables as we did in the weak regime, we find
which is our final result.
F.3 Dependence of Æ on $B_{20}$ and $B_{22}^C$
Before concluding this appendix, we show how to proceed to assess the effect of $B_{22}^C$ and $B_{20}$ on the expression of available energy. The main idea here is that we would like to evaluate these contributions without having to evaluate all the consistent asymptotic dependence of Æ. For instance, we are not interested in the order $r^{1/2}$ correction to the weak $\hat {A}$ in (4.8) due to the finite extent of $\mathcal {F}$. To see how to proceed, let us write the relevant parts of the available energy integral (setting overall factors aside for simplicity of notation),
To evaluate higher order corrections to the integral in the weak regime, we remind the reader first that the calculation involves the local expansion of the integrand. In that spirit, the changes due to second order on the first factor in the integral is straightforward, and may be simply be read off (C1b). It gives as a correction on the leading order integral
where we have evaluated it at the original $k_0$ to leading order. A similarly simple correction arises from the corrections to the bounce time $\tau _b$, which may simply be read off (C3),
The contributions left come from the $c_1$-dependent terms, $\mathcal {F}(c_1)\,\mathrm {d}k/\mathrm {d}c_1$. Here the change in the zero-crossing of precession due to second-order corrections on $|\boldsymbol {B}|$ contribute to the available energy. We must thus assess where is the new zero. With the precession defined as $\omega _\alpha =\omega _{\alpha,0}+\omega _{\alpha,1}$, (3.1), we rewrite it as
ignoring every term that does not depend on $B_{20}$ or $B_{22}^C$, and recalling that $\hat {\mathcal {G}}=B_{20}\mathcal {G}_{20}+B_{22}^C\mathcal {G}_{2c}$, (3.4). Then, define the new $\omega _\alpha (k^\star ) = 0$ point with $k^\star =k_0+r\delta k$, so that
With this ‘displaced’ $k^\star$, we may assess the change in the available energy by simply considering the perturbation of the leading order $k_0$ dependence, namely $k_0K(k_0)/G'(k_0)$, noting that the denominator comes from $\omega _{\alpha }'$ and thus it has an additional contribution. Thus, the third correction may be written as
Evaluating all these terms numerically,Footnote 19 the total correction due to $B_{22}^C$ and $B_{20}$ is
where this factor should be understood to be a relative modification of the leading order available energy $\hat {A}=\hat {A}_0(1+\mathcal {R})$. That is, the correction to available energy due to $B_{20}$ and $B_{22}^C$, which we denote as $\hat {A}_1$, is equal to
where $\mathcal {R}_{20}\approx 1.37$ and $\mathcal {R}_{22}^C\approx 0.03$.
In the case of the strongly driven scenario, the integrals are over the whole $k$-space, but a term by term analysis can be performed in an analogous way to that above and the various terms numerically evaluated. The result of the calculation is
where $\mathcal {R}_{20}^\mathrm {s} =3.91$. Astonishingly, the $B_{22}^C$ contribution completely drops out. In terms of dependence on $B_{22}^C$, the two limits of the Æ are not that different, considering that $\mathcal {R}_{20}\gg \mathcal {R}_{22}^C$ for the former case (dropping it makes approximately $\sim 2\,\%$ error). For the remaining analysis, we shall thus only consider dependence on $B_{20}$.
As was the case in the discussion of the trapped electron precession, it is most physical to express these effects at second order in terms of the triangularity of a cross-section and the pressure gradient. This is explained in more detail in Appendix D. Splitting $B_{20}$ up as
where we define
the correction in the weakly driven regime reduces to
and in the strongly driven regime,
which are in the form presented in the main text.