1 Introduction
This is the third paper in a series (Helander & Plunk Reference Helander and Plunk2022; Plunk & Helander Reference Plunk and Helander2022), in which we develop a linear and nonlinear stability theory based on gyrokinetic energy balance. The last two papers used Helmholtz free energy, and introduced the concept of optimal mode growth for fully electromagnetic gyrokinetics. The present paper proposes a generalized energetic measure of fluctuations, allowing the inclusion of additional instability mechanisms. We do this first for a simple case, namely the electrostatic limit (low plasma $\beta$) with only one kinetic species (ions), with the electrons being treated adiabatically. These simplifications limit the application to ion-temperature-gradient (ITG) driven turbulence, although the central result of the paper is capable of treating completely general details of the magnetic geometry.
Free energy is a useful concept for understanding nonlinear and linear aspects of plasma turbulence. At the level of linear instabilities it is common to speak of a source of free energy that drives modes. Indeed, without a source of free energy, provided by background plasma gradients (density, temperature, flows), there can be no linear instabilities (nor can there be subcritical turbulence Landreman, Plunk & Dorland Reference Landreman, Plunk and Dorland2015; Plunk & Helander Reference Plunk and Helander2022). However, there is usually another ingredient that arises in the detailed analysis of normal linear instabilities, namely the wave–particle resonance. In gyrokinetic theory, this involves parallel motion (along the magnetic field) and magnetic drift, and the resonance is physically linked to the work that the electrostatic field performs on gyrocentre motion. However, the terms needed to capture this do not contribute to free energy balance, and the influence of resonance therefore cannot be accounted for by the optimal modes that we introduced in our previous works.
In this work, we propose a new measure of gyrokinetic fluctuations, a generalization of the concept of free energy, that incorporates the resonance mechanism, and, via the magnetic drift, the full details of the background magnetic geometry. We demonstrate the existence of a class of quadratic measures closely related to the Helmholtz free energy that behave as positive–definite norms for fluctuations in the distribution function. The corresponding energy balance equation is then used to derive a theory of optimal modes that most efficiently extract this energy from its source. The growth rate of these optimal modes provides a rigorous upper bound on the growth rate of linear instabilities, and this bound is shown to be lower than that obtained previously from the Helmholtz free energy. By studying some simple limits, we show that we recover some expected behaviour of both the slab and toroidal branches of the ITG mode.
2 Definitions and gyrokinetic energy balance
We follow closely the conventions of the first paper in this series (Helander & Plunk Reference Helander and Plunk2022), focusing on local gyrokinetic theory in the geometry of a flux tube, whereby fluctuations in the distribution function may be considered small, and periodic in the coordinates perpendicular to the field line. Essential definitions are summarized in what follows, but more detail and background can be found in §§ 2 and 3 of Helander & Plunk (Reference Helander and Plunk2022) (henceforth also called ‘Part 1’).
The ion gyrokinetic equation in the electrostatic limit is written
where $g$ is the gyrocentre-dependent part of the perturbed ion distribution function, i.e. $f_i = ( 1 - e_i \delta \phi (\boldsymbol {r})/T_i )F_{i0} + g(\boldsymbol {R}, E_i, \mu _i, t).$ Its phase space variables are the energy $E_a = m_a v^2 / 2 + e_a \varPhi (\psi )$ and the magnetic moment $\mu _a = m_a v_\perp ^2 / (2B)$, and the perpendicular wavenumber is ${\boldsymbol {k}} = {\boldsymbol {k}}_\perp = k_\psi \boldsymbol {\nabla } \psi + k_\alpha \boldsymbol {\nabla } \alpha$ with $k_\psi$ and $k_\alpha$ independent of the arc length $l$ along the magnetic field, and $\psi$ and $\alpha$ defined via ${\boldsymbol {B}} = B {\boldsymbol {b}} = \boldsymbol {\nabla } \psi \times \boldsymbol {\nabla } \alpha$. We neglect collisions here,Footnote 1 and used the simplified notation $g_{\boldsymbol {k}} = g_{i, {\boldsymbol {k}}}$, and $\omega _{\ast } = \omega _{\ast i}$, etc. because the adiabatic approximation $g_{e, {\boldsymbol {k}}} = 0$ is assumed throughout; note that it is not necessary here to include the customary correction for the zonal component (i.e. for $k_\alpha = 0$; see Dorland & Hammett Reference Dorland and Hammett1993) because the growth of this component is zero due to the fact that it has no source of free energy (see below). We will also assume $k \rho _i \sim 1$, implying $k\rho _e \ll 1$.
As derived for the general case in § 3 of Part 1, the gyrokinetic free energy balance equation obtained in this limit reads
where the drive term $D$ is
and the free energy, expressed in terms of the gyrocentre distribution function
where the space average is defined as (extensions are discussed in § 3 of Part 1)
The diamagnetic frequencies are
and the magnetic drift frequency is
where the magnetic drift velocity is $\boldsymbol {v}_d = \hat {\boldsymbol {b}}\times ((v_{\perp }^2/2)\boldsymbol {\nabla } \ln B + v_\|^2\boldsymbol {\kappa })/\varOmega _i$, $\boldsymbol {\kappa } = \hat {\boldsymbol {b}}\boldsymbol {\cdot }\boldsymbol {\nabla }\hat {\boldsymbol {b}}$ and $\varOmega _a = e_a B / m_a$ is the gyrofrequency. Noting $\hat {\boldsymbol {b}}\times (\boldsymbol {\nabla }\ln B -\boldsymbol {\kappa } - \mu _0/B^2 \boldsymbol {\nabla } p) = 0$, we can assume $\hat {\boldsymbol {b}}\times \boldsymbol {\nabla }\ln B \approx \hat {\boldsymbol {b}}\times \boldsymbol {\kappa }$ in the appropriate limit of low plasma $\beta$, which allows us to separate the drift frequency into velocity-dependent and space-dependent factors following Plunk et al. (Reference Plunk, Helander, Xanthopoulos and Connor2014):Footnote 2
The gyro-averaged electrostatic potential is denoted
and the quasi-neutrality condition is
where $\textrm {J}_{n} = \textrm {J}_n(k_\perp v_\perp / \varOmega _i)$ is a Bessel function. Following our previous convention, we define the free energy as twice that which appears in some other publications. Henceforth, we suppress the $\boldsymbol {k}$-subscripts.
2.1 Electrostatic energy and positive–definiteness of free energy
It is useful to decompose the free energy into a part associated with a perturbed distribution function and a part associated with fluctuations in the electrostatic field, i.e.
where
Recall the conventional definitions $\varGamma _n(b) = \exp (-b)\textrm {I}_n(b)$ and $b = k_\perp ^2 \rho _i^2 = k_\perp ^2 T_i / (m_i \varOmega _i^2)$, and $\tau = (e T_i)/(e_i T_e)$. Note that $\delta F = g - (e_i \overline {\delta \phi }/T_i) F_0$ is the gyro-averaged perturbed distribution function, and these two contributions to $H$ can be identified as the gyrokinetic perturbed entropy and the gyrokinetic field energy.
Although the general electromagnetic free energy admits a similar form as (2.11) ((3.11) Part 1), we note that the electrostatic limit is distinguished by the fact that the field contribution $E$ is itself a nonlinear invariant of the gyrokinetic system (Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009), by which we mean that it is conserved under the sole action of the nonlinearity, but not by all the linear terms; the same applies for the Helmholtz free energy. The conservation of $E$ may be viewed as an additional constraint on the nonlinear dynamics, with consequences e.g., for the cascade and production of large-scale $E\times B$ flows (Plunk et al. Reference Plunk, Cowley, Schekochihin and Tatsuno2010).
For what follows, we need the electrostatic energy balance equation. This is obtained by multiplying the ion gyrokinetic equation by $e_i \overline {\delta \phi }^\ast$, integrating over velocity, averaging over the parallel coordinate $l$ and summing over perpendicular wavenumber ${\boldsymbol {k}}$, yielding (Helander, Proll & Plunk Reference Helander, Proll and Plunk2013)
where the drive term $K$ is
This is composed to two contributions, one coming from the parallel streaming term, and the other coming from the magnetic drift term. The first contribution has a simple physical interpretation, as the rate of energy exchanged between particles and the parallel electric field (i.e. the volume average of the parallel current multiplied by the parallel electric field), while the second term describes an analogous process in the perpendicular direction associated with the drift motion of gyrocentres.
Equation (2.11) is a physically transparent form that makes it clear that the free energy $H$ is a positive–definite norm for the distribution function $g$,Footnote 3 i.e.
over all of phase space, $\ell$ and $\boldsymbol {v}$. To see this, note that the quantities $G$ and $E$ are both positive, i.e. $G \geqslant 0$, obviously, and $E \geqslant 0$ because $\varGamma _0 \leqslant 1$. Therefore, if $H = 0$ then both $E = 0$ and $G = 0$. The first implies $\delta \phi = 0$ everywhere, while the second implies $\delta F = 0$ over all of phase space; $\delta \phi = 0$ and $\delta F = 0$ obviously implies $g = 0$.
We note that positive–definiteness is a desirable property of an energetic measure that can be useful for setting bounds on the growth rate of fluctuations; if a non-zero fluctuation ($g\neq 0$) has zero measure $M$, while $\textrm {d}M/\textrm {d}t \neq 0$, then the rate $M^{-1}\,\textrm {d}M/\textrm {d}t$ is unbounded, so that the problem of optimal modes (i.e. determining the form of $g$ that maximizes growth) is ill posed. For an example of this, consider $M = E$. One may find functions $g_\epsilon$ such that $|\delta \phi | < \epsilon$ for arbitrarily small $\epsilon > 0$, while $g_\epsilon \sim 1$ is itself not small. In this case $\textrm {d}E/\textrm {d}t \sim \epsilon$ while $E \sim \epsilon ^2$ so the rate $E^{-1}\,\textrm {d}E/\textrm {d}t$ is divergent in the limit $\epsilon \rightarrow 0$.
Although we mainly consider a plasma with a single kinetic ion species and adiabatic electrons, the concepts and the formalism carry over to the more general case of a plasma with an arbitrary number of kinetic species, as shown in Appendix A. An important limitation, however, is that magnetic fluctuations and collisions are neglected.
2.2 Generalized free energy
The positive–definiteness of $H$ suggests a family of related quadratic energetic measures that are also positive–definite. In particular it is clear that something of the form
will be positive–definite, by the same arguments as the previous section, for particular values of the parameter $\varDelta$. For instance the choice $\varDelta < 1$ allows trivial generalization of the arguments, but we will see that the value can be extended beyond this.
To find a range of permissible values of $\varDelta$, and to help simplify subsequent derivations, we will consider a linear transformation on the distribution function, i.e. we define a new distribution function $\tilde {g}$ in terms of which the energy $\skew2\tilde{H}$ may be expressed using the Euclidean (or $L^2$) norm
where we have introduced the inner product
We will refer to (2.18) as a ‘diagonal’ form of the norm, as it does not involve additional linear operations on the distribution function, compared with the form given by (2.11). To find the relationship between $\tilde {g}$ and $g$, we introduce the ansatz $\tilde {g} = g - \nu \textrm {J}_0 F_0 e_i\delta \phi /T_i$, and substitute this into (2.18). By evaluating velocity integrals using quasi-neutrality (2.10) and (2.13), one can find the free parameter $\nu$ that yields the form (2.17), that is
where we have taken the negative root for convenience. Observe that in order for $\nu$ to be real, we must have
The parameter $\varDelta$ can of course be negative, in which case its magnitude is unbounded. Noting that $\varGamma _0$ generally depends on $k$, we may also assume the more restrictive $\varDelta \leqslant (1 + \tau )$ to ensure that $\skew2\tilde{H}$ remains a nonlinear invariant.
We pause to note that the choice $\varDelta = 0$ yields a novel form of the conventional (Helmholtz) free energy, immediately suggesting what can be considered as the phase-space density of free energy, namely the quantity $T_i |\tilde {g}|^2/F_0$, for which there is not yet an expression available.Footnote 4
It is useful now to write quasi-neutrality in terms of $\tilde {g}$
where
Finally, we can show that $\skew2\tilde{H}$ is positive–definite. First, positivity follows from (2.18), and it is obvious from (2.10) that if $g = 0$ then $\delta \phi = 0$ so that $E$ and $H$ both vanish, implying $\skew2\tilde{H} = 0$. On the other hand, if we assume that $\skew2\tilde{H} = 0$, then (2.18) implies that $\tilde {g} = 0$, and (2.22) implies that $\delta \phi = 0$, from which we conclude $g = 0$. In summary, $\skew2\tilde{H} \geqslant 0$ and $\skew2\tilde{H} = 0$ iff $g = 0$.
3 Modes of optimal growth
A key point in introducing the generalization of free energy $\skew2\tilde{H}$ is that this quantity introduces wave–particle effects (parallel resonance and drift resonance) that enter the electrostatic energy balance equation, (2.14).
We note that, for the choice $\varDelta = 0$, the energy $\skew2\tilde{H}$ reduces to the conventional Helmholtz free energy that we have studied in the previous papers. For this choice the modes of optimal growth correspond exactly to the modes introduced in § 6 of Part 1, and studied in fully electromagnetic limit in Plunk & Helander (Reference Plunk and Helander2022) (Part 2). Because those modes are included as a limit of our present analysis, we can be assured that the most stringent bound on growth obtained from the generalized free energy will be at least as good as the known bound obtained from the Helmholtz free energy.
Note that, as long as the parameter $\varDelta$ is independent of ${\boldsymbol {k}}$, the quantity $\skew2\tilde{H}$ is conserved by the nonlinearity, i.e. under summation over ${\boldsymbol {k}}$. This is because it is a linear combination of two nonlinear invariants. One simply combines (2.2) and (2.14) to obtain
i.e. the change of this measure is due to the drive terms of electrostatic and free energy, and is otherwise conserved by the turbulent interactions. It is potentially useful to also consider $\varDelta$ that does depend on ${\boldsymbol {k}}$, for the purpose of obtaining bounds on linear growth, but the nonlinear implications will be less clear in that case.
In direct analogy to how modes of optimal free energy growth were defined, we introduce a rate $\varLambda$
to be optimized over the space of ion distribution functions $g$. We note the bound on conventional gyrokinetic instability growth rates
Having already found a diagonal form of the generalized free energy, (2.18), we need not use a variational approach to find the states of extremal $\varLambda$. We simply identify the Hermitian linear operators associated with the input of free energy and electrostatic energy, i.e.
To obtain these forms note that, from (2.22), $\delta \phi$ can be regarded as the result of a linear operator on $\tilde {g}$. Then, using (2.22) and (2.3) and (2.15), and some straightforward algebra (see Appendix B), we obtain explicit forms for the operators. First we have
where primes denote evaluation at $\boldsymbol {v}^\prime$ and $v_{\textrm {th}} = \sqrt {2 T_i/m_i}$. For convenience, the operator ${\mathcal {K}}$ can be split into its parallel and perpendicular components as ${\mathcal {K}} = {\mathcal {K}}_{\|} + {\mathcal {K}}_d$, for which we obtain
and
In deriving (3.8), it is important to note that the parallel derivative is taken at fixed magnetic moment and particle energy, and that the velocity-space volume element $\textrm {d}^3v$ is proportional to $B/v_\|$ in these variables. More details are given in Appendix A. The kinetic eigenvalue problem can be stated now as
where solutions, i.e. pairs $\varLambda _n$ and $\tilde {g}_n(l, \boldsymbol {v})$, realize optimal growth of $\skew2\tilde{H}$. To see this, we can decompose the distribution function $\tilde {g}$ in terms of the orthogonal eigenmodes of (3.9), $\tilde {g} = \sum _n c_n \tilde {g}_n$ to obtain (choosing $||\tilde {g}_n|| = 1$)
from which it is clear that the rate of energy growth of the system is maximized by setting the distribution function equal to the mode of largest growth rate, i.e.
which bounds the normal growth of the system, $\gamma _L$, in accordance with (3.3).
3.1 Moment form of eigenproblem
The analysis of (3.9) is greatly simplified by adopting a moment form. As found in the preceding papers, there are natural moments that appear in the energy input terms that can be identified to reduce the dimensionality of the problem substantially. Upon inspecting the energy balance equations one finds the following key dimensionless integrals:
where $\kappa _1$ is a density-like moment, $\kappa _2$ and $\kappa _3$ are pressure-like, $\kappa _4$ is parallel ion flow while $\kappa _5$ is more abstract.
It is easy to recognize these integrals on the right-hand side of (3.6), (3.7) and (3.8), and straightforward to rewrite those equations in moment form. The dimensional reduction is achieved by taking moments of the these equations to obtain a coupled set of five fluid equations. These, which are given in Appendix C, can be combined, leading, after lengthy algebra, to a relatively simple second-order ordinary differential equation, the main result of this paper
where $\varphi = e_i \delta \phi /T_i = \lambda \kappa _1$ is the normalized electrostatic potential. The functions $G_{m,n}$, $G^\prime _{m,n}$ and $G^{\prime \prime }_{m,n}$, which depend on arc length via $b(l)$ and $B(l)$, are defined in terms of integrals involving Bessel functions, and are evaluated in Appendix D; see, in particular, (D3)–(D4) and (D9)–(D12). The other $b$-dependent factors ($G_0$–$G_5$) can be expressed in terms of $G_{m,n}$, and are evaluated in terms of more elementary Bessel functions in Appendix D.1.
In (3.17) we see the eigenvalue $\varLambda$ entering quadratically, reflecting the fact that there will be two real roots, one positive and one negative, owing to Hermiticity and time-reversal symmetry of the full eigenproblem, (3.9). Note that the terms arising from the parallel drive of electrostatic energy are placed on the right-hand side. In the following section, we will consider some simple limits of this equation, and leave its more general solution for a future publication.
4 Simple limits
In this section we will consider some simple limits applied to (3.17), and draw some comparison with linear theory of the main instability targeted by limit of this paper, the ITG mode (see for instance Plunk et al. Reference Plunk, Helander, Xanthopoulos and Connor2014). To start, we note that taking $\varDelta = 0$, so that $\skew2\tilde{H}$ becomes the conventional Helmholtz free energy, yields
which matches (6.20) of Part 1.
In considering other simplifications, we first should note that the adiabatic electron approximation already neglects the contribution from a trapped electron population, which requires either large electron collisionality, or uniform magnetic-field strength as measured along the field line. Let us assume the latter for simplicity
Making this assumption simplifies somewhat (3.17), where all the explicit factors of $B$ drop out of the right-hand side. A more significant simplification is achieved by assuming unsheared and uniform magnetic geometry, in particular
In this limit, all of the coefficients of (3.17) are constants, and a simple dispersion relation is the obtained by taking $\partial \varphi /\partial l = \textrm {i} k_\| \varphi$. We find
were we have used $G_{0,2} = G_{0}/2$. As noted in § 2.2, the quantity $\varDelta$ is a free parameter, over which we can optimize $\varLambda$ to improve the bounds on the growth rate of fluctuations.
4.1 Slab ITG mode
Setting $\omega _d = 0$ leaves only the slab branch of the ITG mode, driven by the temperature gradient, and involving ion parallel resonance. Equation (4.5) reduces to
where $\kappa _\| = \eta \omega _\ast /(k_\|v_{\textrm {th}})$. Because $G_0G_2 - G_1^2 \geqslant 0$, the two contributions on the right-hand side are both positive but the solution for which $\varLambda$ is minimal is actually not obtained for $\varDelta = 0$, due to the implicit dependence of $\lambda$ on $\varDelta$ given by (2.23).
Although all values of the parameter $\varDelta$ satisfying (2.21) yield a valid bound on the growth rate of normal modes ($\gamma _L \leqslant \varLambda$), the lowest value is the most stringent and serves as the closest approximation of $\gamma _L$. To obtain this ‘optimal bound’, we can consider the extrema of $\varLambda ^2/(\eta \omega _\ast )^2$, i.e.
This results in a quadratic equation for $\varDelta$ that is still rather complicated so we will consider the limit $b \rightarrow 0$; see Appendix D.2 for the relevant limits of $G_{m,n}$, etc. Applying the limit to (4.6) yields
This solution diverges as $\varDelta$ approaches $1+\tau$; recall that this is the upper limit allowed by (2.21). It also grows in an unbounded fashion as $\varDelta \rightarrow -\infty$. There is an optimal value giving minimal $|\varLambda |$, obtained by solving (4.7) in this limit. This solution, denoted as $\varDelta _\textrm {min}$, is
where the negative root has been selected to be consistent with (2.21). Substituting this solution into (4.8) gives
where we define $\bar {\kappa }_\| = \kappa _\|/(1+\tau )$; see figure 1. This reaches its maximum value in the limit $\bar {\kappa }_\| \rightarrow 0$, and is a decreasing function of $|\bar {\kappa }_\||$, i.e.
Physically, the first result implies that, when drive ($\eta \omega _\ast$) is much smaller than the parallel transit frequency ($k_\parallel v _{\textrm {th}}$), the best bound is equal to that obtained by free energy ($\varDelta = 0$). In this case, the bound is consistent from expectations of the growth rate of a resonant slab ITG mode, i.e. $\gamma _L \sim \eta \omega _\ast$.
In the opposite limit ($\bar {\kappa }_\| \gg 1$), however, when the drive large, i.e. in the so-called non-resonant or ‘fluid’ limit, we obtain a much lower bound, essentially the geometric mean of the drive and the parallel transit frequency $k_\|v_{\textrm {th}}$. We note that this bound is not as low as what is obtained from the non-resonant solution of the dispersion relation (without a density gradient), i.e. $\gamma _L \sim \eta \omega _\ast ^{1/3}(k_\parallel v _{\textrm {th}})^{2/3}$ (Plunk et al. Reference Plunk, Helander, Xanthopoulos and Connor2014), but nevertheless captures the expected weakening (relative to the resonant result) qualitatively. A plot of the optimal bound of (4.10) is provided in figure 2. Note that the resonant stabilization at high $k_\|$ (low $\bar {\kappa }_\|$) is not captured in this case.
It is interesting to observe that this latter limit corresponds to $\varDelta _\textrm {min} \rightarrow -\infty$, making $\skew2\tilde{H}$ in some sense dominated by the electrostatic component.
4.2 Toroidal ITG mode
Now taking $k_\parallel v _{\textrm {th}}$ to be small, we can neglect the right-hand side of (4.5), leaving
To derive the optimal choice of $\varDelta$, we again take the $b \rightarrow 0$ limit and obtain from (4.12)
where we define $\kappa _d = \eta \omega _\ast /\omega _d$. Note the similar qualitative behaviour with $\varDelta$ as (4.6), namely its divergence at the $\varDelta \rightarrow 1 + \tau$, and unbounded growth as $\varDelta \rightarrow -\infty$. The key difference here arises in the linear term in the drive parameter $\kappa$; this is expected from the theory of the toroidal ITG mode since the sign of the drift frequency (associated with so-called ‘good’ and ‘bad’ magnetic curvature) is important for the resonance.
We now find the value of $\varDelta$ that minimizes $\varLambda$
where we define the parameter $\bar {\kappa }_d = \kappa _d/(1 + \tau )$. Substituting this into (4.13) yields
with $\zeta = \sqrt {2 \bar {\kappa }_d^2-8\bar {\kappa }_d/3 + 1}$. This expression for $\varLambda _\textrm {min}$ is naturally separated into a factor that depends only on the instability parameter $\bar {\kappa }_d$, from which we can derive the asymptotic behaviour. To show the behaviour of this factor we plot the quantity $\tau (1 + \tau )\varLambda _\textrm {min}^2/(\eta \omega _\ast )^2$ in figure 2. The overall behaviour of $\varLambda _\textrm {min}$ is captured by the following limits:
where we denote $\sigma = \pm 1$ as the sign of $\kappa _d$. At large drive ($|\bar {\kappa }_d| \gg 1$; $|\eta \omega _\ast | \gg |\omega _d| (1+\tau )$) we recover the expected non-resonant (‘fluid’) behaviour of the toroidal ITG mode with no density gradient, namely $\gamma _L \sim \sqrt {\eta \omega _\ast \omega _d}$. Note that this growth rate is much smaller than the bound found by merely considering the Helmholtz free energy (Helander & Plunk Reference Helander and Plunk2022). Although we do not see the complete stabilization ($\varLambda = 0$) at negative values of $\bar {\kappa }_d$ (opposite sign of $\omega _d$ and $\eta \omega _\ast$) expected from theory, there is a strong asymmetry, with $|\varLambda |$ having its larger values at positive $\bar {\kappa }_d$ and being comparatively much smaller for negative $\bar {\kappa }_d$.
The value $\bar {\kappa }_d = 4/3$ (i.e. $\eta \omega _\ast = 4 (1+\tau )\omega _d/3$) achieves the maximal value of $\varLambda _\textrm {min}$ at fixed $\eta \omega _\ast$, and therefore is evocative of the resonance condition for the toroidal ITG modes $\eta \omega _\ast \sim \omega _d$ (Biglari, Diamond & Rosenbluth Reference Biglari, Diamond and Rosenbluth1989). This value of $\bar {\kappa }_d$ is obtained by solving for $\varDelta _\textrm {min} = 0$, explaining why it produces the worst bound, i.e. that given by optimal growth of Helmholtz free energy.
It is noteworthy that for the limit $\bar {\kappa }_d \rightarrow 0$ ($\omega _d \gg \eta \omega _\ast /(1+\tau )$) our method yields a value of $|\varLambda |$ that is a factor of $1/3$ reduced as compared with the resonant case, again at least qualitatively reproducing the expected stabilization of the toroidal ITG mode in this limit.
5 Conclusion
We have demonstrated that the use of a generalized form of free energy $\skew2\tilde{H}$ introduces some of the physics of wave–particleresonance that is missing in the theory of optimal mode growth of the Helmholtz free energy (Helander & Plunk Reference Helander and Plunk2021, Reference Helander and Plunk2022; Plunk & Helander Reference Plunk and Helander2022). The growth rates of optimal modes of the generalized free energy provide a rigorous upper bound on the growth of conventional gyrokinetic instabilities (‘normal modes’), which, for appropriate choice of $\Delta$ is less than or equal to the Helmholtz bound. Moreover, optimal modes of generalized free energy depend on the magnetic-field geometry to a greater extent than those associated with Helmholtz free energy. The difference in growth rates can be very large. For instance, in the important case of a strongly driven toroidal ITG mode, the Helmholtz bound is larger by a factor of order $\eta \omega _\ast / \omega _d \gg 1$.
A single ordinary differential equation has been derived for optimal modes, allowing general magnetic geometry. We found solutions of this equation in some simple limits to demonstrate that it indeed recovers, at least qualitatively, some of the physical effects expected from the theory of linear ITG modes, including sensitivity to the ratio of the frequencies associated with drive and resonance, and transition of the instability when this ratio is near one. Density gradient dependence of the ITG mode is absent from both the electrostatic and free energy input terms, assuming adiabatic electrons, so its effect is not accounted for by the theory presented here.
The results of this work have possible implications for ‘turbulence optimization’, i.e. the endeavour to shape the equilibrium magnetic geometry of stellarators for low turbulence. The general result, (3.17), allows, in principle, for the inclusion of the complete geometric information that is needed to run gyrokinetic simulations. However, the solution of this equation should be far simpler and more efficient, due to the reduction of velocity space to a single moment. Stellarator optimization requires a large number of calculations to be performed, which so far has severely limited the use of direct gyrokinetic simulations, even linear ones. The simplifications obtained here, which also include the removal of the need for time integration, will need to be balanced against numerical costs associated with eigenvalue problems, and possible costs associated with determining the optimal value of the parameter $\varDelta$, if the absolute strongest bound is desired.
Our results found for the toroidal branch of the ITG mode hint at a possible optimization strategy based on optimal modes. Consider fixed plasma conditions, i.e. a given temperature gradient ($\eta \omega _\ast$) and temperature ratio ($\tau$): at high drive ($\eta \omega _\ast / (1 + \tau ) > 4\omega _d/3$), minimization of the optimal growth rate $|\varLambda |$ is achieved by minimization of the magnetic drift $\omega _d$ (i.e. magnetic curvature), corresponding to minimization of the strongly driven(non-resonant) toroidal ITG mode. On the other hand, at low drive ($\eta \omega _\ast /(1 + \tau ) < 4\omega _d/3$), the increase of $\omega _d$ is favoured, corresponding to a weakening of the marginally unstable ITG mode, i.e. an increase of the threshold of instability. The latter case corresponds to ‘critical gradient’ optimization, an idea which has recently been developed (Roberg-Clark, Plunk & Xanthopoulos Reference Roberg-Clark, Plunk and Xanthopoulos2022a; Roberg-Clark, Xanthopoulos & Plunk Reference Roberg-Clark, Xanthopoulos and Plunk2022b).
It is worth mentioning the larger context in which optimal modes are potentially interesting to study, and applications besides their use to bound or estimate the growth rate of normal instabilities. Although these modes ($\tilde {g}_n$, $\varLambda _n$) do not arise as late-time-asymptotic solutions of initial-value gyrokinetic simulations, as normal modes do, they are still realizable in the sense that a gyrokinetic simulation initialized in one of these states $\tilde {g}_n$ will temporarily exhibit an energy growth rate exactly equal to $2\varLambda _n$. These modes are thus just as ‘real’ as conventional eigenmodes and may also be observed, to some extent, in nonlinear simulations, where fluctuations are continually driven away by nonlinear interactions from the form of normal linear instabilities. They are also potentially useful for studying systems without any unstable normal modes, i.e. ‘subcritical’ turbulence (indeed this is the context in which they were originally formulated). Furthermore, since optimal modes are based on quadratic norms that are nonlinearly conserved by the gyrokinetic equations, they can be used to bound the instantaneous growth rates observed in fully nonlinear solutions; this point was made in Parts 1 and 2, but applies equally well here.
More general solutions of the optimal mode equation, and the application to optimization, will be pursued in future works. Other special limits can also be explored including the limit of large electron-bounce frequency, appropriate for studying trapped-electron modes, or adiabatic-ion limits applied to universal instabilities. Electromagnetic generalizations are also possible: although it is not clear how to construct a positive–definite electromagnetic form of the generalized free energy $\skew2\tilde{H}$ that is a nonlinear invariant, it is certainly possible to consider related measures that focus on linear bounds.
Acknowledgements
Editor Alex Schekochihin thanks the referees for their advice in evaluating this article.
Funding
This work has been carried out within the framework of the EUROfusion Consortium, funded by the European Union via the Euratom Research and Training Programme (Grant Agreement No 101052200 – EUROfusion). Views and opinions expressed are, however, those of the author(s) only and do not necessarily reflect those of the European Union or the European Commission. Neither the European Union nor the European Commission can be held responsible for them. This work was partly supported by a grant from the Simons Foundation (560651, PH).
Declaration of interests
The authors report no conflict of interest.
Appendix A. Several kinetic species
For a plasma with an arbitrary number of particle species, we multiply each gyrokinetic equation
by $e_a \overline {\delta \phi }_{\boldsymbol {k}}^\ast$, integrate over velocity space, take the real part and the average $\langle \cdots \rangle$ over the flux tube and sum over all species $a$ and wave vectors $\boldsymbol {k}$. In other words, we apply the operator
Since the expression
changes sign under an exchange of $\boldsymbol {k}$ and ${\boldsymbol {k}}'$, the nonlinear terms cancel upon summation over $\boldsymbol {k}$ and ${\boldsymbol {k}}'$, and we obtain
The quasi-neutrality equation (2.10) can be used to write the first term as
We thus arrive at the electrostatic energy balance equation
which is the generalization of (2.14) to several species. The right-hand side can be interpreted as minus the work done by the electric field on the various particle species. The first term in this expression contains
which we used in (3.8), with $\sigma = v_\| / | v_\| |$, $E_a = m_av^2/2$ and $\mu _a = m_a v_\perp ^2/(2B)$.
Appendix B. Derivation of operators ${\mathcal {D}}$ and ${\mathcal {K}}$
We first write forms of $D$ and $K$ explicit in $\tilde {g}$, noting that the contribution to $D$ proportional to the density gradient is zero by use of quasi-neutrality with the adiabatic electron approximation (the factor $\eta \omega _\ast \propto \textrm {d}T_i/\textrm {d}\psi$ appears in what follows, but never $\omega _\ast \propto \textrm {d}n_i/\textrm {d}\psi$ alone). The terms proportional to $\nu$, involved in the transformation from $g$ to $\tilde {g}$, are also zero, due to oddness in $v_\|$, so expressing energy input in terms of $\tilde {g}$ merely has the consequence of introducing the overall factor $\lambda$. The expressions are
where we separate $K = K_\| + K_d$ and ‘c.c.’ denotes complex conjugate. These can be re-expressed in terms of linear operators by writing them in the form
Identifying ${\mathcal {D}}$ and ${\mathcal {K}}_d$ is simply a matter of exchanging labels of dummy variables of integration ($v$ for $v^\prime$, etc.). Manipulating the expression for $K_\|$ to reveal ${\mathcal {K}}_\|$ is more involved. We also need to integrate by parts in $l$ and will need to use the fact that $\partial /\partial l$ is performed at fixed phase-space variables $E_i$ and $\mu = m_i v_{\perp }^2/(2B)$. The velocity-space volume element contains an important factor of $1/v_\|$, which generally depends on $l$ and does not itself commute with $\partial /\partial l$
where $\sigma$ denotes the sign of $v_\|$.
Appendix C. Moment form of eigenproblem
The three terms of (3.9) can be rewritten in terms of the moments of $\tilde {g}$ (3.12)
Then, taking moments of (3.9) yields the following five equations:
See the next section where the integrals $G_{m,n}$, etc., are defined and evaluated. Note that the final two equations can be immediately used to eliminate $\kappa _4$ and $\kappa _5$, leaving a system of three equations. The second and third equations are used together to find forms for $\kappa _2$ and $\kappa _3$ in terms of $\kappa _1$, and these forms are substituted into the first equation to obtain the final form, in terms of $\kappa _1$ only, given by (3.17).
Appendix D. Bessel-type integrals
The following definitions, mostly copied from Plunk & Helander (Reference Plunk and Helander2022), are needed to perform the various integrals that appear in the moment equations for our eigenproblem. First, we need a general form of Weber's integral (Iyanaga & Kawada Reference Iyanaga and Kawada1980)
where $\textrm {I}_{n}$ is the modified Bessel function of order $n$. The integrals we need to evaluate can be conveniently found in terms of ${\mathcal {I}}_{n}$. We define
where $m$ is assumed to be even. Now we note that these integrals can be evaluated in terms of Weber's integral
The above relations allow us to evaluate the functions
where
and $\varGamma _E$ is the gamma function. Finally, we can evaluate the integrals $G_{m,n}^{\prime }$ and $G_{m,n}^{\prime }$. We define
Relating $x_\perp$ to the proper gyrokinetic phase-space variable $\mu$, that is $x_\perp = (\mu B/T_i)^{1/2}$, allows the derivatives to be evaluated
so that we can write
These expressions allow us to evaluate
D.1 Explicit expressions for some Bessel integrals
The $b(l)$-dependent factors in (3.17) can be written as
which can be evaluated using the identities of the previous section in terms of the familiar $\varGamma _n(b)$ of gyrokinetic theory (suppressing its argument for succinctness)
where we recall
For completeness, we evaluate the few remaining factors that enter (3.17)
and using (D11)
D.2 Limit $b \rightarrow 0$
In the limit $b \rightarrow 0$ we obtain
and
Appendix E. Linear dispersion relation
For comparison, the following local linear dispersion relation can be solved numerically (here, we neglect finite Larmor radius effects $\textrm {J}_0 = 1$ and assume zero density gradient)
where $\varOmega = \omega /(\omega _* \eta )$. The velocity integrals can be evaluated separately for the $k_\| = 0$ and $\omega _d = 0$ cases in terms of the plasma dispersion function (Kadomtsev & Pogutse Reference Kadomtsev and Pogutse1970; Biglari et al. Reference Biglari, Diamond and Rosenbluth1989).