1 Introduction
While stellarators offer the possibility of stable, steady-state fusion power with minimal recirculating power and immunity from disruptions, particle confinement in stellarators is a challenge. In a general non-axisymmetric magnetic field, even if magnetic surfaces exist, guiding-centre trajectories are not necessarily confined close to a magnetic surface in the absence of turbulence and collisions, as they are in perfect axisymmetry. However, confinement can be improved significantly by optimizing the shaping of the magnetic field. Guiding-centre trajectories are essentially determined by the strength of the magnetic field $B$ in Boozer coordinates $(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ , where $r$ labels magnetic surfaces, and $\unicode[STIX]{x1D703}$ and $\unicode[STIX]{x1D711}$ are poloidal and toroidal angles (Boozer Reference Boozer1981). If $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ has certain forms, such as quasi-symmetry (Nührenberg & Zille Reference Nührenberg and Zille1988) or omnigenity (Cary & Shasharina Reference Cary and Shasharina1997; Landreman & Catto Reference Landreman and Catto2012), the guiding-centre confinement would be as good as in axisymmetry. In principle, $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ is a function of the shapes of the magnetic surfaces through the equations of magnetohydrodynamic (MHD) equilibrium, but this functional relationship is complicated. Given a desired $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ , it is not generally clear whether a three-dimensional magnetic field $\boldsymbol{B}(\boldsymbol{r})$ exists with the desired field strength and which solves the MHD equilibrium equations, much less what this solution $\boldsymbol{B}(\boldsymbol{r})$ is.
Previously, MHD equilibria with desirable $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ have been obtained using optimization (Nührenberg & Zille Reference Nührenberg and Zille1988; Nührenberg, Lotz & Gori Reference Nührenberg, Lotz and Gori1994; Garabedian Reference Garabedian1996; Zarnstorff et al. Reference Zarnstorff, Berry, Brooks, Fredrickson, Fu, Hirshman, Hudson, Ku, Lazarus and Mikkelsen2001). In this approach, an ‘off-the-shelf’ optimization algorithm is applied to minimize an objective function representing the departure from the desired $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ (for instance, the summed squared amplitudes of symmetry-breaking terms in the Fourier series), as some shape parameters of a bounding magnetic surface are varied. For each function evaluation, a three-dimensional MHD equilibrium solution must be calculated numerically and then converted to Boozer coordinates. While this approach has been successful, it has some shortcomings. Since there are multiple local minima, results depend on the initial condition, and one is never sure that all the interesting regions of parameter space have been found. The optimization is computationally expensive, and little insight is gained as to the number of degrees of freedom in the problem.
A complementary approach was taken by Garren & Boozer (Reference Garren and Boozer1991a ,Reference Garren and Boozer b ). Their work is commonly cited as a proof that perfectly quasi-symmetric magnetic fields (apart from truly axisymmetric ones) do not exist, but less well known is that their work contains a practical procedure to directly construct MHD equilibria with desirable $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ , generating ‘optimized’ stellarators without optimization. The Garren–Boozer analysis is based upon an expansion in $r$ , the effective distance from the magnetic axis; while it does not describe the outer region of a low-aspect-ratio device, it does describe some region sufficiently close to the axis of any stellarator, even one with low aspect ratio. (A complementary approach, based on expansion in departure from axisymmetry, was recently developed by Plunk & Helander (Reference Plunk and Helander2018).) The present paper is the first in a series in which we extend the Garren & Boozer framework, to more fully understand the landscape of stellarator shapes with good confinement, and to develop a practical tool for generating good initial conditions for conventional optimization.
In this first paper of the series, we derive the relationship between the shape of the magnetic surfaces in cylindrical coordinates $(R,\unicode[STIX]{x1D719},z)$ and $B$ in Boozer coordinates. (More precisely, we consider surface shapes parameterized by $\{R(\unicode[STIX]{x1D703},\unicode[STIX]{x1D719}),\,Z(\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})\}$ using the Boozer poloidal angle $\unicode[STIX]{x1D703}$ , so our representation is in a sense a hybrid one.) While we use a similar $r$ expansion to Garren & Boozer, our calculation is different because theirs did not use cylindrical coordinates. Instead, Garren & Boozer worked in the Frenet–Serret frame of the magnetic axis. The Frenet–Serret frame is an orthonormal basis $(\boldsymbol{t},\boldsymbol{n},\boldsymbol{b})$ satisfying the equations
where $\boldsymbol{t}=\text{d}\boldsymbol{r}_{0}/\text{d}\ell$ , $\boldsymbol{r}_{0}$ is the position vector along the magnetic axis and $\ell$ denotes the arclength along the curve. The vectors $\boldsymbol{t}$ , $\boldsymbol{n}$ and $\boldsymbol{b}$ are called the tangent, normal and binormal, $\unicode[STIX]{x1D705}$ is the curvature and $\unicode[STIX]{x1D70F}$ is the torsion. Note that the opposite sign convention for torsion is used in Garren & Boozer (Reference Garren and Boozer1991a ,Reference Garren and Boozer b ).
There are two particular motivations for this paper. First, we will (in Part 2 of the series, Landreman, Sengupta & Plunk Reference Landreman, Sengupta and Plunk2018) generate plasma shapes as input for stellarator physics codes that employ cylindrical coordinates, specifically the VMEC code (Hirshman & Whitson Reference Hirshman and Whitson1983; Hirshman, van Rij & Merkel Reference Hirshman, van Rij and Merkel1986). This can be done either using the equations for cylindrical coordinates derived in the present paper (§ 2), or else by solving Garren & Boozer’s equations in the Frenet frame and mapping the results to cylindrical coordinates afterwards, using a transformation that will be derived in § 4. By having these two approaches available, and showing that the results are the same, we can be highly confident that the results are correct. An analytic proof of the equivalence of the two methods will be presented in this paper (§ 4), and numerical solutions will be presented in an accompanying Part 2 (Landreman et al. Reference Landreman, Sengupta and Plunk2018). There, we will show that our approaches can generate quasi-symmetric flux surface shapes in ${<}$ 1 ms on a laptop – 4 orders of magnitude faster than a single VMEC equilibrium calculation, much less a traditional optimization – thus enabling high-resolution mapping of the landscape of possible quasi-symmetric plasma shapes.
Our second motivation in this paper is to modify Garren & Boozer’s analysis to avoid the Frenet–Serret frame because this basis can be pathological in certain situations of interest. The Frenet–Serret frame is known to be problematic if there are any points of vanishing curvature: even smooth curves can have discontinuous Frenet–Serret basis vectors. For instance, for the curve defined by $R(\unicode[STIX]{x1D719})=1+R_{c}\cos (n\unicode[STIX]{x1D719})$ and $z(\unicode[STIX]{x1D719})=z_{s}\sin (n\unicode[STIX]{x1D719})$ , the curvature vanishes if $R_{c}=1/(n^{2}+1)$ , and the Frenet basis is generally discontinuous at these points, as shown in figure 1. Where $\unicode[STIX]{x1D705}=0$ , the torsion is generally not well defined. This situation of vanishing $\unicode[STIX]{x1D705}$ is of particular interest because it is necessary for a desirable $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ optimization: omnigenity with poloidally closed $B$ contours (Cary & Shasharina Reference Cary and Shasharina1997; Subbotin et al. Reference Subbotin, Mikhailov, Shafranov, Isaev, Nührenberg, Nührenberg, Zille, Nemov, Kasilov and Kalyuzhnyj2006; Helander & Nührenberg Reference Helander and Nührenberg2009; Landreman & Catto Reference Landreman and Catto2012) (sometimes called ‘quasi-isodynamic’). In this optimization, which yields good particle confinement at the same time as vanishing bootstrap current (Helander & Nührenberg Reference Helander and Nührenberg2009), the maximum of $B$ on each $r$ surface must be a constant- $\unicode[STIX]{x1D711}$ curve, so $\unicode[STIX]{x2202}B/\unicode[STIX]{x2202}\unicode[STIX]{x1D703}$ must vanish for all $\unicode[STIX]{x1D703}$ at these $\unicode[STIX]{x1D711}$ values. To see that this condition near the axis implies $\unicode[STIX]{x1D705}=0$ , consider that the pressure gradient $\unicode[STIX]{x1D735}p$ vanishes on the magnetic axis, so it follows from the MHD equilibrium relation $(\unicode[STIX]{x1D735}\times \boldsymbol{B})\times \boldsymbol{B}=0$ that
The condition $\unicode[STIX]{x2202}B/\unicode[STIX]{x2202}\unicode[STIX]{x1D703}$ on the maximum- $B$ curves near the axis implies $\unicode[STIX]{x1D735}_{\bot }B=0$ there, implying $\unicode[STIX]{x1D705}=0$ . While one would have to grapple with discontinuities and ill-defined torsion to apply the Frenet–Serret approach to construct omnigenous fields with poloidally closed $B$ contours, all quantities remain smooth in cylindrical coordinates. Construction of omnigenous magnetic fields will be considered in Part 3 of this series.
The Frenet–Serret frame has also been used in another important stellarator calculation: Mercier’s result that rotational transform on the magnetic axis arises from a combination of axis torsion, rotating elongation and current density (Mercier Reference Mercier1964; Helander Reference Helander2014). This result was also derived by Garren & Boozer (Reference Garren and Boozer1991a ) as part of their quasi-symmetry analysis, as their equation (77). Just as Garren & Boozer’s quasi-symmetry equation acquires singularities if the axis curvature ever vanishes, so does Mercier’s expression for the rotational transform, as it includes torsion explicitly. As part of our analysis, we will re-derive Mercier’s result in cylindrical coordinates, resulting in an expression that does not become singular if the axis curvature vanishes.
The main content of this paper begins in § 2 with the calculation of the relationship between $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ and flux surface shape directly in cylindrical coordinates. The analogous results of the Garren–Boozer calculation in the Frenet–Serret frame are then reviewed in § 3. The transformation between the two coordinate systems is derived in § 4.1, and this transformation is used in the remainder of § 4 to prove that the cylindrical and Frenet–Serret equations are equivalent, when the latter are valid. Some reductions of the equations for the particular case of quasi-symmetry are discussed in § 5, and we will conclude in § 6.
2 Direct calculation in cylindrical coordinates
We now present the calculation in which the field strength in Boozer coordinates is directly related to the magnetic surface shape in cylindrical coordinates. Aside from the fact that we describe the magnetic surface shapes in cylindrical coordinates rather than by the projections along the Frenet–Serret vectors, our approach is similar in structure to the one in Garren & Boozer (Reference Garren and Boozer1991a ). The covariant and contravariant expressions for $\boldsymbol{B}$ in Boozer coordinates are equated, giving three independent equations. The square of either expression for $\boldsymbol{B}$ gives an additional equation for $B$ . These four equations are then expanded in the distance $r$ from the magnetic axis. Here we will carry out the expansion to sufficient order that the first-order quantities in $r$ are determined.
2.1 Starting equations
In any straight-field-line coordinates, including Boozer coordinates, the magnetic field can be written
where $2\unicode[STIX]{x03C0}\unicode[STIX]{x1D713}$ is the toroidal flux, $\unicode[STIX]{x1D704}$ is the rotational transform and $\unicode[STIX]{x1D703}$ and $\unicode[STIX]{x1D711}$ are the poloidal and toroidal angles. In the particular case of Boozer coordinates, $\boldsymbol{B}$ can also be written
Here $I(\unicode[STIX]{x1D713})$ is $\unicode[STIX]{x1D707}_{0}/(2\unicode[STIX]{x03C0})$ times the toroidal current enclosed by the flux surface, and $G(\unicode[STIX]{x1D713})$ is $\unicode[STIX]{x1D707}_{0}/(2\unicode[STIX]{x03C0})$ times the poloidal current outside the flux surface. The Boozer toroidal angle $\unicode[STIX]{x1D711}$ differs from the cylindrical azimuthal angle $\unicode[STIX]{x1D719}$ , and we will keep track of the difference, denoted $\unicode[STIX]{x1D708}$ :
(By assuming this equation, our analysis will not pertain to certain unconventional configurations such as knots in which $\unicode[STIX]{x1D719}$ increases by an integer ${>}1$ multiple of $2\unicode[STIX]{x03C0}$ when $\unicode[STIX]{x1D711}$ increases by $2\unicode[STIX]{x03C0}$ .) We will consider the independent variables to be $(\unicode[STIX]{x1D713},\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ . From the product of (2.1) and (2.2), the Jacobian of these coordinates is
We will assume $\unicode[STIX]{x2202}\unicode[STIX]{x1D708}/\unicode[STIX]{x2202}\unicode[STIX]{x1D719}>-1$ so this Jacobian remains non-zero. Physically, this assumption means the direction of $\boldsymbol{B}$ always points toward increasing $\unicode[STIX]{x1D719}$ or always points towards decreasing $\unicode[STIX]{x1D719}$ , never reversing direction. This same assumption is made in the VMEC code (Hirshman & Whitson Reference Hirshman and Whitson1983), and it is not restrictive in practice.
Using the dual relations
where $\boldsymbol{r}$ is the position vector, we can write (2.1) as
and write (2.2) as
The derivatives of $\boldsymbol{r}(\unicode[STIX]{x1D713},\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})=R\boldsymbol{e}_{R}+z\boldsymbol{e}_{z}$ can be evaluated using $\text{d}\boldsymbol{e}_{R}/\text{d}\unicode[STIX]{x1D719}=\boldsymbol{e}_{\unicode[STIX]{x1D719}}$ , where $(\boldsymbol{e}_{R},\boldsymbol{e}_{\unicode[STIX]{x1D719}},\boldsymbol{e}_{z})$ are cylindrical unit basis vectors. Equating the three cylindrical components of (2.6) and (2.7), we obtain
To get (2.9) we have added (2.8) times $\unicode[STIX]{x2202}R/\unicode[STIX]{x2202}\unicode[STIX]{x1D719}$ and (2.10) times $\unicode[STIX]{x2202}z/\unicode[STIX]{x2202}\unicode[STIX]{x1D719}$ to the $\boldsymbol{e}_{\unicode[STIX]{x1D719}}$ components. In these expressions, we have changed the flux surface label coordinate from $\unicode[STIX]{x1D713}$ to the effective minor radius $r(\unicode[STIX]{x1D713})$ defined by $2\unicode[STIX]{x03C0}\unicode[STIX]{x1D713}=\unicode[STIX]{x03C0}r^{2}\bar{B}$ , where $\bar{B}$ is an arbitrary reference magnitude of magnetic field. (Since $\unicode[STIX]{x1D713}$ can be negative, $\bar{B}$ may be negative.) Also, a relation for $B$ can be obtained by squaring (2.6):
Equations (2.8)–(2.11) are the basis of the remainder of the analysis, in which these equations will be systematically expanded.
2.2 Expansion about the magnetic axis
We take the magnetic axis to be described by its cylindrical coordinates $R_{0}(\unicode[STIX]{x1D719})$ and $z_{0}(\unicode[STIX]{x1D719})$ . Regularity considerations near the axis imply we can write the cylindrical coordinate $R(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ for a general point near the axis in the form of an expansion
where
Expansions of the same form are made for $z$ , $\unicode[STIX]{x1D708}$ and $B$ :
These expansions are justified in appendix A. We also have
Using these expansions, we proceed to systematically consider the terms of each order in (2.8)–(2.11).
2.3 Magnitude of $B$ : zeroth order
We first consider the $O(r^{0})$ terms in (2.11). These terms give
where $s_{G}=\pm 1$ , primes denote $\text{d}/\text{d}\unicode[STIX]{x1D719}$ , and $\ell ^{\prime }>0$ is the differential length of the magnetic axis:
Integrating (2.20) in $\unicode[STIX]{x1D719}$ ,
Thus, $s_{G}$ is the sign of $G_{0}$ , $+1$ if $\boldsymbol{B}$ points in the direction of increasing $\unicode[STIX]{x1D719}$ and $-1$ otherwise. Equations (2.20)–(2.22) allow us to eliminate $\unicode[STIX]{x1D708}_{0}$ and $G_{0}$ in favour of $R_{0}$ , $z_{0}$ and $B_{0}$ .
2.4 Equating representations of the field: first order
Next, the leading-order terms in the $r$ expansion of (2.9) are $O(r^{1})$ , giving
We can eliminate $\unicode[STIX]{x1D708}_{0}$ in this equation using (2.20) to obtain
This equation, which is analogous to (53) in Garren & Boozer (Reference Garren and Boozer1991a ), expresses the fact that the toroidal flux within the magnetic surface $r$ should be $2\unicode[STIX]{x03C0}\unicode[STIX]{x1D713}=\unicode[STIX]{x03C0}r^{2}\bar{B}$ . To see this, consider that the toroidal field on the magnetic axis is $\boldsymbol{B}\boldsymbol{\cdot }\boldsymbol{e}_{\unicode[STIX]{x1D719}}=s_{G}B_{0}\boldsymbol{t}\boldsymbol{\cdot }\boldsymbol{e}_{\unicode[STIX]{x1D719}}=B_{0}R_{0}/(s_{G}\ell ^{\prime })$ , and as shown in appendix B, the area of the flux surface in the constant- $\unicode[STIX]{x1D719}$ plane is $\unicode[STIX]{x03C0}r^{2}$ times the right-hand side of (2.24).
Similarly, the leading terms in (2.8) and (2.10) are $O(r^{1})$ and give
Solving for $\unicode[STIX]{x1D708}_{1c}$ and $\unicode[STIX]{x1D708}_{1s}$ and applying (2.24), we find
2.5 Magnitude of $B$ : first order
Another pair of equations is obtained from the $O(r^{1})$ terms in (2.11). These terms can be found by applying $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}r$ to (2.11) and evaluating the result at $r\rightarrow 0$ . We find
In this equation, the terms that include a factor of $\unicode[STIX]{x1D704}_{0}$ can be written
which can be seen to vanish in light of (2.27) and (2.20). Eliminating $\unicode[STIX]{x1D708}_{0}$ and $\unicode[STIX]{x1D708}_{1}$ in the remaining terms using (2.20) and (2.27), one finds
where
Noting from the first line of (1.1) that $\unicode[STIX]{x1D705}\boldsymbol{n}\ell ^{\prime }=\boldsymbol{t}^{\prime }=[(\ell ^{\prime })^{-1}\boldsymbol{r}_{0}^{\prime }]^{\prime }$ , and evaluating the result in cylindrical coordinates, it can be seen that equivalent expressions to (2.31)–(2.32) are
Note that the $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ components of $B_{1}$ , $R_{1}$ and $z_{1}$ each satisfy (2.30) separately. Equations (2.30)–(2.32) are analogous to (70) in Garren & Boozer (Reference Garren and Boozer1991a ). These equations reflect (1.2). In the limit of a circular magnetic axis, $R_{0}^{\prime }=0$ and $z_{0}^{\prime }=0$ , equation (2.30)–(2.32) reduce to $B_{1}/B_{0}=-R_{1}/R_{0}$ , reflecting the expected $B\propto 1/R$ variation.
2.6 Equating representations of the field: second order
The highest-order terms in the $r$ expansion we will consider are the $O(r^{2})$ terms in (2.8)–(2.10). The expressions at this order become rather lengthy and so details are left to appendix C. At $O(r^{2})$ , the three equations (2.8)–(2.10) each have a $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ component, so there are six independent equations. Although nine second-order quantities ( $R_{2s}$ , $R_{2c}$ , $R_{20}$ and similar $\unicode[STIX]{x1D708}$ and $z$ terms) appear, they only enter through five linearly independent combinations. Therefore the second-order quantities can be annihilated by forming a certain linear combination of the six equations, equation (C 10). What remains is an equation relating zeroth- and first-order quantities:
where
and
Our (2.34)–(2.36) play an analogous role to (63) and (67) in Garren & Boozer (Reference Garren and Boozer1991a ). Note that (2.34) can be integrated to give $\unicode[STIX]{x1D704}_{0}=(\oint w\,\text{d}\unicode[STIX]{x1D719})^{-1}\oint (wT/V)\,\text{d}\unicode[STIX]{x1D719}$ for any $w(\unicode[STIX]{x1D719})$ , analogous to Garren & Boozer’s (77). Encoded in these equations is the classic result by Mercier (Reference Mercier1964), Helander (Reference Helander2014): rotational transform on the magnetic axis arises due to axis torsion, rotating elongation and toroidal current. Indeed, in Part 2 we will compute $\unicode[STIX]{x1D704}_{0}$ numerically by solving (2.34)–(2.36) or its Frenet–Serret analogue. The toroidal current contribution to $\unicode[STIX]{x1D704}_{0}$ is the $I_{2}$ term in $T$ , while the axis torsion and rotating elongation contributions are evidently contained in the remaining terms. Interestingly, while the torsion in Mercier’s expression involves the third derivative of the axis shape, the highest derivative of the axis shape appearing in (2.34)–(2.36) is the second. If there are any points where the axis curvature vanishes, the torsion becomes ill defined, so Mercier’s expression for $\unicode[STIX]{x1D704}$ (which explicitly depends on $\unicode[STIX]{x1D70F}$ ) becomes awkward; equation (2.34) has no such problem.
Another perspective on rotational transform and torsion in cases with vanishing curvature (without effects of elongation) has been discussed by Pfefferlé et al. (Reference Pfefferlé, Gunderson, Hudson and Noakes2018).
3 Frenet–Serret approach
The analogous calculation using the Frenet–Serret frame is clearly explained in Garren & Boozer (Reference Garren and Boozer1991a ,Reference Garren and Boozer b ), so we will not repeat it here, only quote the main results. The position vector is written
where $\boldsymbol{r}_{0}$ , $\boldsymbol{n}$ , $\boldsymbol{b}$ and $\boldsymbol{t}$ refer to the magnetic axis. The quantities $X$ , $Y$ and $Z$ are expanded similarly to (2.12)–(2.14) but with $\unicode[STIX]{x1D719}\rightarrow \unicode[STIX]{x1D711}$ :
where regularity requires
and analogous expansions are made for $Y$ and $Z$ . The expansion of $B$ is written in terms of $\unicode[STIX]{x1D711}$ rather than $\unicode[STIX]{x1D719}$ , so
where $\hat{B}_{1}(\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})=\hat{B}_{1s}(\unicode[STIX]{x1D711})\sin \unicode[STIX]{x1D703}+\hat{B}_{1c}(\unicode[STIX]{x1D711})\cos \unicode[STIX]{x1D703}$ .
Instead of (2.20), one obtains $G_{0}=s_{G}B_{0}\,\text{d}\ell /\text{d}\unicode[STIX]{x1D711}$ . Instead of (2.24), one finds $Z_{1}=0$ and
Noting from appendix B that the left-hand side of this equation is the cross-sectional area of the flux surface in a plane perpendicular to the on-axis $\boldsymbol{B}$ , equation (3.6) represents the fact that the toroidal flux inside the flux surface is $\unicode[STIX]{x03C0}r^{2}\bar{B}$ . Instead of (2.30), one finds
where this equation holds separately for $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ components. Instead of (2.34)–(2.36), Garren & Boozer obtain
where
and
These equations correspond to (63) in Garren & Boozer (Reference Garren and Boozer1991a ), but with an extra $I_{2}$ term since a vacuum field was assumed in that work. The fact that a $2$ appears in the $\unicode[STIX]{x1D70F}$ term here whereas a 4 appears in Garren & Boozer (Reference Garren and Boozer1991a ) is due to the normalization used in the latter, and $\unicode[STIX]{x1D70F}$ enters with the opposite sign due to the opposite sign convention for torsion.
Combining the above equations to eliminate unknowns, the system can be reduced to a single equation. To this end, we introduce a variable $\unicode[STIX]{x1D70E}(\unicode[STIX]{x1D711})$ related to the flux surface shape, defined by
From this definition and (3.6)–(3.7),
Substituting these results and (3.7) into (3.8), we obtain
Considering $\unicode[STIX]{x1D705}$ , $\unicode[STIX]{x1D70F}$ , $I_{2}$ , $B_{0}$ , $\hat{B}_{1s}$ and $\hat{B}_{1c}$ to be known, this result is a first-order nonlinear ordinary differential equation for $\unicode[STIX]{x1D70E}$ . Once $\unicode[STIX]{x1D70E}$ is obtained, $Y_{1s}$ and $Y_{1c}$ can be found from (3.12), and $X_{1s}$ and $X_{1c}$ are known from (3.7), so the flux surface shape can be reconstructed from (3.1).
4 Equivalence of the two approaches
4.1 Relating representations of the surface shape
Let us now prove that if the curvature of the magnetic axis does not vanish, the Frenet–Serret approach and the direct calculation in cylindrical coordinates are equivalent, as they should be. To begin, we must relate $X_{1}$ and $Y_{1}$ to $R_{1}$ and $z_{1}$ . This can be done by equating the position vector in the two approaches, expanding (3.1) using $\unicode[STIX]{x1D711}(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})=\unicode[STIX]{x1D711}_{0}(\unicode[STIX]{x1D719})+r\unicode[STIX]{x1D708}_{1}(\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})+O(r^{2})$ where $\unicode[STIX]{x1D711}_{0}(\unicode[STIX]{x1D719})=\unicode[STIX]{x1D719}+\unicode[STIX]{x1D708}_{0}(\unicode[STIX]{x1D719})$ :
Equating the $O(r^{0})$ terms gives $\boldsymbol{r}_{0}(\unicode[STIX]{x1D711}_{0})=R_{0}(\unicode[STIX]{x1D719})\boldsymbol{e}_{R}(\unicode[STIX]{x1D719})+z_{0}(\unicode[STIX]{x1D719})\boldsymbol{e}_{z}$ . Then applying $\boldsymbol{n}(\unicode[STIX]{x1D711}_{0})\boldsymbol{\cdot }(\ldots )$ and $\boldsymbol{b}(\unicode[STIX]{x1D711}_{0})\boldsymbol{\cdot }(\ldots )$ to the $O(r)$ terms in (4.1), we obtain two equations that can be represented
Here and for the rest of this section, $n_{R}=\boldsymbol{n}(\unicode[STIX]{x1D711}_{0})\boldsymbol{\cdot }\boldsymbol{e}_{R}(\unicode[STIX]{x1D719})$ , $b_{R}=\boldsymbol{b}(\unicode[STIX]{x1D711}_{0})\boldsymbol{\cdot }\boldsymbol{e}_{R}(\unicode[STIX]{x1D719})$ , analogous expressions hold for $n_{z}$ and $b_{z}$ and $X_{1}$ and $Y_{1}$ are understood to be evaluated at $\unicode[STIX]{x1D711}_{0}$ . (The $\boldsymbol{t}(\unicode[STIX]{x1D711}_{0})\boldsymbol{\cdot }(\ldots )$ component of (4.1) yields (2.27).)
Noting the components of the tangent vector in cylindrical coordinates,
the determinant of the matrix in (4.2) is
Hence the inverse transformation is
This relation enables the solution of the quasi-symmetry equations in the Frenet–Serret basis to be mapped to cylindrical coordinates. Note that by applying (4.5) and (4.4) to (2.24), we obtain (3.6), and so these equations from the Frenet–Serret and cylindrical coordinates analyses are consistent.
4.2 Equivalence of the $B_{1}$ equations
Next let us show that (2.30) and (3.7) are equivalent. Expanding (3.5) about $\unicode[STIX]{x1D711}\approx \unicode[STIX]{x1D711}_{0}$ , and equating the result to the $B$ analogue of (2.12), we obtain
The $O(r^{0})$ terms give $B_{0}(\unicode[STIX]{x1D719})=\hat{B}_{0}(\unicode[STIX]{x1D711}_{0})$ , which upon differentiation gives
Combining this result with (2.20), (2.27) and the $O(r^{1})$ terms of (4.6), we find
Then using the top row of (4.2), (3.7) and (2.30) are equivalent. Note that using (4.8), (2.30) can be written in terms of $\hat{B}_{1}$ rather than $B_{1}$ , yielding a relation between the flux surface shape in cylindrical coordinates and the field strength in Boozer coordinates:
4.3 Equivalence of the $\unicode[STIX]{x1D704}_{0}$ equations
Finally, let us show that equations (2.34)–(2.36), which determine $\unicode[STIX]{x1D704}_{0}$ in cylindrical coordinates, can be independently derived from the analogous Frenet–Serret equations (3.8)–(3.10) by applying the transformation (4.2). We first note the following relations between components of the normal and binormal vectors:
Using these results and (4.5), then
An analogous expression holds for the subscript- $1c$ ( $\cos \unicode[STIX]{x1D703}$ ) terms. Thus, it can be seen that $V^{FS}=V$ .
It remains to show $T^{FS}=T$ . To show this equivalence we first apply (4.2) and then (4.10) to the first four terms of $T^{FS}$ , giving
where
In the last term of (4.13), $\bar{B}$ is eliminated using (2.24). Applying the last two lines of (1.1),
where we have used $n_{z}b_{R}-b_{z}n_{R}=t_{\unicode[STIX]{x1D719}}=R_{0}/\ell ^{\prime }$ . Applying
and
we find
Thus, $T^{FS}=T$ as desired. This concludes the proof that whenever the curvature of the magnetic axis does not vanish, so the Frenet–Serret approach is free of singularities, all the equations derived directly in cylindrical coordinates in § 2 are equivalent to the analogous equations derived in the Frenet–Serret frame by Garren & Boozer (Reference Garren and Boozer1991a ).
5 Quasi-symmetry
Next, let us consider how the equations for the magnetic field strength reduce in an important case, that of quasi-symmetry. (The more general condition of omnigenity will be considered in Part 3.) As shown by Garren & Boozer (Reference Garren and Boozer1991a ), for quasi-symmetry to $O(r^{1})$ , the curvature of the magnetic axis can never vanish, or else the elongation of the first-order flux surfaces diverges. Since the curvature does not vanish, the Frenet–Serret frame is non-singular, and the torsion can be defined. Therefore the reduced equation (3.13) should be free of singularities. We will consider the cases of quasi-axisymmetry and quasi-helical symmetry in turn. We will not consider quasi-poloidal symmetry, $B=B(r,\unicode[STIX]{x1D703})$ , since it cannot exist at $O(r^{1})$ .
5.1 Quasi-axisymmetry
Quasi-axisymmetry is the condition $\unicode[STIX]{x2202}B/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}=0$ . At $O(r^{0})$ , quasi-axisymmetry implies $B_{0}^{\prime }=0$ . It is convenient then to take the normalizing field $\bar{B}$ equal to the constant $s_{\unicode[STIX]{x1D713}}B_{0}$ , where $s_{\unicode[STIX]{x1D713}}=\text{sign}(\unicode[STIX]{x1D713})=\pm 1$ . A consequence of $B_{0}^{\prime }=0$ is $B_{1c}(\unicode[STIX]{x1D719})=\hat{B}_{1c}(\unicode[STIX]{x1D711}_{0})$ and $B_{1s}(\unicode[STIX]{x1D719})=\hat{B}_{1s}(\unicode[STIX]{x1D711}_{0})$ .
At $O(r^{1})$ , quasi-axisymmetry implies $\text{d}\hat{B}_{1s}/\text{d}\unicode[STIX]{x1D711}=0$ and $\text{d}\hat{B}_{1c}/\text{d}\unicode[STIX]{x1D711}=0$ . We are free to shift the origin of the $\unicode[STIX]{x1D703}$ coordinate so $\hat{B}_{1s}=0$ , leaving the first-order magnetic field strength completely described by the single constant $\hat{B}_{1c}$ . In this case, equation (3.13) simplifies to
where $\unicode[STIX]{x1D70E}(\unicode[STIX]{x1D711})=\hat{B}_{1c}Y_{1c}(\unicode[STIX]{x1D711})/(s_{G}s_{\unicode[STIX]{x1D713}}B_{0}\unicode[STIX]{x1D705}(\unicode[STIX]{x1D711}))$ . This result is equivalent to (82) in Garren & Boozer (Reference Garren and Boozer1991a ) and to (A6) in Garren & Boozer (Reference Garren and Boozer1991b ). In the appendix of Part 2 (Landreman et al. Reference Landreman, Sengupta and Plunk2018), we prove that for any given $\unicode[STIX]{x1D70E}(0)$ , $I_{2}/B_{0}$ , $G_{0}/B_{0}$ , $\hat{B}_{1c}/(B_{0}\unicode[STIX]{x1D705})$ and $s_{\unicode[STIX]{x1D713}}\unicode[STIX]{x1D70F}$ , precisely one periodic solution $\unicode[STIX]{x1D70E}(\unicode[STIX]{x1D711})$ and associated $\unicode[STIX]{x1D704}_{0}$ exist, even though (5.1) is nonlinear in $\unicode[STIX]{x1D70E}$ .
5.2 Quasi-helical symmetry
Quasi-helical symmetry is the condition $B=B(r,M\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711})$ for some non-zero integers $M$ and $N$ . At $O(r^{0})$ , this condition implies $B_{0}^{\prime }=0$ , so again we can take $\bar{B}=s_{\unicode[STIX]{x1D713}}B_{0}$ to normalize by the on-axis field. The fact that only $\propto \cos \unicode[STIX]{x1D703}$ and $\propto \sin \unicode[STIX]{x1D703}$ terms are permitted in first-order quantities like $\hat{B}_{1}$ means that $M=1$ is required at this order. We are free to choose the origin of the $\unicode[STIX]{x1D703}$ coordinate so $\hat{B}_{1}(\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})=\bar{\unicode[STIX]{x1D702}}B_{0}\cos (\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711})$ (for some constant $\bar{\unicode[STIX]{x1D702}}$ ), meaning $\hat{B}_{1c}=\bar{\unicode[STIX]{x1D702}}B_{0}\cos (N\unicode[STIX]{x1D711})$ and $\hat{B}_{1s}=\bar{\unicode[STIX]{x1D702}}B_{0}\sin (N\unicode[STIX]{x1D711})$ . Substituting this $\hat{B}_{1s}$ and $\hat{B}_{1c}$ into (3.13), we find
Observe that (5.2) is the same as the quasi-axisymmetry equation (5.1) up to the generalizations $\hat{B}_{1c}\rightarrow \bar{\unicode[STIX]{x1D702}}B_{0}$ and $\unicode[STIX]{x1D704}_{0}\rightarrow \unicode[STIX]{x1D704}_{0}-N$ . The same result can also be obtained by noting that if a helical angle $\unicode[STIX]{x1D717}=\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711}$ is introduced, (2.1)–(2.2) become
These equations differ in form from (2.1)–(2.2) only through $\unicode[STIX]{x1D703}\rightarrow \unicode[STIX]{x1D717}$ , $\unicode[STIX]{x1D704}\rightarrow \unicode[STIX]{x1D704}-N$ and $G\rightarrow G+NI$ , with the latter replacement only having an effect at $O(r^{2})$ . Therefore, for $B$ to possess a single helicity in $\unicode[STIX]{x1D703}_{h}$ to the relevant order, the equations must be the same as for quasi-axisymmetry (in $\unicode[STIX]{x1D703}$ ) except for $\unicode[STIX]{x1D704}\rightarrow \unicode[STIX]{x1D704}-N$ .
Furthermore, given a particular magnetic axis shape, it is possible to determine $N$ (including the quasi-axisymmetry case $N=0$ ) before solving (5.1) or (5.2), by the following reasoning. Consider the general quasi-symmetry condition $\hat{B}_{1}(\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})=\bar{\unicode[STIX]{x1D702}}B_{0}\cos (\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711})$ for constant $\bar{\unicode[STIX]{x1D702}}$ , where $N$ is allowed to be zero or non-zero, and let us take $\bar{\unicode[STIX]{x1D702}}>0$ without loss of generality. Now consider a vector pointing perpendicularly from the axis to the $\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711}=0$ curve on the first-order-in- $r$ flux surface, which equivalently points to the maximum- $B$ contour on the surface. From (3.1) and (3.7), this vector is $\boldsymbol{n}r\bar{\unicode[STIX]{x1D702}}/\unicode[STIX]{x1D705}+\boldsymbol{b}rY_{1}$ , which has a positive projection along $\boldsymbol{n}$ at all $\unicode[STIX]{x1D711}$ . Therefore this vector to the maximum- $B$ curve never points in a direction more than $90^{\circ }$ away from the normal vector $\boldsymbol{n}$ . Hence, in a full toroidal transit around the axis, the $\unicode[STIX]{x1D703}-N\unicode[STIX]{x1D711}=0$ curve must wrap poloidally around the magnetic axis the same number of times $\boldsymbol{n}$ does so. Therefore, $N$ is the number of times $\boldsymbol{n}$ rotates poloidally around the axis in a full toroidal transit of the axis. If $\boldsymbol{n}$ does not have such a net rotation for a given axis shape, then all quasi-symmetric solutions for this axis shape will be quasi-axisymmetric, whereas if $\boldsymbol{n}$ does have this net rotation, all quasi-symmetric solutions for this axis shape will be quasi-helically symmetric.
For another perspective on $N$ , consider that because of (5.3), in the derivation of (3.13), (5.1) and (5.2), it was never imposed that $\unicode[STIX]{x1D703}$ must be a poloidal angle rather than a helical angle. The choice of $N$ in the previous paragraph finally eliminates this redundancy. If one solves the quasi-axisymmetry equation (5.1) for an axis shape that ‘really’ should have quasi-helical symmetry rather than quasi-axisymmetry, one finds that the $\unicode[STIX]{x1D703}=0$ curve on each flux surface wraps around the axis poloidally as you traverse the axis toroidally, i.e. $\unicode[STIX]{x1D703}$ turns out to be a helical angle rather than a poloidal angle.
Numerical solution of (5.1)–(5.2) as a practical method to construct and parameterize quasi-symmetric equilibria will be demonstrated in Part 2.
5.3 Necessity of axis torsion
Note that $\unicode[STIX]{x1D70F}=0$ implies the magnetic axis and $\boldsymbol{n}$ are confined to a plane, so $\boldsymbol{n}$ cannot rotate poloidally about the magnetic axis. Then by the argument in the preceding section, $\unicode[STIX]{x1D70F}=0$ can only be consistent with quasi-axisymmetry, not quasi-helical symmetry. Moreover, in a stellarator, $I_{2}$ (which represents the on-axis density of toroidal current) is typically zero, as the bootstrap current vanishes on axis. In this case, if $\unicode[STIX]{x1D70F}=0$ , the integral of (5.1) gives
The integral is positive–definite, so $\unicode[STIX]{x1D704}_{0}$ must vanish. Therefore, torsion of the magnetic axis is essential in a quasi-symmetric stellarator in order to have rotational transform on axis.
6 Discussion and conclusions
In this paper, we have derived the relationship near the magnetic axis between the flux surface shape in cylindrical coordinates and the magnetic field strength $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ in Boozer coordinates. This relationship is important for stellarator design since $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D711})$ essentially determines the guiding-centre confinement, but it is the flux surface shape in three dimensions that determines the coils and engineering design. As part of this calculation, we have also derived the relationship between the flux surface shape in cylindrical coordinates and the rotational transform. No matter how low the aspect ratio of a stellarator, the analysis here applies in a region sufficiently close to the axis. The result of this analysis is the system of equations (2.24), (2.30)–(2.32) or (4.9), and (2.34)–(2.36). These equations can be derived directly in cylindrical coordinates, as in § 2, or by the appropriate transformation of Garren & Boozer’s equations, using the transformation of § 4.1. In contrast to the calculation of Garren & Boozer (Reference Garren and Boozer1991a ), the equations here remain regular on segments or points where the axis torsion vanishes, which always occurs for omnigenous fields with poloidally closed $B$ contours. The torsion, which may not be well defined in this circumstance, does not appear in our analysis since we avoid using the Frenet–Serret frame.
Consistent with Garren & Boozer (Reference Garren and Boozer1991a ), we find that at $O(r^{1})$ , for a prescribed $B_{1}$ , there are two more $\unicode[STIX]{x1D719}$ -dependent degrees of freedom than there are equations. Specifically, the six $\unicode[STIX]{x1D719}$ -dependent unknowns ( $R_{0}$ , $z_{0}$ , $R_{1c}$ , $R_{1s}$ , $z_{1c}$ and $z_{1s}$ ) are constrained by four equations: (2.24), the $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ components of (2.30), and (2.34). Thus, two of these six functions can be viewed as inputs. Choosing $R_{0}$ and $z_{0}$ as the two inputs amounts to specifying the magnetic axis shape, and the four aforementioned equations then give the flux surface shape that yields the desired $B_{1}$ .
Acknowledgements
We acknowledge illuminating conversations about this work with G. Plunk. This work was supported by the U.S. Department of Energy, Office of Science, Office of Fusion Energy Science, under award numbers DE-FG02-93ER54197 and DE-FG02-86ER53223. This work was also supported by a grant from the Simons Foundation (560651, M.L.)
Appendix A. Regularity near the magnetic axis
In this section we will derive the form of the expansion (2.12)–(2.14) for $R$ , $z$ , $\unicode[STIX]{x1D708}$ and $B$ . As an alternative to the argument based on analyticity in Garren & Boozer (Reference Garren and Boozer1991a ), here we give a constructive demonstration, proceeding in several steps. First, we will derive the form (2.12)–(2.14) for $R$ and $z$ but with a non-straight-field-line poloidal angle $\unicode[STIX]{x1D6FC}$ in place of the Boozer angle $\unicode[STIX]{x1D703}$ . Then we will derive the form (2.12)–(2.14) for $R$ and $z$ but with the poloidal angle $\unicode[STIX]{x1D709}$ defined such that field lines are straight in the $\unicode[STIX]{x1D709}$ – $\unicode[STIX]{x1D719}$ plane. Next, we will derive (2.12)–(2.14) for $\unicode[STIX]{x1D703}$ . Finally, we extend the proof to $\unicode[STIX]{x1D708}$ and $B$ .
Assuming good flux surfaces exist near the axis, a Taylor expansion exists for $\unicode[STIX]{x1D713}(R,z)$ :
where quantities such as $\unicode[STIX]{x1D713}_{RR}$ refer to partial derivatives evaluated at the axis $(R_{0},z_{0})$ , and dependence on the independent variable $\unicode[STIX]{x1D719}$ is not displayed to simplify notation. Note $A>0$ where $A=\unicode[STIX]{x1D713}_{RR}\unicode[STIX]{x1D713}_{zz}-\unicode[STIX]{x1D713}_{Rz}^{2}$ , since the axis is an extremum of $\unicode[STIX]{x1D713}$ rather than a saddle point. For this section we assume $\unicode[STIX]{x1D713}_{RR}$ and $\unicode[STIX]{x1D713}_{zz}$ are positive for simplicity, so $\unicode[STIX]{x1D713}\geqslant 0$ . We then seek a solution of the desired form:
Substituting (A 2) into (A 1), terms can be collected based on their order in $r$ and $\unicode[STIX]{x1D6FC}$ dependence. The number of equations that result at a given order in $r$ is smaller than the number of associated coefficients in (A 2), reflecting the non-uniqueness of the poloidal angle; for instance the $\unicode[STIX]{x1D6FC}=0$ direction can be shifted. One solution satisfying (A 1) through $O(r^{3})$ is $z_{1c}^{\unicode[STIX]{x1D6FC}}=0$ , $R_{20}^{\unicode[STIX]{x1D6FC}}=0$ ,
Thus, given the Taylor series for $\unicode[STIX]{x1D713}(R,z)$ , we can construct expansions of the form (2.12)–(2.14), but with $\unicode[STIX]{x1D703}\rightarrow \unicode[STIX]{x1D6FC}$ , for $R$ and $z$ . The $O(r)$ terms in (A 2) can be manipulated to write the poloidal angle explicitly as
where atan2 is the arctangent with range $(-\unicode[STIX]{x03C0},\unicode[STIX]{x03C0}]$ .
Next we construct the straight-field-line angle $\unicode[STIX]{x1D709}=\unicode[STIX]{x1D6FC}+\unicode[STIX]{x1D706}$ where $\unicode[STIX]{x1D706}(r,\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D719})$ is single valued. From the $\unicode[STIX]{x1D735}\unicode[STIX]{x1D719}$ component of $\boldsymbol{B}=\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D709}+\unicode[STIX]{x1D704}\unicode[STIX]{x1D735}\unicode[STIX]{x1D719}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$ , we find
for some $f(r,\unicode[STIX]{x1D719})$ . The Jacobian in this expression can be evaluated using derivatives of $\boldsymbol{r}=R\boldsymbol{e}_{R}+z\boldsymbol{e}_{z}$ ; substitution of (A 2) then yields
where $J_{1s}$ and $J_{1c}$ are complicated algebraic functions of the Taylor coefficients in (A 1). Also, in (A 5), $\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D719}$ is smooth so it has a Taylor series
Using the $O(r)$ terms in (A 2) and (A 3) in (B 3) for the area of an ellipse, flux surfaces near the axis have an area $2\unicode[STIX]{x03C0}\unicode[STIX]{x1D713}/\sqrt{A}$ in the $R$ – $z$ plane, so $b_{0}=\sqrt{A}/R_{0}$ . Evaluating the integral in (A 5) then gives
where $\unicode[STIX]{x1D706}_{1s}=b_{R}R_{1c}^{\unicode[STIX]{x1D6FC}}R_{0}/\sqrt{A}+J_{1c}$ , $\unicode[STIX]{x1D706}_{1c}=-J_{1s}-(b_{R}R_{1s}^{\unicode[STIX]{x1D6FC}}+b_{z}z_{1s}^{\unicode[STIX]{x1D6FC}})R_{0}/\sqrt{A}$ and $\hat{f}=f-r\unicode[STIX]{x1D706}_{1c}$ . To constrain the form of $\hat{f}$ , we use the $\unicode[STIX]{x1D735}\unicode[STIX]{x1D6FC}$ component of $\boldsymbol{B}=\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D709}+\unicode[STIX]{x1D704}\unicode[STIX]{x1D735}\unicode[STIX]{x1D719}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$ to write
In the last term, note that $\boldsymbol{B}$ has a Taylor expansion in $R-R_{0}$ and $z-z_{0}$ like (A 7) but with vector coefficients; the leading term is parallel to $\unicode[STIX]{x2202}\boldsymbol{r}/\unicode[STIX]{x2202}\unicode[STIX]{x1D719}$ , so the last term in (A 9) is finite on the axis. Evaluating the last term in (A 9) by differentiating $\boldsymbol{r}=R\boldsymbol{e}_{R}+z\boldsymbol{e}_{z}$ and substituting (A 2), and applying $\int _{0}^{2\unicode[STIX]{x03C0}}\text{d}\unicode[STIX]{x1D6FC}\;\unicode[STIX]{x2202}(\ldots )/\unicode[STIX]{x2202}r$ to (A 9), we find $\int _{0}^{2\unicode[STIX]{x03C0}}\text{d}\unicode[STIX]{x1D6FC}\;\unicode[STIX]{x2202}^{2}\unicode[STIX]{x1D706}/\unicode[STIX]{x2202}r\unicode[STIX]{x2202}\unicode[STIX]{x1D719}=0$ at $r=0$ , which implies the $O(r)$ term of $\hat{f}$ is independent of $\unicode[STIX]{x1D719}$ . This term can therefore be set to 0, since $\unicode[STIX]{x1D706}$ can be shifted by any function of only $r$ . Hence,
for some $\unicode[STIX]{x1D706}_{0}(\unicode[STIX]{x1D719})$ . Substituting $\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D709}-\unicode[STIX]{x1D706}$ and (A 10) into (A 2), we obtain an expansion of the desired form:
where the $R^{\unicode[STIX]{x1D709}}$ and $z^{\unicode[STIX]{x1D709}}$ coefficients are functions of the $R^{\unicode[STIX]{x1D6FC}}$ and $z^{\unicode[STIX]{x1D6FC}}$ coefficients, e.g. $R_{1s}^{\unicode[STIX]{x1D709}}=R_{1s}^{\unicode[STIX]{x1D6FC}}\cos \unicode[STIX]{x1D706}_{0}+R_{1c}^{\unicode[STIX]{x1D6FC}}\sin \unicode[STIX]{x1D706}_{0}$ .
Next we transform to Boozer coordinates. The magnetic field can be written (Helander Reference Helander2014) as
for some $\hat{\unicode[STIX]{x1D6FD}}$ , where the transformation to Boozer coordinates is given by $\unicode[STIX]{x1D711}=\unicode[STIX]{x1D719}+\unicode[STIX]{x1D708}$ and $\unicode[STIX]{x1D703}=\unicode[STIX]{x1D709}+\unicode[STIX]{x1D704}\unicode[STIX]{x1D708}$ . Applying $\unicode[STIX]{x1D735}\unicode[STIX]{x1D719}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\boldsymbol{\cdot }(\ldots )$ to (A 12), we find
The denominator is smooth (and non-vanishing near the axis for cases of interest in this paper), with the expansion (A 7). The numerator is a product of three quantities that are smooth near the axis and so it too is smooth, vanishing on the axis since $\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}=0$ there. Noting $I$ is smooth function of $\unicode[STIX]{x1D713}$ and $I=0$ on axis, then the quantity in square brackets in (A 13) is smooth and so has a Taylor expansion
for some coefficients $H_{\ldots }$ . Substituting (A 11) and integrating in $\unicode[STIX]{x1D709}$ , (A 13) gives
where ${\hat{g}}$ is the sum of $g$ and terms from the lower integration bound. To constrain the form of ${\hat{g}}$ we apply $\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D709}\boldsymbol{\cdot }(\ldots )$ to (A 12), with the result
The right-hand side is manifestly smooth near the axis and so it has a Taylor series in $R$ and $z$ , into which we substitute (A 11). Applying $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}r$ and integrating over $\unicode[STIX]{x1D709}$ , we find $\int _{0}^{2\unicode[STIX]{x03C0}}\text{d}\unicode[STIX]{x1D709}\;\unicode[STIX]{x2202}^{2}\unicode[STIX]{x1D708}/\unicode[STIX]{x2202}r\unicode[STIX]{x2202}\unicode[STIX]{x1D719}=0$ at $r=0$ . It follows that the $\unicode[STIX]{x2202}{\hat{g}}/\unicode[STIX]{x2202}\unicode[STIX]{x1D719}$ has no term linear in $r$ . Then since we are free to shift $\unicode[STIX]{x1D708}$ by any function of only $r$ , we can choose ${\hat{g}}$ so $\unicode[STIX]{x1D708}$ has the form
Substitution of $\unicode[STIX]{x1D709}=\unicode[STIX]{x1D703}-\unicode[STIX]{x1D704}\unicode[STIX]{x1D708}$ and (A 17) in (A 11) yields the desired expansions for $R$ and $z$ , equations (2.12)–(2.14). The same substitutions applied to (A 17) give the desired expansion for $\unicode[STIX]{x1D708}(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ .
Finally, $B$ is smooth near the axis and so it has a Taylor expansion
Substitution of (2.12)–(2.14) for $R(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ and the analogous expansion for $z(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ into (A 18) gives the desired expansion for $B(r,\unicode[STIX]{x1D703},\unicode[STIX]{x1D719})$ .
Appendix B. Geometric properties of flux surfaces
Here we relate several geometric properties of the flux surfaces – specifically the cross-sectional area and elongation – to the variables $(R_{1},z_{1})$ used elsewhere in the paper. We consider a cross-section of the flux surfaces in a constant- $\unicode[STIX]{x1D719}$ plane. All results of this section apply to cross-sections perpendicular to the magnetic axis if $(R_{1},z_{1})$ are replaced by $(X_{1},Y_{1})$ . Several geometric quantities are defined in figure 2. To $O(r)$ , the flux surfaces are elliptical, with semi-major axis $a$ and semi-minor axis $b$ . Axes $u$ and $v$ are aligned with the minor and major axes, and $\unicode[STIX]{x1D6FE}$ is the angle between the $u$ and $R_{1}$ axes. The $\unicode[STIX]{x1D703}=0$ line is not generally aligned with any of these axes, and we let $\unicode[STIX]{x1D703}_{0}$ denote the angle between this line and the $R_{1}$ axis. Any point in the plane, such as the black dot in the figure, makes an angle $\unicode[STIX]{x1D703}+\unicode[STIX]{x1D703}_{0}$ relative to the $R_{1}$ axis and an angle $\unicode[STIX]{x1D712}$ relative to the $u$ axis, with $\unicode[STIX]{x1D712}=\unicode[STIX]{x1D703}+\unicode[STIX]{x1D703}_{0}+\unicode[STIX]{x1D6FE}$ . Substituting $u=b\cos \unicode[STIX]{x1D712}$ and $v=a\sin \unicode[STIX]{x1D712}$ into
applying the angle sum formula to $\unicode[STIX]{x1D712}$ and equating $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ terms using (2.13), we find
Using (B 2), the right-hand side of (2.24) is found to be
which is (minus) the area of the ellipse divided by $\unicode[STIX]{x03C0}$ .
Another important property of the flux surfaces is their elongation, $a/b$ . In practice, many solutions of (3.13) are uninteresting since they correspond to impractically large values of elongation, so to discard these solutions it is valuable to derive an expression for the elongation in terms of $R_{1}$ and $z_{1}$ . Such a formula can be obtained by first defining $p=R_{1s}^{2}+R_{1c}^{2}+z_{1s}^{2}+z_{1c}^{2}$ , and noting from (B 2) that $p=a^{2}+b^{2}$ . Then defining $q=R_{1s}z_{1c}-R_{1c}z_{1s}=-ab$ , we can solve $a^{4}-pa^{2}+q^{2}=0$ for $a$ , noting the larger positive root is $a$ and the smaller is $b$ , since $b$ satisfies the same quadratic equation. Then the elongation is
Appendix C. Equating representations of the field: second order
Here the derivation of (2.34)–(2.36) is presented. The $O(r^{2})$ terms in (2.8)–(2.10) can be obtained by applying $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}r$ twice and evaluating the results at $r\rightarrow 0$ . We find
In (C 2), the terms including a factor of $\unicode[STIX]{x1D704}_{0}$ can be written in the combination (2.29), which vanishes as before. Plugging in (2.13)–(2.14), it can be seen that (C 1)–(C 3) each have only $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ Fourier components. These $\sin \unicode[STIX]{x1D703}$ and $\cos \unicode[STIX]{x1D703}$ components give the following six equations:
In the last two equations we have used (2.24).
While these six equations contain $R_{2}$ , $\unicode[STIX]{x1D708}_{2}$ and $z_{2}$ , all these subscript-2 quantities can be eliminated to give a constraint on the subscript-1 quantities by forming
The $\unicode[STIX]{x1D6FD}_{0}$ terms happen to vanish as well in this combination. Multiplying the result through by $2G_{0}R_{0}/\bar{B}$ , we obtain
Eliminating $\unicode[STIX]{x1D708}_{0}$ , we find $(T-\unicode[STIX]{x1D704}_{0}V)(\ell ^{\prime })^{2}B_{0}^{2}/G_{0}^{2}=0$ where
and
Eliminating $\unicode[STIX]{x1D708}_{1s}$ and $\unicode[STIX]{x1D708}_{1c}$ using (2.27) results in (2.35)–(2.36).