1. Introduction
In the presence of a strong magnetic field, charged particles execute helical gyromotion around the magnetic field lines. The Lorentz force acts on particles in a direction perpendicular to both the magnetic field lines and the particle motion through them, causing the particles to accelerate, and in turn to release energy in the form of electromagnetic waves, ‘cyclotron radiation’. Plasmas with sufficiently large particle density, such as fusion plasmas, are optically thick to such emissions, meaning that any energy released through this process is simply reabsorbed back into the plasma. However, certain plasma systems are optically thin to cyclotron emission and this radiated energy can therefore be lost to the surroundings. These optically thin plasmas are the focus of this current work. It is the purpose of the present paper to show how these emissions lead to the manifestation of anisotropy in the plasma distribution function, and how the distribution evolves in these regimes.
In this work, paper (I) of the series, our contributions are: (a) to show that cyclotron emission results in strongly anisotropic distribution functions on the radiation time scale; (b) to calculate the evolution of the distribution function under collisional scattering which, in the absence of any radiation terms, acts to drive the plasma towards Maxwellian behaviour; (c) to show that this behaviour manifests itself under very general conditions; and (d) to present the applications and the limitations of this theory. In the companion work, paper II, we apply this theory to the first laboratory electron–positron plasma experiment. We begin in § 2 by briefly recapitulating the single-particle picture of an electron emitting cyclotron radiation, before using this description to explore the relaxation of the distribution function in a collisionless plasma. In § 3, collisions are introduced back into the picture and used to show that an initially isotropic distribution will relax to a distribution that is Maxwellian over the perpendicular velocity. In computing this distribution we introduce a free function $T_{\perp }$; in §§ 4 and 5, the evolution of this free function is explored. In § 6, we show that the conclusions drawn in the previous sections actually hold in quite general magnetic geometry; subsequently, we discuss some of the broad classes of applications in § 7.
2. The collisionless system in brief
Whilst we endeavour to provide a physical understanding of this theory in general magnetic geometries, let us first understand the basic premise before embarking on a more detailed calculation. To this end, we begin with a straight, constant magnetic field and, for the time being, neglect collisions in the plasma.
The fundamental aim of this section is thus to understand the following: (a) in the most simple case, cyclotron emission is responsible for an exponential decay of the perpendicular kinetic energy on the radiation time scale and (b) the inclusion of cyclotron emission, and the absence of collisions, results in a strongly anisotropic distribution function.
2.1. Cyclotron cooling
The emission of cyclotron radiation by particles in a plasma gives rise to a reaction force which must be included in the kinetic equation. The theory of relativistic plasmas, accounting for this radiation reaction by including the Abraham–Lorentz reaction force in the kinetic equation, has been developed by Andersson, Helander & Eriksson (Reference Andersson, Helander and Eriksson2001), Hazeltine & Mahajan (Reference Hazeltine and Mahajan2004) and subsequent authors. Here, we begin by specialising these results to a non-relativistic plasma.
The non-relativistic Abraham–Lorentz reaction force is given by
where $\boldsymbol {a}$ is the acceleration vector and $e$ is the charge. Following the arguments of the aforementioned authors, it can be seen that, to leading order in a small-gyroradius expansion, the change in perpendicular energy, $w_{\perp }$, of a non-relativistic point charge as it accelerates in a magnetic field is given by Larmor's formula
where $\boldsymbol {v}$ is the particle velocity vector and $m$ is the rest mass. Throughout this work, the subscripts $\perp$ and $\parallel$ indicate directions perpendicular and parallel to the magnetic field, respectively. The centripetal acceleration of a particle undergoing gyromotion is given by
and hence it follows that
Simply put, the plasma will radiate its perpendicular kinetic energy on a time scale given by the radiation time:
It is perhaps easiest to understand the influence of radiative emission when the change of energy is rewritten in terms of the perpendicular and parallel components of the velocity. We thus obtain
Equations (2.6a,b) summarise the average effect of radiation reaction on the plasma. Fundamentally, it is these equations which will give rise to anisotropy in the distribution function.
It is pertinent to point out at this stage that collisions, whose effect is to isotropise the plasma, can mediate this cooling process due to the scattering of the velocity vector, as a result allowing the conversion of energy between perpendicular and parallel components. We will return to this discussion in § 3, following a discourse on the simpler collisionless case.
2.2. Collisionless kinetic equation
The kinetic equation governing the evolution of the distribution function $f(\boldsymbol {r},\boldsymbol {v},t)$ in the absence of collisions is given by
where $\boldsymbol {\nabla }_{v}$ stands for the velocity derivative:
Cyclotron emission is included in the kinetic equation through the inclusion of (2.6a,b) in the usual collisionless Vlasov equation.
We will immediately specialise to the case where the magnetic field is constant, $\boldsymbol {B} = B\hat {{\boldsymbol {b}}}$, and there is zero electric field, $\boldsymbol {E} = \boldsymbol {0}$. Note that, in this limit, we have assumed the plasma is homogeneous and there is no spatial dependence of the distribution function. We then write (2.7) in cylindrical $\boldsymbol {v}$-space coordinates $(v_{\perp },\alpha ,v_{\parallel })$ to obtain
where we have used that $\dot {{v}}_{\parallel } = 0$. The final term in this equation can then be eliminated by averaging over the gyroangle $\alpha$.
It is fruitful to write the resulting equation in terms of the perpendicular energy of the particles, so as to obtain
This equation can then be easily solved via the method of characteristics; the general solution is given by
That is, the distribution function will be a function of the parallel and perpendicular kinetic energies, the latter of which will decay exponentially quickly on the radiation time scale.
As an illustrative example, if the initial distribution is Maxwellian, then
which corresponds to a bi-Maxwellian
with $T_{\parallel } = T = \text {const.}$ and
At this stage of our investigation, we have been able to accomplish the first fundamental aim of this paper; namely, we now understand the following. Firstly, in the most simple case, cyclotron emission is responsible for an exponential decay of the perpendicular energy on the radiation time scale whilst the parallel energy is kept constant. Secondly, the inclusion of radiative emission, and the absence of collisions, results in a strongly anisotropic distribution function on the radiation time scale.
2.3. Validity of the collisionless approach
A caveat here is that, of course, many plasmas are in fact collisional. It is therefore expected that eventually collisions will come into play and must be taken into account.
Following Braginskii (Reference Braginskii1965), the conventional definitions of the electron–ion and electron–electron collision times are given by
respectively, where $\ln \varLambda$ is the Coulomb logarithm. We will also assume bulk neutrality throughout this work so that $n_{e} = Zn_{i}$. From the above expressions, we can see that the collision time will decrease as the plasma cools.
We might therefore envision the following scenario. A plasma might begin in a regime where the radiation time $\tau _{r}$ is smaller than the initial collision time $\tau _{c} = \text {min}(\tau _{ee},\tau _{ei})$ and hence the collisionless theory will be applicable at first. However, the collision time itself will decrease as the plasma cools and hence this assumption will be violated after sufficient time has elapsed. As a result, collisional effects must be taken into consideration.
We thus imminently turn our attention to the collisional problem. However, before doing so, let us allow ourselves a small digression to say something about the nature of cyclotron radiation.
2.4. Cyclotron cooling and plasma waves
Before we embark on solving the collisional kinetic equation, let us first turn our attention to the nature of the electromagnetic waves in play. The cyclotron radiation that is being generated has to propagate through our optically thin plasma as some electromagnetic plasma wave (Krall & Trivelpiece Reference Krall and Trivelpiece1986).
For the reasons outlined in the previous section, we declare an interest in optically thin plasmas where the radiation time is (at least initially) much shorter than the collision time. That is, we declare an interest in the limit
where have introduced the two characteristic frequencies in the plasma, namely the plasma frequency and the cyclotron frequency:
Thus, it is easy to see from this ordering that electromagnetic waves with frequencies comparable to the cyclotron frequency are almost light waves:
That is, our cyclotron radiation is simply photon emission which can immediately escape the plasma because of the optical thinness.
We now turn our attention to the collisional problem.
3. Lowest-order collisional kinetic equation
Our aim now is to deduce the class of distribution functions to which a radiating collisional plasma can relax. When collisions are retained, the kinetic equation satisfied by the distribution function $f$ in straight field lines is
where we have opted to write this equation in terms of the magnetic moment $\mu$ instead of the perpendicular velocity. The distribution function can now be changed both via radiation (the second term on the left-hand side) and also via collisions (the term on the right-hand side). The Landau collision operator for Coulomb interaction (Landau Reference Landau1936) is ${C} = {C}_{ee} + {C}_{ei}$ with
and
if $f$ is normalised so that
Here we have also introduced ${\boldsymbol{\mathsf{U}}}$, the second-rank tensor:
where $\boldsymbol {u} = \boldsymbol {v} - \boldsymbol {v}^{\prime }$ is the difference in velocity vectors between colliding particles and ${\boldsymbol{\mathsf{I}}}$ is the identity matrix.
The collision frequency is of order $\sigma /v_{\text {th}a}^{3}$ and we are interested in the limit ${\sigma \tau _{r}/v_{\text {th}a}^{3} \ll 1}$, where $v_{\text {th}a}$ is the thermal velocity of species $a$. That is, for the reasons outlined in § 2, we are declaring an interest in optically thin plasmas where the radiation time is (at least initially) much shorter than the collision time. The key idea here is that, in this limit, by the time collisions enter the picture our plasma will have already radiated a large fraction of its perpendicular energy through the cyclotron cooling process outlined earlier. In this limit $f$ will be strongly anisotropic, due to the cyclotron cooling process, and we write
Our strategy is then to expand the collision operators in terms of this small parameter.
It is important to point out here that $\epsilon$, effectively a measure of $v_{\perp }/v_{\parallel },$ might itself be expected to change in time, which could have ramifications later in our analysis. However, these concerns can be mitigated by noting that the window of time in which this theory is valid is only a few collision times. Thus we expect $\epsilon$ will not change appreciably and will certainly remain $\epsilon \ll 1$. This is explored further in § 5.3. In what follows, $\epsilon$ is treated as a constant.
We begin with the electron–electron collision operator. To lowest order in $\epsilon$ we obtain
Great care must be taken when dealing with the integral in (3.7) which is divergent when $v_{\parallel } = v_{\parallel }^{\prime }.$ We may evaluate
since, by partial integration, for any suitably well-behaved $f$ we have
where
remains finite as $\epsilon \rightarrow 0$. Thus, it follows that
The electron–electron collision term is larger than the electron–ion collision term,
provided $Z = O(1).$ This can be clearly seen by noting that
The lowest-order kinetic equation thus becomes
which can be integrated to give
The integration constant $C$ is determined by the requirement that
should equal $g(v_{\parallel })/2|\ln \epsilon |$, which gives $C = 1/(8{\rm \pi} \sigma \tau _{r}|\ln \epsilon |)$ and thus
It is interesting to note that the distribution of particles over perpendicular velocities is Maxwellian, but not for the usual reason. Normally, this happens because of a balance between two terms in the collision operator, describing friction (drag) and energy diffusion, respectively. The former term slows particles down and the latter increases their average energy. The two terms balance exactly for a Maxwellian distribution. In (3.19), radiative energy loss has replaced the collisional friction, and the result is a Maxwellian in $v_{\perp }$ with a different (lower) perpendicular temperature than in the purely collisional case.
We note that the distribution function only depends on $v_{\parallel }$ through the function $T_{\perp }(v_{\parallel },t)$. We can determine this function by taking a moment of the kinetic equation.
4. Evolution of the perpendicular density
The moment $N$ defined in (3.18) can be seen to satisfy
In solving this equation, we obtain the temperature $T_{\perp }$ through (3.18) and can thus ascertain the distribution function $f$ through use of (3.19). Let us first remark on the nature of (4.1). In deriving (4.1), we have integrated the kinetic equation (3.1a,b) over perpendicular velocities. In doing so, we have eliminated the radiation term and instead isolated the slow evolution of the distribution over parallel velocities. We can think of the evolution of an initial isotropic distribution function, in the collisional regime, as being partitioned into two disparate stages:
(i) Firstly, the distribution function will quickly radiate most of its perpendicular energy and relax to a form which is Maxwellian over perpendicular velocity as per (3.19).
(ii) Secondly, on a longer time scale, the distribution over parallel velocities will evolve according to (4.1).
It is this slow behaviour (ii) that we are now interested in probing. We will soon see that the precise nature of this evolution depends on the specific ordering satisfied by the plasma parameters.
In order to ascertain the time evolution of $N$, we thus require the following moments of the collision operators:
It can be shown that the leading-order contributions to these two terms are given by
The details of the calculation are relegated to appendix A. It then follows that (4.1) becomes
or, more compactly, we can write
Once again, we have been able to successfully reduce our integro-differential evolution equation (4.1) into an equation involving only differential operators (4.5a–c). The two terms on the right-hand side of (4.5a–c) have very different character. The first one describes friction on the electrons from collisions with the ions, which causes the former to slow down in the parallel direction. As usual for Coulomb collisions, the collision frequency decreases with increasing speed, which here causes the singularity at $v_{\parallel } = 0.$ The second term describes electron–electron collisions, whose effect is to scatter the velocity in a peculiar way, causing ‘anti-diffusion’ in $v_{\parallel }$. This phenomenon can be understood as follows. Most electron–electron collisions occur between particles with similar velocities $v_{\parallel } \simeq v_{\parallel }^{\prime }$, and the local (in parallel velocity space) collision frequency is thus proportional to $N$, making this term quadratic in $N$ (particles with given $v_{\parallel }$ mostly collide with each other). The effect of such collisions is to convert parallel kinetic energy to perpendicular energy, and thus to increase $T_{{\perp }}$, which according to (3.18) leads to an increase in $N$. A local accumulation of particles somewhere in parallel velocity space will thus tend to grow at the expense of neighbouring regions, making the distribution function undergo ‘anti-diffusion’ in $v_{\parallel }$.
Despite its relatively simple form, (4.5a–c) is a nonlinear parabolic partial differential equation. It seems impossible to make analytical progress in full generality here, but nevertheless some insight into the system can be gleaned by appealing to various limiting forms.
5. Limiting forms of the perpendicular density
We note that, from (4.3a,b), it is easy to see that the moments of the collision operator (which gives rise to the different terms in (4.5a–c)) are ordered:
This is to be compared and contrasted to the ordering of the collision operators themselves:
Based on these different orderings, we can turn to solving equation (4.5a–c) in two distinct limits:
In both of these limits, we are justified in dropping the electron–ion collision operator at lowest order as discussed in § 3. This approximation gives rise to the rapid evolution of the distribution function towards (3.19) at lowest order.
5.1. Limit (A): neglecting the contribution from electron–electron collisions
In limit (A), we are justified in neglecting the contribution of $I_{ee}$ whilst retaining that of $I_{ei}$, noting that this is still consistent with the derivation of (3.19) providing the required ordering is satisfied. In this instance, (4.5a–c) becomes
The equation has been reduced to a quasilinear partial differential equation and so is amenable to solution by the method of characteristics. We thus obtain the general solution
So, if we start from a Maxwellian distribution initially
then
From this equation, we can also obtain an expression for the perpendicular temperature:
It is important to recall that this solution was for $v_{\parallel } > 0$, which means that, despite appearances, both $T_{\perp }$ and $N$ are continuous. For $v_{\parallel } < 0$ we obtain the same solution as above with $v_{\parallel } \rightarrow -v_{\parallel }$.
5.1.1. Rate of energy loss
A quantity of direct experimental interest is the amount of power radiated by the plasma. This can be calculated by looking at the rate of change of energy loss:
where we have defined the total thermal energy stored in the plasma (per unit volume):
The energy stored can be seen to satisfy the equation
Thus, it suffices to calculate $W_{\perp }$ in order to find the power radiated by the plasma. This is given by evaluating
where we have used (3.19) to carry out the $v_{\perp }$ integral. We arrive at
where we have normalised $v_{\parallel }$ and $\tau$ to the electron thermal velocity by writing
The integral (5.14) cannot be expressed in terms of standard mathematical functions. Instead, we must turn to numerically evaluating this function. Firstly, however, we can determine the behaviour of (5.14) in the long-time and short-time limits.
In the limit $y \gg 1$ we obtain (see appendix 1)
In the limit $y \ll 1$ we obtain (see appendix 2)
In figure 1 we show the full numerical solution of (5.14) as well as the analytic solution for the long-term and short-term behaviour, given by (5.16) and (5.17), respectively. Some caution must be taken in interpreting this figure, and one must remember that the solution of (5.14) is only valid on time scales longer than the radiation time. Note that the quantity plotted on the vertical axis, $W_{\perp }(y)/W_{\perp }(0),$ is directly proportional to the emitted power as can be seen from (5.12a,b). The abscissa is simply a scaled time coordinate: $y = (9\sqrt {{\rm \pi} }/4)t/\tau _{ei}$.
Although at a first glance it may seem as though the energy loss rate would be faster than exponential, this is of course not the case when the solution is restricted to the region where the orderings are valid.
5.2. Limit (B): neglecting the contribution from electron–ion collisions
In limit (B), when $n_{i} Z^{2} / n_{e} \ll | \ln \epsilon |$, we are justified in neglecting the contribution from $I_{ei}$ and obtain a relatively simple equation:
This is a nonlinear parabolic partial differential equation which essentially describes backwards diffusion in $v_{\parallel }$ space. Indeed, this is simply the backwards heat equation in a medium where the diffusion coefficient varies linearly with temperature. As such, this problem is ill-posed.
A simple solution exists (given suitable initial data) but the general solution will not depend continuously on the auxiliary data as it describes a reverse diffusion process meaning that arbitrarily small perturbations in the initial conditions will be amplified and can lead to even infinitesimal-time singularities. This means that it is possible to prepare arbitrarily small perturbations to the initial conditions (likely in any given Sobolev norm), which are sufficiently wiggly in higher derivatives to ensure the solution blows up before any given time $t > 0$.
In order to find this singular solution, we consider the problem
with
Equation (5.19a,b) admits a similarity solution. To see this, we introduce a dilation transformation:
Our problem then becomes
From this, we see that (5.19a,b) is invariant under the dilation transformation provided $b -c = 2(a-c)$, i.e. $c = 2a - b$.
We therefore pose the self-similar ansatz
The condition (5.20) then gives
This condition must hold for all $\tau > 0$ and hence we have $3a=b$. So we can write
It is of course no coincidence to have arrived at this particular self-similar ansatz. We could have also posited this solution simply by noting that the quantity $\xi = v_{\parallel }^{3}/\tau$ is dimensionless, which we could have deduced from (5.6).
The derivatives then transform as
where the prime notation now denotes differentiation with respect to $\xi$.
Equation (5.19a,b) becomes
or equivalently
Direct integration gives
The integration constant $k_{0}$ is chosen such that the flux of particles $N ({{\mathrm {d}} N}/{{\mathrm {d}} v_{\parallel }})$ is continuous at the edge of the support where $N$ goes to zero. In our case, this is the same constant that one gets from imposing zero flux at the symmetry point because the flux has to vanish either when $N=0$ (at the edge) or when ${{\mathrm {d}} N}/{{\mathrm {d}} v_{\parallel }} = 0$ (at a symmetry point).
This equation can then be integrated once again:
In order to satisfy the requirement that $N \rightarrow 0$ as $| v_{\parallel } | \rightarrow \infty ,$ we write the solution as
where $A = \sqrt [3]{9\beta /2}$ so as to satisfy (5.20).
We can thus obtain a weak, in the sense that the first derivative is discontinuous, solution of (4.5a–c) in the limit where electron–ion collisions are neglected. It is imperative to mention that this solution holds only when $\tau < 0.$ In essence, what we have done in this section is to solve an ill-posed backwards diffusion equation (5.18) by solving instead a well-posed forward diffusion equation (5.19a,b) and running the solution backwards in time. The rationale in adopting this approach is to motivate the correct boundary conditions to apply to the equation.
As this problem is ill-posed, our self-similar solution requires very specific initial conditions. An arbitrarily small perturbation (in any given Sobolev norm) can cause the solution to fail and develop an infinitesimal-time singularity. Of course the self-similar solution is still of value, and the formation of singularities is somewhat fictitious since our assumption that $\boldsymbol {\nabla }_{{\parallel }} \ll \boldsymbol {\nabla }_{{\perp }}$ will certainly break down before any singularity arises. The solution obtained is of the correct general form (i.e. it attains a maximum for $v_{\parallel } = 0,$ and satisfies $N \rightarrow 0$ as $|v_{\parallel }|\rightarrow 0$) as to possibly be approached from more general initial conditions. Such solutions are not uncommon in plasma physics and the generation of sharp structures by resistive or ambipolar diffusion has also been investigated (Low Reference Low1973).
It is also fitting to remark here that including higher powers of the expansion parameter $\epsilon$ in our expansion of the collision operator (i.e. in the calculation of $I_{ee}$) would usually bring higher-order derivatives of $N$ into (5.18) which would regularise the partial differential equation. This is a possible avenue of further exploration with the caveat that the underlying equations would then likely be rendered too difficult to solve with analytical techniques.
As before, one can calculate the perpendicular temperature:
5.2.1. Rate of energy loss
Following the previous subsection, one can calculate the rate at which energy is radiated from the plasma by first calculating $W_{\perp }$ and then appealing to equation (5.12a,b). This is given by
which can be evaluated exactly to give
This leads to an energy loss rate of
which can be written in terms of the Braginskii electron–electron collision time $\tau _{ee}$ as
At first glance, this equation might appear somewhat peculiar. We must first of course recall that our solution was valid only when time is run backwards, and hence the term $t$ appearing on the right-hand side is negative; as a result, the total energy does indeed decrease as expected.
5.3. Validity of the collisional approach; a self-accelerating process
We have found the solution to (4.5a–c) in two different limits based on the plasma parameters, thus elucidating the plasma distribution function in the collisional regime (3.19). Throughout this work, a number of orderings have been invoked and we thus turn our attention to the validity of our solution; specifically, when it is valid to treat $\epsilon$ as constant. Thus far we have focused on the evolution of the plasma on time scales $t$ with
In this regime, we treat $\epsilon$ as constant seeing as though any variation is very small. However, as per our earlier remarks in § 2.3, we must take care to recall that $\tau _c$ is not constant and will itself decrease as the plasma cools via cyclotron emission. Our entire process (radiative cooling and collisional scattering) is therefore self-accelerating. Collisional scattering converts parallel kinetic energy to perpendicular kinetic energy. This perpendicular energy is then radiated, lowering the temperature of the plasma and in turn increasing the collisional scattering rate. As a result, the plasma will only remain in this regime for a few collision times, before entering a regime where
In this final regime, any remaining parallel kinetic energy will be converted to perpendicular kinetic energy and then radiated. The total energy of the plasma will thus decay exponentially quickly on the radiation time scale.
In essence, we can partition the evolution of the plasma energy into three distinct regimes:
(i) Initially, $0 < \tau _{r} < t < \tau _{c}.$ In this regime the behaviour is collisionless as described in § 2. The perpendicular energy decays exponentially quickly on the radiation time scale.
(ii) After some time, $\tau _{r} < t \sim \tau _{c}$ and collisional scattering becomes important. This regime is what we have studied in §§ 3–5. This is a self-accelerating process and the plasma will only remain in this regime for a few collision times.
(iii) Eventually, $\tau _{c} < \tau _{r} < t.$ In this regime, the distribution function is isotropic (Maxwellian), and any remaining parallel kinetic energy will be converted to perpendicular kinetic energy via collisional scattering and then radiated.
We have now accomplished our second aim; namely, we have been able to calculate the evolution of the distribution function under collisional scattering. Unusually for plasma physics, this was possible even though the distribution function is far from Maxwellian. Thus far, we have made the restrictive assumption of straight-field-line geometry. We will now ask ourselves whether we can relax this assumption.
6. General magnetic geometry
Lurking within this theory was an assumption of straight-field-line geometry which considerably simplified the derivations involved. It is clear that this assumption will generally not hold in any plasma of physical interest, so let us address this point.
We begin in general magnetic geometry, although we will later find it necessary to discuss trapped and circulating particles. The kinetic equation becomes
where $l$ parametrises the length of a magnetic field line and we retain the assumption that the parallel electric field is negligible. Here we can exploit the existence of two time scales by expanding in the small parameter
where $\tau _{b} \sim L/v$ denotes the bounce time, i.e. the typical time it takes to travel the macroscopic distance $L$ along the field. Thus, writing $f = f_{0} + f_{1} + \cdots$, one obtains at leading order
One can solve this equation by making the change of variables $(l,\mu ,v_{\parallel }) \rightarrow (l,\mu ,w)$, where we have introduced the total particle energy
so that the kinetic equation becomes at leading order
which has the solution for the lowest-order distribution function
At the next order, the kinetic equation becomes
6.1. The bounce-averaged kinetic equation
In order to remove the dependence on $f_{1}$, we define the bounce average of a function $Q(\mu ,w,l)$ by
with
The integral in (6.8) is taken between consecutive bounce points, defined by $v_{\parallel } = 0$, for trapped particles. For circulating particles, the integral is taken once around a field line if the field line is closed. If the field line is not closed, but instead traces out a magnetic surface, as in a stellarator or tokamak, then the bounce average for circulating particles is given by
Taking the bounce average of (6.7) then yields
Now, here we must be careful when bounce-averaging as $\tau _{r} \propto B^{-2}$. Let us write
where $B_{0}$ and $\tau _{0}$ appearing on the right-hand side of this equation are constants. We thus arrive at the bounce-averaged kinetic equation
Since $\overline {B^{2}}$ and $\overline {B^{3}}$ are, in general, complicated functions of $\mu /w,$ one cannot hope to solve this equation in full generality. However, we can deduce the distribution function on a significantly long time scale $\tau$ which is larger than the radiation time and comparable to the collision time $\tau _{r} \ll \tau \lesssim \tau _{c}$. In this instance, the perpendicular energy will have been radiated away to leading order and we have
where we have introduced $\lambda = \mu /w$. Trapped particles will thus be absent from the population. In this regime, one can neglect the third term in (6.13) and thus obtain
Moreover, we can also evaluate
where the last equivalence follows from the choice to simply define $B_{0}^{2}$ to be the average of $B^{2}$ over a field line.
Hence, we arrive at
6.2. The bounce-averaged collision operator
We have already shown that at leading order, the contribution from electron–ion collisions is formally smaller than that from electron–electron collisions provided $n_{i}Z^{2}/n_{e} = O(1).$ Let us thus restrict our attention to the electron–electron collision operator.
We know that at leading order
We must be cautious as $v_{\perp }$ varies along the orbit and should not be used as a coordinate in the bounce-averaged equation. When the collision operator is instead written using the magnetic moment $\mu$ as a coordinate, we obtain
and thus
It is of course no accident that this equation looks remarkably similar to the equation obtained in the straight-field-line limit. One can make the analogy exact and state that the leading-order dynamics of both systems are governed by the differential equation provided we associate the $B^{2}$ appearing in Larmor's formula, and the function $g(v_{\parallel })$ arising from the electron–electron collision operator, with their averages over a magnetic field line.
That is, on time scales $\tau$ with
the distribution function will become strongly anisotropic in general magnetic geometry.
7. Applicability of this theory
The theory developed in the preceding sections applies to any optically thin plasma where the collision time exceeds the radiation emission time, which is always true if the density is sufficiently small. Two broad classes of applications can be envisaged.
7.1. Cyclotron sources in the laboratory
Non-neutral plasmas, specifically plasmas consisting of charged particles with a single sign of charge can be confined in Penning–Malmberg traps (Dubin & O'Neil Reference Dubin and O'Neil1999; Danielson et al. Reference Danielson, Dubin, Greaves and Surko2015). Such a trap consists of a vacuum region inside an electrode structure consisting of a stack of hollow, metal cylinders. A uniform axial magnetic field is then applied to inhibit particle motion in the radial direction. Voltages must also be imposed on the end electrodes to prevent particle loss in the magnetic field direction.
It is frequently useful to compress plasmas radially; for instance, to increase the plasma density. This is usually accomplished by applying a torque on the plasma using rotating electric fields, the so-called ‘rotating wall technique’ (Anderegg, Hollmann & Driscoll Reference Anderegg, Hollmann and Driscoll1998). Very long confinement times (of the order of hour or days) can be achieved using these techniques, making their use highly desirable. Particle cooling is often necessary to maintain good confinement by mitigating the heating caused by the torque using the rotating wall method. In the case of electrons or positrons, if the magnetic field is sufficiently strong, the particles will cool by cyclotron radiation (O'Neil Reference O'Neil1985). An example of where cyclotron cooling is employed successfully to this end is in the production of anti-hydrogen, where this process is used cool pure electron plasma to sub-eV temperatures (Amoretti et al. Reference Amoretti, Carraro, Lagomarsino, Manuzio, Testera and Variola2002).
Another major application of the theory developed in this work will be for the first laboratory experiment, currently under development, to create and confine the first terrestrial electron–positron plasmas in the laboratory. This is done by first accumulating positrons from a powerful source and then injecting these into a pure electron plasma confined by the dipolar magnetic field of a current-carrying circular coil, so that a stationary, quasi-neutral electron–positron plasma is formed (Sunn Pedersen et al. Reference Sunn Pedersen, Danielson, Hugenschmidt, Marx, Sarasola, Schauer, Schweikhard, Surko and Winkler2012). This system will satisfy the necessary conditions of being optically thin and with a radiation time which should be initially shorter than the collision time. However, there is an additional complication in that the plasma will be so strongly magnetised that Coulomb collisions are no longer effectively described by the Landau operator. The theory of scattering in strongly magnetised anisotropic pair plasmas is developed in the companion paper (II).
7.2. Synchrotron sources in astrophysics
This work was built on the fact that a charged particle moving in a magnetic field radiates energy. At non-relativistic energies, the focus of this paper, this process is called cyclotron cooling. At relativistic velocities it is known as synchrotron radiation.
Synchrotron sources are ubiquitous and the emission of relativistic and ultra-relativistic electrons gyrating in a magnetic field is a process that dominates much of high-energy astrophysics. Indeed, it is known that synchrotron radiation is responsible for the non-thermal optical and X-ray emission observed in the Crab Nebula (Rees & Gunn Reference Rees and Gunn1974) and that pulsars are strong synchrotron sources (Sturrock Reference Sturrock1971).
Of course, the physics of these systems should also be treated carefully using a model taking relativistic effects into account in the modelling of the radiation term and the collision terms. This is an area of active research and a potential axis along which this current work could be further developed. There also likely exist analogous systems in astrophysics where the electrons are subrelativistic but nevertheless are still strongly radiating and weakly collisional in the sense discussed here. Such a system would of course still be required to satisfy
The easiest way to satisfy this is of course to allow relativistic plasmas, but we might also envision some exotic astrophysical plasma which is sufficiently nebulous (very low density) but also sufficiently strongly magnetised (very strong magnetic fields) as to allow this condition to be met in a non-relativistic plasma. In such systems, the theory in this paper will be directly relevant.
However, these is another caveat here which must be carefully considered. We have presented here a mechanism through which plasmas can become strongly anisotropic in velocity space. In actuality, there is a plethora of instabilities which could act on astrophysical plasmas and restore isotropy on time scales shorter than the collision time. It seems to be that the question of (in)stability could depend sensitively on plasma parameters and that considerations of isotropy-restoring instabilities do not necessarily preclude the types of systems studied in this work.
8. Conclusions
In this paper, we have developed a theory for collisional scattering in strongly anisotropic plasmas. Such plasmas arise due to the emission of radiation when charged particles move in magnetic fields, which leads to rapid depletion of the perpendicular kinetic energy. We have derived equations which describe the evolution of the electron distribution function in such plasmas. Unusually for plasma physics, the collision operator could be calculated analytically, albeit only to logarithmic accuracy, although the distribution function is far from Maxwellian.
It was found that in such strongly anisotropic populations, the evolution of the lowest-order distribution function is dominated by electron–electron collisions unless $n_{i}Z^{2}/n_{e} \gg 1$. Such collisions lead to a distribution that is Maxwellian in $v_{\perp }$ for any value of $v_{\parallel }$.
The distribution over $v_{\parallel }$ can be ascertained from an equation governing the density (in $v_{\parallel }$ space) $N,$ the integral of the distribution function over perpendicular velocities. We found that that this quantity satisfies an ill-posed, nonlinear, parabolic partial differential equation, reminiscent of the backwards diffusion equation.
This equation can be solved in two limits: firstly, in the case where the contribution from electron–ion collisions can be neglected, in which case a similarity solution is found; and secondly, in the case where the contribution from electron–electron collisions can be neglected and the equation can be solved by integration along the characteristic curves. In this latter limit, the equation is well-posed. In both cases, the remaining energy is radiated on the time scale of the ordinary collision frequency divided by $|\ln \epsilon |$.
Several candidates for areas of application of this theory were presented. These included both astrophysical applications and experimental applications. The latter of these applications will be developed in a companion paper (II).
Editor William Dorland thanks the referees for their advice in evaluating this article.
Declaration of interests
The authors report no conflict of interest.
Appendix A. Moments of the collision operator
Let us now evaluate the contributions to (4.1) arising from the moments of the collision operator.
A.1. Calcuation of $I_{ee}$
The contribution to (4.1) arising from electron–electron collisions is given by
where $T_{{\perp }}^{\prime } = T_{{\perp }}(v_{\parallel }^{\prime })$.
In order to carry out the integrals over $\boldsymbol {v}_{{\perp }}$ and $\boldsymbol {v}_{{\perp }}^{\prime }$ we note that
where $\theta$ is the angle between $\boldsymbol {v}_{{\perp }}$ and $\boldsymbol {v}_{{\perp }}^{\prime }.$ We note further that upon integration, any terms involving $\cos \theta$ will vanish. Hence, we may write
The leading-order contribution to this integral comes from the final term in (A 5) and so we have
We perform the $v_{\parallel }^{\prime }$-integral first and are thus led to consider
which is an integral of the form
and thus becomes
Hence, we arrive at
A.2. Calculation of $I_{ei}$
The contribution arising from electron–ion collisions gives
We remark here that, although the electron–ion collision operator was ordered much smaller than the electron–electron collision operator, it is not necessary for $I_{ei}$ to be ordered much smaller than $I_{ee}$.
Appendix B. Asymptotic forms of $N$
In this appendix, we calculate the long-time $(y \gg 1)$ and short-time $(y\ll 1)$ asymptotic behaviour of (5.14).
B.1. Long-time limit
For $y\gg 1$, most of the contribution to this integral comes from $x^{3} \ll y$, so that
We now make the substitution $s = 2x^{3}/3y^{1/3}$ to obtain
Thus, recalling the definition of the Gamma function,
we arrive at
B.2. Short-time limit
In the short-time limit, $y \ll 1$, we have
where most of the contribution comes from the region $x \sim y^{1/3}$ where the integrand is of order
and we thus expect the integral to be of order $y^{-2/3}$. Indeed,
where we have made the substitution $u = x^{3}$. Upon making a further substitution $p = u/y,$ we obtain
where we recognise
Thus, we may conclude that
which leads to
This equation can be rewritten in terms of the time $t$ and the Braginskii electron–ion collision time $\tau _{ei}$ as