Introduction
Aberration correction in the electron microscope is a topic that started when, in 1936, Otto Scherzer proved that chromatic and spherical aberrations are unavoidable in cylindrically symmetric static electron lenses (later on called “Scherzer Theorem” due to its importance) (Scherzer, Reference Scherzer1936). Shortly after, in 1947, Scherzer realized that using an array of multipolar lenses could allow phase manipulation beyond his theorem's rotationally symmetric constraints, thus providing a pathway for correcting the formerly mentioned aberrations (Scherzer, Reference Scherzer1947). Attempts toward experimental realization saw many iterations throughout the following decades (Seeliger, Reference Seeliger1951; Archard, Reference Archard1954; Mollenstedt, Reference Mollenstedt1954; Meyer, Reference Meyer1961; Kelman & Yavor, Reference Kelman and Yavor1962; Dymnikov & Yavor, Reference Dymnikov and Yavor1964; Hardy, Reference Hardy1969; Rose, Reference Rose1971; Koops et al., Reference Koops, Kuck and Scherzer1977; Bernhard, Reference Bernhard1980). However, the successful implementation of an aberration corrector for TEM did not come until 50 years later with the Haider-Rose-Urban project (Rose, Reference Rose1990; Haider et al., Reference Haider, Braunshausen and Schwan1995, Reference Haider, Uhlemann, Schwan, Rose, Kabius and Urban1998). The implementation of this corrector represented a breakthrough in the field, allowing for beyond-Ångström resolution in both TEM and STEM and up to atomic resolution in analytical methods due to a significantly increased current density.
More than 20 years after this breakthrough in the field of TEM, we want to explore the possibility of implementing a different idea to correct for third-order spherical aberration ($C_{s}$). To achieve this, we take inspiration from the field of optics, more specifically spatial light modulators, that allow to freely program the wavefront of coherent light, making use of a range of different technologies (Efron, Reference Efron1994; Maurer et al., Reference Maurer, Jesacher, Bernet and Ritsch-Marte2011). “Spatial Electron Modulators,” as opposed to their light optic counterparts, are unfortunately still far from achieving a similar level of technological advancement.
The concepts and reasons why a programmable phase plate is attractive in general have been discussed earlier (Guzzinati et al., Reference Guzzinati, Clark, Béché, Juchtmans, Boxem, Mazilu and Verbeeck2015; García de Abajo & Konečná, Reference García de Abajo and Konečná2021). We have opted for a technological approach that uses an array of electrostatic Einzel lenses as programmable phase shifters. We demonstrated a $2\times 2$ proof of concept (Verbeeck et al., Reference Verbeeck, Béché, Müller-Caspary, Guzzinati, Luong and Den Hertog2018) and have since then put significant efforts into extending the concept to the current state of the art where a 48-pixel programmable phase is produced by lithographic means (Verbeeck et al., Reference Verbeeck, Vega Ibáñez and Béché2021). In the meanwhile, other groups have explored different means to achieve similar freedom in phase shaping of electron beams using miniaturized multipolar lenses (Grillo et al., Reference Grillo, Karimi, Gazzadi, Frabboni, Dennis and Boyd2014), interaction with optical near fields (Maxson et al., Reference Maxson, Lee, Bartnik, Kiefer and Bazarov2015; Konečná & García de Abajo, Reference Konečná and García de Abajo2020; Wang et al., Reference Wang, Dahan, Shentcis, Kauffmann, Ben Hayun, Reinhardt, Tsesses and Kaminer2020; García de Abajo & Konečná, Reference García de Abajo and Konečná2021; Constantin Chirita Mihaila et al., Reference Constantin Chirita Mihaila, Weber, Schneller, Grandits, Nimmrichter and Juffmann2022), electrostatic nanofabricated elements applying a projected potential to a region of free space (Schultheiss et al., Reference Schultheiss, Pérez-Willard, Barton, Gerthsen and Schröder2006; Cambie et al., Reference Cambie, Downing, Typke, Glaeser and Jin2007; Nagayama & Danev, Reference Nagayama and Danev2009; Hettler et al., Reference Hettler, Gamm, Dries, Frindt, Schröder and Gerthsen2012; Béché et al., Reference Béché, Juchtmans and Verbeeck2017), and many others.
This technological progress allows to ponder potential applications such phase plate could provide. In this paper, we will focus on whether an array of electrostatic phase shifters could be made into an acceptable $C_{s}$ corrector for TEM. Proving this to be possible, would provide the option of introducing a small, integrated device into the TEM column with little change to its configuration. Furthermore, it would offer a rapid response tool, which combined with adaptive algorithms can auto-tune and respond to instrument or specimen-induced drifts, potentially reducing the overall complication of an experiment (dose and material efficiency).
As simple as the idea may sound, the devil is in the details, and we attempt in this paper to give an overview of the design parameters that have to be balanced between manufacturability and expected performance to evaluate if aberration correction with a programmable electrostatic phase plate could have a future in TEM.
Methods
Electron Beam Parameters to Be Optimized
In order to evaluate different designs for programmable phase plates, we need to first agree on the beam parameters to optimize.
For aberration correction, we are interested in spatial resolution and current density in the electron probe. It is convenient to define the spatial resolution as $d_{50}$, the diameter of the probe containing 50% of the beam intensity (Kohl & Rose, Reference Kohl and Rose1985). This definition comes very close to the FWHM for very sharp beams, while it offers the advantage of accounting for the effect of the beam tails. For the case of current density, we assume the phase plate to be homogeneously and coherently illuminated with a current density that would lead to a total probe current of $I_0$ if a circular aperture would replace the phase plate with the same total diameter. As the electrostatic phase plate will block part of this beam inherent to the construction of the segments making up for it (sketched in Fig. 1), we get for the beam current with phase plate $I' = I_0 \zeta$ with $\zeta$ the fill factor of the specific phase plate. Ideally, this fill factor should be as close to 1 as possible, meaning no blocking of the electron beam, but practical design constraints will determine what is realistic to achieve.
The average current density, $J_{50}$, in the $d_{50}$ probe diameter is then given as:
In comparison, for an ideal aberration-corrected system with circular aperture and convergence half angle $\alpha$, this becomes:
Note that in this theoretical design exercise, we ignore other sources of experimental probe broadening, such as those caused by source size (Michael & Williams, Reference Michael and Williams1987), vibrations, partial coherence, and any sources of electronic noise or thermal drifts that could affect the final probe size (Von Harrach, Reference Von Harrach1995).
To keep the arguments as general as possible, we focus on high-level design parameters and avoid going into details and technical challenges arising from manufacturing.
Phase Plate Design Parameters
After listing the probe parameters we aim to study and optimize, we now will give an insight into some phase plate design considerations to achieve probe size reduction and current density increase. We want to pay special attention to the number of phase-shifting elements, their width, and the percentage of the aperture they will block (all correlated through the interconnections that deliver the potential to each individual element). A simple rule of thumb would be that more segments come with more interconnections, thus blocking more of the incoming electron beam. However, as mentioned before, we will discuss the mathematical implications of the segment/phase plate design leaving the manufacturing of the phase plate itself out of the scope of this study.
The Role of the Fill Factor
Putting material in the path of the beam is inevitable for these phase plates since it is the only way to break the symmetry constraints imposed by the Maxwell equations on fields in free space. Adding material to create the structure of the plate segments leads to the concept of the fill factor ($\zeta$), the amount of optically transparent versus nontransparent parts in the phase plate. This modulation of the local amplitude of the electron beam will inevitably lead to a broadening of the probe as high spatial frequency tails are introduced.
If we try to estimate this effect, we can begin with a wave function on the probe forming aperture with a constant amplitude:
With $a( \vec {k})$ a function defining the aperture's shape being either 1 ($\vec {k}< \alpha$) or 0 ($\vec {k}> \alpha$), $A = \int a( \vec {k}) d^2\vec {k}$ is the total area of the aperture, $\phi$ is the local phase, and $\vec {k}$ is a vector in the aperture plane. The wave function in real space then becomes:
We consider as the ideal case a circular aperture and a flat phase (i.e., the diffraction limit). In this case, the probe will be the sharpest and the wave will have a value at the central point:
Now, if we want to describe the situation of a pixelated phase plate with the same outer dimensions, we can put a mask $M$ over the ideal aperture, which is either 1 (electron transparent) or 0 (not electron transparent). If we assume the ideal case where the pixelated phase plate can still provide a flat phase in those areas where the plate is electron transparent, we get:
This mask changes the maximum of the real space wave function to:
With $\zeta = \int ( {M( \vec {k}) }/{a( \vec {k}) }) d^2\vec {k}$ the fill factor of the phase plate.
Now, let us consider the resulting probe which consists of the sum of the ideal corrected wave and an unwanted tail part:
Where the scale factor $\zeta$ describes the scaling of the central maxima with respect to the ideal corrected case. We can now write the intensity of the probe as:
We assume that the current of the tails does not overlap with the central spot. This assumption is reasonable given that the ideal probe is a maximally compact function near the center, and the tails come from the high spatial frequencies of the mask, which are much smaller than the total aperture radius.
If we normalize the total current illuminating the round aperture $I_{{\rm ideal}, {\rm total}} = I_0 = 1$ for simplicity, the total intensity in the probe then becomes:
If we normalize the tails relative to the total intensity in the probe, we get:
In other words, the unwanted tail part of the probe formed by a pixelated phase plate scales approximately as $1-\zeta$. These tails will form a low-resolution background signal to any scanned probe setup. This background is highly unwanted as it will increase the counting noise, which is especially bad for spectroscopic methods since it will bring signal from areas away from the probe center. To prevent these tails, we want to create a mask having the highest possible fill factor. For the same reason, to optimize the value of $d_{50}$, we need a $\zeta > 0.5$, and the ideal case would be to bring this value as close to 1 as possible. Cutting off the tails with an aperture placed lower in the TEM column seems another option, but this would require cutting apertures with an equivalent real space diameter only a few orders larger than the probe size, which seems extremely difficult to obtain if we aim for Å probes, especially when considering that working in another (magnified) plane than the sample plane will introduce inevitable lens aberrations.
Phase Plate Pixel Pattern
In order to best compensate for the lens aberrations in a pixelated phase plate, it is important that each phase-changing segment can locally correct for the phase error of the other lenses in the microscope as well as possible. This will naturally lead to pixel patterns that will mimic the symmetry of the aberration function. Starting from the aberration function $\chi ( \theta )$ and considering only the defocus ($\Delta f$) and $C_{s}$ terms we have:
We now look for the highest angle that still can be corrected by a segment in the phase plate, and we assume a spherically symmetric phase plate segment covering an angular range between $\theta _i$ and $\theta _{i + 1}$. Taylor expanding the aberration function to second-order around $\theta _i$ gives:
Zeroth-Order Phase Correction
Suppose we use a zeroth-order phase plate which produces a constant phase shift that is programmable per segment. If we allow for a maximum phase error $\epsilon$ within each segment, we get for the maximum angle up to which we can correct:
This puts an upper limit on the maximum angle that can be corrected depending on how small we can make $\Delta \theta$. If we assume only $C_s$ needs correction, we can always choose $\Delta f = 0$, and we also take the first-order Taylor expansion as sufficient we get:
For a typical $C_s = 1\, {\rm mm}$, $\theta _i = 15\, {\rm mrad}$, and $\epsilon = {2\pi }/{10}$, we obtain a $\Delta \theta < 58\, \mu {\rm rad}$. This would require feature sizes of the segments of only 0.3% of the total aperture diameter and could become rather difficult to manufacture. Alternatively, we can express the maximum angle for a given minimum size of $\Delta \theta$:
This leads to 5.8 mrad for $\Delta \theta = 1$ mrad, giving us the maximum aperture angle we can correct with a flat phase within the given error $\epsilon$.
First-Order Phase Correction
If, on the other hand, we allow for first-order correction in each phase segment, meaning a linear projected potential ramp in the radial direction and thus requiring at least two independent potential electrodes per segment, the situation changes. In this case, the phase could be corrected up to first order and we get as phase error:
We choose $\Delta f = 0$ , which yields:
For a typical $C_s = 1\, {\rm mm}$, $\theta _i = 15\, {\rm mrad}$, and $\epsilon = {2\pi }/{10}$, we get $\Delta \theta < 0.76\, {\rm mrad}$ which is ${\approx }13$ times larger as compared with zeroth-order correction. Following the steps of the previous section, we can express the maximum angle for a given minimum size of $\Delta \theta$:
This leads to 11.46 mrad for $\Delta \theta = 1\, {\rm mrad}$, nearly double its zeroth-order counterpart.
We give a simplified sketch of the main building blocks needed to make up for both a zeroth- and first-order phase-shifting elements in Figure 2. Furthermore, we show a plot of equations (17) and (22) in Figure 3 for two different phase errors $\epsilon$. To put this into perspective, we give the resolution ranges for some manufacturing techniques (shaded regions).
In order to translate the previous results to meters, we can take a scaling factor to relate angle (mrad) and physical distance (meters), assuming that the widest area we can coherently illuminate is in the order of $100\, \mu {\rm m}$ (so, despite the maximum aperture angle $\theta$, we still illuminate the same area in meters). With this in mind, and looking at the right axis scale on Figure 3, we can get a value for the physical dimension corresponding to the minimum $\Delta \theta$ needed to keep the phase error under a specific error ($\epsilon$).
Results
Integrating the design rules discussed above, we numerically simulated a set of different electrostatic phase plate designs to test their capabilities to correct $C_s$ at 300 keV with two approaches, (1) applying constant phase shift (zeroth-order) and (2) a combination of constant and linear ramp shift (first-order) segments. As $C_{s}$ is rotationally symmetric, the proposed designs all consist of concentric rings to make maximum use of the symmetry of the problem.
For reference, these proposed concentric segments shown in Figures 4a–4d are analogous to those labeled in Figure 1 as “individual phase segments,” with the only difference being that we reduce the spacing between segments arbitrarily for our study, and their working principle is the same as the one sketched in Figure 2.
After some design iterations, we narrowed down the study to compare three apertures: zeroth-order concentric rings (Fig. 4b), a hybrid design (Fig. 4c), and a simplified version of the latter (Fig. 4d). We show the resulting probe profiles from the apertures in Figure 4(i). This figure shows how the probe from all proposed designs approaches that of a corrected instrument, visibly improving over a noncorrected instrument. It is essential to mention that Figure 4(i) only gives a view of an azimuthally integrated intensity which is normalized to the maximum intensity of each probe for scale ($y$-axis) and shows only the tails of the lower spatial frequency features ($x$-axis).
The fill factor ($\zeta$) displayed in Figure 4 was calculated by counting the number of pixels in the matrix with a value different than 0 and dividing it by the total number of pixels a round aperture with the same radius will have. Furthermore, the spacing between holes shown in Figure 1 is arbitrarily reduced for simplicity. The calculated probe size ($d_{50}$) for the different proposed designs is shown in Figure 5. We see the simulated $d_{50}$ value for each of the plates with an increasing opening angle. We found that all the proposed designs offer some $C_s$ correction. However, the linear phase profile obtained by applying first-order correction can keep the phase relatively flat for bigger opening angles, further reducing the probe size. More specifically, we reach a $d_{50}$ value of 0.93 Å at 15 mrad for the zeroth-order phase plate, a $d_{50}$ of 0.66 Å at 21 mrad for the hybrid design, and a $d_{50}$ of 0.75 Å at 18 mrad for the simplified hybrid design. These values represent a 40%, 57%, and 52% improvement in spatial resolution, respectively, compared with a noncorrected instrument. At higher aperture values (i.e., larger than 21 mrad), we must reduce the width of the segment in order to reduce the phase error, and this will eventually become an issue in terms of fabrication.
The relative current density can be calculated from equation (1) and is plotted in Figure 6 assuming $I_{0} = 50\, {\rm pA}$, all the proposed designs increase this value. More specifically, 6.4$\times$ for the zeroth-order phase plate, 28$\times$ for the hybrid phase plate shown, and 16.4$\times$ for the simpler hybrid design compared with a noncorrected round aperture at 10 mrad. This relative current density is highly important for, e.g., spectroscopic methods where the increased current in a small probe can lead to a vastly improved signal-to-noise ratio on top of the gain in spatial resolution.
Discussion
This design exercise shows that having an adaptive phase plate in the condenser aperture plane can correct $C_{s}$. Not only did we numerically obtain a probe size below the 1 Å limit, but we also increased the relative current density more than 20$\times$. However, implementing a device like the one proposed in this study poses several challenges. The most critical issue is the possibility of manufacturing a device with all the necessary electrical connections to control each phase segment separately. As the aperture angle increases, we quickly approach regions where the aberration function changes rapidly; this change requires a narrow segment to keep the phase error within a reasonable range. However, reducing this segment size or going from zeroth-order segments to first-order segments will increase the number of interconnections needed to control such implementation, ultimately reducing the attainable fill factor.
We show the relation between probe size ($d_{50}$) and fill factor ($\zeta$) in Figure 7. We can observe how the probe size for a plate with a $\zeta < 0.5$ cannot even match the performance of a noncorrected instrument in terms of $d_{50}$ for small angles, but at angles ${> }11\, {\rm mrad}$ this less than ideal phase plate can still significantly improve $d_{50}$ and hence increase $J_{50}$ when compared with the noncorrected situation. This may be of interest in cases where current density is more important than ultimate resolution.
It is important to note that the fill factor has a double negative effect on the probe current. On the one hand, it lowers the current in the beam due to partial blocking by a factor $\zeta$. On top of this, the beam that gets through is split into the desired part (central spot) and a tail part, which further lowers the intensity of the desired part of the beam to $\zeta ^2$. Often there is more than enough beam current available, and the sample may limit how much current can be used. In such cases, the initial loss due to finite transparency of the phase plate is not a problem, but the tail argument still is, as it will provide a degraded image contrast while still doing full probe current beam damage. Increasing the number of segments (more interconnections) or reducing their size brings the fabrication process closer to the photolithography resolution limit (${\approx }1\, \mu {\rm m}$) as indicated in Figure 3. For this reason, one must be careful with the phase plate design since high complexity designs will require higher-resolution methods such as EUV or e-beam lithography (Engstrom et al., Reference Engstrom, Porter, Pacios and Bhaskaran2014). For instance, a zeroth-order correction requires only one electrode to create an Einzel lens inside the region of the segment, while a first-order implementation would require a plate capacitor-like arrangement to achieve the required phase ramp. Another way to increase the fill factor is to tie the different electrodes together with a fixed resistor string, saving multiple interconnection lines. The drawback of this solution is that it would take away the ability to fine-tune each segment individually, and if the projected potentials are not quite what was expected, there is no easy way to correct them unless resistor values could be, e.g., laser trimmed. Furthermore, having a significant $C_s$ correction seems unreasonable for phase plates with less than $50\percnt$ fill factor since most of the current will land on the tails of the probe, as shown above. However, we demonstrate here that the number of segments needed to achieve sub-1 Å correction can be kept relatively low, thus reducing the design complexity significantly over, e.g., the design proposed in our previous paper (Verbeeck et al., Reference Verbeeck, Béché, Müller-Caspary, Guzzinati, Luong and Den Hertog2018). The predicted resolution should be taken with some reservation as effects like source size broadening, vibrations, chromatic aberration, higher-order aberrations, and other non-idealities were not considered for the theoretical study and will further lower the actual attainable resolution in practice.
An advantage of the proposed designs is that, as indicated earlier, the performance of an electrostatic phase plate is relatively insensitive to the quality of the voltage sources driving the segments (Verbeeck et al., Reference Verbeeck, Béché, Müller-Caspary, Guzzinati, Luong and Den Hertog2018). We test this claim in the zeroth-order design in Figure 8 by adding different normally distributed phase noise to each of the phase segments and calculating the resulting $d_{50}$ and the difference between a case where no noise is present. We note that the increase in probe size to such noise remains under 0.1 Å for most aperture angles. Assuming an electronic system is designed to provide, e.g., a maximum of $200\pi$ phase shift, such precision and noise requirements would easily be met even, e.g., with a humble 12-bit digital to analog converter.
Conclusion
We have numerically demonstrated how a programmable electrostatic phase plate can correct third-order spherical aberration in the TEM with phase plates of modest complexity consisting of 19 segments (zeroth-order) and as few as 8 segments for a hybrid design. All the proposed designs are capable of flattening the electron wavefront up to relatively high opening angles and can provide sub-1 Å probe sizes.
We discussed the benefit of moving from zeroth-order to first-order phase shifters to contain the phase error within some error margin. On top of this, we showed how the segment width for a first-order element does not necessarily need to have unreasonably small dimensions to correct $C_{s}$ with acceptable performance. In terms of shape, we naturally confirm that a circularly symmetric design compensates for the phase shift caused by $C_{s}$ since it mimics its symmetry; thus, correcting the aberrations more efficiently within each segment. Nevertheless, it is likely that in practice also some breaking of this symmetry would be useful to compensate for nonround aberration. This would bring a further increase in the number of phase-shifting elements and the complexity of getting interconnects to each.
We found that to achieve $C_{s}$ correction to any significant extent, the phase plate has to have a fill factor $\zeta \geq 0.75$ to achieve $d_{50} \leq 1$ Å.
We demonstrated a significant increase in current density, crucial for applications such as STEM EELS, even for cases where the fill factor is low. When the absolute current density is important, we argued that it scales as $\zeta ^2$ as both the total current and the amount of probe tails scale with the fill factor.
We investigated the robustness of the correction versus noise on the electrostatic potentials of the phase-shifting elements and showed that, for a zeroth-order phase plate, we get excellent results for $\epsilon \leq 2\pi /24$, which is well within reach of even simple digital–analog converter circuits.
This study demonstrates how an adaptive optical device can potentially be implemented in a noncorrected instrument, improving its performance without any significant changes in the optical setup of the column. Having adaptive optical elements in an electron microscope allows for increased flexibility and performance, which opens up a wide variety of experimental setups, ultimately expanding the microscope's capabilities.
Besides (adaptive) aberration correction, one of the potential applications for electrostatic phase plates is to shape the beam to either enlarge the depth of focus or increase the $z$-resolution. This first idea has been demonstrated before using a spatial light modulator that can generate Bessel-like beams invariant with propagation length (Chattrapiban et al., Reference Chattrapiban, Rogers, Cofield, Hill and Roy2003; Tao et al., Reference Tao, Lee and Yuan2003). However, we can also attempt to correct higher-order aberrations with a phase plate to increase the opening angle, potentially enhancing the $z$-resolution (which scales as the inverse of the aperture squared).
Another possible application is to use the phase plate for phase retrieval experiments. This idea has already been studied in optics (Zhang et al., Reference Zhang, Pedrini and Osten2007; Falldorf et al., Reference Falldorf, Agour, von Kopylow and Bergmann2010; Zhang & Rodenburg, Reference Zhang and Rodenburg2010), and the possibilities of fast and reliable wavefront tuning with the electrostatic phase plate can allow us to do the same in the electron microscope.
Increasing selectivity in inelastic scattering experiments by differential experiments changing the probe rapidly between two or more configurations is another class of applications that could shed light on, e.g., the magnetic, chiral, and optical response of materials at the nanoscale. Furthermore, having such adaptive apertures allows for automatic correction and optimization, self-tuning the phase of each of the segments to match the user's needs.
However, it is evident that the performance of the proposed designs in terms of $C_{s}$ correction capabilities is inferior to that of a modern multipole corrector. Still, the proposed setup would offer several significant advantages, such as small size (1 mm scale), low power consumption (1 W scale), high speed (up to 100 kHz, extendable to much higher), no hysteresis, vastly reduced precision constraints on drive electronics (12 bit suffices), negligible drift, and potentially low production cost.
These observations will guide further practical implementations with experimental realization of phase plate-based aberration correction on the nearby horizon.
Acknowledgments
All authors acknowledge funding from the Flemish Research Fund under contract G042820N “Exploring adaptive optics in transmission electron microscopy”. J.V. acknowledges funding from the European Union's Horizon 2020 Research Infrastructure – Integrating Activities for Advanced Communities under grant agreement No 823717 – ESTEEM3 and from the University of Antwerp through a TOP BOF project.