1. Introduction
In contrast to the consistent decrease in Arctic sea-ice extent (SIE) during the satellite era, Antarctic SIE exhibited a gradual increase until 2015, but in recent years has abruptly declined (e.g. Stuecker and others, Reference Stuecker, Bitz and Armour2017; Turner and Comiso, Reference Turner and Comiso2017; Turner and others, Reference Turner2017; Kusahara and others, Reference Kusahara, Reid, Williams, Massom and Hasumi2018; Schlosser and others, Reference Schlosser, Haumann and Raphael2018; Wang and others, Reference Wang, Turner, Wu and Liu2019). Although previous studies suggest that trends in Antarctic SIE may result from changes in the atmospheric circulation or ocean stratification (e.g. Hobbs and others, Reference Hobbs2016; Jones and others, Reference Jones2016; Meehl and others, Reference Meehl2019), it is difficult to characterize how Antarctic sea ice is responding to changing climate without reliable sea-ice thickness (SIT) and sea-ice volume (SIV) estimates. Sea-ice data assimilation, merging the information from observations with that from models, can provide more accurate and useful estimates of sea-ice conditions than could otherwise be obtained through either observations or models individually (Buehner and others, Reference Buehner, Bertino, Caya, Heimbach, Smith, Lemieux, Toudal Pedersen, Buehner and Carrieres2017).
Due to the success in observing Antarctic sea-ice concentrations (SIC) through satellite-based passive microwave instruments while difficulties in retrieving Antarctic SIT from satellites (Turner and Comiso, Reference Turner and Comiso2017), many efforts have been devoted to constraining the simulation of Antarctic sea ice by SIC observations. A simple nudging of model SIC to observation was shown to minimize the effect of observational errors in the interior of the ice pack (e.g. Lindsay and Zhang, Reference Lindsay and Zhang2006). Recently, the SIC observation has also been assimilated by adopting more advanced data assimilation methods such as ensemble Kalman filter (e.g. Massonnet and others, Reference Massonnet2013) and 4-D variational analysis (e.g. Mazloff and others, Reference Mazloff, Heimbach and Wunsch2010) in Antarctic sea ice–ocean models, which is likely to provide more balanced estimations of model state. However, there are still large uncertainties in the estimation of Antarctic SIT via assimilating SIC only (e.g. Shi and others, Reference Shi2021). With recent advances in retrieval methods, the soil moisture and ocean salinity (SMOS) satellite data have been used to derive Antarctic SIT, which appears to be more accurate in the thin ice regime (Tian-Kunze and others, Reference Tian-Kunze2014; Tian-Kunze and Kaleschke, Reference Tian-Kunze and Kaleschke2018). As a new kind of observation, although it provides an unique opportunity to investigate the constraint of SIT observations on the Antarctic sea-ice estimates, how to assimilate Antarctic SIT observation derived from SMOS is still debatable.
In this study, a new Data Assimilation System for the Southern Ocean (DASSO) is established based on a regional Southern Ocean sea ice–ocean coupled model, which allows for multivariate model updates in dynamical balance and is convenient to assimilate all available sea-ice observations directly. To the authors' knowledge, this is the first study of assimilating Antarctic SIT observations into a coupled sea ice–ocean model. Thus, based on this developed sea-ice data assimilation system, a unique opportunity exists to evaluate the impact of assimilating SIT together with SIC on Antarctic sea-ice modeling. In addition, because our previous studies suggest that an ensemble of atmospheric forcing can account for atmospheric uncertainties and thus mitigate the need for ensemble inflation in the Arctic sea-ice data assimilation (e.g. Yang and others, Reference Yang, Losch, Losa, Jung and Nerger2016a, Reference Yang2016b; Mu and others, Reference Mu2018), we would like to see whether this also holds in the Antarctic.
2. Methodology
The DASSO system is based on the Massachusetts Institute of Technology general circulation model (MITgcm, Marshall and others, Reference Marshall, Adcroft, Hill, Perelman and Heisey1997) and the parallel data assimilation framework (PDAF, Nerger and Hiller, Reference Nerger and Hiller2013). The model used in this study is a coupled sea ice–ocean model with the same physical configuration as Verdy and Mazloff (Reference Verdy and Mazloff2017), which extends from the equator to 78° S. It has an average 1/3° horizontal grid spacing with 52 unevenly spaced vertical levels from 2.1 to 5800 m. The sea-ice component of model is the viscous-plastic dynamic-thermodynamic sea-ice model (Losch and others, Reference Losch, Menemenlis, Campin, Heimbach and Hill2010). The dynamic part of the sea-ice model is solved by line successive over-relaxation (Zhang and Hibler, Reference Zhang and Hibler WD1997) on a C grid, and the thermodynamic counterpart is a ‘zero-layer’ model (Semtner, Reference Semtner1976). The fifth generation of ECMWF atmospheric reanalyses (ERA5, Hersbach and others, Reference Hersbach2020) provides the ensemble of atmospheric forcing as the surface boundary conditions, including air temperature and dewpoint temperature at 2 m, zonal and meridional wind speed at 10 m, surface downward shortwave and longwave radiation flux, surface pressure and total precipitation.
The assimilation scheme adopted in this study is the local error subspace transform Kalman filter (LESTKF, Nerger and others, Reference Nerger, Janjić, Schröter and Hiller2012) provided in PDAF. The LESTKF provides not only consistent projections between the ensemble space and the error subspace, but also minimum transformations of the ensemble members. The solution is given by:
where X represents the ensemble model states, y o the observation vector, R the observation error covariance matrix and H the observation operator that interpolates from the model grid to observation locations. L = X fΩ is the ensemble projected on the error subspace, and Ω is a projection matrix generated by Householder reflections. A is a transform matrix defined by
where ρ ∈ (0, 1] is the forgetting factor used as a tuning parameter of the analysis step to stabilize the filter performance. As ρ decreases, the background error covariance is inflated and results in the reduction of A according to Eqn (2), and finally X a heads to the observation due to Eqn (1). C is the square root of A. 1m = [1, 1, …, 1] ∈ R 1×m. The ensemble size m is 10 here due to only 10 ensemble members available in ERA5. The overbar denotes the ensemble mean. The superscripts a, f, T and ′ denote analysis, background, matrix transpose and the ensemble perturbation, respectively. Only horizontal localization is adopted in this study. The localization radius is about 100 km and the localization function is the Gaspari–Cohn function. To avoid inconsistency introduced by assimilation, we adopt the postprocessing as Tietsche and others (Reference Tietsche, Notz, Jungclaus and Marotzke2013) and Kimmritz and others (Reference Kimmritz2018) suggested to ensure the physical consistency of the state of sea-ice and ocean components. Initial perturbations to the ensembles are generated from second-order exact sampling (Pham, Reference Pham2001) of daily output from a free run of 3 months right before the start of analysis.
The SIT of thin ice from SMOS and the SIC from the Ocean and Sea Ice Satellite Application Facility (OSISAF, Lavergne and others, Reference Lavergne2019) are used in data assimilation and, due to a lack of additional data, are also largely used to validate simulations. Operational thin SIT data retrieved from SMOS L-band brightness temperatures (Tian-Kunze and others, Reference Tian-Kunze2014) have been disseminated for the Arctic for the time period of 2010 to present. The same retrieval algorithm has been implemented to the Antarctic sea ice, and the product is still under validation (Tian-Kunze and Kaleschke, Reference Tian-Kunze and Kaleschke2018). Several validation activities have been carried out in the Arctic with promising results for the SMOS SIT product (Tian-Kunze and others, Reference Tian-Kunze2014). The uncertainties of the ice thickness increase with increasing ice thickness. The SMOS L-band loses its sensitivity in the thick ice range with thickness more than 1 m. Furthermore, it is found that the retrieved ice thickness is quite sensitive to weather changes because of the immediate thermodynamic equivalence assumption made in the retrieval algorithm (Tietsche and others, Reference Tietsche2018). The thickness product is strictly limited to cold periods, i.e. for the Antarctic from April to October. The second version of the SIC interim climate data record (OSI-430-b) has been released by the European Meteorological Satellite Agency for Antarctic as well as Arctic from 2016 onward, which is retrieved from passive microwave instruments on board the Defense Meteorological Satellite Program (DMSP) satellites. Similar to the first version (OSI-409/OSI-409-a), OSI-430-b features an explicit correction of the satellite signal due to weather contamination, dynamic adaptation of algorithm tie points and spatiotemporally varying maps of uncertainties. In addition, OSI-430-b greatly reduces the occurrence of missing data in the final SIC fields due to using all DMSP platforms available at any time, and taking advantage of the overlap of satellite missions. As in our study for the Arctic (e.g. Yang and others, Reference Yang2016b), the observation error of SIT used in the data assimilation is provided by SMOS, which varies from 0.004 to 0.790 m, whereas that of SIC is a constant value of 0.25.
To illustrate the impact of sea-ice data assimilation on Antarctic sea-ice simulations and the extent of the model uncertainty as represented by the ensemble spread, three experiments are initialized with the result of Iteration 121 solution of the Biogeochemical Southern Ocean State Estimate (Verdy and Mazloff, Reference Verdy and Mazloff2017) and are conducted over the period from 15 April 2016 to 14 October 2016, which proceeded an unprecedented retreat of Antarctic sea ice in the following austral spring. The control experiment for this study is a free run without data assimilation (denoted Ctrl). Then two experiments that assimilate both SIC and SIT are carried out, with the forgetting factor of one experiment being 1 (denoted F100) and the other being 0.5 (denoted F50). By comparing F100 with F50, we can identify the method to assimilate Antarctic sea-ice observation effectively, which will lay the groundwork for further studies on the Antarctic sea-ice data assimilation.
3. Results
The experimental results are examined from both a deterministic and a probabilistic perspective. Figure 1 displays the root mean square error (RMSE) of SIC and SIT against observations. In Ctrl, the RMSE of SIC increases quickly until mid-June and levels off thereafter, whereas that of SIT increases throughout the simulation period. Theses discrepancies between simulations and observations can usually be attributed to model errors and initial errors. Considering the long memory of sea ice, it would be essential to decrease initial errors through data assimilation. Assimilating observational data effectively reduces the RMSE of Antarctic sea-ice simulation at all times. In F100, the RMSE of SIC increases before 17th June and then decreases to 0.088 (Fig. 1a), whereas that of SIT increases until early July and then remains approximately constant (Fig. 1d). The spatial distribution of RMSE differences shows that the data assimilation reduces the SIC error at the edge of sea ice (Fig. 1b) and the SIT error in the Weddell Sea sector as well as coastal areas of the Antarctic continent (Fig. 1e). These improvements stress the impact of data assimilation on the Antarctic sea-ice modeling. The RMSE of F50 is less than that of F100 as expected, but there are differences in the RMSE reduction between SIC and SIT. First, the difference in RMSE between F50 and F100 is greater for SIT than for SIC. Second, in F50, the RMSE of SIT stays small with no trend (Fig. 1d), whereas that of SIC behaves similarly to F100 (Fig. 1a). Third, in terms of spatial distribution, F50 improves over F100 in the Southern Ocean for SIT (Fig. 1f), but not much for SIC (Fig. 1c).
To quantify the difference between F100 and F50, the analysis increment vector (${\rm increment} = H\overline {X^{\rm a}} -H\overline {X^{\rm f}}$) is projected onto the innovation vector (${\rm innovation} = y^{\rm o}-H\overline {X^{\rm f}}$), which is defined as
where the dot between increment and innovation denotes inner product, and || denotes the norm of the vector. This projection determines to what extent the observation information is absorbed in DASSO. As shown in Figs 2a, b, the projections of SIC and SIT are similar in F100. The projection of SIC (SIT) decreases rapidly in the first 4 (14) analysis cycles, and then remains low until the end of the experiment, suggesting that the majority of observation information absorbed in F100 is from the first few analysis cycles. Combined with what has been shown in Fig. 1, this result indicates that Antarctic sea ice is of long memory, and initial condition should be important for its simulation and prediction. The evolutions of SIC and SIT projection are quite different in F50. The projection of SIC decreases rapidly in the beginning, then decreases slowly before mid-June, followed by a gradual increase toward the end of experiment. Meanwhile, the projection of SIT stays at a relatively high level. These discrepancies result in different RMSE evolutions of SIC and SIT in F50 (Figs 1a, d).
To further illustrate the influence of data assimilation on sea-ice simulation, Figs 2c, d show the evolution of SIC and SIT at (9.8° E, 68.1° S), where both variables have considerable variability and the observed SIT is always <1 m during the experiment. Compared with Ctrl and F100, F50 can reproduce the high-frequency fluctuation of SIC and SIT from OSISAF and SMOS. The observed high-frequency fluctuation of sea ice may be owed to atmospheric processes on the synoptic time scale, whereas this fluctuation is not captured in Ctrl due to limitations of understanding and modeling the air–ice–sea interaction (Swart and others, Reference Swart2019). Besides, obvious differences in the sea-ice evolution between F100 and F50 can be attributed to whether a covariance inflation procedure is adopted in the assimilation, which can directly affect the probability distribution of ensemble model states.
To evaluate whether a collection of ensemble model states for a scalar variable satisfies the consistency condition, Talagrand diagrams for SIC and SIT are shown in Figs 3a, b. The Talagrand diagram is generated by ranking the values of m ensemble members in increasing order, which divides the range of target variable into m + 1 intervals, and then the value of the verifying observation falls into one of the m + 1 categories. Talagrand diagrams of Ctrl and F100 both show a U-shaped distribution, suggesting that the ensemble members tend to be too similar to each other but different from the observation. Thus, the ensembles of Ctrl and F100 are underdispersed. The Talagrand diagram of F50 is rather flat, indicating that the probability distribution of observations is quite well represented by the ensemble. In addition, there is an overpopulation of the smallest ranks in the Talagrand diagram of Ctrl, implying that both SIC and SIT are overestimated in Ctrl. This overestimation is reduced in the assimilation experiments, especially in F50.
The temporal evolution of Antarctic SIE and SIV is shown in Figs 3c, d. The SIE is calculated as the sum of the area of all gridcells with SIC >15%. The SIV is calculated as the sum of the volume of all gridcells with SIT <1 m because of the saturation of SMOS brightness temperatures over thicker ice (Tian-Kunze and others, Reference Tian-Kunze2014). The observed SIE increases gradually, and reaches a maximum on 27th August. In contrast, the observed SIV increases slowly during the whole simulation period. The evolution of SIE and SIV in Ctrl is similar to the observations, but there are significant positive biases, which are reduced in the assimilation experiments, especially in F50. However, although assimilating sea-ice observations reduces RMSEs of SIV and SIE, there are differences between SIV and SIE in terms of the anomaly correlation coefficient (ACC). Compared to Ctrl, the ACC of SIV is increased, whereas that of SIE is decreased, in the assimilation experiments. It is worth noting that SIC (SIT) observations play an important role in the simulation of SIE (SIV) during analysis cycles, and the only difference in configuration between assimilating SIC and assimilating SIT is the choice of the observation error in this study. Thus, the differences in ACC between SIE and SIV might result from different choices of observation error for SIC and SIT.
To further validate the performance of DASSO, independent observations are required. And the observation platform for independent observation should be different from that of assimilated observation. However, there is the lack of independent data during assimilation experiments (i.e. from 15 April to 14 October, 2016), whereas in situ sea-ice observations provided by the Antarctic Sea Ice Processes & Climate program (ASPeCt) are available from 4 to 23 November 2016 (Fig. 4a). Hence, experiments are extended from 15 October to 23 November, during which satellite-based observations have not been assimilated anymore. The correlations between simulation and ASPeCt are low, due to the difference in spatial resolution between simulation (1/3°) and ASPeCt (~11.1 km) and significant variation of sea ice over horizontal distances as small as a few meters. Thus, bias and RMSE will be proper metrics to compare simulation with ASPeCt, which reflects the average feature of the navigation area. As Figs 4d, e show, biases and RMSE in experiments initialized with assimilating sea-ice observation (i.e. F100 and F50) can be suppressed effectively, compared with that without assimilation (i.e. Ctrl). Besides, compared to F100, F50 can further reduce biases and RMSE in sea-ice simulation, especially in the simulation of SIT. These results suggest that Antarctic sea ice is of long memory, and the more reasonable scheme of sea-ice data assimilation will play an essential role in the simulation as well as in the prediction of Antarctic sea ice.
4. Concluding remarks
This study introduces a data assimilation system called DASSO based on a regional Southern Ocean sea ice–ocean coupled model, and presents a set of assimilation experiments to assess the impact of SIT as well as SIC observations on reproducing the sea-ice conditions and variations during the period from 15 April to 14 October, 2016. Generally, assimilating SIC and SIT improves the simulation of Antarctic sea ice, and in particular suppresses the positive biases of SIT and SIC in the model-free run. Even though atmospheric uncertainties are partially accounted for by using the ERA5 atmospheric state ensemble, ERA5 ensemble is usually underestimated because ERA5 ensemble takes mostly random uncertainties into account, while systematic model errors are not considered (Hersbach and others, Reference Hersbach2020). Therefore, the uncertainty in modeled sea ice is also underestimated in the model ensemble, which leads to filter divergence eventually. Thus, how to fully consider the model uncertainty is of great importance in sea-ice data assimilation. Although Yang and others (Reference Yang, Losch, Losa, Jung and Nerger2016a) and Mu and others (Reference Mu2018) found it is unnecessary to use additional inflation if the UKMO ensemble forcing is applied in the Arctic sea-ice data assimilation, here a covariance inflation procedure is necessary in the Antarctic sea-ice data assimilation to counteract the underestimation of ensemble variance resulted from underdispersed ERA5 ensemble, which significantly improves the simulation of Antarctic sea ice and lays the groundwork for further studies on the Antarctic sea-ice data assimilation. This also suggests that there are obvious differences in sea-ice data assimilation between the two poles, and the experience of Arctic sea-ice data assimilation cannot be simply transplanted to the Antarctic.
The results described here represent our first attempt to develop a new DASSO to improve Antarctic sea-ice simulation and estimation. Given the limitations of the present Antarctic SIT reanalysis data (Shi and others, Reference Shi2021), further refinements and extensions are very much needed. For example, the difference between SIC and SIT RMSE in the assimilation experiments suggests that the observation error should be estimated in a more reasonable way. In addition, it is important to isolate the impact of different sea-ice observations on reproducing Antarctic sea ice–ocean variability. In future research, we would like to address this problem systematically based on DASSO. On a final note, since the experimental Antarctic EnviSat-2 and CryoSat-2 SIT data have been recently released, although with large uncertainties (Hendricks and others, Reference Hendricks, Paul and Rinne2018), and since the Antarctic ICESat-2 SIT data are also due to be released in the near future (Kacimi and Kwok, Reference Kacimi and Kwok2020), the prospects look bright for reconstructing long-term Antarctic SIT and volume through sea-ice data assimilation.
Acknowledgements
The authors thank two anonymous reviewers for their very helpful comments and suggestions. We thank Petra Heil of Australia Antarctic Division and Australian Antarctic Program Partnership for providing ASPeCt sea-ice observations from 4 to 23 November 2016. We also thank Guokun Lyv of Shanghai Jiao Tong University and Chao-Yuan Yang of Sun Yat-sen University for useful discussions. This is a contribution to the Year of Polar Prediction (YOPP), a flagship activity of the Polar Prediction Project (PPP), initiated by the World Weather Research Programme (WWRP) of the World Meteorological Organisation (WMO). We acknowledge the WMO WWRP for its role in coordinating this international research activity. This study is supported by the National Natural Science Foundation of China (Nos. 41941009, 41922044 and 42006191), and the Guangdong Basic and Applied Basic Research Foundation (No. 2020B1515020025). SMOS Sea Ice Thickness data is produced within the framework of ‘SMOS sea ice thickness processing and dissemination service’ supported by ESA under the contract No. 4000i2473 l/l8/I-EF. We thank the National Supercomputer Center in Guangzhou for providing compute resources.