Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation

Sumanta Chandra Mishra Sharma; Adway Mitra

doi:10.1017/eds.2024.39

Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation

Part of: Climate Informatics 2024

Published online by Cambridge University Press: 02 January 2025

Sumanta Chandra Mishra Sharma

and

Adway Mitra

Show author details

Sumanta Chandra Mishra Sharma*: Affiliation:
Department of Artificial Intelligence, Indian Institute of Technology Kharagpur, Kharagpur, India
Adway Mitra: Affiliation:
Department of Artificial Intelligence, Indian Institute of Technology Kharagpur, Kharagpur, India
*: Corresponding author: Sumanta Chandra Mishra Sharma; Email: sumantamishra22@gmail.com

Article contents

Abstract
Impact Statement
Introduction
Study area and dataset
Methodology
Result and discussion
Conclusion
Open peer review
Data availability statement
Author contribution
Provenance
Funding statement
Competing interest
Ethics statement
References

Abstract

Bias correction is a critical aspect of data-centric climate studies, as it aims to improve the consistency between observational data and simulations by climate models or estimates by remote sensing. Satellite-based estimates of climatic variables like precipitation often exhibit systematic bias when compared to ground observations. To address this issue, the application of bias correction techniques becomes necessary. This research work examines the use of deep learning to reduce the systematic bias of satellite estimations at each grid location while maintaining the spatial dependency across grid points. More specifically, we try to calibrate daily precipitation values of tropical rainfall measuring mission based TRMM_3B42_Daily precipitation data over Indian landmass with ground observations recorded by India Meteorological Department (IMD). We have focused on the precipitation estimates of the Indian Summer Monsoon Rainfall (ISMR) period (June–September) since India gets more than 75% of its annual rainfall in this period. We have benchmarked these deep learning methods against standard statistical methods like quantile mapping and quantile delta mapping on the above datasets. The comparative analysis shows the effectiveness of the deep learning architecture in bias correction.

Keywords

bias correction deep learning ISMR precipitation satellite precipitation estimates

Information

Type: Application Paper
Information: Environmental Data Science , Volume 3 , 2024 , e41

DOI: https://doi.org/10.1017/eds.2024.39 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2024. Published by Cambridge University Press

Impact Statement

The application of deep learning techniques for bias correction of satellite precipitation data has significantly advanced our ability to obtain more accurate and reliable information for weather monitoring and analysis. This innovative approach addresses inherent biases in satellite precipitation estimates, enhancing the precision of meteorological data and thereby improving the quality of forecasts and climate studies. By mitigating biases in satellite precipitation, this deep learning-based correction method contributes to more informed decision-making processes, ultimately benefiting various sectors reliant on precise and unbiased meteorological information.

1. Introduction

Satellite-based precipitation estimates (SPEs) play a crucial role in providing valuable rainfall data for various applications, including climate research and weather monitoring. However, these rainfall data may be subject to biases due to multiple reasons. Some of the common reasons that introduce bias in SPEs include (i) imperfections in calibration and validation, (ii) sensor limitations, (iii) zonal bias, (iv) topographical effects, and (v) seasonal and regional variability. Hence, researchers and climate scientists often employ bias correction techniques and validation methods to mitigate these issues and improve the reliability of satellite-based rainfall data for various applications, including climate modeling, hydrological studies, and disaster risk assessment. Effective bias correction methods play a pivotal role in addressing systematic biases within climate model outputs and satellite estimations. The primary objective of these correction techniques is to align climate model simulations or SPEs with observational data, to produce reliable precipitation estimates (Tong et al., Reference Tong, Gao, Han, Xu, Xu and Giorgi2021; Yang et al., Reference Yang, Yang, Tan, Pan, Zhang, Wang, He and Wang2022).

The literature has shown that many bias correction techniques have been introduced to improve the accuracy of SPEs (Iqbal et al., Reference Iqbal, Shahid, Ahmed, Wang, Ismail and Gabriel2022; Katiraie-Boroujerdy et al., Reference Katiraie-Boroujerdy, Rahnamay Naeini, Akbari Asanjan, Chavoshian, Hsu and Sorooshian2020; Sun et al., Reference Sun, Chen and Han2021). The most prevalent statistical methods used in this field can be grouped into two broad categories, i.e., mean-based approaches and distribution-based approaches (Jaiswal et al., Reference Jaiswal, Mall, Singh, Lakshmi Kumar and Niyogi2022; Wei et al., Reference Wei, Jiang, Ren, Zhang, Wang, Liu and Duan2022). The mean-based techniques include linear and local intensity-based scaling methods while the distribution-based approaches deal with cumulative distribution functions (Dinh and Aires, Reference Dinh and Aires2023; Holthuijzen et al., Reference Holthuijzen, Beckage, Clemins, Higdon and Winter2022; Pierce et al., Reference Pierce, Cayan, Maurer, Abatzoglou and Hegewisch2015). Quantile mapping and quantile delta mapping are the most popular distribution-based bias correction techniques. These techniques try to establish a functional relationship between the climate model outputs or SPEs and the ground observation (Guo et al., Reference Guo, Chen, Zhang, Shen, Chen and Guo2019; Irwandi et al., Reference Irwandi, Rosid and Mart2023; Passow and Donner, Reference Passow and Donner2020).

The advancement of artificial intelligence techniques and the availability of meteorological data (SPEs and Climate Model Outputs) have introduced a new way to visualize and analyze climate variables. Nowadays, many researchers have expanded their research area to resolve the global and local climate issues (Kumar et al., Reference Kumar, Atey, Singh, Chattopadhyay, Acharya, Singh, Nanjundiah and Rao2023; Mishra Sharma and Mitra, Reference Mishra Sharma and Mitra2022; Mitra, Reference Mitra2021; Sharma et al., Reference Sharma, Das, Chakraborty, Mitra and Goswami2023). Again the application of satellite-based products in climate informatics has boosted their research by providing relevant climate data (Chen et al., Reference Chen, Sun, Cifelli and Xie2022). The tropical rainfall measuring mission (TRMM)-based precipitation products are one such valuable climate information that helps research studies to understand the characteristics of this hydrometeorological variable.

Along with other climatic research domains, bias correction seems to be the most prominent domain for AI-based researchers. Different research groups of data scientists (Fulton et al., Reference Fulton, Clarke and Hegerl2023; Kim et al., Reference Kim, Ham, Joo and Son2021; Wang and Tian, Reference Wang and Tian2022; Wang et al., Reference Wang, Tian and Carroll2023) have demonstrated their interest in tackling this climate science challenge. Recent studies on bias correction have highlighted the effectiveness of machine learning and deep learning techniques in rectifying the bias associated with the spatiotemporal climate data (Han et al., Reference Han, Chen, Chen, Chen, Zhang, Lu, Song and Qin2021; Hu et al., Reference Hu, Yin and Zhang2021). Researchers like Wang and Tian (Reference Wang and Tian2022) have used convolution neural networks to correct the bias present in climate model outputs. Similarly, other groups of researchers (Chen et al., Reference Chen, Sun, Cifelli and Xie2022) have used CNN-based models to correct the SPEs.

This research aims to examine the effectiveness and usefulness of deep learning-based architectures for bias correction. Here, the authors have tried to rectify the inherent bias present in the TRMM precipitation estimates by using gauge station-based ground observations. Figure 1 shows the proposed workflow of this work. From Figure 1, it can be identified that the proposed work gives a comparative analysis of different bias correction techniques. Mainly the bias correction ability of both statistical and CNN-based models are examined here.

Figure 1. Flow diagram showing bias correction of SPEs.

The rest of this article is organized as follows: Section 2 introduces the study area and the dataset used for this work. Section 3 discusses the methodologies employed. Section 4 presents the comparative analysis of results for different bias correction techniques using various performance measures. Finally, Section 5 draws the conclusion.

2. Study area and dataset

2.1 Study area

This article concentrates on a designated study area, namely, the mainland of India. The study domain encompasses the latitude range of 6.75°N–38.5°N and the longitude range of 66.5°E–98.25°E. This research specifically incorporates precipitation data solely from the mainland of India, while precipitation values outside the specified area are treated as zero.

2.2 Dataset

In this research, the TRMM_3B42_Daily dataset (Huffman et al., Reference Huffman, Bolvin, Nelkin, Adler and Savtchenko2016) is employed to illustrate the bias correction techniques. TRMM_3B42_Daily dataset provides gridded rainfall values with spatial resolution 0.25⁰ × 0.25⁰. This dataset is produced by NASA GES DISC from the research-quality 3-hourly TRMM Multi-Satellite Precipitation Analysis (TMPA_3B42). The other dataset used in this work is the IMD daily gridded rainfall dataset with spatial resolution 0.25⁰ × 0.25⁰ (Pai et al., Reference Pai, Sridhar, Rajeevan, Sreejith, Satbhai and Mukhopadhyay2014). The IMD precipitation dataset is prepared by the India Meteorological Department from the rain gauge-based ground observations. Since this study focuses on the bias correction of gridded precipitation estimates and India receives more than 75% of its annual rainfall in the ISMR period, we have collected the gridded precipitation data for the ISMR period (June, July, August, and September) only. Here, TRMM dataset is used as the bias data, and the IMD-gridded rainfall dataset serves as the ground observation. Based on the availability of TRMM data samples, we have collected both the biased data and observation samples for the period 1998–2019. The data from 1998–2014 are used for training the deep learning models, and the remaining data are used for testing. The initial samples collected from the IMD-gridded dataset has the shape of 129 × 135. Since we are concentrating on the Indian landmass only, we have excluded some unwanted rows and columns so as to get the required data for the study area mentioned above. The shape of the sample became 128 × 128 after this preprocessing.

The proposed work aims to utilize the temporal relation in the data, so data samples are prepared by considering the temporal axis of the dataset. The three-dimensional convolutional model used in this work has a temporal depth of 10. Hence, each input sample for the model is prepared by taking ten consecutive samples from the TRMM dataset. The proposed work tries to correct the bias present in the Kth sample of TRMM by considering the Kth sample along with nine previous samples of TRMM. The targeted observation for this TRMM sample is the Kth observation of IMD. With this approach, we have prepared 113 input samples per year from the 122 samples available in the ISMR period. The corresponding 113 daily rainfall samples are used for the other bias correction techniques employed in this work. For the quantile-based mapping approaches, the calibration period is considered from 2010 to 2014, while the projection period spans from 2015 to 2019.

3 Methodology

3.1 Statistical methods for bias correction

The widely adopted statistical method for bias correction is the quantile mapping (QM) method. This approach aims to align the distribution of biased data with the distribution of observed data samples. If D_x,y denotes the cumulative distribution function of the dataset x during a time period y, then the bias-corrected satellite data for the projection period are expressed as,

(1)

$$ {R}_{BC}={D}_{o,h}^{-1}\left[{D}_{s,p}\left({R}_{s,p}\right)\right] $$

where R represents the variable of interest, and D ⁻¹ denotes the inverse of the cumulative distribution function D. The subscripts p and h signify the projection period and historical period respectively. Additionally, s is employed to represent satellite data, while o is used for observational data.

Another approach in statistical bias correction is the quantile delta mapping (QDM). In QDM, initially, the model projections or biased data undergo detrending based on quantiles, and then the simulated values are bias-corrected using QM with the transfer function established during the calibration period. Subsequently, the relative changes (for precipitation) in quantiles are multiplied with the bias-corrected model outputs to produce the final results.

Mathematically the bias-corrected output for climate variable ‘R’ at time ‘t’ using QDM is given by,

(2)

$$ {R}_{BC}=\left[{D}_{o,h}^{-1}\left[{D}_{s,p}^{(t)}\left({R}_{s,p}(t)\right)\right]\right]\times \left[\frac{R_{s,p}(t)}{D_{s,h}^{-1}\left[{D}_{s,p}^{(t)}\left({R}_{s,p}(t)\right)\right]}\right] $$

In the above multiplication, the first term represents the QM-based bias-corrected value at time t and the second term shows the relative change in quantiles.

3.2 Deep learning based bias correction

Super resolution deep residual network

The SRDRN or super resolution deep residual network is a deep learning architecture used for bias correction as well as downscaling of climate data (Wang and Tian, Reference Wang and Tian2022). The SRDRN architecture used in this work has sixteen residual blocks in its encoder part and the encoder receives low-resolution bias data as input. For this model, the low-resolution input samples are prepared from the TRMM data by using bilinear interpolation. The interpolated low-resolution input samples of this model have the shape of 32 × 32. The decoder part of this network has two upsampling blocks, and this part enhances the resolution to get the final bias-corrected high-resolution output sample with dimensions 128 × 128.

Convolutional neural network for bias correction (CNNBC)

Convolutional neural network for bias correction is a 3D-CNN-based model for bias correction. We have proposed this model in our recent work on bias correction of CFS simulations (Mishra Sharma et al., Reference Mishra Sharma, Kumar, Mitra and Saha2024). This model takes a three-dimensional input with depth = 10 and produces the targeted bias-corrected output with depth = 1. The architecture of this model is shown in Figure 2.

Figure 2. Deep learning based CNNBC model.

As shown in Figure 2, this model uses four types of convolutional blocks along with averaging nodes and skip connections. In Figure 2, ‘f’ indicates the number of filters, while ‘k’ and ‘s’ represent the kernel shape and stride, respectively. The first type of convolutional block in CNNBC has a 3D kernel with a shape of (9,9,9) and a stride equal to (1,1,1). The convolutional layer of this block generates 64 feature maps from the single input sample. The second type of block takes the 64 feature maps as input and produces a low-dimensional feature set by using 32 filters. The kernel used in this block has a shape of (3,3,3), and it uses a single stride in all dimensions. The first and second types of convolutional blocks in this model use ReLU as the activation function. These blocks also use dropout layers, which help in regularization and avoid overfitting of the model. The third type of convolutional block used in this network contains a 3D-convolution layer and linear activation. The convolution layer of this block has a single filter and a (5,5,5) kernel. This block has the same stride as that of the first and second blocks. The fourth type of convolutional block of CNNBC has a 3DCNN layer with a kernel shape of (10,1,1) and uses a stride of (10,1,1). This block is used as the final block of the CNNBC model. This block uses a ReLU activation function that provides nonlinearity and truncates the negative estimations. The final layer of CNNBC produces a bias-corrected output sample with a depth of 1. The model is trained by considering MSE as the loss function and ADAM as the optimizer. To regularize the training process, we have used an early stopping criterion with patience set to 20.

4 Result and discussion

In this section, the trained deep learning models and calibrated statistical techniques are evaluated using two state-of-the-art performance measures, namely root-mean-square error (RMSE) and Pearson’s correlation coefficient (R). With these performance measures, a model can be considered most suitable for bias correction if it has a low RMSE value and high correlation.

In this work, the performance measures are calculated at each valid grid point, as well as for the spatial mean rainfall values by comparing the predicted values with the ground observations. In the first step, we calculated these performance measures at each valid grid location. Let, $ {M}_{\left(i,j\right)}^t\hskip0.24em $ represents the model output or predicted value, and $ {O}_{\left(i,j\right)}^t $ represents the observation value for a grid location (i, j) at time t (or test sample t), then the RMSE and correlation (R) value calculated for the grid location (i, j) are mathematically represented as,

(3)

$$ {RMSE}_{\left(i,j\right)}=\sqrt{\frac{\sum \limits_{t=1}^T{\left({M}_{\left(i,j\right)}^t-{O}_{\left(i,j\right)}^t\right)}^2}{T}} $$

(4)

$$ {R}_{\left(i,j\right)}=\frac{\sum \limits_{t=1}^T\left({O}_{\left(i,j\right)}^t-\hat{O_{\left(i,j\right)}}\right)\left({M}_{\left(i,j\right)}^t-\hat{M_{\left(i,j\right)}}\right)}{\sqrt{\sum \limits_{t=1}^T{\left({O}_{\left(i,j\right)}^t-\hat{O_{\left(i,j\right)}}\right)}^2\sum \limits_{t=1}^T{\left({M}_{\left(i,j\right)}^t-\hat{M_{\left(i,j\right)}}\right)}^2}} $$

Where,

(5)

$$ \hat{M_{\left(i,j\right)}}=\frac{\sum \limits_{t=1}^T{M}_{\left(i,j\right)}^t}{T} $$

and,

(6)

$$ \hat{O_{\left(i,j\right)}}=\frac{\sum \limits_{t=1}^T{O}_{\left(i,j\right)}^t}{T} $$

where $ \hat{M_{\left(i,j\right)}} $ and $ \hat{O_{\left(i,j\right)}\;} $ symbolize the mean values of predicted rainfall (bias-corrected rainfall) and observed rainfall, respectively, for the grid location (i, j). Here, T indicates the total number of daily rainfall samples present in the test set.

The grid-wise RMSE values calculated for the biased data as well as for the bias-corrected outputs are depicted in Figure 3. These plots indicate that the deep learning based approaches, especially the CNNBC model, have a lower error rate compared to the other models. The CNNBC model effectively reduces the error present in the biased data for most of the regions across India. Similarly, the gridded correlation values obtained by comparing the model outputs with the observation samples are presented in Figure 4. Here also, we found that the CNNBC model outperforms other statistical and deep learning approaches. Figure 4 clearly shows that the bias-corrected rainfall values obtained by the CNNBC model are highly correlated with the observed precipitation values. The results also indicate that the statistical methods lag behind deep learning-based approaches in maintaining a good correlation between bias-corrected output and observation samples. To further analyze the model performance, we found the average RMSE and correlation values by taking the gridded RMSE and correlation values as presented in Figures 3 and 4, respectively. This average RMSE and correlation value are shown in Table 1. From Table 1, it can be observed that the CNNBC model has a low mean RMSE value and high mean correlation value compared to the other approaches.

Figure 3. Spatial plots showing RMSE values calculated at each grid location.

Figure 4. Spatial plots showing correlation coefficient values calculated at each grid location.

Table 1. Comparison of mean values of gridded RMSE and gridded correlation coefficients

Apart from the grid-wise performance evaluation, we have also carried out the performance evaluation for spatial mean rainfall values. To prepare the mean rainfall samples, we have analyzed the gridded data samples and result samples. We took the area averaged value for each data sample by considering the valid grid locations. If ‘T’ represents the number of result samples and ‘Y’ indicates the number of valid grid locations in each sample, then the spatial mean rainfall value can be represented by a vector X with dimension T, i.e., X = [X₁, X₂,…,X_T]. Here, each X_t indicates the average value of Y grid points at time t.

After preparing the mean rainfall vectors for all the model outputs and observation samples, we calculated the RMSE and correlation coefficient. Let the spatial mean rainfall vector for a model output be represented by $ M $ , and the mean rainfall vector of observation is represented by $ O $ , then the RMSE and correlation coefficient (R) can be calculated by using the following formulas,

(7)

$$ RMSE=\sqrt{\frac{\sum \limits_{i=1}^T{\left({M}_i-{O}_i\right)}^2}{T}} $$

(8)

$$ R=\frac{\sum \limits_{i=1}^T\left({O}_i-\hat{O}\right)\left({M}_i-\hat{M}\right)}{\sqrt{\sum \limits_{i=1}^T{\left({O}_i-\hat{O}\right)}^2\sum \limits_{i=1}^T{\left({M}_i-\hat{M}\right)}^2}} $$

where T indicates the number of values stored in the vector $ M $ or $ O $ . $ \hat{O} $ and $ \hat{M} $ are used to denote the mean values of observations and predictions, respectively. The results obtained in this examination are shown in Table 2. The results indicate that the use of deep learning models for bias correction reduces the bias present in the daily mean rainfall value by improving the correlation between bias-corrected mean rainfall value and observed mean rainfall value.

Table 2. Performance measures calculated for the daily spatial mean rainfall values

The above analysis indicates that the CNNBC model effectively corrects the bias within the SPEs both at the grid level and for the spatial mean rainfall values. Furthermore, upon comparing the deep learning models used in this study in terms of their learnable parameters and floating-point operations (FLOPs), it is evident that SRDRN contains nearly 39 times more learnable parameters than CNNBC. This suggests that in terms of storage space requirements for the model parameters, CNNBC is much more economical than SRDRN. However, it is also observed that CNNBC requires more FLOPs compared to SRDRN due to its architecture and input–output shape. This highlights a potential avenue for future research, wherein researchers could enhance the CNNBC model to reduce the FLOPs while maintaining performance requirements. Overall, CNNBC appears to be a superior model for bias correction compared to others.

5 Conclusion

The application of deep learning techniques in bias correction of the satellite-based daily precipitation estimates offers promising advancements in enhancing the accuracy and reliability of precipitation estimates. Through the utilization of sophisticated neural network architectures, such as artificial neural networks and convolutional neural networks, significant improvements in bias correction performance can be achieved. By utilizing the vast amounts of data available from satellite estimations, these models can effectively learn complex spatiotemporal patterns inherent in precipitation distributions. To utilize and analyze the power of spatiotemporal convolutional architectures in bias correction, this study performs a comparative analysis between different statistical and deep learning-based bias correction techniques. The main objective of this work is to correct the systematic bias present in the satellite-based TRMM precipitation products. To achieve this goal, four different bias correction techniques, namely, QM, QDM, SRDRN, and CNNCB, are applied to the bias data to get the bias-corrected outputs. The comparative analysis of these models indicates that the CNNBC model is the most suitable model for correcting the daily TRMM precipitation estimates for the specified study area. This research can be further extended by effectively optimizing the proposed architecture to get more reliable and improved bias-corrected results. One more area of future research for this work is the utilization of these models in bias correction of different climatic variables in different geographical locations.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/eds.2024.39.

Data availability statement

The TRMM data used in this work are collected from: https://disc.gsfc.nasa.gov/datasets/TRMM_3B42_Daily_7/summary and the IMD-gridded data are available at: https://www.imdpune.gov.in/cmpg/Griddata/Rainfall_25_NetCDF.html.

Author contribution

Conceptualization: S.C.M.S; A.M. Methodology & Experiments: S.C.M.S. Writing First Draft: S.C.M.S. Providing Technical Advice: A.M. Arranging Funds: A.M. Policing the Draft: A.M.

Provenance

This article was accepted into the Climate Informatics 2024 (CI2024) Conference. It has been published in Environmental Data Science on the strength of the CI2024 review process.

Funding statement

This research has been partially funded by ISIRD Grant to Adway Mitra by Sponsored Research and Industrial Consultancy (SRIC), IIT Kharagpur, Grant No. IIT/SRIC/ISIRD/2020-2021/11.

Competing interest

The authors declare no competing interests exist.

Ethics statement

The research meets all ethical guidelines, including adherence to the legal requirements of the study country.

References

Chen, H, Sun, L, Cifelli, R and Xie, P (2022) Deep learning for bias correction of satellite retrievals of orographic precipitation. IEEE Transactions on Geoscience and Remote Sensing 60, 4104611. https://doi.org/10.1109/TGRS.2021.3105438.Google Scholar

Dinh, TLA and Aires, F (2023) Revisiting the bias correction of climate models for impact studies. Climatic Change 176, 140. https://doi.org/10.1007/s10584-023-03597-y.CrossRef Google Scholar

Fulton, DJ, Clarke, BJ and Hegerl, GC (2023) Bias correcting climate model simulations using unpaired image-to-image translation networks. Artificial Intelligence for the Earth Systems 2, e220031. https://doi.org/10.1175/AIES-D-22-0031.1.CrossRef Google Scholar

Guo, Q, Chen, J, Zhang, X, Shen, M, Chen, H and Guo, S (2019) A new two-stage multivariate quantile mapping method for bias correcting climate model outputs. Climate Dynamics 53, 3603–3623. https://doi.org/10.1007/s00382-019-04729-w.CrossRef Google Scholar

Han, L, Chen, M, Chen, K, Chen, H, Zhang, Y, Lu, B, Song, L and Qin, R (2021) A deep learning method for bias correction of ECMWF 24–240 h forecasts. Advances in Atmospheric Sciences 38(9), 1444–1459. https://doi.org/10.1007/s00376-021-0215-y.CrossRef Google Scholar

Holthuijzen, M, Beckage, B, Clemins, PJ, Higdon, D and Winter, JM (2022) Robust bias-correction of precipitation extremes using a novel hybrid empirical quantile-mapping method. Theoretical and Applied Climatology 149, 863–882. https://doi.org/10.1007/s00704-022-04035-2.CrossRef Google Scholar

Hu, YF, Yin, FK and Zhang, WM (2021) Deep learning-based precipitation bias correction approach for Yin–he global spectral model. Meteorological Applications 28(5), e2032. https://doi.org/10.1002/met.2032.CrossRef Google Scholar

Huffman, GJ, Bolvin, DT, Nelkin, EJ and Adler, RF (2016) TRMM (TMPA) precipitation L3 1 day 0.25 degree x 0.25 degree V7. In Savtchenko, A (ed.), Goddard Earth Sciences Data and Information Services Center (GES DISC). https://doi.org/10.5067/TRMM/TMPA/DAY/7 (accessed 26 April 2023).CrossRef Google Scholar

Iqbal, Z, Shahid, S, Ahmed, K, Wang, X, Ismail, T and Gabriel, HF (2022) Bias correction method of high-resolution satellite-based precipitation product for peninsular Malaysia. Theoretical and Applied Climatology 148, 1429–1446. https://doi.org/10.1007/s00704-022-04007-6.CrossRef Google Scholar

Irwandi, H, Rosid, MS and Mart, T (2023) Effects of climate change on temperature and precipitation in the Lake Toba region, Indonesia, based on ERA5-land data with quantile mapping bias correction. Scientific Reports 13, 2542. https://doi.org/10.1038/s41598-023-29592-y.CrossRef Google Scholar PubMed

Jaiswal, R, Mall, RK, Singh, N, Lakshmi Kumar, TV and Niyogi, D (2022) Evaluation of bias correction methods for regional climate models: Downscaled rainfall analysis over diverse agroclimatic zones of India. Earth and Space Science 9, e2021EA001981. https://doi.org/10.1029/2021EA001981.CrossRef Google Scholar

Katiraie-Boroujerdy, P-S, Rahnamay Naeini, M, Akbari Asanjan, A, Chavoshian, A, Hsu, K-l and Sorooshian, S (2020) Bias correction of satellite-based precipitation estimations using Quantile mapping approach in different climate regions of Iran. Remote Sensing 12(13), 2102. https://doi.org/10.3390/rs12132102.CrossRef Google Scholar

Kim, H, Ham, YG, Joo, YS and Son, SW (2021) Deep learning for bias correction of MJO prediction. Nature Communications 12, 3807. https://doi.org/10.1038/s41467-021-23406-3.Google Scholar PubMed

Kumar, B, Atey, K, Singh, B, Chattopadhyay, R, Acharya, N, Singh, M, Nanjundiah, R and Rao, A (2023) On the modern deep learning approaches for precipitation downscaling. Earth Science Informatics 16, 1459–1472. https://doi.org/10.1007/s12145-023-00970-4.CrossRef Google Scholar

Mishra Sharma, SC, Kumar, B, Mitra, A and Saha, SK (2024) Deep learning-based bias correction of ISMR simulated by GCM. Atmospheric Research 309, 107589. https://doi.org/10.1016/j.atmosres.2024.107589.CrossRef Google Scholar

Mishra Sharma, SC and Mitra, A (2022) Resdeepd: A residual super-resolution network for deep downscaling of daily precipitation over India. Environmental Data Science 1, e19. https://doi.org/10.1017/eds.2022.23.CrossRef Google Scholar

Mitra, A (2021) A comparative study on the skill of CMIP6 models to preserve daily spatial patterns of monsoon rainfall over India. Frontiers in Climate 3, 654763. https://doi.org/10.3389/fclim.2021.654763.CrossRef Google Scholar

Pai, DS, Sridhar, L, Rajeevan, M, Sreejith, OP, Satbhai, NS and Mukhopadhyay, B (2014) Development of a new high spatial resolution (0.25° X 0.25°) long period (1901-2010) daily gridded rainfall data set over India and its comparison with existing data sets over the region. MAUSAM 65(1), 1–18.CrossRef Google Scholar

Passow, C and Donner, RV (2020) Regression-based distribution mapping for bias correction of climate model outputs using linear quantile regression. Stochastic Environmental Research and Risk Assessment 34, 87–102. https://doi.org/10.1007/s00477-019-01750-7.CrossRef Google Scholar

Pierce, DW, Cayan, DR, Maurer, EP, Abatzoglou, JT and Hegewisch, KC (2015) Improved bias correction techniques for hydrological simulations of climate change. Journal of Hydrometeorology 16, 2421–2442. https://doi.org/10.1175/JHM-D-14-0236.1.CrossRef Google Scholar

Sharma, D, Das, S, Chakraborty, D, Mitra, A and Goswami, B (2023) Physics-based 18-month lead forecast of Indian Summer Monsoon Rainfall. https://doi.org/10.21203/rs.3.rs-2923543/v1.CrossRef Google Scholar

Sun, L, Chen, H and Han, L (2021, July) Bias correction of satellite retrievals of orographic precipitation. In 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS. IEEE, pp. 7240–7243.CrossRef Google Scholar

Tong, Y, Gao, X, Han, Z, Xu, Y, Xu, Y and Giorgi, F (2021) Bias correction of temperature and precipitation over China for RCM simulations using the QM and QDM methods. Climate Dynamics 57, 1425–1443. https://doi.org/10.1007/s00382-020-05447-4.CrossRef Google Scholar

Wang, F and Tian, D (2022) On deep learning-based bias correction and downscaling of multiple climate models simulations. Climate Dynamics 59, 3451–3468. https://doi.org/10.1007/s00382-022-06277-2.CrossRef Google Scholar

Wang, F, Tian, D and Carroll, M (2023) Customized deep learning for precipitation bias correction and downscaling. Geoscientific Model Development 16(2), 535–556. https://doi.org/10.5194/GMD-16-535-2023.CrossRef Google Scholar

Wei, L, Jiang, S, Ren, L, Zhang, L, Wang, M, Liu, Y and Duan, Z (2022) Bias correction of GPM IMERG early run daily precipitation product using near real-time CPC global measurements. Atmospheric Research 279, 106403. https://doi.org/10.1016/j.atmosres.2022.106403.CrossRef Google Scholar

Yang, X, Yang, S, Tan, ML, Pan, H, Zhang, H, Wang, G, He, R and Wang, Z (2022) Correcting the bias of daily satellite precipitation estimates in tropical regions using deep neural network. Journal of Hydrology 608, 127656. https://doi.org/10.1016/j.jhydrol.2022.127656.CrossRef Google Scholar

Figure 1. Flow diagram showing bias correction of SPEs.

Figure 2. Deep learning based CNNBC model.

Figure 3. Spatial plots showing RMSE values calculated at each grid location.

Figure 4. Spatial plots showing correlation coefficient values calculated at each grid location.

Table 1. Comparison of mean values of gridded RMSE and gridded correlation coefficients

Table 2. Performance measures calculated for the daily spatial mean rainfall values

Author comment: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR1

Published online by Cambridge University Press: 02 January 2025

DOI: https://doi.org/10.1017/eds.2024.39.pr1

Sumanta Chandra Mishra Sharma

Centre of Excellence in Artificial Intelligence, Indian Institute of Technology Kharagpur, India

Revision round: 0

Role: author

Comments

Sumanta Chandra Mishra Sharma

(Corresponding Author)

Date: 26/07/2024

Editorial Team

Environment Data Science

Cambridge University Press

Dear Editors,

I am pleased to submit the final version of our manuscript, titled “Deep Learning based bias correction of TRMM precipitation estimates using IMD gridded precipitation as ground observation,” for publication in the special issue on Climate Informatics 2024 of the Environment Data Science Journal. The manuscript has been prepared according to the journal’s guidelines.

Our research examines the use of Deep Learning to reduce systematic bias in satellite estimations while maintaining spatial dependency across grid points. We calibrate daily precipitation values from TRMM_3B42_Daily data over the Indian landmass with ground observations from the India Meteorological Department (IMD), focusing on the Indian Summer Monsoon Rainfall (June-September). Benchmarking against standard statistical methods like quantile mapping and quantile delta mapping, our comparative analysis demonstrates the effectiveness of deep learning in bias correction. Our findings contribute to a deeper understanding of bias correction in satellite precipitation estimates and offer innovative approaches to improve the accuracy of such estimations, which is crucial for climate informatics and related applications.

We believe our findings offer valuable insights for climate informatics and align well with the goals of this special issue. Thank you for considering our manuscript for publication.

Sincerely,

Sumanta Chandra Mishra Sharma

Centre of Excellence in Artificial Intelligence,

Indian Institute of Technology Kharagpur,

Kharagpur, 721302, India.

E-mail: sumantamishra22@gmail.com

Review: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR2

Published online by Cambridge University Press: 02 January 2025

DOI: https://doi.org/10.1017/eds.2024.39.pr2

Andrew Hyde Cambridge University Press, United Kingdom of Great Britain and Northern Ireland

Date of review: 28 September 2024

Revision round: 0

Role: reviewer

Recommendation/decision: minor-revision

Conflict of interest statement

Reviewer declares none.

Comments

>Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

This work demonstrates the utility of a deep learning framework to apply bias corrections to a remotely sensed precipitation dataset, TRMM daily observations, using gridded ground-based observations as truth. The work focuses on the mainland of India during monsoon season (June-September for 1998-2019). Results are compared with commonly employed statistical bias correction techniques (e.g. Quantile Mapping, Quantile Delta Mapping).

>Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

The topic of advancing and improving bias correction techniques for remotely sensed precipitation data is highly relevant to open interdisciplinary climate informatics problems. However, the techniques presented here do not seem to significantly improve over current methods. It is not clear if the investment in computation cost is worth the incremental improvement in RMSE and correlation coefficients as presented in this work. Admittedly the spatial maps illustrated appear more impressive than the aggregated numbers summarized in the tables. I think it would be interesting to detail the computational costs of the multiple methods presented. Overall I think it is worth exploring these different methodologies for bias correction and I think the work should be shared amongst the community for general awareness.

> Detailed Comments

I would recommend editorial review by native English speakers for minor corrections in grammar.

Recommendation: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR3

Published online by Cambridge University Press: 02 January 2025

DOI: https://doi.org/10.1017/eds.2024.39.pr3

Douglas Rao NC Institute for Climate Studies, North Carolina State University, United States

Date of review: 28 September 2024

Revision round: 0

Role: Editor

Recommendation/decision: accept

Comments

This article was accepted into Climate Informatics 2024 Conference after the authors addressed the comments in the reviews provided. It has been accepted for publication in Environmental Data Science on the strength of the Climate Informatics Review Process.

Decision: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR4

Published online by Cambridge University Press: 02 January 2025

DOI: https://doi.org/10.1017/eds.2024.39.pr4

Claire Monteleoni University of Colorado Boulder, United States

Revision round: 0

Role: Editor in Chief

Recommendation/decision: accept

Comments

No accompanying comment.

Article contents

Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation

Abstract

Keywords

Information

Impact Statement

1. Introduction

2. Study area and dataset

2.1 Study area

2.2 Dataset

3 Methodology

3.1 Statistical methods for bias correction

3.2 Deep learning based bias correction

Super resolution deep residual network

Convolutional neural network for bias correction (CNNBC)

4 Result and discussion

5 Conclusion

Open peer review

Data availability statement

Author contribution

Provenance

Funding statement

Competing interest

Ethics statement

References

Author comment: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR1

Comments

Review: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR2

Conflict of interest statement

Comments

Recommendation: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR3

Comments

Decision: Deep learning based bias correction of TRMM precipitation estimates using IMD-gridded precipitation as ground observation — R0/PR4

Comments

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests