Remotely Sensed Precipitation Estimates Using Hyrbid Machine Learning Models in a Monsoon-Dominated Climate

Swaranjit Roy
Md. Helal Ahmmed
Susmith Kundu
Abu Reza Md. Towfiqul I

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Precise precipitation estimation is vital for effective water resource management, disaster planning, and climate adaptation, particularly in data-deficient regions like a monsoon-dominated sub-tropical country including Bangladesh. However, conventional approaches relying on rain gauge networks or meteorological models face challenges such as sparse spatial coverage and high installation and maintenance costs. To fill this gap, this study proposes a high-tech hybrid machine learning model for estimating rainfall using remote sensing (RS) datasets (CHIRPS, PERSIANN, and ERA5) and cutting-edge algorithms with gradient-enhanced bias correction. The model combines XGBoost, linear regression (LR), and K-Nearest Neighbors (KNN) in a stacked ensemble setup. The gradient-boosting-based bias correction used monthly rainfall data from nine locations in Bangladesh (1990–2019) to fix common RS issues, like seasonal shift detection and peak rainfall underestimation. The meta-model outperformed individual ML models (LR, RF, XGBoost, KNN), with R² values consistently above 0.75. Combining all three RS datasets improved performance (R² = 0.9) compared to using two (R² >0.87) or one (R² >0.8). The bias correction process substantially enhanced predictive accuracy across geographical locations. Post-correction, R² increased from 6.5–13.1%, RMSE decreased by 22–60.1%, and MAE reduced from 49.6–71.4%, underscoring the effectiveness of the bias correction. It also minimized pre-monsoon and monsoon inaccuracies, increasing robustness. The model achieved a median R² of 0.95, RMSE of 25 mm, and MAE of 15 mm. Overall, the hybrid meta-model outperformed all individual ML models in predicting rainfall from RS datasets, with bias correction significantly enhancing performance across contexts. This study is the first to compare various ML models with the proposed meta-model while integrating multiple RS datasets to improve accuracy. The model’s limited ability to capture localized precipitation in complex terrains and short timeframes highlights the need for incorporating more climatic variables and advanced neural networks to improve accuracy and scalability.

Version published to 10.21203/rs.3.rs-7652794/v1 on Research Square
Sep 22, 2025

Seasonal Weather Pattern Prediction From Enso Indices Using Machine Learning

This article has 5 authors:
1. M. Mohsin
2. T. Ghosh
3. F. Akter
4. S. Sarkar
5. Md. R.A. Mullick
This article has no evaluationsLatest version Oct 17, 2025
EWLR – A New Method for Interpolating Elevation-Driven Variables: Annual Rainfall in Erbil Governorate

This article has 1 author:
1. Azad Rasul
This article has no evaluationsLatest version Nov 12, 2025
Imputing Missing Precipitation Data at Benin Synoptic Stations (West Africa) by Using Machine Learning Methods

This article has 3 authors:
1. Mawinesso Gnonyi N’Kaina
2. Noukpo Médard Agbazo
3. Gabin Koto N’Gobi
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Seasonal Weather Pattern Prediction From Enso Indices Using Machine Learning

EWLR – A New Method for Interpolating Elevation-Driven Variables: Annual Rainfall in Erbil Governorate

Imputing Missing Precipitation Data at Benin Synoptic Stations (West Africa) by Using Machine Learning Methods