An Interpretable Machine Learning Approach for Quantitative Precipitation Estimation from Multi-Source Remote Sensing Data

Kefeng Deng
Dawei Li
Di Zhang
Hongze Leng
Yudi Liu
Junqiang Song

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Machine Learning (ML) models are powerful tools for meteorological applications but often operate as "black boxes", hindering scientific understanding. This study addresses this challenge by implementing an interpretable ML approach Rulefit for Quantitative Precipitation Estimation (QPE). The algorithm was applied to a multisource dataset from eastern China, incorporating composite radar reflectivity from China’s New Generation DopplerWeather Radar, Digital Elevation Model data, and Himawari-8 satellite bands 7-10. A geographical mask based on China’s national borders was applied to exclude data points outside the land area. Additionally, standard scaling normalization was applied to all input features to account for different units and value ranges across datasets. The model’s performance was evaluated against six baseline models and traditional Z-R relationship. The RuleFit model demonstrated strong predictive performance, achieving a Critical Success Index (CSI) of 0.6015 and a Probability of Detection (POD) of 0.9359 for 10-minute precipitation events exceeding a 2mm threshold. This accuracy was comparable to other ML models and significantly surpassed traditional Z-R relationship methods. Crucially, the generated decision rules and Partial Dependence Plots(PDP) provided transparent insights into the model’s logic, revealing key non-linear interactions between radar and satellite features and showing an emphasis on predicting heavy rainfall. Our findings show that RuleFit is not only an accurate QPE tool but also a framework for uncovering meteorological relationships, thereby building researcher confidence and addressing the critical need for interpretability in ML-based atmospheric science.

Version published to 10.20944/preprints202511.0506.v1
Nov 10, 2025

A Machine Learning-Based Quality Control Algorithm for Heavy Rainfall Using Multi-Source Data

This article has 10 authors:
1. Hao Sun
2. Qing Zhou
3. Lijuan Shi
4. Cuina Li
5. Shiguang Qin
6. Dan Yao
7. Mingyi Xu
8. Yang Huang
9. Qin Hu
10. Yunong Guan
This article has no evaluationsLatest version Oct 16, 2025
Remotely Sensed Precipitation Estimates Using Hyrbid Machine Learning Models in a Monsoon-Dominated Climate

This article has 4 authors:
1. Swaranjit Roy
2. Md. Helal Ahmmed
3. Susmith Kundu
4. Abu Reza Md. Towfiqul I
This article has no evaluationsLatest version Sep 22, 2025
Imputing Missing Precipitation Data at Benin Synoptic Stations (West Africa) by Using Machine Learning Methods

This article has 3 authors:
1. Mawinesso Gnonyi N’Kaina
2. Noukpo Médard Agbazo
3. Gabin Koto N’Gobi
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Machine Learning-Based Quality Control Algorithm for Heavy Rainfall Using Multi-Source Data

Remotely Sensed Precipitation Estimates Using Hyrbid Machine Learning Models in a Monsoon-Dominated Climate

Imputing Missing Precipitation Data at Benin Synoptic Stations (West Africa) by Using Machine Learning Methods