Machine Learning-based Hydrological Models for Flash Floods: A Systematic Literature Review

Leonardo Santos
Luiz Satolo
Ricardo Oyarzabal
Elton Escobar-Silva
Michael Diniz
Rogério Negri
Glauston Lima
Stephan Stephany
Jaqueline Soares
Johan Duque
Fernando Saraiva-Filho
Luiz Bacelar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

@rashi's saved articles (rashi)

Abstract

Flash floods are critical events for emergency management, yet their modeling remains highly challenging, even in smart cities approaches. Physically based hydrological models are often unsuitable at small spatiotemporal scales due to their computational complexity and dependence on detailed local parameters, which are rarely available during flash floods. With the growing availability of hydrological data, machine learning (ML) has emerged as a promising alternative. This work performs a Systematic Literature Review (SLR) to improve our understanding of the research landscape on ML applications for flash flood forecasting, a significant subset of flash flood modeling. From more than 1,200 papers published until January 2024 in Web of Science, SCOPUS/Elsevier, Springer/Nature, and Wiley, 50 were selected following PRISMA guidelines. The inclusion and exclusion criteria removed reviews, retractions, papers focused on post-flood damage assessment (not forecasting), and those with time resolutions of 6 hours or more, retaining only studies with fine-scale temporal data (<6 hours). For each paper, we extracted information on forecasting horizon, study area size, input data, ML techniques, and outcomes (regression or classification). Results show a sharp rise in ML-based flash flood research, with China leading (38%). Nearly all studies rely on rainfall, discharge, and water level data - often in combination. Long short-term memory (LSTM) networks dominate (60%). Unfortunately, only 10% of the selected studies provide access to their datasets. This lack of transparency poses a major barrier to reproducibility, inhibits fair comparative evaluation of models, and ultimately slows methodological progress in flash flood forecasting. Furthermore, our review highlights that no method consistently outperforms others. This variability in performance is likely influenced by factors such as regional hydrological characteristics (e.g., differences between arid and tropical basins), variations in input data quality, and the length of the forecast horizon (e.g., 1- vs. 6-hour prediction). Lastly, we recommend advancing this field through integration with early warning systems, creation of benchmarks, open data practices, and stronger multidisciplinary collaboration.

Version published to 10.31223/x5c699
Sep 30, 2025

Advanced Hydrological Forecasting with Machine Learning

This article has 4 authors:
1. Kundan Meshram
2. Umank MISHRA
3. Vikram Kumar
4. Maya Rajnarayan Ray
This article has no evaluationsLatest version Sep 18, 2025
Rapid Identification of Flood Inundation Areas and Dominant Drivers in Compound Floods Using Explainable Machine Learning

This article has 6 authors:
1. Jiqiang Xie
2. Bing Yu
3. Heng Lyu
4. Shengnan Fu
5. Chen Yang
6. Chi Zhang
This article has no evaluationsLatest version Oct 9, 2025
Flood Prediction with Artificial Intelligence An Exploratory Data Analysis Approach

This article has 7 authors:
1. Arya Vithal Mane
2. Rashmi Ravindra Halkarni
3. Pallavi Mahesh Bhat
4. Amarnath Mahesh Kakatikar
5. Rajkumar Raikar
6. Rajashri Khanai
7. Salma Shamashoddin Shahapur
This article has no evaluationsLatest version Sep 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Advanced Hydrological Forecasting with Machine Learning

Rapid Identification of Flood Inundation Areas and Dominant Drivers in Compound Floods Using Explainable Machine Learning

Flood Prediction with Artificial Intelligence An Exploratory Data Analysis Approach