Source identification of sudden water pollution events in the Dongliao River using a hybrid AI framework

Yanchen Wang
Yu Wang
Peng Shi
Jian min Bian
Caidie Chen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study presents a novel hybrid framework for rapid and robust identification of sudden water pollution sources by integrating machine learning (ML) with numerical modeling, enabling high-precision inversion of source parameters while quantifying their uncertainties. A MIKE21 hydrodynamic-water quality model of the Dongliao River was developed to generate a synthetic dataset, which was used to train and evaluate long short-term memory (LSTM), kernel extreme learning machine (KELM), and support vector machine (SVM) surrogate models. Among them, the LSTM achieved the highest accuracy (R ² = 0.98, RMSE = 0.03) and was selected for further integration. For deterministic source identification, a whale optimization algorithm (WOA)-LSTM model was developed, reducing the average inversion error to 6.89% (source location error < 3%) and computation time to 233 seconds. A probabilistic inversion system was subsequently established by coupling the WOA-LSTM model with a Bayesian framework, which characterized the posterior probability distributions of source parameters with an average error of 5.26%. To assess robustness, a comparative analysis under a 5% data noise scenario revealed that the probabilistic approach achieved an average relative error of 5.39%, representing a 47.2% improvement over the deterministic method’s 10.22% error. These findings demonstrate that integrating a physics-informed ML surrogate with Bayesian inference effectively addresses uncertainty and computational cost in environmental inverse problems, offering a powerful tool for intelligent early warning and precise management of sudden water pollution incidents.

Version published to 10.21203/rs.3.rs-7711289/v1 on Research Square
Sep 26, 2025

A novel hybrid machine learning approach for suspended sediment load forecasting: A case study of Mazandaran rivers basins

This article has 3 authors:
1. Ali Akbar Eatesam
2. Khosrow Hosseini
3. Hojat Karami
This article has no evaluationsLatest version Oct 7, 2025
Rapid Identification of Flood Inundation Areas and Dominant Drivers in Compound Floods Using Explainable Machine Learning

This article has 6 authors:
1. Jiqiang Xie
2. Bing Yu
3. Heng Lyu
4. Shengnan Fu
5. Chen Yang
6. Chi Zhang
This article has no evaluationsLatest version Oct 9, 2025
A Comparative Study of TabNet and Classical Machine Learning Models for Landslide Prediction

This article has 2 authors:
1. Ali Aalianvari
2. Shirin Jahanmiri
This article has no evaluationsLatest version Oct 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A novel hybrid machine learning approach for suspended sediment load forecasting: A case study of Mazandaran rivers basins

Rapid Identification of Flood Inundation Areas and Dominant Drivers in Compound Floods Using Explainable Machine Learning

A Comparative Study of TabNet and Classical Machine Learning Models for Landslide Prediction