A clustering-based surrogate model leveraging random forest for rapid pumping optimization in saltwater-intruded heterogeneous aquifers

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The efficient approximation of time-variant outputs from high-fidelity numerical models is essential for sustainable groundwater management in coastal aquifers. While surrogate models are increasingly deployed to bypass the computational costs of simulation-optimization (S-O) under heterogeneity, their performance often degrades in high-dimensional input spaces. This study proposes a novel ensemble clustering framework integrating hierarchical algorithms coupled with a Random Forest (RF) surrogate model to optimize pumping strategies across extensive well networks. The framework utilizes MODFLOW and SEAWAT to generate a foundational dataset of hydraulic drawdown and saltwater intrusion (SI) distributions. A primary innovation lies in our clustering-based dimensionality reduction, which effectively reduces 52 physical pumping wells to 10 representative proxy wells. This strategy significantly reduces input dimensionality while identifying near-optimal pumping patterns. To train the RF model, targeted SEAWAT simulations were subsequently implemented to generate 5,000 training samples, for each of which 100 realizations of hydraulic conductivity fields were generated. Results indicate that this integrated clustering-RF approach achieves 95% computational savings over traditional surrogate-numerical hybrids. This efficiency is realized through a drastic reduction in input variables via well-field classification and focused sampling near optimal extraction patterns. The resulting scalable framework provides a robust tool for decision-makers managing complex, saltwater-intruded aquifer systems.

Article activity feed