Interpretable Machine Learning Framework for Geochemical Classification: Advancing Mineral and Geothermal Resource Assessment

Thien Thuan Huynh
Quoc Lap Nguyen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate lithological classification from geochemical data is fundamental to quantitative resource exploration, evaluation, and risk reduction. This study develops an explainable ensemble learning framework that integrates Random Forest, XGBoost, CatBoost, and Multi-Layer Perceptron models to classify 3,868 igneous rock samples using major oxide compositions. The CatBoost model achieved the highest performance with 89.9% accuracy and 85.7% F1-macro score, outperforming other optimized models. Explainability analysis using SHAP (SHapley Additive exPlanations) quantitatively validated model outputs against petrological theory: SiO2 emerged as the dominant discriminator (importance: 1.026), followed by CaO and MgO, accurately reflecting magmatic differentiation processes. The framework integrates prediction confidence to quantify geological uncertainty in resource assessment contexts. This approach enhances efficiency in mineral and geothermal resource evaluation by enabling rapid, interpretable geochemical classification that supports subsurface mapping and reduces exploration uncertainty. With sub-second inference times, the framework provides operational feasibility for field deployment in exploration programs. By bridging machine learning outputs with geological understanding, this work advances quantitative resource geoscience through transparent, high-accuracy classification suitable for mineral prospectivity mapping, geothermal reservoir characterization, and exploration risk assessment.

Version published to 10.21203/rs.3.rs-8062402/v1 on Research Square
Nov 10, 2025

An Automatic Classification Method for Igneous Rock Fractures Based on an Interpretable Ensemble Machine Learning Model

This article has 3 authors:
1. Meng Wang
2. Lu Yin
3. Quan Zhou
This article has no evaluationsLatest version Sep 26, 2025
Soil Geochemistry and Contamination Zoning in Northeastern Ghana: Insights From the Bongo and Talensi Districts

This article has 6 authors:
1. Belinda S. Berdie
2. Raymond W. Kazapoe
3. Darwin A. Awog-Badek
4. Blestmond A. Brako
5. Gordon Foli
6. Simon K. Y. Gawu
This article has no evaluationsLatest version Oct 27, 2025
A Comparative Study of TabNet and Classical Machine Learning Models for Landslide Prediction

This article has 2 authors:
1. Ali Aalianvari
2. Shirin Jahanmiri
This article has no evaluationsLatest version Oct 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

An Automatic Classification Method for Igneous Rock Fractures Based on an Interpretable Ensemble Machine Learning Model

Soil Geochemistry and Contamination Zoning in Northeastern Ghana: Insights From the Bongo and Talensi Districts

A Comparative Study of TabNet and Classical Machine Learning Models for Landslide Prediction