Ensemble Machine Learning for Predicting TBM Penetration Rate with Limited Geotechnical Data

Halil Karahan
Devrim Alkaya

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate prediction of TBM penetration rate (ROP) is of critical importance for the planning of tunneling operations and performance assessment. In this study, both classical multiple linear regression (MLR) and machine learning approaches—namely Random Forest, Bagged Trees, Support Vector Machine, and LSBoost—were employed to investigate the contributions of BI, UCS, DPW, α, and BTS parameters to ROP prediction. Univariate and MLR analyses exhibited limited explanatory power (R2 = 0.365), confirming that ROP is governed by complex, multivariate, and nonlinear interactions. Comparative machine learning analyses revealed that LSBoost provides the most reliable predictions, achieving the highest accuracy (R2 = 0.9565) and the lowest error metrics (RMSE = 0.1794; MAPE = 5.63%) for both original and normalized datasets. While Random Forest and Bagged Trees demonstrated comparable performance, SVM showed limited predictive capability on the original dataset (R2 = 0.452; RMSE = 0.637; MAPE = 18.60). However, its performance improved substantially following data normalization, approaching that of LSBoost (R2 = 0.936; RMSE = 0.218; MAPE = 4.87). Feature importance analyses based on PDP-driven Jacobian sensitivity and SHAP methods indicate that UCS, BI, and DPW are the dominant factors governing TBM penetration performance, while also demonstrating that model outputs remain interpretable in an interaction-aware manner. These findings highlight that machine learning-based approaches can deliver both reliable prediction and interpretability even with small and heterogeneous datasets, and suggest that future research should focus on integrating larger datasets, hybrid modeling strategies, and advanced explainability techniques.

Version published to 10.3390/app16073451
Apr 2, 2026
Version published to 10.20944/preprints202602.2047.v1
Feb 28, 2026

Interpretable Machine Learning for Predicting Splitting Strength of Asphalt Concrete: Insights from SHAP Analysis

This article has 7 authors:
1. Jianglei Xing
2. Xiao Tan
3. Yihao Li
4. Dongzhao Jin
5. Pengwei Guo
6. Yuhuan Wang
7. Huiya Niu
This article has no evaluationsLatest version Mar 30, 2026
Predictive Modeling of Soil Electrical Resistivity Using Ensemble Machine Learning Algorithms with Geotechnical Parameters

This article has 3 authors:
1. Kornkanok Sangprasat
2. Avirut Puttiwongrak
3. Shinya Inazumi
This article has no evaluationsLatest version Mar 4, 2026
A Hybrid GPR Framework with Feature Selection for Enhanced Prediction and Explainability of Subgrade Soil Strength

This article has 5 authors:
1. Billel Bouguedra
2. Khaled Sandjak
3. Mouloud Ouanani
4. Ahmed Hassan Backar
5. Hegazy Rezk
This article has no evaluationsLatest version Mar 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Interpretable Machine Learning for Predicting Splitting Strength of Asphalt Concrete: Insights from SHAP Analysis

Predictive Modeling of Soil Electrical Resistivity Using Ensemble Machine Learning Algorithms with Geotechnical Parameters

A Hybrid GPR Framework with Feature Selection for Enhanced Prediction and Explainability of Subgrade Soil Strength