A Hybrid GPR Framework with Feature Selection for Enhanced Prediction and Explainability of Subgrade Soil Strength

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

In order to overcome the drawbacks of the labour-intensive California Bearing Ratio (CBR) test for pavement design, a Hybrid Gaussian Process Regression (GPR) model for predicting subgrade soil strength was developed. The model is optimised using the Bayesian Tree Parzen Estimator and uses Recursive Feature Elimination for feature selection. Several machine learning techniques, such as Multi-Linear Regression (MLR), Multi-Layer Perceptron of Artificial Neural Networks (MLPNN), and Random Forest (RF), were used in the first stage of evaluation on a regional geotechnical database with over 1000 samples. In the second stage, performant hybrid models based on Dense Neural Network and tree-based ensembles models were used. All of the machine learning models utilised in this study were outperformed by the Hybrid GPR model, according to the trial results. With the training and testing subsets' MSE, MAE, MAPE, and R2 ranging from 0.0768 to 0.2974, 0.2107 to 0.4364, 5.32% to 13.79%, and 0.9982 to 0.9935, respectively, it attained the highest accuracy. The SHAP interpretability method is used to analyse the prediction outcomes in order to further investigate the "black box" problem of the prediction model. According to the SHAP study, dry density, gravel content, and optimum moisture content are the main factors influencing subgrade soil strength based on CBR values. The results of the study show that the Hybrid Gaussian Process Regression model effectively captures the intricate nonlinear interactions between the feature parameters and offers a crucial foundation for CBR value estimation.

Article activity feed