Bandgap Prediction of Binary Compounds via a Machine Learning Approach Utilizing Gradient Boosting Regression
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
This study developed a predictive model for the bandgap of binary materials using a machine learning approach based on the Gradient Boosting Regressor (GBR). The model showed exceptional performance on the training set, achieving a Mean Absolute Error (MAE) of 0.087, a Root Mean Squared Error (RMSE) of 0.118, and an R 2 of 98%. While the model's performance on the unseen test data was lower, with an R 2 of 77%, this still indicates a strong capability to predict bandgaps, explaining over 77% of the variation. The performance drop suggests a minor degree of overfitting to the training data. Visualization of the results shows that while most predictions align closely with the ideal reference line, a few outliers lead to significant errors for certain materials. This is likely because compositional features alone are insufficient to capture the intricate physical properties governing these materials.