A Comparative Study of Ensemble Models for Thyroid Disease Prediction under Class Imbalance

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Thyroid disease is a significant medical condition affecting approximately 20 million Americans. The thyroid gland regulates metabolism through hormones such as triiodothyronine (T3) and thyroxine (T4), with disorders that typically manifest as hypothyroidism or hyperthyroidism. This study evaluates the performance of various machine learning models in predicting and diagnosing thyroid disease, including logistic regression, decision trees, random forest, XGBoost, support vector machines, neural networks, bagging and stackingmethods. The bagging model that used three decision trees achieved the highest F1 score of 0.9766, outperforming both Random Forest and XGBoost. Furthermore, experiments on class balancing through undersampling and regrouping significantly improved model performance, particularly for stacking models with XGBoost, which attained an F1 Score of 0.9944.

Article activity feed