Evaluation of Classical and Ensemble Machine Learning Algorithms for Thyroid Cancer Diagnosis: A Comparative Evaluation

Kamorudeen Amuda

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Thyroid cancer is a growing global health concern, necessitating reliable and accurate diagnostic tools to support early detection and clinical decision-making. This study aims to develop and implement classical and ensemble machine learning models based on clinical, demographic, and biochemical data to predict thyroid cancer risk. Pearson correlation analysis was employed to identify and select the most relevant features for model training. A range of classifiers was optimized using hyperparameter tuning and cross-validation strategies. To assess robustness and generalizability, model performance was evaluated using accuracy, precision, recall, and F1-score across two independent datasets. Results show that ensemble models, particularly CatBoost, Bagging (Random Forest), and XGBoost, achieved the highest performance, with accuracies of up to 98.70% and F1-scores of 0.99 on Dataset 2, while maintaining consistent performance on Dataset 1 with accuracies around 82.51%. Classical models such as Logistic Regression, LDA, and SVM also performed competitively, achieving up to 97.40% accuracy on Dataset 2 and 82.51% on Dataset 1. These findings demonstrate the effectiveness of combining feature selection with optimized machine learning models and highlight the potential of ensemble approaches for improving thyroid cancer risk as- sessment in clinical practice.

Version published to 10.20944/preprints202507.1436.v1
Jul 17, 2025

Implementation and Evaluation of Support Vector Machine-Based Models for Cancer Detection Using Multi-Omic Data: A Systematic Review

This article has 12 authors:
1. Zhina Mohamadi
2. Erfan Abtahi
3. Zahra sadat Shayegh
4. Mehrafrin Ataei Kachouei
5. Amin Fakhar
6. Mohammad Mahdi Shirani
7. Mohammadhosein Malekian
8. Amir Zinatshoar
9. Mahdi Biglari
10. Fatemeh Rezaei
11. Armin Zarinkhat
12. Rozhina Mohammadi
This article has no evaluationsLatest version Jul 11, 2025
Using Machine Learning to Improve Cancer Diagnosis Accuracy Through Genetic Data Analysis

This article has 4 authors:
1. Bassam Elzaghmouri
2. Marwan Abo zanoneh
3. Feras Fares AL-Mashakbah
4. Saad Mamoun AbdelRahman Ahmed
This article has no evaluationsLatest version Jul 18, 2025
A Comparative Study of Ensemble Models for Thyroid Disease Prediction under Class Imbalance

This article has 2 authors:
1. Jiachen Zhong
2. Yiting Wang
This article has no evaluationsLatest version Jul 18, 2025

Listed in

Abstract

Article activity feed

Related articles

Implementation and Evaluation of Support Vector Machine-Based Models for Cancer Detection Using Multi-Omic Data: A Systematic Review

Using Machine Learning to Improve Cancer Diagnosis Accuracy Through Genetic Data Analysis

A Comparative Study of Ensemble Models for Thyroid Disease Prediction under Class Imbalance