Efficient Assessment of the Risk of Elevated Aspartate Aminotransferase Using Machine Learning Methods Based on Routine Biochemical Markers

Natalya Maxutova
Akmaral Kassymova
Kuanysh Kadirkulov
Aisulu Ismailova
Gulkiz Zhidekulova
Zhanar Azhibekova
Jamalbek Tussupov
Quvvatali Ortikovich Rakhimov
Zhanat Kenzhebayeva

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study proposes an interpretable and high-accuracy ensemble learning framework for predicting aspartate aminotransferase (AST) levels using open-access biomedical datasets. Using a structured pipeline of preprocessing, feature selection, and model ensembling, we evaluated a series of regression algorithms including Random Forest, XGBoost, CatBoost, and three stacking architectures. The best-performing ensemble (Stacking_v2) achieved R² = 0.98 and RMSE = 1.23 on the validation set, surpassing conventional and single-model approaches. Feature importance was assessed using SHAP values, mutual information, and correlation analysis, revealing that gamma-glutamyl transferase, ferritin, and anthropometric markers had the greatest predictive impact. The proposed stacking-based model demonstrates excellent generalization, robust calibration, and high interpretability, and can serve as a benchmark for algorithmic evaluation in medical data modeling. The work highlights the effectiveness of ensemble regression and interpretable AI in real-world clinical prediction tasks using routine biomarkers.

Version published to 10.20944/preprints202506.2273.v1
Jun 27, 2025

Evaluation of Classical and Ensemble Machine Learning Algorithms for Thyroid Cancer Diagnosis: A Comparative Evaluation

This article has 1 author:
1. Kamorudeen Amuda
This article has no evaluationsLatest version Jul 17, 2025
Development of a Machine Learning-Based Interface for Insulin Dependency Prediction Using Clinical Data

This article has 7 authors:
1. Om Pritam Das
2. B. V. S. Lakshmi
3. M. Vaishnavi
4. Mohd Arif Uddin
5. Aparna Srikan
6. Vinod Kumar Yata
7. Sarad Pawar Naik Bukke
This article has no evaluationsLatest version Jul 1, 2025
Auto-MedCalc: Automated Biomarkers Discovery and Risk Score Generation with AI Agents

This article has 3 authors:
1. Sirui Ding
2. Sanchita Bhattacharya
3. Atul J. Butte
This article has no evaluationsLatest version Jul 16, 2025

Listed in

Abstract

Article activity feed

Related articles

Evaluation of Classical and Ensemble Machine Learning Algorithms for Thyroid Cancer Diagnosis: A Comparative Evaluation

Development of a Machine Learning-Based Interface for Insulin Dependency Prediction Using Clinical Data

Auto-MedCalc: Automated Biomarkers Discovery and Risk Score Generation with AI Agents