Two-Stage Machine Learning Based Prediction of Thrombophilia Management

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Thrombophilia diagnosis and management rely on the nuanced interpretation of clinical history, risk factors, and laboratory data, yet significant variability exists in clinical practice due to subjective assessment, institutional differences, and limited consensus guidelines. In this retrospective study, we evaluated the use of a two-stage machine learning (ML) approach to predict thrombophilia diagnosis and subsequent anticoagulation treatment. The study included data (14 clinical and 26 laboratory features) from 496 patients evaluated at the University Hospital Bonn between 2019 and 2024. The hybrid nature of the diagnostic categories combining ordinal (including thrombophilia diagnosis severity and treatment recommendations) and categorical (e.g. antiphospholipid syndrome) classes necessitated a two-stage approach. XGBClassifier was used to distinguish categorical from ordinal classes, followed by an ordinal-specific XGBOrdinalV2 model. A sliding window approach improved classification performance across all ordinal categories reaching sensitivities above 89% across all ordinal classes. Feature importance analysis revealed that age at first thrombosis and antiphospholipid antibody status were key predictive variables. Of the 496 patients, 362 (73%) experienced no discrepancy between the ML-based predictions and the practitioner diagnosis and subsequent treatment recommendations. Re-evaluation of the remaining 134 patients revealed that, while the ML models correctly classified 36 patients, it underestimated or overestimated the thrombophilia severity in 71 and 27 patients, respectively. This study highlights the potential of interpretable ML models to support standardized thrombophilia management and improve diagnostic accuracy. Future prospective studies and external validation are needed to assess generalizability and clinical impact.

Article activity feed