Predictive Modeling of Acute Graft-versus-Host-Disease using Machine Learning on Immune Cell and Cytokine Profiles at Engraftment

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Acute Graft-versus-Host-Disease (aGvHD) is a major immune complication following allogenic hematopoietic stem cell transplantation (Allo-HSCT), initiated by conditioning regimen-associated tissue damage. It involves the complex interplay of immune cells and cytokines. Our study aims to leverage machine learning (ML) algorithms on the immune and cytokine profile of Allo-HSCT recipients to develop biomarker-based classification models to predict the onset of aGvHD at the time of engraftment.

Materials and Methods

Seventy patients diagnosed with hematological disorders who had undergone I st Allo-HSCT were recruited from All India Institute of Medical Sciences, New Delhi, India. Peripheral blood (PB) was collected from the patients at the time of engraftment, and the immune cell subtypes and cytokine profiles were analyzed using flow cytometry and ELISA respectively. The individual cell counts were then processed using basic ML models, including support vector classifier with RBF kernel, Decision Tree, and Random Forest, chosen for their mathematical simplicity and feature importance advantage of Decision Trees and Random Forests. Various data settings were utilized in the study: combined immune and cytokine counts, immune cell counts only, cytokine counts only, T-cell counts only, NK cell counts only, dendritic cell counts only, and B-cell counts only. These configurations were selected to investigate how different data sets impact the prediction of aGvHD before its onset.

Results

At the engraftment flow cytometric analysis of reconstituted lymphocytes in patients who developed aGvHD revealed that there was a remarkable decrease in the ratio of CD4 + /CD8 + T-cell and Tregs, with an increase in the cytotoxic regulatory NK-cell, dendritic cells, and B-cell. The levels of pro-inflammatory cytokines (IFN-γ, IL-1β, IP-10, TNF-α, IL-17α, IL-12p70, MIP-1α, MIP-1β, RANTES), and Th17-and Th1-cells were elevated with consequent decline of the levels of anti-inflammatory cytokine IL-10, IL-2, IL-4 and Th2-, Th9-cells. Machine learning based on 48 parameters [all immune cell subsets n=34 and all cytokines (n=14)]. The correlation heat map shows a higher correlation of aGvHD with the cytokine profile with or without immune cells (accuracy: 1), T-cell alone (accuracy: 0.96); NK-cell alone (accuracy: 0.93); dendritic cells alone (accuracy: 0.90), B-cell alone (accuracy: 0.86).

Conclusion

The current models classify perfectly, indicating the potential for a ML algorithm in predicting the onset of aGvHD. However, a study with a larger sample size is required to validate these classification models and mitigate the risk of overfitting observed due to the consistently high performance. The study also highlights the potential of cytokine profiles as a viable alternative to T-cell counts, as evidenced by the correlation heat map and classifier models. These findings provide valuable insights into dataset requirements and future directions for integrating ML models into aGvHD prediction.

Article activity feed