Enhanced Language Models for Predicting and Understanding HIV Care Disengagement: A Case Study in Tanzania

Waverly Wei
Junzhe Shao
Rita Qiuran Lyu
Rebecca Hemono
Xinwei Ma
Joseph Giorgio
Zeyu Zheng
Feng Ji
Xiaoya Zhang
Emmanuel Katabaro
Matilda Mlowe
Amon Sabasaba
Caroline Lister
Siraji Shabani
Prosper Njau
Sandra I. McCoy
Jingshen Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Sustained engagement in HIV care and adherence to ART are crucial for meeting the UNAIDS "95-95-95" targets. Disengagement from care remains a significant issue, especially in sub-Saharan Africa. Traditional machine learning (ML) models have had moderate success in predicting disengagement, enabling early intervention. We developed an enhanced large language model (LLM) fine-tuned with electronic medical records (EMRs) to predict individuals at risk of disengaging from HIV care in Tanzania. Using 4.8 million EMR records from the National HIV Care and Treatment Program (2018–2023), we identified risks of ART non-adherence, non-suppressed viral load, and loss to follow-up. Our enhanced LLM may outperform traditional machine learning models and zero-shot LLMs. HIV physicians in Tanzania evaluated the model’s predictions and justifications, finding 65% alignment with expert assessments, and 92.3% of the aligned cases were considered clinically relevant. This model can support data-driven decisions and may improve patient outcomes and reduce HIV transmission.

Version published to 10.21203/rs.3.rs-6608559/v2 on Research Square
Jan 14, 2026
Version published to 10.21203/rs.3.rs-6608559/v1 on Research Square
May 8, 2025

Performance Evaluation of Large Language Models in Real-World Perinatal Medication Consultations: A Cross-Sectional Study

This article has 4 authors:
1. RAN WANG
2. Yifan Li
3. Xuewei Feng
4. Xin Feng
This article has no evaluationsLatest version Feb 4, 2026
A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies

This article has 11 authors:
1. Panagiotis Kaliosis
2. Adithya V. Ganesan
3. Oscar N.E. Kjell
4. Whitney Ringwald
5. Scott Feltman
6. Melissa A. Carr
7. Dimitris Samaras
8. Camilo Ruggero
9. Benjamin J. Luft
10. Roman Kotov
11. H. Andrew Schwartz
This article has no evaluationsLatest version Dec 25, 2025
Machine Learning-Based Classification of HIV Viral Load Suppression in Low-Resource Settings

This article has 4 authors:
1. Abraham Keffale Mengistu
2. Aynadis Worku Shime
3. Muluken Belachew Mengistie
4. Andualem Enyew Gedefaw
This article has no evaluationsLatest version Jan 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Performance Evaluation of Large Language Models in Real-World Perinatal Medication Consultations: A Cross-Sectional Study

A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies

Machine Learning-Based Classification of HIV Viral Load Suppression in Low-Resource Settings