Simplifying the Diagnosis of Tuberculous Pleural Effusion: A Machine Learning Analysis of ADA and Lymphocyte Percentage in 1134 Patients

Xiaomei Hai
Bofei Liu
Yuankui Chu
Sujie Zheng
Xiuqin Chang
Guoren Ma
Jing Liu
Meng Hao
Tao Liu
Xiangjun Ye

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Background: Diagnosing tuberculous pleural effusion (TPE) is often complicated by overlapping features with other causes of pleural effusion. Adenosine deaminase (ADA) and lymphocyte percentage (LYM%) are widely used biomarkers, but their isolated diagnostic value remains limited. Methods We retrospectively enrolled 1134 patients with confirmed pleural effusion (615 TPE, 519 non-TPE) from two Chinese hospitals between 2021 and 2025. Nine pleural fluid parameters were analyzed. The dataset was divided into training (70%), validation (15%), and test (15%) sets. We developed four machine learning (ML) models—logistic regression (LR), random forest (RF), Light Gradient Boosting Machine (LightGBM), and support vector machine (SVM)—and compared their diagnostic performance to logistic models based on ADA alone, LYM% alone, and their combination. The DeLong test was used to compare AUCs. Results All pleural fluid parameters, including red blood cells, significantly differed between the TPE and non-TPE groups (p < 0.05). The RF model achieved the highest AUC (0.946), followed by LightGBM (0.945), SVM (0.945), and LR (0.934). ADA + LYM% (AUC = 0.928) outperformed ADA alone (0.815) and LYM% alone (0.905), and showed no significant difference from the full-feature RF model (p = 0.181). Both ADA and LYM% were strong positive predictors in all models. Conclusions A minimal logistic model based on ADA and LYM% demonstrates excellent diagnostic performance for TPE, comparable to more complex machine learning models. This simple and interpretable approach is well-suited for routine clinical application. Trial registration Not applicable. This retrospective diagnostic study was not registered as a clinical trial.

Version published to 10.21203/rs.3.rs-7278771/v1 on Research Square
Sep 30, 2025

Diagnostic role of cancer ratio in suspected malignant pleural effusion

This article has 8 authors:
1. Vijaya Lakshmi V V
2. sake vasavi sai
3. Nagaraja C L
4. Vivek U
5. Deepa A S
6. Arun B J
7. Mohan J
8. Basavaraju T J
This article has no evaluationsLatest version Sep 26, 2025
An interpretable machine learning model for assessing the risk of Talaromycosis in HIV patients lacking skin lesions

This article has 14 authors:
1. Jiaguang Hu
2. Wenming He
3. Qun Tian
4. Yanqiu Lu
5. Peng Zhang
6. Jinyu Qin
7. Chuan Qin
8. Ying Wu
9. Cheng Huang
10. Xu Li
11. Luhuai Feng
12. Linghua Li
13. Zhongsheng Jiang
14. Jianning Jiang
This article has no evaluationsLatest version Oct 8, 2025
A multi-stage machine learning framework for stepwise prediction of tuberculosis treatment outcomes: Integrating gradient boosted decision trees and feature-level analysis for clinical decision support

This article has 4 authors:
1. Linfeng Wang
2. Susana Campino
3. Taane G. Clark
4. Jody E. Phelan
This article has no evaluationsLatest version Oct 19, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Diagnostic role of cancer ratio in suspected malignant pleural effusion

An interpretable machine learning model for assessing the risk of Talaromycosis in HIV patients lacking skin lesions

A multi-stage machine learning framework for stepwise prediction of tuberculosis treatment outcomes: Integrating gradient boosted decision trees and feature-level analysis for clinical decision support