Machine Learning for Differentiating Dengue from Chikungunya in Northern Brazil

Victor Hugo Ovani Marchetti

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose: Dengue and chikungunya, viral diseases spread by Aedes mosquitoes, are prevalent in northern Brazil, where overlapping symptoms hinder accurate diagnosis. This study aims to develop machine learning models to differentiate these diseases, enhancing early management and reducing underreporting in resource-limited settings. Methods: We used clinical symptom data from the Brazilian Notifiable Diseases Information System (SINAN, 2021–2023) to train machine learning models. The dataset comprised 4,874 PCR-confirmed cases among adults (18–59 years), split into training (2021–2022, n=2,437) and testing (2023, n=2,437) sets. Five algo-rithms—Random Forest, XGBoost, LightGBM, CatBoost, and TabPFN 2—were evaluated using AUC-ROC, precision, and recall metrics. Feature importance was analyzed with SHAP and Boruta methods. Results: The Random Forest model performed best, achieving an AUC-ROC of 0.782, precision of 0.734, and recall of 0.733 for dengue. Adjusting the classification threshold to the training prevalence (62.4%) optimized performance, supporting early triage in primary care. Conclusion: Machine learning enhances the sensitivity and efficiency of dengue-chikungunya diagnosis. By leveraging clinical symptoms, these models provide a practical, cost-effective tool for resource-constrained settings, improving arbovirus management.

Version published to 10.21203/rs.3.rs-6702129/v1 on Research Square
May 21, 2025

Predicting Acute Respiratory Infection Risk in Under–Five Children Using Machine Learning: Evidence from Bangladesh

This article has 7 authors:
1. Samrat Kumar Dev Sharma
2. Md. Yusuf Hossain Ador
3. Md. Rokunuzzaman
4. Md. Kamruzzaman
5. Jakir Hossain
6. Mahmud Hossen
7. Futanta Chakma
This article has no evaluationsLatest version Jul 11, 2025
Integrating Google Trends and Hybrid Statistical-Machine Learning Models for Dengue Surveillance in an Inland Vietnamese Province: A 9-Year Evaluation with Media Bias Assessment

This article has 2 authors:
1. Dang Anh Tuan
2. Pham Vu Nhat Uyen
This article has no evaluationsLatest version Jun 5, 2025
Staged Identification of CAP in Fever Patients Across Epidemic Environments: Modeling &Validation

This article has 7 authors:
1. Gao Ziheng
2. Chen Tengfei
3. Ha Yanxiang
4. Shi Yifan
5. Xu Xiaolong
6. Li Bo
7. Liu Qingquan
This article has no evaluationsLatest version Jun 29, 2025

Listed in

Abstract

Article activity feed

Related articles

Predicting Acute Respiratory Infection Risk in Under–Five Children Using Machine Learning: Evidence from Bangladesh

Integrating Google Trends and Hybrid Statistical-Machine Learning Models for Dengue Surveillance in an Inland Vietnamese Province: A 9-Year Evaluation with Media Bias Assessment

Staged Identification of CAP in Fever Patients Across Epidemic Environments: Modeling &amp;Validation

Staged Identification of CAP in Fever Patients Across Epidemic Environments: Modeling &Validation