Convolutional neural networks quantify antibiotic resistance in Mycobacterium tuberculosis with diagnostic grade accuracy and predict treatment response

Sanjana G. Kulkarni
Anna G. Green
Brendon C. Mann
Samantha Malatesta
Suchitra Kulkarni Goodwin
Nina Cesare
Shandukani Mulaudzi
Noorjahn Rawoot
MIC-ML Consortium
Robin Warren
Karen R. Jacobson
Maha R. Farhat

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

There is considerable interest in training machine learning (ML) models on genomic data that achieve clinical grade diagnostic accuracy. Many successful ML models have been trained and validated on binary tasks because predicting biomedically relevant continuous variables is difficult to optimize. In this work, we present convolutional neural networks (CNNs) that predict minimum inhibitory concentrations (MICs) for eight antibiotics from Mycobacterium tuberculosis (Mtb) gene sequences. By including evolutionary information, protein biochemical properties, and data augmentation for rare variants, we build models that predict 89% of MICs within one drug concentration doubling. Although trained on ≤ 52% of the World Health Organization’s (WHO) drug resistance mutation catalogue data, the CNNs accurately predict the effects of 97% of its graded mutations. In a cohort of 373 patients with rifampicin-susceptible Mtb infections, higher CNN-predicted rifampicin MICs are associated with unfavorable treatment outcomes, suggesting that subtle differences in MIC below the resistance threshold are clinically relevant. These results demonstrate the value of encoding multiple dimensions of biological data in machine learning of function or cellular phenotypes and that domain knowledge-inspired machine learning models can be both interpretable and reach clinical grade accuracy.

Version published to 10.1101/2025.08.05.25333066 on medRxiv
Aug 7, 2025

Development and Validation of a Residual Deep Neural Network for Predicting Vancomycin Trough Concentration Categories in Pediatric Patients

This article has 7 authors:
1. Jin Hee Kim
2. Bongjin Lee
3. Wonjin Jang
4. You Sun Kim
5. Yonghyuk Jeon
6. Chunggang Jung
7. June Dong Park
This article has no evaluationsLatest version Jan 29, 2026
AI-Driven Two-Component System Classifier for Pediatric MDR Pathogens

This article has 6 authors:
1. Rajeswari Rajavel
2. Dharani Pandi
3. Grahalakshmi Arunagiri
4. Prithiga Veerasamy
5. Ganesh Irisappan
6. Gurudeeban Selvaraj
This article has no evaluationsLatest version Jan 9, 2026
Cross-Geographic Validation Demonstrates Universal Transcriptomic Signatures for Tuberculosis Diagnosis: A Machine Learning Study

This article has 1 author:
1. Siddalingaiah H.S.
This article has no evaluationsLatest version Dec 31, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Development and Validation of a Residual Deep Neural Network for Predicting Vancomycin Trough Concentration Categories in Pediatric Patients

AI-Driven Two-Component System Classifier for Pediatric MDR Pathogens

Cross-Geographic Validation Demonstrates Universal Transcriptomic Signatures for Tuberculosis Diagnosis: A Machine Learning Study