Evaluation of the ‘qXR’ Software for the Detection of Pulmonary Nodules, Cardiomegaly and Pleural Effusion: A Comparative Analysis in a Latin American General Hospital
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background/Objectives: AI-based tools for chest radiograph interpretation are increasingly used as decision-support systems, yet their performance must be validated in local clinical environments before deployment. This study evaluated the diagnostic performance of qXR (Qure.ai, v3.2) for detecting pulmonary nodules, cardiomegaly, and pleural effusion in adult patients at Hospital Clínica Bíblica, San José, Costa Rica. Methods: Three radiologists independently interpreted 225 chest radiographs, providing the reference standard. qXR outputs were compared against radiologist assessments for each finding. The sensitivity, specificity, Cohen’s kappa, and area under the ROC curve (AUC) were calculated. Due to the convenience-stratified sampling design, predictive values were not used for clinical interpretation. Results: For pulmonary nodules, qXR achieved a sensitivity of 0.71, specificity of 0.90, Cohen’s kappa of 0.51, and AUC of 0.80. For pleural effusion, sensitivity and specificity were both 0.86, with a kappa of 0.63 and AUC of 0.86. Cardiomegaly showed the lowest agreement, with a sensitivity of 0.64, specificity of 0.91, kappa of 0.57, and AUC of 0.77. Conclusions: qXR demonstrated moderate diagnostic agreement with radiologist assessments for pulmonary nodules and pleural effusion, and lower agreement for cardiomegaly under local imaging conditions. These results reflect technical concordance between the AI system and individual radiologists and do not constitute evidence of clinical utility or real-world impact. Context-specific validation is essential prior to integrating AI tools into routine radiological workflows.