ChestX-VQA: AI Tool for Multimodal Chest X-ray Analysis and Clinical QA

Charanpreet kaur
Nishtha Hooda

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Chest radiographs are an important aspect of medical diagnosis, but accurate interpretation often requires combining the image with relevant clinical context. This work presents a multimodal large language model (M-LLM)-based chatbot designed to perform visual question answering (VQA) by processing chest X-ray images and associated clinical text. The publicly available VQA-RAD dataset, which contains chest radiographs and corresponding question–answer pairs, is used for evaluation. The study conducts a comparative evaluation of GIT, CLIP, BLIP, FLAVA, and VLIT, focusing on overall BERTScore, readability, and response time. In addition to automatic metrics, a human assessment by medical practitioners is also carried out to evaluate the clinical relevance and accuracy of the responses. The integration of GIT and T5 yields the best performance with an overall BERTScore of 0.92. The chatbot enables users to upload chest radiographs along with clinical notes and receive clear,context-sensitive responses in the field of healthcare.

Version published to 10.21203/rs.3.rs-7563215/v1 on Research Square
Oct 23, 2025

External Validation of an Artificial Intelligence Triaging System for Chest X-Rays: A Retrospective Independent Clinical Study

This article has 7 authors:
1. André Coutinho Castilla
2. Iago de Paiva D’Amorim
3. Maria Fernanda Barbosa Wanderley
4. Mateus Aragão Esmeraldo
5. André Ricca Yoshida
6. Anthony Moreno Eigier
7. Márcio Valente Yamada Sawamura
This article has no evaluationsLatest version Nov 15, 2025
Large language models in radiologic numerical tasks: A thorough evaluation and error analysis

This article has 6 authors:
1. Ali Nowroozi
2. Masha Bondarenko
3. Adrian Serapio
4. Tician Schnitzler
5. Sukhmanjit S Brar
6. Jae Ho Sohn
This article has no evaluationsLatest version Oct 21, 2025
Performance of a Large Language Model in BI-RADS Classification of Ultrasound Based Breast Lesions

This article has 2 authors:
1. Kathryn Pillai
2. Fauzia Nausheen
This article has no evaluationsLatest version Oct 1, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

External Validation of an Artificial Intelligence Triaging System for Chest X-Rays: A Retrospective Independent Clinical Study

Large language models in radiologic numerical tasks: A thorough evaluation and error analysis

Performance of a Large Language Model in BI-RADS Classification of Ultrasound Based Breast Lesions