X-QSViT: Explainable Quantum-Self-Supervised Vision Transformer for Lung Classification

Vishal Vishal
Vinay Kukreja
Kanwal Preet Kour
Shiva Mehta

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Context: Histopathological image analysis remains critical for early and accurate diagnosis of lung and colon cancers. However, challenges such as class imbalance, scarcity of labeled data, computational inefficiency, and lack of interpretability hinder the deployment of AI systems in clinical settings. Objective This study proposes a hybrid quantum-classical framework, H-QSVT-X, to enhance classification accuracy, computational efficiency, and clinical explainability in lung and colon cancer diagnosis from histopathological images. Methodology The framework integrates a quantum-inspired self-supervised Vision Transformer, combining Quantum GAN (QGAN) simulated on classical hardware for class imbalance assessment, Masked Autoencoder (MAE), and SimCLR for self-supervised feature extraction, and quantum-inspired self-attention mechanisms for efficient long-range dependency modeling. Additional edge and texture analysis using depth-aware Canny and LBP features augment fine-grained tissue characterization. Grad-CAM is employed for visual explainability. Results The model achieved 98.4% classification accuracy, 98.1% precision, 97.8% recall, and 98.0% F1-score. QGAN reduced the imbalance ratio from 0.6 to 1.0, and MAE attained a reconstruction loss of 0.024. SimCLR yielded a contrastive loss of 0.012, with a latent similarity ratio of 7.58. The quantum attention mechanism improved precision by 4.2% and reduced computational time by 33%. Grad-CAM achieved 97.6% salient region coverage, with a classification confidence increase of 15.3%. Future Scope Future work includes expanding the model for multi-modal cancer analysis, integrating federated learning for privacy preservation, and validation on diverse clinical datasets for improved generalizability.

Version published to 10.21203/rs.3.rs-8867721/v1 on Research Square
Mar 10, 2026

ML-ConvNet: A Lightweight and Interpretable Unified Architecture for Medical Image Classification Across Modalities

This article has 10 authors:
1. Williams Ayivi
2. Xiaoling Zhang
3. Yeongx Yeong Hyeon Gu
4. Amil Aligayev
5. Ali Alqahtani
6. Wisdom Xornam Ativi
7. Francis Sam
8. Muhammed Amin Abdullah
9. Emmanuel Sarpong Addai Gyarteng
10. Mugahed A. Al-antari
This article has no evaluationsLatest version Mar 17, 2026
A Novel Multi-Stage Fusion Pipeline for Robust and Interpretable Melanoma Classification Using Physics-Informed and Vision-Language Models

This article has 2 authors:
1. G. Isha
2. F. D Asbel sherlin
This article has no evaluationsLatest version Mar 2, 2026
Attention-Enhanced Xception Network for Automated Microsatellite Instability Classification in Colorectal Cancer Histopathology

This article has 5 authors:
1. Yunxia Huang
2. Yanzong Lin
3. Qingyang Zhuang
4. Chenjun Zhang
5. Junxin Wu
This article has no evaluationsLatest version Mar 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ML-ConvNet: A Lightweight and Interpretable Unified Architecture for Medical Image Classification Across Modalities

A Novel Multi-Stage Fusion Pipeline for Robust and Interpretable Melanoma Classification Using Physics-Informed and Vision-Language Models

Attention-Enhanced Xception Network for Automated Microsatellite Instability Classification in Colorectal Cancer Histopathology