Diffusion Models vs. DCGANs for Class-Imbalanced Lung Cancer CT Classification: A Comparative Study

Masoud Tabibian
Tahereh Razmpour
Rajib Saha

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Effective lung cancer detection from CT scans remains critically challenged by class imbalance where benign and normal cases are underrepresented, leading to biased machine learning models with reduced sensitivity for minority classes and potentially missed diagnoses in cancer screening applications. We present a comprehensive comparative analysis of Diffusion Models and Deep Convolutional Generative Adversarial Networks (DCGANs), both incorporating modern architectural enhancements including spectral normalization, self-attention mechanisms, and conditional generation, for addressing class imbalance in lung cancer CT classification. Using the IQ-OTH/NCCD dataset comprising 1,097 CT images across normal, benign, and malignant categories with statistical validation across 10 independent runs, we evaluated both approaches through quantitative image quality metrics (Fréchet Inception Distance, Kullback-Leibler divergence, Kernel Inception Distance, and Inception Score) and downstream classification performance. While Diffusion models consistently outperformed DCGANs across most image quality measures, the clinical significance was confirmed through task-based validation. Both generative approaches successfully addressed class imbalance: DCGAN-augmented datasets achieved overall accuracy of 0.9760 ± 0.0116 with benign recall improvement from 0.833 to 0.933, while Diffusion-augmented datasets reached superior performance of 0.9959 ± 0.0068 with perfect benign recall (1.000 ± 0.000). Critically for cancer screening where false negatives carry severe consequences, Diffusion maintained the highest malignant detection sensitivity (0.997 ± 0.008) with substantially lower performance variance, demonstrating more consistent synthetic data quality. These findings establish that while both modern architectures can mitigate class imbalance, Diffusion models’ superior recall performance and lower variability position them as the preferred approach for high-stakes clinical applications, demonstrating that ultimate validation must prioritize downstream clinical task performance over image quality metrics alone.

Version published to 10.1101/2025.11.19.689331 on bioRxiv
Nov 20, 2025

Pixels to Prognosis: ResNet50 Hyper-parameter Analysis for Predicting Benign vs. Malignant Breast Cancer from Biopsy Scans

This article has 1 author:
1. Naman Dhariwal
This article has no evaluationsLatest version Jan 27, 2026
Improving Normal/Abnormal and Benign/Malignant Classifications in Mammography with ROI-Stratified Deep Learning

This article has 3 authors:
1. Kenji Yoshitsugu
2. Kazumasa Kishimoto
3. Tadamasa Takemura
This article has no evaluationsLatest version Dec 31, 2025
Adaptive Bandelet Transform and Transfer Learning for Geometry‑Aware Thyroid Cancer Ultrasound Classification

This article has 4 authors:
1. Yassine Habchi
2. Hamza Kheddar
3. Mohamed Chahine Ghanem
4. Jamal Hwaidi
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Pixels to Prognosis: ResNet50 Hyper-parameter Analysis for Predicting Benign vs. Malignant Breast Cancer from Biopsy Scans

Improving Normal/Abnormal and Benign/Malignant Classifications in Mammography with ROI-Stratified Deep Learning

Adaptive Bandelet Transform and Transfer Learning for Geometry‑Aware Thyroid Cancer Ultrasound Classification