Mammo-SAE: Interpreting Breast Cancer Concept Learning with Sparse Autoencoders

Krishna Kanth Nakka

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Interpretability is critical in high-stakes domains such as medical imaging, where understanding model decisions is essential for clinical adoption. In this work, we introduce Sparse Autoencoder (SAE)-based interpretability to breast imaging by analyzing {Mammo-CLIP}, a vision--language foundation model pretrained on large-scale mammogram image--radiology report pairs. We train a patch-level \texttt{Mammo-SAE} on Mammo-CLIP visual features to identify and probe latent neurons associated with clinically relevant breast concepts such as \textit{mass} and \textit{suspicious calcification}. We show that top-activated class-level latent neurons often tend to align with ground-truth regions, and also uncover several confounding factors influencing the model’s decision-making process. Furthermore, we demonstrate that finetuning Mammo-CLIP leads to larger concept separation in the latent space, improving interpretability and predictive performance. Our findings suggest that sparse latent representations offer a powerful lens into the internal behavior of breast foundation models. The code will be released at https://krishnakanthnakka.github.io/MammoSAE/.

Version published to 10.21203/rs.3.rs-7614664/v1 on Research Square
Oct 14, 2025

A Versatile Foundation Model for AI-enabled Mammogram Interpretation

This article has 23 authors:
1. Hao Chen
2. Fuxiang Huang
3. Jiayi Zhu
4. Yunfang Yu
5. Yu Xie
6. Yuan Guo
7. Qingcong Kong
8. MingXiang Wu
9. Xinrui Jiang
10. Shu Yang
11. Jiabo MA
12. Ziyi LIU
13. Zhe Xu
14. Zhixuan Chen
15. Yujie Tan
16. Zifan He
17. Luhui Mao
18. Xi Wang
19. Junlin Hou
20. Lei Zhang
21. Qiong Luo
22. Zhenhui Li
23. Herui Yao
This article has no evaluationsLatest version Oct 9, 2025
Towards Explainable Breast Cancer Classification Using SimCLR-Based Self-Supervised Representation Learning

This article has 6 authors:
1. Md.Faishal Ahmed Rudro
2. Omar faruque siyam
3. Shajedul Hasan Arman
4. Anzim Hasan Nabil
5. Nur Nabi Rahman
6. Afiah Rahman
This article has no evaluationsLatest version Oct 12, 2025
HybGANN: A Hybrid GAN-GA-ANN Framework for Predicting Diabetes from Imbalanced Medical Data

This article has 2 authors:
1. Nora PireciSejdiu
2. Blagoj Ristevski
This article has no evaluationsLatest version Sep 22, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Versatile Foundation Model for AI-enabled Mammogram Interpretation

Towards Explainable Breast Cancer Classification Using SimCLR-Based Self-Supervised Representation Learning

HybGANN: A Hybrid GAN-GA-ANN Framework for Predicting Diabetes from Imbalanced Medical Data