Systematic evaluation of single-cell foundation model interpretability reveals attention captures co-expression rather than unique regulatory signal

Ihor Kendiukhov

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background : Single-cell foundation models such as scGPT and Geneformer are increasingly used for gene regulatory network (GRN) inference, with attention-derived edge scores routinely interpreted as regulatory proxies. However, whether attention patterns capture causal regulatory relationships—rather than statistical associations already present in expression data—has not been systematically tested. This gap is critical because the NLP interpretability literature has established that attention weights do not reliably indicate feature importance, yet this finding has not been rigorously evaluated in biological foundation models. Results : We present a systematic evaluation framework comprising thirty-seven analyses, 153 statistical tests, four cell types (K562, RPE1, T cells, iPSC neurons), and two perturbation modalities (CRISPRi, CRISPRa). Attention patterns encode layer-specific biological structure—protein–protein interactions in early layers, transcriptional regulation in late layers—but this structure provides no incremental value for perturbation prediction: trivial gene-level baselines outperform both attention and correlation edges (AUROC 0.81–0.88 versus 0.70), pairwise edge scores add zero predictive contribution beyond gene-level features (∆AUROC = −0.0004 to −0.002; 559,720 observations), and causal ablation of regulatory heads produces no degradation across three independent intervention channels. The attention–correlation relationship is context-dependent (equal in K562, worse in CRISPRa, better in RPE1), but gene-level dominance is universal. Cell-State Stratified Interpretability (CSSI) addresses an attention-specific scaling failure, improving GRN recovery up to 1.85×. Conclusions : Attention patterns in single-cell foundation models encode structured biological information but not the causal regulatory signal they are commonly interpreted as capturing. The evaluation framework establishes reusable quality-control standards for the field, and CSSI provides an immediately deployable tool for improved edge recovery from heterogeneous populations.

Version published to 10.21203/rs.3.rs-9082476/v1 on Research Square
Mar 26, 2026

Sparse autoencoders reveal organized biological knowledge but minimal regulatory logic in single-cell foundation models: a comparative atlas of Geneformer and scGPT

This article has 1 author:
1. Ihor Kendiukhov
This article has no evaluationsLatest version Mar 25, 2026
Three Classes of Confound in Gene-Regulatory-Network Inference: A Systematic Audit and Open-Source Diagnostic Toolkit

This article has 1 author:
1. Ihor Kendiukhov
This article has no evaluationsLatest version Mar 26, 2026
BARNO: a batch-aware regulatory network optimization framework reveals a RAN-ENO1-NONO regulatory core in melanoma

This article has 1 author:
1. Xi Zhang
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Sparse autoencoders reveal organized biological knowledge but minimal regulatory logic in single-cell foundation models: a comparative atlas of Geneformer and scGPT

Three Classes of Confound in Gene-Regulatory-Network Inference: A Systematic Audit and Open-Source Diagnostic Toolkit

BARNO: a batch-aware regulatory network optimization framework reveals a RAN-ENO1-NONO regulatory core in melanoma