Single Cell Foundation Models Evaluation (scFME) for In-Silico Perturbation

James Boylan
Elizaveta Solovyeva
Theophile Bouiller
Xiong Liu
Sebastian Hoersch
Bulent Ataman
Jeremy Jenkins
Murthy Devarakonda

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Foundation models pre-trained on large single-cell RNA atlases offer a compelling alternative to in-vitro experimentation for understanding gene regulatory networks and conducting gene perturbation analyses, with significant implications for target identification. Numerous foundation models have been developed, building upon early efforts such as Geneformer and scGPT. Hyperparameter optimization also results in multiple variants which require comparative analysis. Current benchmarking approaches focus on feature-based assessments or intuitive biological and statistical tasks, which may not align with the models’ training objectives. A recent study proposed a systematic benchmarking framework; however, its scope was limited to pre-trained (zero-shot) models. To address these limitations, we propose Single-Cell Foundation Model Evaluation (scFME)—a systematic method designed to benchmark fine-tuned foundation models for insilico perturbation (ISP). scFME ensures comprehensive and robust assessment by requiring sufficient separation between control and perturbed cells at the outset and by quantifying ISP accuracy against zero and random perturbation baselines. Furthermore, scFME enables exploration of model performance across distinct gene categories, facilitating biological interpretation and functional relevance. Using this framework, we evaluated several commonly used models (and some of their variants) and demonstrated that the methodology effectively characterizes their performance in ISP studies. Our results position scFME as a versatile and rigorous methodology for evaluating and comparing current and future foundation models.

Version published to 10.1101/2025.09.22.677811 on bioRxiv
Sep 24, 2025

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

This article has 2 authors:
1. Xiuwei Zhang
2. Yuqi Cheng
This article has no evaluationsLatest version Dec 10, 2025
Comprehensive benchmarking of RNA velocity methods across single-cell datasets

This article has 6 authors:
1. Jin Liu
2. Yida Wu
3. Chuihan Kong
4. Xu Liao
5. Zhixiang Lin
6. Xiaobo Sun
This article has no evaluationsLatest version Feb 2, 2026
Accurate, scalable, and unified single-cell atlas integration with scBIOT

This article has 2 authors:
1. Haihui Zhang
2. Peiwu Qin
This article has no evaluationsLatest version Jan 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

Comprehensive benchmarking of RNA velocity methods across single-cell datasets

Accurate, scalable, and unified single-cell atlas integration with scBIOT