Machine Learning-Based Prediction of Base Editor sgRNA fitness score

Alessandro Orro
Arianna Consiglio
Maria Ilaria Curci
Martina Scichilone
Faiza Hasin
Michele Minervini
Corrado Mencar
Gianluca De Bellis
Cinzia Cocola
Paride Pelucchi
Tommaso Selmi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

CRISPR Base Editors enable precise single-nucleotide modifications, offering advantages over CRISPR-Cas9 knock-out in programming the desired genetic effect. However, in pooled screens targeting essential genes, discrepancies between expected genetic and phenotypic outcomes are frequent: single guide RNAs (sgRNAs), expected to be disruptive, often appear “phenotypically silent” likely due to inefficient editing rather than absence of functional impact. Here, we investigate if Cas9-based gene-level sgRNA depletion data can help to predict the probability that an sgRNA used in base editing will yield the expected fitness effect in pooled proliferation screening. We analysed proliferative effects (z-scores) from high-throughput CRISPR screens using cytosine Base Editors (BEs) and trained machine learning models to predict fitness effects. Our models integrate sequence features, edited strand, mutation type, predicted editing efficiencies and Cas9 gene essentiality scores. Our models discriminate BE sgRNAs that generate a strong phenotypic effect (depletion) in pooled screening, with AUC-ROC greater than 93% in different cell lines. We provide exhaustive analysis of feature importance highlighting the significant impact of sequence features for predicting BE-associated fitness effects. We found that editor-associated fitness predictions are primarily driven by sgRNA sequence features rather than predicted editing efficiency. Moreover, Cas9-derived gene essentiality partially contributes to predictions.

Version published to 10.21203/rs.3.rs-9224117/v1 on Research Square
Apr 10, 2026

Single nucleotide polymorphisms genotyping via an ultrasensitive CRISPR-based assay

This article has 15 authors:
1. Qiupeng Lin
2. Jiaying Huang
3. Linsha Ma
4. Ziyi Wang
5. Yuan Zhang
6. Juntao Wang
7. Jie Chen
8. Yuxin Yuan
9. Xiujie Liu
10. Chun Liu
11. Nan Chai
12. Zhiming Xiang
13. Jisen Zhang
14. Qinlong Zhu
15. Bin Hu
This article has no evaluationsLatest version Mar 27, 2026
TETGen: A Transcriptome-Guided Transformer for Targeted Molecule Generation

This article has 11 authors:
1. Jiahui Gao
2. Yanyun Cao
3. Rui Zhou
4. Yinzi Li
5. Jiale Wang
6. Wanyue Ding
7. Mingzhi Ma
8. Danchen Wu
9. Yan Zhou
10. Bin Yuan
11. Qichao Luo
This article has no evaluationsLatest version Mar 30, 2026
Structure-aware graph learning predicts RNA editability across tissues and species

This article has 4 authors:
1. Gal Oren
2. Zohar Rosenwasser
3. Michael Levitt
4. Erez Levanon
This article has no evaluationsLatest version Mar 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Single nucleotide polymorphisms genotyping via an ultrasensitive CRISPR-based assay

TETGen: A Transcriptome-Guided Transformer for Targeted Molecule Generation

Structure-aware graph learning predicts RNA editability across tissues and species