simCRISPR: Modeling Experimental Complexity in Pooled CRISPR Screens

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Pooled CRISPR screens are widely used to investigate gene function and uncover genetic interactions. However, benchmarking computational methods for detecting gene-by-environment (GxE) interactions remains difficult because ground truth is rarely available and existing simulation tools are not designed for GxE screening contexts. To address this, we developed simCRISPR , a flexible simulation framework for generating pooled CRISPR screen data under complex experimental designs. Using simulated datasets informed by empirical CRISPR screen designs, we evaluated commonly used analysis methods, comparing normalization strategies based on safe-harbor versus non-targeting sgRNAs and assessing empirical log 2 FC thresholds as an additional effect-size criterion. We found that safe-harbor-based normalization improved interaction detection when DNA damage-related effects were present, particularly when combined with empirical log 2 FC thresholding for DESeq2. Application of this workflow to a doxorubicin GxE screen further showed that safe-harbor-based normalization reduced bias in log 2 FC distributions and identified additional biologically relevant candidates. simCRISPR is available at https://github.com/bachergroup/simCRISPR .

Article activity feed