Double-CRISPR Knockout Simulation (DKOsim): A Monte-Carlo Randomization System to Model Cell Growth Behavior and Infer the Optimal Library Design for Growth-Based Double Knockout Screens
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Advances in functional genomic technology, notably CRISPR using Cas9 or Cas12, now allow for large-scale double perturbation screens in which pairs of genes are inactivated, allowing for the experimental detection of genetic interactions (GIs). However, as it is not possible to validate GIs in high-throughput, there is no gold standard dataset where true interactions are known. Hence, we constructed a Double-CRISPR Knockout Simulation (DKOsim), which allows users to reproducibly generate synthetic simulation data where the single gene fitness effect of each gene and the interaction of each gene pair can be specified by the investigator. We adapted Monte-Carlo randomization methods to extend single knockout simulation methods to double knockout designs, which simulate the gene-gene interactions between all possible combinations of the input genes. Using DKOsim, we generated simulated datasets that closely resemble real double knockout CRISPR datasets in terms of Log Fold Change (LFC), GI distribution, and replicate correlation. We further inferred optimal CRISPR library designs by systematically investigating critical experimental parameters including depth of coverage, guide efficiency, and the variance of initial guide distribution. This simulation scheme will help to identify optimal computational methods for GI detection and aid in the design of future dual knockout CRISPR screens.
Author Summary
We designed DKOsim to simulate CRISPR double knockout screens by modeling cell division behavior with both single knockout (SKO) and double knockout (DKO) constructs via Monte-Carlo randomization samplers. Running DKOsim at large scale, we identified the asymptotic tuning points that optimize genetic interaction (GI) identification performance by the delta-LFC (dLFC) method compared to the simulated truth. We show that DKOsim is tunable to approximate actual dual-CRISPR knockout screening data. Comparing replicate correlation from DKOsim with experimentally generated data, DKOsim can be tuned based on users’ desires to reproduce a similar level of randomness to that observed in variety CRISPR screening conditions.