Assessing the inference of single-cell phylogenies and population dynamics from CRISPR lineage recordings
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Multicellular organisms develop from a single cell by repeated rounds of cell division, differentiation, and death, which can be represented as a single-cell phylogenetic tree. Genetic lineage tracing allows us to investigate this development by tracking the ancestry of individual cells as populations grow and change over time. However, accurate reconstruction of the cell phylogeny and quantification of the corresponding phylodynamic parameters - cell division, differentiation and death rates - from this tracking data remains challenging and needs to be systematically evaluated.
We perform simulations and assess, using the Bayesian framework, the joint inference of time-scaled cell phylogenies and phylodynamic parameters from CRISPR lineage recordings with random or sequential edits. Principally, we characterize the inference improvements as the recorder capacity increases. We observe more accurate phylogenetic reconstruction from sequential compared to random recordings, but no substantial improvement in phylodynamic inference when using the additional information contained in the order of edits. Overall, we find that CRISPR lineage recordings carry a strong signal on the rates of cell division when appropriate models are used. However, we detect biases in the inferred rates of cell division and death under phylodynamic model misspecification, i.e. when fitting classic memoryless birth-death processes to synchronous cell divisions.
Moreover, for scenarios when cells differentiate into distinct types, we demonstrate that Bayesian phylodynamic analysis of sparse end-point measurements can resolve these cell differentiation trajectories by lineage and time. Under prototypical dynamics, we recover cell type-specific division and death rates, and cell type transition rates in over 80% of simulations.
Overall, this simulation study explores how much information on cellular development can be extracted from state-of-the-art genetic lineage tracing data using phylogenetic and phylodynamic methodology.
Author summary
Novel technologies provide means to trace the development of cell populations over time by introducing heritable and editable genetic sequences that record lineage information in the cells’ genome. Reconstructing a population’s history from such sequences sparsely sampled at a single time point is, however, computationally challenging. In this work, we use simulations and statistical inference to evaluate how accurately we can recover the relationships among cells and estimate the temporal dynamics of cell populations from genetic lineage tracing data generated from distinct recording systems, and compare their information content. Our results show that it is possible to quantify how cells divide, differentiate, and die based on such data, though certain statistical limitations remain. Addressing these limitations in future research will be essential for deepening our understanding of cell development in complex tissues and organisms, in both health and disease.