Estimating the completeness of large-scale single-cell sequencing projects
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
During embryonic development, cells undergo differentiation into highly specialized cell types. Capitalizing on single-cell RNA sequencing, many initiatives and substantial resources have been established for cataloguing these differentiated cell types by their transcriptomic profiles. Despite the extensive efforts to profile various organs and their cellular compositions, we lack metrics to assess the completeness of the sequencing projects. In this cellular biodiversity analysis, we leveraged the increasingly available single-cell data together with statistical methods, originally developed for assessing the species richness of ecological communities, to estimate the cellular diversity of any organ based on current data from single-cell profiling technologies. Deriving from such cellular richness estimates, we established a practical statistical framework that enables reliable assessment of the completeness of any large-scale single-cell profiling projects, after which additional sequencing efforts do not anymore reveal new insights into an organ’s cellular composition. Such estimates can serve as stoppage-points for the ongoing sequencing projects, hence guiding a more cost-efficient completion of the profiling of various human tissues.