Diploid genome assembly of human fibroblast cell lines enables clone specific variant calling, improved read mapping and accurate phasing

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Human cell lines are fundamental tools in biomedical research and are widely used in disease modeling, drug development, and many other domains. Here, we present chromosome-level, phased diploid genome assemblies of two popular human cell lines: the BJ foreskin fibroblast line and the IMR-90 fetal lung fibroblast line. Our high-quality assemblies, generated using long-read and Hi-C sequencing data, reveal substantial structural variation, including more than 50,000 insertions, deletions, duplications, and inversions compared to the recent T2T-CHM13v2.0 reference. Our assemblies provide detailed maps of genetic variation, enabling more accurate variant calling and the ability to phase reads when using newly generated or historical sequencing data on these cell lines or their derivatives. All assemblies and associated data have been made available as a resource for the research community. We envision that diploid genome assembly will become a cornerstone approach for personalized medicine in the near future.

Article activity feed