Telomere-to-telomere, accurate, and gapless genome assembly (TTAGGA) of the Korean Jindo dog with a single-contig Y chromosome
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Complete canine reference genomes are essential for studies of structural variation, sex-chromosome evolution, and breed-specific architecture, yet existing references remain incomplete on the Y chromosome and retain internal gaps across multiple chromosomes. Here, we introduce TTAGGA (Telomere-to-Telomere, Accurate, Gapless Genome Assembly), a stricter completeness standard requiring chromosomes assembled end-to-end with zero internal gaps and Merqury consensus QV ≥ 50, and present Jindo1-G-TTAGGA, the first canine assembly to meet this standard, achieved under a stringent QV ≥ 60 threshold. From 813 Gb (∼340× coverage) of multi-platform data (PacBio HiFi ∼150×, ONT ultra-long ∼103×, parental Illumina ∼40× per parent) generated from a male Korean Jindo dog, trio binning with hifiasm produced two haplotype-resolved assemblies of 2,441.6 Mb (Hap1, maternal, chrX-carrying) and 2,340.5 Mb (Hap2, paternal, chrY-carrying). Both haplotypes are gap-free at every internal position, with canonical telomeric repeats verified at both ends of all 39 chromosomes (tidk), Merqury consensus QV of 78.0 (Hap1) and 76.8 (Hap2), trio switch-error rates below 0.13%, BUSCO completeness of 99.3% (Hap1) and 96.4% (Hap2; the lower value reflects absent X-linked orthologues in the Y-bearing haplotype), and Genome Continuity Index values of 98.2 (Hap1) and 94.7 (Hap2). Hap2 carries a single 21,255,890 bp gap-free Y chromosome with TTAGGG telomeric repeats at the q-arm terminus and an acrocentric satellite-rich p-arm, representing a 5.4-fold increase over the 3.94 Mb chrY of ROS_Cfam_1.0 and adding approximately 14 Mb of newly resolved Y-linked sequence; this corresponds to roughly 79% of the cytogenetically estimated 27 Mb full-length canine chrY. Jindo1-G-TTAGGA provides a chromosome-scale, haplotype-resolved, gap-free canine reference for studies of canine structural variation, sex-chromosome evolution, and canid phylogenomics.