Haplotype-resolved genome of autotetraploid alfalfa ( Medicago sativa ) Regen-SY27x uncovers large scale structural variation and resistance gene dynamics

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Polyploid genome assembly presents unique challenges due to extensive heterozygosity and complex haplotype structure. We report a haplotype-resolved, chromosome-scale assembly of Regen-SY27x, a genotype of autotetraploid alfalfa ( Medicago sativa ), which is widely used for genetic modification because of its excellent regenerative capacity in tissue culture. Using PacBio HiFi long reads, Omni-C scaffolding, and linkage map guided phasing, we generated a 3.2 GB assembly comprising four haplotypes with high contiguity and completeness. Kmer-based validation confirmed accurate haplotype separation, while linkage map integration and dotplot analysis identified and corrected chimeric scaffolds. Gene annotation yielded 221,688 protein-coding genes, with more than 99% assigned to pseudochromosomes. Repetitive elements accounted for 62.7% of the genome, dominated by long terminal repeat retrotransposons and a high fraction of Helitrons. The spatial enrichment of Helitrons within gene-dense distal chromosome arms underscores their pivotal role as key drivers of genomic innovation and gene family expansion. We identified 3,696 nucleotide-binding leucine-rich repeat R genes, with Toll/interleukin-1 receptor-like and Rx-type subclasses forming large tandem clusters across haplotypes. Comparative analyses revealed strong macrosyntenic conservation among Regen-SY27x and the publicly available Chinese alfalfa genomes but extensive structural variation both within Regen-SY27x haplotypes and between Regen-SY27x and the Chinese genotypes with tens of thousands of duplications, inversions, and translocations detected. These results demonstrate that a single autotetraploid individual captures extensive structural diversity, but individuals from different populations vary greatly. The Regen-SY27x assembly provides a foundational genomic resource for investigating polyploid genome evolution and identifying genetic variation relevant to biological and agronomic improvement in alfalfa.

Article Summary

This study presents the first chromosome-scale, haplotype-resolved genome assembly of the US alfalfa germplasm, Regen-SY27x, a key alfalfa genotype used widely for genetic engineering. We integrated HiFi long reads, Omni-C TM scaffolding, and linkage map-guided phasing to reconstruct all four haplotypes of this complex autotetraploid. Our results identified 221,688 protein-coding genes and reveal immense intra-individual structural variations dominated by small duplications. This high-quality reference serves as a foundational tool for the alfalfa community, enabling researchers to link complex structural diversity with agronomic traits and further enhance the biotechnological potential of this essential forage crop.

Article activity feed