Development and Characterization of Microhaplotype Databases for Diverse Crop Species

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Microhaplotypes are short genomic segments that contain multiple tightly linked variants, providing multiallelic data that can enhance genetic resolution compared to traditional biallelic single nucleotide polymorphism (SNP) markers. Here, we present the creation and utilization of separate microhaplotype databases for eight crop species representing diverse genome sizes, ploidy levels, and breeding systems. We developed a standardized, species-agnostic pipeline for processing, filtering, and databasing microhaplotypes generated using the DArTag targeted genotyping platform. To enhance user accessibility, we developed a no-code, user-friendly application, HapApp, that uses an R Shiny front-end interface to allow breeders and researchers to add unique, standardized microhaplotype identities from raw DArTag reports and iteratively update the existing crop-specific database with the newly discovered microhaplotypes. Comparative analyses of these databases highlighted the advantages of microhaplotypes in capturing greater allelic diversity, resolving fine-scale population structures, and improving linkage map construction. This integrated framework provides a reproducible and scalable foundation for managing and exploiting microhaplotype data in plant breeding and genetic research, enabling robust cross-project comparisons and facilitating trait discovery in both simple and complex crop genomes, while enabling comparative genomics and cross-species functional transfer that accelerates genetic gains across all crop species.

Article activity feed