Genome assembly and diagnostic DNA markers for sex of the largest freshwater fish in North America, the white sturgeon ( Acipenser transmontanus )

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Sturgeon and paddlefish represent some of the most early diverging branches of ray-finned fishes and have undergone at least one global and several lineage-specific whole genome duplication events. White sturgeon ( Acipenser transmontanus ), the largest freshwater fish in North America, have experienced at least two rounds of whole genome duplication, and may exhibit both di- and multi-valent meiotic segregation. Moreover, they exhibit contemporary ploidy variants due to spontaneous autopolyploidy, particularly in aquaculture. Nonetheless, as a species with several population segments that exhibit chronic recruitment failure, conservation aquaculture is an important part of their management. To facilitate the development of genetic tools to aid white sturgeon conservation, as well as a basis to understand how ploidy changes and variation historically and contemporarily shape the evolution of this species, we present a genome assembly for a white sturgeon from the Snake River, Idaho, USA. Analysis of sequence data used for assembly indicated a haploid genome size of approximately 1.5Gbp, implying tetraploidy (4N), while analysis of heterozygous k-mers from 21 to 41 bp suggest the genome reflects both 4N and 8N variants. The final genome assembly, scaffolded using linkage maps constructed from Fraser River and Snake River F1 families, contained 6.26Gbp in 832,145 scaffolds, consistent with published genome size estimates. Conserved ortholog completeness for this genome (90.5%; 22.8% single-copy and 67.7% duplicated) was similar to the putatively diploid sterlet sturgeon, and the largest linkage map-based scaffold was 55.2Gbp, though the N50 for this assembly was only 416Kbp, indicating the assembly remains fragmented. We demonstrate the utility of this assembly by identifying genomic regions significantly associated with sex. Genetic markers, designed for inclusion in an amplicon genotyping panel, predicted sex 96.6% and 81.5% correctly in females and males, respectively, providing a strong overall association ( 2 p-value < 2.7x10 -37 ) with some variation by geographic region.

Article summary

Functional polysomes (more than two chromosomes that pair in meiosis) are rare among vertebrates. White sturgeon, the largest freshwater fish in North America, are tetraploid and occasionally hexaploid. Several populations of this species are stagnant or declining, requiring aquaculture to bolster reproduction. We analyzed whole genomic data and assembled the genome of an individual from the unique Snake River, Idaho, population. Results indicate most genetic variants are consistent with tetraploidy, and the genome assembly, while fragmented, is largely complete. We demonstrate its utility by designing genetic markers for sex for use in conservation and commercial aquaculture.

Article activity feed