Differential LTR-retrotransposon dynamics across polyploidization, speciation, domestication, and improvement of cotton (Gossypium)

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Transposable elements are major components of plant genomes and major drivers of plant genome evolution. The cotton genus ( Gossypium ) is an excellent evolutionary model for polyploidization, speciation, domestication, and crop improvement. Here, we implement genome and pangenome analyses to study in detail the dynamics of LTR-retrotransposons during the cotton evolution.

Results

We show that some LTR-retrotransposon lineages amplified in tetraploid cotton compared to their diploid progenitors, whereas others stayed stable or amplified but were removed through solo-LTR formation . Using species-level pangenomes we show that only a few lineages (CRM, Tekay, Ivana, and Tork) remained active after polyploidization and are still transposing. Tekay and CRM elements have re-shaped the centromeric and pericentromeric regions of tetraploid cottons in a subgenome specific manner, through new insertions but also selective eliminations through solo-LTR formation. On the other hand, Ivana and Tork have actively inserted within or close to genes affecting their expression. Finally, population-level analyses using the two pangenomes and data from 283 and 223 varieties of G. hirsutum and G. barbadense reveal changes in Transposon Insertion Polymorphism frequencies accompanying domestication and improvement of both species, suggesting the possibility of selection on linked regions.

Conclusions

Our findings reveal that LTR-retrotransposon lineages followed differential dynamics during cotton evolution, displaying differences among species and the two coresident genomes of allopolyploid cotton. A handful of the LTR-retrotransposon lineages that expanded after polyploidization helped shape the genomes of both G. hirsutum and G. barbadense , impacting their centromere and pericentromeric regions as well as protein-coding genes.

Article activity feed