Genomic resources of Ascidiella aspersa and comparative analysis across tunicates reveal conserved class-level features and evolutionary diversification

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Ascidiella aspersa is an invasive tunicate, one of the closest relatives of vertebrates. Despite its ecological nuisance, A. aspersa is anticipated to be a valuable model organism for developmental studies due to its remarkably transparent embryos. However, annotated genome assemblies and transcriptomic resources have not yet been fully established. Although several tunicate genomes have been sequenced, most lack annotations, and a comprehensive analysis across tunicates has not yet been conducted.

Results

We performed de novo genome assembly and transcriptome analysis of A. aspersa, producing a high- quality 306.5 Mb genome assembly. The transcriptome was derived from nine different organs of adults and embryos at six developmental stages. Ab initio and homology-based gene predictions identified 24,504 genes as part of the KAS25 gene model with a BUSCO score of 92.2%. Functional annotation was added for 18,636 genes in the model. To understand the relative features of this species among tunicates, we conducted genome-wide comparative analysis using publicly available data from 35 other tunicate genomes across three classes, five orders, and 12 families, constructing gene models for 27 species with BUSCO scores >80%. Overall, Phylogenetic analysis revealed a new hypothesis regarding the relationships among Phlebobranchia and Aplousobranchia families. Gene duplication analysis showed distinct contractions of gene families in some taxa, with losses of specific DNA repair related genes that were shared among Thaliacea, which may have contributed to their evolutionary diversification. Tunicate genomes exhibited a high level of variation in genomic GC content (28.0%–42.7%). A. aspersa has the highest GC content in coding regions and the third position of codons among tunicate species, with changes in codon usage bias differing from other Ascidiidae species. We also constructed an online comparative tunicate genome database (TUNOME), that provide functional annotations of gene models and ortholog analyses based on these genomic and transcriptomic data.

Conclusions

We constructed genomic resources for A. aspersa and another 35 tunicate gene models. Comparative analysis reveals a variety in tunicate species genomes and characterizes class-level features. Our resources are expected to serve as a foundation for experimental studies involving non-model tunicates and for comparative analysis among tunicate species.

Article activity feed