Cenote-Taker 3 for Fast and Accurate Virus Discovery and Annotation of the Virome
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Viruses are abundant across all Earth’s environments and infect all classes of cellular life. Despite this, viruses are something of a black box for genomics scientists. Their genetic diversity is greater than all other lifeforms combined, their genomes are often overlooked in sequencing datasets, and no function can be inferred for most of their genes. For these reasons, scientists need robust, performant, well-documented, extensible tools that can conduct sensitive and specific analyses of sequencing data to discover virus genomes - even those with high divergence from known references - and annotate their genes. Here, we present Cenote-Taker 3. This command line interface tool processes genome assemblies and/or metagenomic assemblies with modules for virus discovery, prophage extraction, and annotation of genes and other genetic features. Benchmarks show that Cenote-Taker 3 outperforms most tools for virus gene annotation in both speed (wall time) and accuracy. For virus discovery benchmarks, Cenote-Taker 3 performs well compared to geNomad, and these tools produce complementary results. Cenote-Taker 3 is freely available on Bioconda, and its open-source code is maintained on GitHub ( https://github.com/mtisza1/Cenote-Taker3 )