Detection and annotation of unique regions in mammalian genomes

Beatriz Vieira Mourato
Bernhard Haubold

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Long unique genomic regions have been reported to be highly enriched for developmental genes in mice and humans. In this paper, we identify unique genomic regions using an efficient method based on fast string matching. We quantify the resource consumption and accuracy of this method before applying it to the genomes of 18 mammals. We annotate their unique regions (URs) of at least 10 kb and find that they are strongly enriched for developmental genes across the board. We then investigated the subset of URs that lack annotations, which we call “anonymous.” The longest anonymous UR in the Tasmanian devil spanned 83 kb and contained the gene encoding inositol polyphosphate-5-phosphatase A, which is an essential part of intracellular signaling. This discovery of an essential gene in a UR implies that URs might be given priority when annotating mammalian genomes. Our documented pipeline for annotating URs in any mammalian genome is available from the repository github.com/evolbioinf/auger; the additional data for this study are available from the dataverse at doi.org/10.17617/3.4IKQAG.

Version published to 10.1093/g3journal/jkae257
Nov 6, 2024
Version published to 10.1101/2024.10.11.617789 on bioRxiv
Oct 13, 2024

Ensembl’s regulatory annotation for human, mouse, livestock, and aquaculture species

This article has 5 authors:
1. Garth R. Ilsley
2. Paulo R. Branco Lins
3. Gabriela A. Merino
4. David Urbina-Gómez
5. Peter W. Harrison
This article has no evaluationsLatest version Jan 6, 2026
META-DIFF: a k-mer-based pipeline that detects differentially abundant sequences in metagenomics whole genome sequencing

This article has 8 authors:
1. Louis-Maël Guéguen
2. Alban Mathieu
3. Simon Pelletier
4. Anthony Woo
5. Namita Misra
6. Magali Moreau
7. Olivier Perin
8. Arnaud Droit
This article has no evaluationsLatest version Jan 29, 2026
The Deep Core: Mapping the 0.91% Regulatory Backbone of the Human Proteome and Its Role in Cancer Drug Resistance

This article has 1 author:
1. Andres Pirolo
This article has no evaluationsLatest version Feb 4, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Ensembl’s regulatory annotation for human, mouse, livestock, and aquaculture species

META-DIFF: a k-mer-based pipeline that detects differentially abundant sequences in metagenomics whole genome sequencing

The Deep Core: Mapping the 0.91% Regulatory Backbone of the Human Proteome and Its Role in Cancer Drug Resistance