A phylogeny-guided framework for decoding mechanisms of human endogenous retrovirus regulation in health and disease

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Human endogenous retroviruses (HERVs) are remmants of ancient infections which make up to ∼8% of the human genome. Their activity influences development, immunity, and cancer, but studying them has been limited by a key technical challenge: short-read sequencing cannot uniquely assign reads to these highly repetitive elements. Here, we present ERVmancer, a phylogeny-informed method that resolves the read-mapping ambiguity and quantifies HERV expression across scales, from individual loci to entire retroviral clades, depending on mapping confidence. Benchmarking with sample-matched long- and short-read data generated in this study demonsrates that ERVmancer outperforms existing approaches in both sensitivity and specificity. Application of ERVmancer recapitulates known HERV expression patterns in multiple sclerosis and uncovers new biology in breast cancer, including suppression of HERVH-LTR7 by p53. By enabling accurate and scalable quantification of integrated retroviral elements, ERVmancer provides a broadly applicable resource for investigating retroviral mechanisms in health and disease.

Article activity feed