Functional CENP-B boxes are selectively conserved at centromeric regions
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Human centromeres are built over large stretches of repetitive, divergent α-satellite DNA. Within these sequences lies a conserved, defined 17-bp sequence named the CENP-B box that is bound by the DNA-binding protein CENP-B. Recent studies have proposed that CENP-B box motifs along chromosome arms exist and may represent conserved, ectopic binding sites with functional relevance. Here, we evaluate the genomic distribution, conservation, and binding capacity of different CENP-B box motifs outside canonical centromeres. Analysis of thousands of complete human centromere assemblies reveals exceptional conservation of a unique and canonical CENP-B box motif within centromeres. This is in contrast with the high sequence variability and stochastic occurrence of more degenerate CENP-B box motifs along chromosome arms. Consistently, CENP-B binds only at canonical CENP-B box motifs embedded within α-satellite sites, with no evidence of functional binding at any ectopic sites. Together, these results indicate an adaptive selection of canonical CENP-B boxes within centromeric regions, in contrast to random sequence occurrences for ectopic CENP-B box-like motifs.