Long-read sequencing reveals extensive FMR1 somatic mosaicism in Fragile-X associated tremor/ataxia syndrome in human brain
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Fragile X–associated tremor/ataxia syndrome (FXTAS) is a progressive neurodegenerative disorder caused by a CGG repeat expansion in the 5’ untranslated region of the X-linked Fragile X messenger ribonucleoprotein 1 ( FMR1) . Although the CGG repeat tract is known for instability that has been posited to contribute to clinical heterogeneity, the extent of somatic variation in human brain remains unclear, in part due to the technical limitations of sequencing long tandem repeats. Here, we quantified FMR1 somatic variation in post-mortem brain tissue from individuals with FXTAS (n = 6) and Fragile X syndrome (FXS, n = 2) by applying amplification-free, targeted, long-read sequencing. This approach uncovered remarkable somatic mosaicism in repeat size and methylation in FXTAS, including somatic expansions and contractions which were not resolvable with traditional approaches. For example, in FXTAS, we identified somatic expansions to over 6000 base pairs in length as well as contractions to below the pathogenic range. We also identified unexpected patterns of methylation mosaicism on pre and full mutations. On the other hand, we replicated prior findings suggesting differential CGG expansion on the active X-chromosome in XX females. Finally, we examined the above cohort for expansions in 19 additional disease-associated repeat loci. Remarkably, we identified additional expansions in 5 out of 8 affected individuals, in FXN and RFC1 . This work provides new insight into the extensive molecular variation underlying FXTAS in human brain and establishes a framework for studying repeat expansion disorders more broadly, highlighting the potential of long-read sequencing to advance our fundamental understanding of somatic mosaicism of these intractable regions of our genome.