Pushing the limits of single molecule transcript sequencing to uncover the largest disease-associated transcript isoforms in the human neural retina
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Sequencing technologies have long limited the comprehensive investigation of large transcripts associated with inherited retinal diseases (IRDs) like Usher syndrome, which involves 11 associated genes with transcripts up to 19.6 kb. To address this, we used PacBio long-read mRNA isoform sequencing (Iso-Seq) following standard library preparation and an optimized workflow to enrich for long transcripts in the human neural retina. While our workflow achieved sequencing of transcripts up to 15 kb, this was insufficient for Usher syndrome-associated genes USH2A and ADGRV1 , with transcripts of 18.9 kb and 19.6 kb, respectively. To overcome this, we employed the Samplix Xdrop System for indirect target enrichment of cDNA, a technique typically used for genomic DNA capture. This method facilitated the successful capture and sequencing of ADGRV1 transcripts as well as the full-length 18.9 kb USH2A transcripts. By combining algorithmic analysis with detailed manual curation of sequenced reads, we identified novel isoforms and alternative splicing events across the 11 Usher syndrome-associated genes, with implications for diagnostics and therapy development. Our findings demonstrate the Xdrop system’s adaptability for cDNA capture and the advantages of integrating computational and manual transcript analyses. The full neural retina sequencing dataset is available via EGA under identifier EGAD50000000720.