Pushing the limits of single molecule transcript sequencing to uncover the largest disease-associated transcript isoforms in the human neural retina

Merel Stemerdink
Tabea Riepe
Nick Zomer
Renee Salz
Michael Kwint
Raoul Timmermans
Barbara Ferrari
Stefano Ferrari
Alfredo Dueñas Rey
Emma Delanote
Suzanne E. de Bruijn
Hannie Kremer
Susanne Roosing
Frauke Coppieters
Alexander Hoischen
Frans P. M. Cremers
Peter A.C. ’t Hoen
Erwin van Wijk
Erik de Vrieze

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Sequencing technologies have long limited the comprehensive investigation of large transcripts associated with inherited retinal diseases (IRDs) like Usher syndrome, which involves 11 associated genes with transcripts up to 19.6 kb. To address this, we used PacBio long-read mRNA isoform sequencing (Iso-Seq) following standard library preparation and an optimized workflow to enrich for long transcripts in the human neural retina. While our workflow achieved sequencing of transcripts up to 15 kb, this was insufficient for Usher syndrome-associated genes USH2A and ADGRV1 , with transcripts of 18.9 kb and 19.6 kb, respectively. To overcome this, we employed the Samplix Xdrop System for indirect target enrichment of cDNA, a technique typically used for genomic DNA capture. This method facilitated the successful capture and sequencing of ADGRV1 transcripts as well as the full-length 18.9 kb USH2A transcripts. By combining algorithmic analysis with detailed manual curation of sequenced reads, we identified novel isoforms and alternative splicing events across the 11 Usher syndrome-associated genes, with implications for diagnostics and therapy development. Our findings demonstrate the Xdrop system’s adaptability for cDNA capture and the advantages of integrating computational and manual transcript analyses. The full neural retina sequencing dataset is available via EGA under identifier EGAD50000000720.

Version published to 10.1101/2024.09.10.612265 on bioRxiv
Sep 14, 2024

An integrated single-cell transcriptomic dataset for Mouse cortex

This article has 8 authors:
1. Xuefeng Shi
2. Zhihui Qi
3. Hong Huang
4. Zhiming Ye
5. YuMin Wu
6. Kahei Chan
7. Maojin Yao
8. Zhongxing Wang
This article has no evaluationsLatest version Dec 18, 2025
Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

This article has 15 authors:
1. Sarah Silverstein
2. Kaushik Ganapathy
3. Sandra Donkervoort
4. Veronique Bolduc
5. Ying Hu
6. Justin Moy
7. Prech Uapinyoying
8. Svetlana Gorokhova
9. Vijay Ganesh
10. Ben Weisburd
11. Rotem OrBach
12. A. Reghan Foley
13. Pejman Mohammadi
14. David Adams
15. Carsten Bonnemann
This article has no evaluationsLatest version Jan 29, 2026
Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

This article has 12 authors:
1. Xiaokang Pan
2. Ashley Patton
3. Yi Seok Chang
4. Ryan Stevens
5. Nehad Mohamed
6. Matthew Hunt
7. Daniel Chappell
8. Yan Hu
9. Cecelia Miller
10. Weiqiang Zhao
11. Matthew Avenarius
12. Dan Jones
This article has no evaluationsLatest version Feb 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

An integrated single-cell transcriptomic dataset for Mouse cortex

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq