TARPON - a Telomere Analysis and Research Pipeline Optimized for Nanopore
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Long-read sequencing has transformed many areas of biology and holds significant promise for telomere research by enabling nucleotide-level resolution of chromosome arm–specific telomere length analyses in both model organisms and humans. However, the adoption of new technologies, particularly in clinical or diagnostic contexts, requires careful validation to identify potential technical and computational limitations. We present TARPON (Telomere Analysis and Research Pipeline Optimized for Nanopore), a best-practices Nextflow pipeline designed for the analysis of telomeres sequenced on the Oxford Nanopore Technologies (ONT) platform. TARPON can be executed via the command line or integrated into ONT’s EPI2ME agent, providing a user-friendly graphical interface for those without computational training. Nextflow’s container-based architecture eliminates dependency conflicts, streamlining deployment across platforms.
TARPON isolates telomeric repeat–containing reads, assigns strand specificity, and identifies enrichment probes that can be used both for demultiplexing and for confirming capture-based library preparation. To ensure that the analysis is restricted to full-length telomeres, reads lacking a capture probe or adjacent non-telomeric sequence on the opposite end are excluded. A sliding-window approach defines the subtelomere-telomere boundary, followed by quality filtering to remove low-quality or subtelomeric reads that passed earlier steps.
The pipeline outputs customizable statistics, text-based summaries, and publication-ready visualizations (HTML, PNG, PDF). While default settings are optimized for diagnostic workflows, all parameters are easily adjustable via the GUI or command line to support diverse applications. These include telomere analyses in variant-rich samples (e.g., ALT-positive tumors) and organisms with non-canonical telomeric repeats such as insects (GTTAG) and plants (GGTTTAG). TARPON is the first complete and experimentally validated pipeline for Nanopore-based telomere analysis requiring no pre-processing or prior bioinformatics expertise, while offering flexibility for advanced users.