GATHeR: Graph-based Accurate Tool for Immunoglobulin HEavy- and Light-chain Reconstruction
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Recovering full-length, paired B-cell receptor (BCR) sequences from scRNA-seq reads remains difficult, especially in naive and memory B cells where immunoglobulin transcripts are sparse. Current methods provide incomplete constant-region coverage, limiting isoform, subclass, and allele resolution. We present GATHeR, an open-source tool that assembles and annotates paired heavy- and light-chain BCR sequences and extends assembled sequences into constant regions. This enables confident subclass and allele assignment and recovery of the membrane-bound isoform, including the transmembrane segment and cytoplasmic tail, thereby distinguishing surface BCRs from secreted antibodies. GATHeR supports Smart-seq2/3 and 10x Genomics libraries and outperforms existing methods across benchmarks, with the largest gains in naive and memory B cells. Notably, in these populations the constant-region extension also revealed splice variation, including heavy-chain intron retention. By delivering high-fidelity receptor, isoform, and clonal lineage information, GATHeR broadens the analytical reach of scRNA-seq for B-cell immunology.