GATHeR: Graph-based Accurate Tool for Immunoglobulin HEavy- and Light-chain Reconstruction

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Recovering full-length, paired B-cell receptor (BCR) sequences from scRNA-seq reads remains difficult, especially in naive and memory B cells where immunoglobulin transcripts are sparse. Current methods provide incomplete constant-region coverage, limiting isoform, subclass, and allele resolution. We present GATHeR, an open-source tool that assembles and annotates paired heavy- and light-chain BCR sequences and extends assembled sequences into constant regions. This enables confident subclass and allele assignment and recovery of the membrane-bound isoform, including the transmembrane segment and cytoplasmic tail, thereby distinguishing surface BCRs from secreted antibodies. GATHeR supports Smart-seq2/3 and 10x Genomics libraries and outperforms existing methods across benchmarks, with the largest gains in naive and memory B cells. Notably, in these populations the constant-region extension also revealed splice variation, including heavy-chain intron retention. By delivering high-fidelity receptor, isoform, and clonal lineage information, GATHeR broadens the analytical reach of scRNA-seq for B-cell immunology.

Article activity feed