SIMPLseq: a high-sensitivity Plasmodium falciparum genotyping and PCR contamination tracking tool

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Pathogen genotyping via polymerase chain reaction (PCR) amplicon sequencing (AmpSeq) is an informative disease surveillance tool. Several large AmpSeq panels containing >100 multiplexed PCR amplicons have been developed as alternatives to whole-genome sequencing (WGS) methods for the Plasmodium spp. parasites that cause malaria, especially for parasite drug resistance tracking and relatedness analysis. However, these large multiplexes often require a costly pre-amplification stage and typically yield sparse data for samples with parasitemia below 10 parasites/μl. Smaller multiplexes optimized for low-parasitemia genotyping have received insufficient methodological work but have the potential to serve multiple important applications. Managing contamination risk during PCR steps represents another key methodological gap that requires attention in the AmpSeq field.

Methods

Here we describe a new 6-locus Plasmodium falciparum AmpSeq ‘miniplex’ (SIMPLseq) optimized for high-sensitivity analyses that also integrates a contamination detection system based on well-specific inline barcodes applied during first-round PCR (PCR1; in addition to conventional indexing steps in the second-round PCR). We assess panel diversity using publicly available WGS and use mock samples to estimate sensitivity and precision relative to 4CAST, a previously described miniplex. We also create deliberate contamination events to assess sensitivity and estimate unintentional contamination rates during assay application to malaria-infected dried blood spots collected in Mali.

Results

SIMPLseq shows high haplotypic diversity in silico , distinguishing 96.0% of sample pairs drawn randomly from 12 subnational sample sets. SIMPLseq outperforms 4CAST in sensitivity analyses, achieving 100% average locus detection at ≥0.5 parasites/μl and ≥50% average locus detection at 0.25 and 0.125 parasites/μl, with zero false-positive haplotypes across 25 replicates. Inline barcoding did not significantly affect yield when using a ‘sentinel’ design, whereby one of the six multiplexed PCR1 primer pairs contains the well-specific sequence pair. Sentinel barcoding correctly identified all 24 contaminations introduced deliberately during PCR1 product handling and identified 39 unintentional contaminations in the 1420-sample Malian run.

Conclusions

SIMPLseq significantly extends the malaria genomic epidemiology toolkit, employing a simple laboratory protocol based on entirely open-source reagents that is significantly more sensitive and also less costly than most other targeted sequencing protocols. Key use cases for SIMPLseq include recurrent infection classification, polyclonality estimation, and genotypic infection endpoints in intervention efficacy trials.

Article activity feed