Pooling RT-PCR or NGS samples has the potential to cost-effectively generate estimates of COVID-19 prevalence in resource limited environments

Krishna R Narayanan
Isabel Frost
Anoosheh Heidarzadeh
Katie K Tseng
Sayantan Banerjee
Jacob John
Ramanan Laxminarayan

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

Background

COVID-19 originated in China and has quickly spread worldwide causing a pandemic. Countries need rapid data on the prevalence of the virus in communities to enable rapid containment. However, the equipment, human and laboratory resources required for conducting individual RT-PCR is prohibitive. One technique to reduce the number of tests required is the pooling of samples for analysis by RT-PCR prior to testing.

Methods

We conducted a mathematical analysis of pooling strategies for infection rate classification using group testing and for the identification of individuals by testing pooled clusters of samples.

Findings

On the basis of the proposed pooled testing strategy we calculate the probability of false alarm, the probability of detection, and the average number of tests required as a function of the pool size. We find that when the sample size is 256, using a maximum pool size of 64, with only 7.3 tests on average, we can distinguish between prevalences of 1% and 5% with a probability of detection of 95% and probability of false alarm of 4%.

Interpretation

The pooling of RT-PCR samples is a cost-effective technique for providing much-needed course-grained data on the prevalence of COVID-19. This is a powerful tool in providing countries with information that can facilitate a response to the pandemic that is evidence-based and saves the most lives possible with the resources available.

Funding

Bill & Melinda Gates Foundation

Authors contributions

RL and KRN conceived the study. IF, KT, KRN, SB and RL all contributed to the writing of the manuscript and AH and JJ provided comments. KRN and AH conducted the analysis and designed the figures.

Research in context

Evidence before this study

The pooling of RT-PCR samples has been shown to be effective in screening for HIV, Chlamydia, Malaria, and influenza, among other pathogens in human health. In agriculture, this method has been used to assess the prevalence of many pathogens, including Dichelobacter nodosus , which causes footrot in sheep, postweaning multisystemic wasting syndrome, and antibiotic resistance in swine feces, in addition to the identification of coronaviruses in multiple bat species. In relation to the current pandemic, researchers in multiple countries have begun to employ this technique to investigate samples for COVID-19.

Added value of this study

Given recent interest in this topic, this study provides a mathematical analysis of infection rate classification using group testing and calculates the probability of false alarm, the probability of detection, and the average number of tests required as a function of the pool size. In addition the identification of individuals by pooled cluster testing is evaluated.

Implications of all the available evidence

This research suggests the pooling of RT-PCR samples for testing can provide a cheap and effective way of gathering much needed data on the prevalence of COVID-19 and identifying infected individuals in the community, where it may be infeasible to carry out a high number of tests. This will enable countries to use stretched resources in the most appropriate way possible, providing valuable data that can inform an evidence-based response to the pandemic.

ScreenIT
Jun 5, 2020
SciScore for 10.1101/2020.04.03.20051995: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
One limitation of pooling multiple RT-PCR samples is that the sensitivity of testing is reduced. To address this, it has been suggested that the number of samples being pooled be kept as low as possible to reduce dilution24,25 …
SciScore for 10.1101/2020.04.03.20051995: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
One limitation of pooling multiple RT-PCR samples is that the sensitivity of testing is reduced. To address this, it has been suggested that the number of samples being pooled be kept as low as possible to reduce dilution24,25 and special kits for extracting DNA from larger volumes be used.8 However, in some settings, pursuing optimal pool size may still not be possible due to fixed testing resources. Herein we recommend the use of the pooled sample method with a binary hierarchical testing strategy for the detection of SARS-CoV-2 by RT-PCR in community surveillance particularly in resource limited settings where testing kits, facilities, and personnel are scarce. Our best estimates are that for the currently estimated prevalence of COVID-19 in many low- and middle-income countries, a single cluster of 30 tests could be combined from patients exhibiting cough, fever and other mild-flu like symptoms. Clustering should be undertaken at a geographic level to prioritize neighborhoods or localities for additional containment efforts.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2020.04.03.20051995 on medRxiv
Apr 6, 2020

Assessing Mass Screening as an Effective Tool for Pandemic Management

This article has 5 authors:
1. Adil Lagmar
2. Maryem Wardi
3. Ahmed Belmouden
4. Mohamed Aghrouch
5. Zohra Lemkhente
This article has no evaluationsLatest version Dec 17, 2025
Designing and making an in-house real-time RT qPCR kit for the detection of SARS-CoV- 2 infection at high sensitivity and low cost

This article has 4 authors:
1. Abdul-Sattar AL-Saeedi
2. Ahmed Sahib Abdulamir
3. Iman M. Aufi
4. Salah Hashim shaheed
This article has no evaluationsLatest version Dec 10, 2025
Performance of automated real-time PCR-based method for bacteria detection

This article has 3 authors:
1. Vesa Mäki-Koivisto
2. Elina Aho-Laukkanen
3. Ilkka Samuel Junttila
This article has no evaluationsLatest version Dec 30, 2025

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.