Nested pool testing strategy for the reliable identification of individuals infected with SARS-CoV-2

Inés Armendáriz
Pablo A. Ferrari
Daniel Fraiman
José M. Martínez
Hugo G. Menzella
Silvina Ponce Dawson

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

The progress of the SARS-CoV-2 pandemic requires the design of cost-effective testing programs at large scale. To this end, pooling multiple samples can provide a solution. Defining a cost-effective strategy requires the establishment of an efficient deconvolution and re-testing procedure that eventually allows the identifcation of the carrier. Based on Dorfman’s algorithm, we developed an adaptive nested strategy for which we have, for a given prevalence, simple analytic expressions of the optimal number of samples in the starting pool, of the number of partitioning steps (stages) in the optimal path, of the pool sizes in each of these stages and of the expected average number of tests needed to identify the infected individuals. In this paper we analyze the strategy in detail focusing on its practical implementation when there are restrictions that prevent the use of the optimum. More specifically, we analyze how to proceed when the infection prevalence is poorly known a priori or when the optimal requires starting with pool sizes that are too large for the reliable detection of an infected sample. The sensitivity of the RT-qPCR assay, the gold standard RNA detection method, is a major concern in the case of SARS-CoV-2: it is estimated that half of the infected individuals give false negative results. Recently, droplet digital PCR (ddPCR) was shown to be 10 − 100 times more sensitive than RT-qPCR, making this technology suitable for pool testing. ddPCR has the added value of providing the direct quantification of the RNA content at the end of the test. In the paper we show how this feature can be used for verification purposes. The analyses and strategies presented here should be useful to those considering the adoption of a pooling approach for RNA detection, particularly, for the identification of individuals infected with SARS-CoV-2.

Author summary

The progress of the SARS-CoV-2 pandemic requires the design of cost-effective testing programs at large scale. Running tests on pooled samples can provide a solution if the tests sensitivity is high enough. In the case of SARS-CoV-2, the current gold standard test, RT-qPCR, has shown some limitations that only allow the use of pools with relatively few samples. In this regard, Droplet digital PCR (ddPCR) has been shown to be 10 − 100 times more sensitive than RT-qPCR, making it suitable for test pooling. In this paper we describe a nested pool testing method in which the properties that make it optimal are simple analytic functions of the infection prevalence. We discuss how to proceed in practical implementations of the strategy, particularly when there are constraints that prevent the use of the optimal. We also show how its nested nature can be combined with the direct RNA quantification that the ddPCR test provides to identify the presence of unviable samples in the pools and for self-consistency tests. The studies of this paper should be useful for those considering the adoption of test pooling for RNA detection.

SciScore for 10.1101/2021.01.03.21249170: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Experimental Models: Organisms/Strains
Sentences	Resources
In particular, the position along the row (starting from 0) of the individual sample with subscripts is ik+1 + 3ik + … + 3k−1i2 + 3ki1.	ik+1 + 3ik + … + 3k−1i2 suggested: None

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

This type of events have been shown to be very unlikely and of having a larger probability of false negatives is currently accepted as a …

SciScore for 10.1101/2021.01.03.21249170: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Experimental Models: Organisms/Strains
Sentences	Resources
In particular, the position along the row (starting from 0) of the individual sample with subscripts is ik+1 + 3ik + … + 3k−1i2 + 3ki1.	ik+1 + 3ik + … + 3k−1i2 suggested: None

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

This type of events have been shown to be very unlikely and of having a larger probability of false negatives is currently accepted as a limitation of the pooling method with no significant clinical meaning. Considering these limitations, test pooling may still serve for epidemiological purposes and for continuous validation of the method. As analyzed in this Section, however, accurate quantification that ddPCR provides can be of help to enlarge the truly confirmed set of negative tests. We discuss in what follows how a self-consistency check could be applied to the results obtained with our strategy when the nucleic acid content is quantified as described in S1 Appendix, Sec. 3, and how this quantification can be used to detect some of the flawed pooled samples that test negative. We also use the ddPCR quantification to determine the probability of detecting a single infected sample in a pool as a function of the pool size and the viral load. Test verification: In ddPCR the volume that goes in the reaction tube is subdivided into many (20,000) sub-volumes. At the end, the test gives (ideally) the number of sub-volumes that contained, at the beginning of the test, at least one molecule of the nucleic acid of interest (RNA in our case). As explained in S1 Appendix, Sec. 3, if the fraction of occupied sub-volumes is bounded away from 0 or 1, a range of possible values for the concentration of the RNA detected by the test can be obtained. As derived in S1 Appendix, Sec. 3, when ...

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We found bar graphs of continuous data. We recommend replacing bar graphs with more informative graphics, as many different datasets can lead to the same bar graph. The actual data may suggest different conclusions from the summary statistics. For more information, please see Weissgerber et al (2015).

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a protocol registration statement.

Read the original source

Version published to 10.1101/2021.01.03.21249170 on medRxiv
Jan 4, 2021

Emergence of novel SARS-CoV-2 variants keeps slowing down

This article has 12 authors:
1. Xin Sun
2. Zhuoran Huang
3. Jiayu Sang
4. Peipei Guo
5. Jiani Zhang
6. Jiaxin Li
7. Jiahui Si
8. Wei Ma
9. Kaida Ning
10. Xinxue Liu
11. George Fu Gao
12. Jun Liu
This article has no evaluationsLatest version Feb 6, 2026
Sentinel Surveillance of COVID-19 in 2024–2025 Showed the Persistence of Omicron Circulation and the Introduction of the XFJ Recombinant in the Republic of Guinea

This article has 21 authors:
1. Mamadou Bhoye Keita
2. Bassala Traore
3. Housseinatou Bah
4. Mamadou Aliou Sampou
5. Pépé Tohonamou
6. Almamy Amara Toure
7. Moussa Aminata Keita
8. Mamadou Conde
9. Bile Ebi
10. Sidiki Ibrahima Bamba
11. Ibrahima Fane
12. Abdoulaye Diakite
13. Abdoulaye Fodé Toure
14. Sékou Oumar Traore
15. Mamadou Dian Djiwo Diallo
16. Oumou Salamata Diallo
17. Mamadou Baghir Bah
18. Mafoudia Soumare
19. Kaba Kourouma
20. Pierre Fenano
21. Mahamoud Sama Cherif
This article has no evaluationsLatest version Feb 6, 2026
Genomic characterization of SARS-CoV-2 variants circulating in the population of Bangui, Central African Republic (CAR) in 2022.

This article has 15 authors:
1. Pulchérie Pelembi
2. Philippe Colson
3. Alain Farra
4. Ornella Anne Sibiro-Demi
5. Christian Noël Malaka
6. Aurélia Kwasiborski
7. Véronique Hourdel
8. Gilles Landry Ngaya
9. Romaric Nzoumbou-Boko
10. Jean-Claude Manuguerra
11. Emmanuel Ryvalin Nakoune-Yandoko
12. Guy VERNET
13. Bernard La Scola
14. Valérie Caro
15. Alexandre Manirakiza
This article has no evaluationsLatest version Jan 19, 2026

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Author summary

Article activity feed

Related articles

Emergence of novel SARS-CoV-2 variants keeps slowing down

Sentinel Surveillance of COVID-19 in 2024–2025 Showed the Persistence of Omicron Circulation and the Introduction of the XFJ Recombinant in the Republic of Guinea

Genomic characterization of SARS-CoV-2 variants circulating in the population of Bangui, Central African Republic (CAR) in 2022.