Dealing with differential misclassification of an outcome or a covariate in association studies with an internally validated sample. Application to the use of a serological test for the diagnosis of SARS-CoV-2 infection

Júlia Lacombe Ossó
Benjamin Glemain
Céline Dorival
Hélène Blanché
Cédric Lemogne
Jean-François Deleuze
Olivier Robineau
Mathilde Touvier
Gianluca Severi
Marie Zins
Xavier Lamballerie
Fabrice Carrat

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background To present an analytical framework for correcting misclassification when an imperfect test is used as an indicator of a disease in association studies, taking into account that part of the sample has joint test and disease data. Methods We explored two scenarios, depending on whether the disease is a covariate or the outcome. The analysis sample includes an internal validation sample where the disease status is known in addition to the test. Joint likelihood models taking into account classification errors and the possibly non-random selection of the validation sample were used. Simulations were performed to evaluate the methods. We illustrated our framework using data from a multi-cohort COVID-19 serological study conducted in France between 2020 and 2021, with serology as the imperfect test and SARS-CoV-2 infection as the disease. The dataset included concomitant measurements of the serological test and the SARS-CoV-2 infection status in 7% participants. We estimated 1) the association between incident persistent symptoms (outcome) and SARS-CoV-2 infection (covariate) and 2) the association between infection (outcome) and several covariates. For comparison, we also estimated ‘naïve’ models using serology without correction, or models based solely on the validation sample. Results Simulations confirmed the methods’ abilities to correct for misclassification and non-random selection of the validation sample. In the application, the estimated sensitivities and specificities of the serological test with respect to SARS-CoV-2 infection were 86.2%-87.7% and 95.8%-97.5%, respectively. Considering SARS-CoV-2 infection as a covariate, the corrected analysis showed a significant association between infection and persistent symptoms, while other analyses did not. Considering SARS-CoV-2 infection as the outcome, the corrected analysis confirmed the association between infection and age, gender and active smoking, but did not retrieve an association with living with at least one child at home and previous smoking, which were identified in the naive analysis. Conclusion This methodological framework can be applied in association studies when an imperfect test is used as an indicator of a disease and the disease status has been validated in a subset of the sample. We extended previous works to deal with non-random selection of this validated sample. Registration: NCT04392388

Version published to 10.21203/rs.3.rs-6751937/v1 on Research Square
Jun 5, 2025

Development and Validation of a Multidimensional Indicator-Based Risk Prediction Model for Gestational Diabetes Mellitus: A Nested Case-Control Study

This article has 11 authors:
1. Jiajia Chen
2. Shanshan Yin
3. Shuling Wang
4. Shu Li
5. Ru Feng
6. Xianqi Wang
7. Xiao Hao
8. Xia Zhang
9. Qing Zhang
10. Guijuan Zhang
11. Linlin Hua
This article has no evaluationsLatest version Jul 2, 2025
Evaluating the impact of unadjusted confounding and study design on estimated pathogen-attributable incidence of diarrhoea among children in low and middle-income countries: a sensitivity analysis of an attribution algorithm in the MAL-ED cohort

This article has 5 authors:
1. Charlotte Doran
2. Elizabeth Rogawski McQuade
3. Eric Houpt
4. Jie Liu
5. James Platts-Mills
This article has no evaluationsLatest version Jun 25, 2025
Factors influencing SARS-CoV-2 IgG test sensitivity: A Bayesian analysis of seroconversion and reversion by time since infection, test, age and disease severity

This article has 3 authors:
1. Toon Braeye
2. Steven Abrams
3. Niel Hens
This article has no evaluationsLatest version Jun 30, 2025

Listed in

Abstract

Article activity feed

Related articles

Development and Validation of a Multidimensional Indicator-Based Risk Prediction Model for Gestational Diabetes Mellitus: A Nested Case-Control Study

Evaluating the impact of unadjusted confounding and study design on estimated pathogen-attributable incidence of diarrhoea among children in low and middle-income countries: a sensitivity analysis of an attribution algorithm in the MAL-ED cohort

Factors influencing SARS-CoV-2 IgG test sensitivity: A Bayesian analysis of seroconversion and reversion by time since infection, test, age and disease severity