Modeling COVID-19 disease processes by remote elicitation of causal Bayesian networks from medical experts
This article has been Reviewed by the following groups
Listed in
- Evaluated articles (ScreenIT)
Abstract
Background
COVID-19 is a new multi-organ disease, caused by the SARS-CoV-2 virus, resulting in considerable worldwide morbidity and mortality. While many recognized pathophysiological mechanisms are involved, their exact causal relationships remain opaque. A better understanding is needed for predicting their progression, targeting therapeutic approaches, and improving patient outcomes. While many mathematical causal models describe COVID-19 epidemiology, none have been developed for its pathophysiology. The virus’s rapid and extensive spread and therapeutic responses made this particularly difficult. Initially, no large patient datasets were publicly available, and their data remains limited. The medical literature was flooded with unfiltered, technical and sometimes conflicting pre-review reports. Clinicians in many countries had little time for academic consultations, and in-person meetings were unsafe.
Methods and Findings
In early 2020, we began a major project to develop causal models of the pathophysiological processes underlying the disease’s clinical manifestations. We used Bayesian network (BN) models, because they provide both powerful tools for calculation and clear maps of probabilistic causal influence between semantically meaningful variables, as directed acyclic graphs (DAGs). Hence, they can incorporate expert opinion and numerical data, and produce explainable results. Dynamic causal BNs, which represent successive “time slices” of the system, can capture feedback loops and long-term disease progression.
To obtain the likely causal structures, we used extensive elicitation of expert opinion in structured online sessions. Centered in Australia, with its exceptionally low COVID-19 burden, we managed to obtain many consultation hours. Groups of clinical and other subject matter specialists, all independent volunteers, were enlisted to filter, interpret and discuss the literature and develop a current consensus. We aimed to capture the experts’ understanding, so we encouraged discussion and inclusion of theoretically salient latent (i.e., unobservable) variables, documented supporting literature while noting controversies, and allowed experts to propose mechanisms by extrapolation from other diseases. Intermediary experts with some combined expertise facilitated the exchange of knowledge to BN modelers and vice versa. Our method was iterative and incremental: we systematically refined and checked the group output with one-on-one follow-up meetings with the original and new experts to validate previous results. In total, 35 experts contributed 126 face-to-face hours, and could review our products.
Conclusions
Our method demonstrates and describes an improved procedure for developing BNs via expert elicitation, which can be implemented rapidly by other teams modeling emergent complex phenomena. The results presented are two key models, for the initial infection of the respiratory tract and the possible progression to complications, as causal DAGs and BNs with corresponding verbal descriptions, dictionaries and sources. These are the first published causal models of COVID-19 pathophysiology, with three anticipated applications: (i) making expert knowledge freely available in a readily understandable and updatable form; (ii) guiding design and analysis of observational and clinical studies, by identifying potential mediators, confounders, and modifiers of treatment effects; (iii) developing and validating parameterized automated tools for causal reasoning and decision support, in clinical and policy settings. We are currently developing such tools for the initial diagnosis, resource management, and prognosis of COVID-19, parameterized using the ISARIC and LEOSS databases.
Article activity feed
-
SciScore for 10.1101/2022.02.14.22270925: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources The primary outcome of the workshop was a causal DAG (in the GeNIe BN software) that a high proportion of experts agreed captured the most important underlying pathophysiological processes. 6. GeNIesuggested: (GENIE, RRID:SCR_009197)Results from OddPub: Thank you for sharing your data.
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:Limitations of our models: While there are still unknowns and competing hypotheses in the literature, the differences are not profound enough to require presentation as competing DAGs; our list of …
SciScore for 10.1101/2022.02.14.22270925: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources The primary outcome of the workshop was a causal DAG (in the GeNIe BN software) that a high proportion of experts agreed captured the most important underlying pathophysiological processes. 6. GeNIesuggested: (GENIE, RRID:SCR_009197)Results from OddPub: Thank you for sharing your data.
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:Limitations of our models: While there are still unknowns and competing hypotheses in the literature, the differences are not profound enough to require presentation as competing DAGs; our list of issues is sufficient to indicate which structural components would differ. Some disagreements only concern the degree of impact of alternative pathways, which corresponds to differing numerical parameterizations of the BNs, and no parameterizations are included here. Both our BNs do not include important background factors such as age, comorbidities and vaccination status, that strongly influence the probability of more serious COVID-19 outcomes (and each other). Knowledge about their role in the COVID-19 process was too limited: although they are known to directly influence some of our variables (e.g., vaccination reduces the chance of initial infection), not all of their direct influences are clear (e.g., vaccination also decreases the chance of infection developing into severe COVID-19, probably by influencing multiple variables along these pathways). Fortunately, these theoretical models are useful and valid without them. According to our experts’ assessment, the distinctive COVID-19 causal structure depicted is unlikely to change for any particular specification of background factors. Rather, such factors will affect the parameters, e.g., how strongly some variables influence others. When adequate data is available to adjust these parameters depending on background factors, the...
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:- Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
- Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
- No protocol registration statement was detected.
Results from scite Reference Check: We found no unreliable references.
-