Disentangling the relationship between cancer mortality and COVID-19

Curation statements for this article:
  • Curated by eLife

    eLife logo

    eLife assessment

    This valuable work explores death coding data to understand the impact of COVID-19 on cancer mortality. The work provides solid evidence that deaths with cancer as a contributing cause were not above what would be expected during pandemic waves, suggesting that cancer did not strongly increase the risk of dying of COVID-19. These results are an interesting exploration into the coding of causes of death that can be used to make sense of how deaths are coded during a pandemic in the presence of other underlying diseases, such as cancer.

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Several countries have reported that deaths with a primary code of cancer did not rise during COVID-19 pandemic waves compared to baseline pre-pandemic levels. This is in apparent conflict with findings from cohort studies where cancer has been identified as a risk factor for COVID-19 mortality. Here we further elucidate the relationship between cancer mortality and COVID-19 on a population level in the US by testing the impact of death certificate coding changes during the pandemic and leveraging heterogeneity in pandemic intensity across US states. We computed excess mortality from weekly deaths during 2014-2020 nationally and for three states with distinct COVID-19 wave timing (NY, TX, and CA). We compared pandemic-related mortality patterns from underlying and multiple causes (MC) death data for six types of cancer and high-risk chronic conditions such as diabetes and Alzheimer’s. Any coding change should be captured in MC data.Nationally in 2020, we found only modest excess MC cancer mortality (∼12,000 deaths), representing a 2% elevation over baseline. Mortality elevation was measurably higher for less deadly cancers (breast, colorectal, and hematologic, 2-5%) than cancers with a poor 5-year survival (lung and pancreatic, 0-1%). In comparison, there was substantial elevation in MC deaths from diabetes (39%) and Alzheimer’s (31%). Homing in on the intense spring 2020 COVID-19 wave in NY, mortality elevation was 2-15% for cancer and 126% and 55% for diabetes and Alzheimer’s, respectively. Simulations based on a demographic model indicate that differences in life expectancy for these conditions, along with the age and size of the at-risk populations, largely explain the observed differences in excess mortality during the COVID-19 pandemic.In conclusion, we found limited elevation in cancer mortality during COVID-19 waves, even after considering coding changes. Our demographic model predicted low expected excess mortality in populations living with certain types of cancer, even if cancer is a risk factor for COVID-19 fatality risk, due to competing mortality risk. We also find a moderate increase in excess mortality from blood cancers, aligned with other types of observational studies. While our study concentrates on the immediate consequences of the COVID-19 pandemic on cancer mortality, further research should consider the pandemic impact on hospitalizations, delayed diagnosis/treatment and risk of Long COVID in cancer patients.

Article activity feed

  1. eLife assessment

    This valuable work explores death coding data to understand the impact of COVID-19 on cancer mortality. The work provides solid evidence that deaths with cancer as a contributing cause were not above what would be expected during pandemic waves, suggesting that cancer did not strongly increase the risk of dying of COVID-19. These results are an interesting exploration into the coding of causes of death that can be used to make sense of how deaths are coded during a pandemic in the presence of other underlying diseases, such as cancer.

  2. Reviewer #1 (Public Review):

    Summary:

    In the paper "Disentangling the relationship between cancer mortality and COVID-19", the authors study whether the number of deaths in cancer patients in the USA went up or down during the first year (2020) of the COVID-19 pandemic. They found that the number of deaths with cancer mentioned on the death certificate went up, but only moderately. In fact, the excess with-cancer mortality was smaller than expected if cancer had no influence on the COVID mortality rate and all cancer patients got COVID with the same frequency as in the general population. The authors conclude that the data show no evidence of cancer being a risk factor for COVID and that the cancer patients were likely actively shielding themselves from COVID infections.

    Strengths:

    The paper studies an important topic and uses sound statistical and modeling methodology. It analyzes both, deaths with cancer listed as the primary cause of death, as well as deaths with cancer listed as one of the contributing causes. The authors argue, correctly, that the latter is a more important and reliable indicator to study relationships between cancer and COVID. The authors supplement their US-wide analysis by analysing three states separately.

    Weaknesses:

    The main findings of the paper can be summarized as six numbers. Nationally, in 2022, multiple-cause cancer deaths went up by 2%, Alzheimer's deaths by 31%, and diabetes deaths by 39%. At the same time, assuming no relationship between these diseases and either Covid infection risk or Covid mortality risk, the deaths should have gone up by 7%, 46%, and 28%. The authors focus on cancer deaths and as 2% < 7%, conclude that cancer is not a risk factor for COVID and that cancer patients must have "shielded" themselves against Covid infections.

    However, I did not find any discussion of the other two diseases. For diabetes, the observed excess was 39% instead of "predicted by the null model" 28%. I assume this should be interpreted as diabetes being a risk factor for Covid deaths. I think this should be spelled out, and also compared to existing estimates of increased Covid IFR associated with diabetes.

    And what about Alzheimer's? Why was the observed excess 31% vs the predicted 46%? Is this also a shielding effect? Does the spring wave in NY provide some evidence here? Why/how would Alzheimer's patients be shielded? In any case, this needs to be discussed and currently, it is not.

  3. Reviewer #2 (Public Review):

    The article is very well written, and the approach is quite novel. I have two major methodological comments, that if addressed will add to the robustness of the results.

    (1) Model for estimating expected mortality. There is a large literature using a different model to predict expected mortality during the pandemic. Different models come with different caveats, see the example of the WHO estimates in Germany and the performance of splines (Msemburi et al Nature 2023 and Ferenci BMC Medical Research Methodology 2023). In addition, it is a common practice to include covariates to help the predictions (e.g., temperature and national holidays, see Kontis et al Nature Medicine 2020). Last, fitting the model-independent for each region, neglects potential correlation patterns in the neighbouring regions, see Blangiardo et al 2020 PlosONE.

    Based on the above:
    a. I believe that the authors need to run a cross-validation to justify model performance. I would suggest training the data leaving out the last year for which they have mortality and assessing how the model predicts forward. Important metrics for the prediction performance include mean square error and coverage probability, see Konstantinoudis et al Nature Communications 2023. The authors need to provide metrics for all regions and health outcomes.

    b. In the context of validating the estimates, I think the authors need to carefully address the Alzheimer case, see Figure 2. It seems that the long-term trends pick an inverse U-shape relationship which could be an overfit. In general, polynomials tend to overfit (in this case the authors use a polynomial of second degree). It would be interesting to see how the results change if they also include a cubic term in a sensitivity analysis.

    c. The authors can help with the predictions using temperature and national holidays, but if they show in the cross-validation that the model performs adequately, this would be fine.

    d. It would be nice to see a model across the US, accounting for geography and spatial correlation. If the authors don't want to fit conditional autoregressive models in the Bayesian framework, they could just use a random intercept per region.

    (2) I think the demographic model needs further elaboration. It would be nice to show more details, the mathematical formula of this model in the supplement, and explain the assumptions.