Unveiling Population Heterogeneity in Health Risks Posed by Environmental Hazards Using Regression- Guided Neural Network

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

As environmental hazards become more frequent, it is critically important to understand their health impacts and identify individuals at disproportionately higher risk. Moderated Multiple Regression (MMR) provides a straightforward approach for investigating population heterogeneity by incorporating interaction terms between hazard exposure and population characteristics into a regression model. However, when vulnerabilities are embedded within complex, high-dimensional covariate spaces, MMR often fails to adequately model complex population heterogeneity. Here, we introduce a hybrid method, Regression-Guided Neural Networks (ReGNN), which integrates the flexibility of artificial neural networks (ANNs) within the structural form of a regression model. Briefly, ReGNN embeds an ANN inside a regression equation to generate a latent representation that nonlinearly combines potential sources of heterogeneity and moderates the effect of an environmental hazard. Because the outer layer maintains a regression structure, the interpretability of standard regression analysis is preserved. Through extensive simulation studies, we demonstrate ReGNN’s effectiveness in modeling complex heterogeneous effects. We further illustrate its utility by applying it to investigate population heterogeneity in the health impacts of air pollution (PM2.5) on cognitive functioning scores. By comparing ReGNN’s results with those from traditional MMR models, we show that ReGNN can uncover patterns of heterogeneity that would otherwise remain hidden.

Article activity feed