GRASS-NB: Group-structured variable selection for spatial negative binomial data with applications to cancer registry and spatial omics

Chloe Mattila
Brian Neelon
Kalyani Sonawane
Sha Cao
Peggi Angel
Elizabeth Hill
Souvik Seal

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Spatially structured, overdispersed count data with high-dimensional predictors are increasingly observed across studies from population-level epidemiology to cellular-level spatial omics. Feature selection is critical to identify influential predictors, such as key risk factors or biomarkers. Few Bayesian studies have assessed negative binomial regression (NBR) models with standard variable selection priors, like the mixture spike-and-slab (SS) or continuous horseshoe (HS), but mostly under aspatial settings. Features often form groups; for instance, in population surveys, caloric intake and physical activity may fall under “Diet & Exercise”, while cigarette use and smoking laws belong to “Smoking”. We propose a flexible NBR model that accommodates spatial autocorrelation and introduces a novel group-structured prior by hybridizing SS and HS shrinkage. The model’s performance with different priors is evaluated in terms of specificity, precision, and computational cost under challenging scenarios, including “large p , small n ” cases. We further apply the model to CDC state-level cancer data, comprising demographic, screening, and behavioral covariates, to identify key drivers and population-level risk factors, and to a melanoma spatial omics dataset for predictive modeling expression of gene. An efficient R package is provided on GitHub .

Version published to 10.1101/2025.10.24.684446 on bioRxiv
Oct 25, 2025

Climate, Spatial Clustering and Hotspots of Non-Communicable Disease Mortality in Sub-Saharan Africa: A Bayesian Spatial Epidemiology Study, 2000–2019

This article has 2 authors:
1. Tsikai Solomon Chinembiri¹
2. Godfrey Pachavo
This article has no evaluationsLatest version Dec 18, 2025
A Robust Improved GTWR Framework for Spatiotemporal Heterogeneity and Outlier Effects: Evidence from Simulation and Applied Case Studies

This article has 3 authors:
1. Luri Zahara
2. I GEDE NYOMAN MINDRA JAYA
3. DEFI YUSTI FAIDAH
This article has no evaluationsLatest version Jan 13, 2026
Bayesian Hierarchical Spatiotemporal Models Outperform Spatial Autoregressive Models in Predictive Stability: Evidence from Mississippi County-Level Premature Mortality, 2015–2025 County Health Rankings Releases

This article has 3 authors:
1. Jae Eun Lee
2. JungHye Sung
3. Ji-Young Lee
This article has no evaluationsLatest version Jan 26, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Climate, Spatial Clustering and Hotspots of Non-Communicable Disease Mortality in Sub-Saharan Africa: A Bayesian Spatial Epidemiology Study, 2000–2019

A Robust Improved GTWR Framework for Spatiotemporal Heterogeneity and Outlier Effects: Evidence from Simulation and Applied Case Studies

Bayesian Hierarchical Spatiotemporal Models Outperform Spatial Autoregressive Models in Predictive Stability: Evidence from Mississippi County-Level Premature Mortality, 2015–2025 County Health Rankings Releases