Multi-Domain Counterfactual Causal Graphs for Spurious Pathway Detection and Functional Risk Estimation

Emma L. Prescott
Yichen Zhou
Marcus D. Houghton
Rania El-Masri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep neural networks (DNNs) are prone to exploiting spurious correlations, especially when trained on multi-source datasets, where pseudo-causal paths can form across domains and interfere with generalization. This work introduces a method for detecting such paths and quantifying their influence through a counterfactual causal graph framework. By assembling cross-domain causal graphs from datasets like DomainNet (six domains, 18,500 samples) and OfficeHome (four domains, 17,000 samples), we incorporate both statistical correlations (based on Pearson coefficients) and expert-defined priors. A causal reasoning model is constructed on top of a ResNet-18 backbone, trained with a learning rate of 0.0008 and batch size of 64. After 40 epochs, the model achieves an AUC of 0.92 in distinguishing between true and pseudo-causal signals. To quantify shortcut interference, we propose the Functional Shortcut Risk Index (FSRI), which combines path scoring and intervention-based accuracy gain, weighted at 0.65 and 0.35, respectively. Using FSRI to guide path weight adjustment improves Top-1 transfer consistency in target domains from 65.2% to 83.5%. On average, accuracy on DomainNet and OfficeHome increases by 13.8% and 15.6%, with statistical significance confirmed by paired t-tests (p < 0.001). Further analysis of layer-wise activations shows that when true causal paths are used, activation in the fourth residual block and final classifier reaches 0.82 and 0.88, while pseudo-paths yield much lower values (0.28 and 0.36). These results highlight the potential of causal graph diagnostics in mitigating shortcut learning and improving model robustness across domains.

Version published to 10.21203/rs.3.rs-7455888/v1 on Research Square
Aug 26, 2025

DeepCEF: A Deep Causal Estimation Framework for Complex Biological Systems Integrating Local Scores, Independence Tests, and Relation Attributes

This article has 3 authors:
1. Zhenjiang Fan
2. Mengrui Zhang
3. Summer Han
This article has no evaluationsLatest version Oct 11, 2025
Causal Attention Graph Knowledge Tracing

This article has 3 authors:
1. Mengran Tian
2. Zhihao Wang
3. Xiaohui Zhao
This article has no evaluationsLatest version Oct 8, 2025
Interpretable gene network inference with nonlinear causality

This article has 2 authors:
1. Madison S. Krieger
2. William Gilpin
This article has no evaluationsLatest version Sep 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DeepCEF: A Deep Causal Estimation Framework for Complex Biological Systems Integrating Local Scores, Independence Tests, and Relation Attributes

Causal Attention Graph Knowledge Tracing

Interpretable gene network inference with nonlinear causality