TACO: TabPFN Augmented Causal Outcomes for Early Detection of Long COVID

Sindy Piñero
Xiaomei Li
Lin Liu
Jiuyong Li
Sang Hong Lee
Marnie Winter
Thin Nguyen
Junpeng Zhang
Thuc Duy Le

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Long COVID affects 10-40% of COVID-19 survivors, yet early detection remains challenging. We present TACO (TabPFN Augmented Causal Outcomes), a framework that uniquely combines causal inference with foundation models for presymptomatic Long COVID detection. TACO employs Differential Causal Effect (DCE) analysis to identify causally relevant genes, then utilizes TabPFN, a foundation model that does not require hyperparameter adjustment, to achieve consistent performance. In comprehensive benchmarking, TACO achieved superior precision using 18% fewer features than conventional approaches. Critically, TACO maintains consistent performance without any hyperparameter optimization, while benchmark models show variable results depending on the tuning. The causal genes of the framework provide biological interpretability, with 23.6% validated in the Long COVID literature (4.72-fold enrichment, p = 5.04 × 10 ⁻³⁹ ), including regulators of viral entry ( AR, TMPRSS2 ), immune response ( TP53, CDKN1A ), and tissue remodeling ( SMAD2/3 ). By prioritizing causal mechanisms over statistical associations and eliminating the need for hyperparameter search, TACO offers a practical, interpretable solution for clinical deployment, transforming Long COVID management from reactive diagnosis to proactive prevention.

Version published to 10.1101/2025.10.02.25337138 on medRxiv
Oct 5, 2025

Screen-VarCal: An Interpretable Probabilistic Framework for Recalibrating ACMG Rule-Based Variant Classification in Preventive Medicine

This article has 5 authors:
1. Divya Mishra
2. Alok Tiwari
3. Shivani Srivastava
4. Minal Tripathi
5. Anmol Kapoor
This article has no evaluationsLatest version Sep 25, 2025
CoxMDS: Multiple Data Splitting for High-dimensional Mediation Analysis with Survival Outcomes in Epigenome-wide Studies

This article has 13 authors:
1. Minhao Yao
2. Peixin Tian
3. Xihao Li
4. Shijia Bian
5. Gao Wang
6. Yian Gu
7. Ana Navas-Acien
8. Badri N. Vardarajan
9. Daniel W. Belsky
10. Gary W. Miller
11. Andrea A. Baccarelli
12. Zhonghua Liu
13. the Alzheimer’s Disease Neuroimaging Initiative
This article has no evaluationsLatest version Oct 13, 2025
A Bayesian Informative Shrinkage Approach for Large-scale Multiple Hypothesis Testing (BISHOT): with Applications in Differential Analysis of Omics Data

This article has 3 authors:
1. Ya Su
2. Mary Eunice Joy Z. Clark
3. Chi Wang
This article has no evaluationsLatest version Sep 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Screen-VarCal: An Interpretable Probabilistic Framework for Recalibrating ACMG Rule-Based Variant Classification in Preventive Medicine

CoxMDS: Multiple Data Splitting for High-dimensional Mediation Analysis with Survival Outcomes in Epigenome-wide Studies

A Bayesian Informative Shrinkage Approach for Large-scale Multiple Hypothesis Testing (BISHOT): with Applications in Differential Analysis of Omics Data