TACO: TabPFN Augmented Causal Outcomes for Early Detection of Long COVID
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Long COVID affects 10-40% of COVID-19 survivors, yet early detection remains challenging. We present TACO (TabPFN Augmented Causal Outcomes), a framework that uniquely combines causal inference with foundation models for presymptomatic Long COVID detection. TACO employs Differential Causal Effect (DCE) analysis to identify causally relevant genes, then utilizes TabPFN, a foundation model that does not require hyperparameter adjustment, to achieve consistent performance. In comprehensive benchmarking, TACO achieved superior precision using 18% fewer features than conventional approaches. Critically, TACO maintains consistent performance without any hyperparameter optimization, while benchmark models show variable results depending on the tuning. The causal genes of the framework provide biological interpretability, with 23.6% validated in the Long COVID literature (4.72-fold enrichment, p = 5.04 × 10 −39 ), including regulators of viral entry ( AR, TMPRSS2 ), immune response ( TP53, CDKN1A ), and tissue remodeling ( SMAD2/3 ). By prioritizing causal mechanisms over statistical associations and eliminating the need for hyperparameter search, TACO offers a practical, interpretable solution for clinical deployment, transforming Long COVID management from reactive diagnosis to proactive prevention.