FlowTransOP: Distributional Translation of Omics Signatures via Constrained Deep Flow Matching
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Observations from pre-clinical models rarely generalize to human patients, leading to many failures in clinical trials. Most existing methods cannot handle domains with non-overlapping features and no paired samples. Here, we developed FlowTransOP to translate biological observations across such domains without requiring 1-to-1 feature mappings and paired data, while providing a guideline for model selection across four translational regimes. We use flow matching to align full domain distributions in a pre-aligned latent space, with a structural regularization term that keeps similar conditions proximate after transformation. FlowTransOP remains competitive with gold-standard approaches requiring paired samples, but outperforms them when pairs become scarce (<35 pairs) or when cross-domain features are only moderately correlated (r<=0.58). Overall, FlowTransOP can translate perturbations between pre-clinical models and patients when direct correspondences are unavailable, enabling reliable therapeutic inference. As a proof-of-concept, we trained a foundational mouse-human transcriptomic map on ARCHS4 and applied it to liver disease predictions.