BABAPPAΩ: Diagnosing the Identifiability of Episodic Selection under Branch–Site Evolution Using Likelihood-Free Neural Inference

Krishnendu Sinha

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Episodic positive selection acting on specific evolutionary lineages is a longstanding yet intrinsically difficult target of molecular inference. Classical branch–site methods formulate this problem as hypothesis testing under explicit codon substitution models, implicitly assuming that episodic selection is statistically identifiable from finite alignments. Under biologically realistic conditions—including recombination, epistasis, transient fitness shifts, and alignment uncertainty—this assumption may fail, leading to unstable or uninterpretable results. BABAPPAΩ reframes branch–site analysis as a problem of statistical measurement rather than binary detection. Instead of estimating dN/dS or conducting likelihood ratio tests, the method produces continuous, scale-preserving summaries that quantify the measurability of lineage-specific evolutionary deviation under observed data conditions. Inference is likelihood-free and performed using a frozen neural model trained on forward-time mutation– selection simulations, without estimating substitution rates or codon model parameters. Simulation-based calibration shows that under strict neutrality (ω = 1), outputs remain diffuse, bounded, and structurally uninformative across phylogenies ranging from 8 to 64 taxa, with decreasing variance and no reproducible high-ranking branches or sites. In addition, a tree-conditional Monte Carlo calibration procedure provides a gene-level Episodic Identifiability Index (EII), standardized relative to neutral expectations and accompanied by an empirical p-value. Imposed episodic structure produces monotonic but saturating responses, consistent with continuous measurement rather than threshold behavior. Permutation tests eliminate inferred structure, whereas bootstrap and taxon jackknife analyses demonstrate stability under realistic perturbations. These results establish BABAPPAΩ as a conservative diagnostic framework for assessing when episodic selection is statistically resolvable, at what scale, and with what uncertainty, complementing rather than replacing likelihood-based branch–site methods.

Version published to 10.32942/x2r073
Feb 24, 2026

PSMC-FAC: A Statistical Framework for Correcting Loss of Heterozygosity in Low-Coverage Genomic Demographic Inference

This article has 5 authors:
1. Francisco Iglesias-Santos
2. Alba Nieto
3. Sònia Casillas
4. Antonio Barbadilla
5. Carlos Sarabia
This article has no evaluationsLatest version Mar 9, 2026
R-package agentBayes: likelihood-based statistical methods for agent-based models

This article has 8 authors:
1. Niklas Moser
2. Dmitri Finkelshtein
3. Georgy Chargaziya
4. Stephen Cornell
5. Sara Hamis
6. Jacob Scott
7. Dagim Tadele
8. Otso Ovaskainen
This article has no evaluationsLatest version Mar 20, 2026
Learning Neural Evolution Operators: From Decoding to Identifiable Causal State-Space Models

This article has 1 author:
1. Armin Hakkak Moghadam Torbati
This article has no evaluationsLatest version Mar 5, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

PSMC-FAC: A Statistical Framework for Correcting Loss of Heterozygosity in Low-Coverage Genomic Demographic Inference

R-package agentBayes: likelihood-based statistical methods for agent-based models

Learning Neural Evolution Operators: From Decoding to Identifiable Causal State-Space Models