On the utility of Deep Learning for model classification and parameter estimation on complex diversification scenarios

P.G. Peña
G. Iglesias
E. Talavera
AS. Meseguer
I. Sanmartín

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (Arcadia Science)

Abstract

Birth-Death models applied to dated phylogenies are a useful tool to study past diversification dynamics. Parameters in these stochastic models are typically inferred using likelihood-based methods such as Maximum Likelihood Estimation (MLE) or Bayesian Inference. However, these approaches exhibit computational tractability issues in the case of models of moderate to high complexity. One approach to increase model complexity while remaining computationally tractable in the context of birth-death modelling is machine learning. So far, these techniques have been explored in the context of serially-sampled phylogenies (phylodynamics) and trait-dependent birth-death models. Here, we explored the power of Convolutional Neural Networks (CNNs), a type of Deep Learning (DL) method, to solve classification and regression (parameter estimation) tasks under constant-rate and time-homogeneous, rate-variable birth-death models. In particular, we compared six diversification scenarios: Constant Birth-Death, High-Extinction, Mass-Extinction, Diversity-Dependent, Stasis-and-Radiate, and Waxing-and-Waning. We simulated 10, 000 phylogenetic trees under each diversification scenario, which were encoded using a vectorization procedure that captures the topology and branch length information. The encoded trees were used to train or test a set of CNNs models that were designed to tailor three empirical case studies differing in the number of tips. We compared CNNs performance with MLE inference. Our results show that CNNs exhibited classification accuracy levels of 93-78%, whereas maximum likelihood estimation achieved levels of 74-70%. The most difficult scenarios to predict for the CNNs were the high-extinction and mass-extinction scenarios, which were often misidentified as one another. For the regression tasks, mean average errors were comparable between CNNs models and MLE inference, and they also coincided in their difficulty estimating ratio parameters such as mass extinction survival and turnover. Finally, we applied our CNNs to three empirical studies (eucalypts, conifers and cetaceans) and discussed potential shortcomings and future avenues for improvement in the application of deep-learning birth–death modelling approaches.

Arcadia Science
Sep 5, 2025

A tentative first step was taken by Lajaaiti et al. (2023), who combined a graph-based representation of phylogenetic trees with Graphical Neural Networks (GNN). This combination, however, resulted in a poor performance due to “over-smoothing” and “hop neighbourhood” problems.

But see Leroy et al., 2025 (https://doi.org/10.1101/2025.08.14.670341) that addresses this issue using an improved pooling operator. However, as they discuss in their preprint, the performance they achieve (exceeding that of the MLE) still likely does not represent a ceiling on their performance here, as the architecture is quite simple. Use of more sophisticated graph-based architectures including graph transformers (which combat oversmoothing and can more readily account for both local and global patterns) will likely increase this performance further.

Read the original source
Version published to 10.1101/2025.08.27.671290 on bioRxiv
Aug 27, 2025

Graph Neural Networks for Likelihood-Free Inference in Diversification Models

This article has 8 authors:
1. Amélie Leroy
2. Ismaël Lajaaiti
3. Sophia Lambert
4. Jakub Voznica
5. Maximilian Pichler
6. Hélène Morlon
7. Florian Hartig
8. Laurent Jacob
Reviewed by Arcadia Science

This article has 8 evaluationsAppears in 1 listLatest version Aug 19, 2025Latest activity Aug 29, 2025
Deep Learning Models for Financial Data Analysis: A Focused Review of Recent Advances

This article has 3 authors:
1. Jackson Duane
2. Alicia Ren
3. Wei Zhang
This article has no evaluationsLatest version Jul 23, 2025
Evaluating the Effectiveness of Parameter-Efficient Fine-Tuning in Genomic Classification Tasks

This article has 6 authors:
1. Daniel Berman
2. Daniel Jimenez
3. Stanley Ta
4. Brian Merritt
5. Jeremy Ratcliff
6. Vijay Narayan
This article has no evaluationsLatest version Aug 26, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

Graph Neural Networks for Likelihood-Free Inference in Diversification Models

Deep Learning Models for Financial Data Analysis: A Focused Review of Recent Advances

Evaluating the Effectiveness of Parameter-Efficient Fine-Tuning in Genomic Classification Tasks