Learning the syntax of plant assemblages

César Leblanc
Pierre Bonnet
Maximilien Servajean
Wilfried Thuiller
Milan Chytrý
Svetlana Aćić
Olivier Argagnon
Idoia Biurrun
Gianmaria Bonari
Helge Bruelheide
Juan Antonio Campos
Andraž Čarni
Renata Ćušterevska
Michele De Sanctis
Jürgen Dengler
Tetiana Dziuba
Emmanuel Garbolino
Ute Jandt
Florian Jansen
Jonathan Lenoir
Jesper Erenskjold Moeslund
Aaron Pérez-Haase
Remigiusz Pielech
Jozef Sibik
Zvjezdana Stančić
Domas Uogintas
Thomas Wohlgemuth
Alexis Joly

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

To address the urgent biodiversity crisis, it is crucial to understand the nature of plant assemblages. The distribution of plant species is shaped not only by their broad environmental requirements but also by micro-environmental conditions, dispersal limitations, and direct and indirect species interactions. While predicting species composition and habitat type is essential for conservation and restoration purposes, it remains challenging. In this study, we propose an approach inspired by advances in large language models to learn the ‘syntax’ of abundance-ordered plant species sequences in communities. Our method, which captures latent associations between species across diverse ecosystems, can be fine-tuned for diverse tasks. In particular, we show that our methodology is able to outperform other approaches to (1) predict species that might occur in an assemblage given the other listed species, despite being originally missing in the species list (16.53% higher accuracy in retrieving a plant species removed from an assemblage than co-occurrence matrices and 6.56% higher than neural networks), and (2) classify habitat types from species assemblages (5.54% higher accuracy in assigning a habitat type to an assemblage than expert system classifiers and 1.14% higher than tabular deep learning). The proposed application has a vocabulary that covers over 10,000 plant species from Europe and adjacent countries and provides a powerful methodology for improving biodiversity mapping, restoration and conservation biology. As ecologists begin to explore the use of artificial intelligence, such approaches open opportunities for rethinking how we model, monitor and understand nature.

Version published to 10.1038/s41477-025-02105-7
Oct 13, 2025
Version published to 10.21203/rs.3.rs-6304381/v1 on Research Square
Apr 7, 2025

The shifting balance between habitat and climate drivers of boreal biodiversity across space and taxa

This article has 19 authors:
1. Emy Guilbault
2. Laura Antão
3. Andrea Santangeli
4. Mirkka Jones
5. Janne Heliölä
6. Heikki Henttonen
7. Ida-Maria Huikkonen
8. Otso Huitu
9. Erkki KorpimÃÂ¤ki
10. Mikko Kuussaari
11. Aleksi Lehikoinen
12. Andreas Lindén
13. Hannu Pietiäinen
14. Juha Pöyry
15. Pasi Sihvonen
16. Anna-Liisa Laine
17. Tomas Roslin
18. Marjo Saastamoinen
19. Jarno Vanhatalo
This article has no evaluationsLatest version Jan 28, 2026
Scaling from Metawebs to Realised Webs: A Hierarchical Approach to Network Ecology

This article has 5 authors:
1. Tanya Strydom
2. Alexander Dunhill
3. Jennifer Dunne
4. Timothée Poisot
5. Andrew Beckerman
This article has no evaluationsLatest version Jan 21, 2026
Emergent functions in the chemodiversity landscape

This article has 28 authors:
1. Maximilian Hanusch
2. Thomas Dussarrat
3. Xue Xiao
4. Dominik Ziaja
5. Kruthika Sen Aragam
6. James Blande
7. Andrea Bräutigam
8. Nicole van Dam
9. Benjamin Delory
10. Selina Gaar
11. Marvin Hildebrandt
12. Ruth Jakobs
13. Robert Junker
14. Caroline Müller
15. Thomas Nägele
16. Moritz Popp
17. Riikka Rinnan
18. Jörg-Peter Schnitzler
19. Hannah Schneider
20. Judit Valeria Mendoza Servín
21. Anke Steppuhn
22. Dorothea Tholl
23. Yonca Seymen
24. Elikplim Setordjie
25. Sybille Unsicker
26. Sara Weirauch
27. Wolfgang Weisser
28. Robin Heinen
This article has no evaluationsLatest version Jan 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The shifting balance between habitat and climate drivers of boreal biodiversity across space and taxa

Scaling from Metawebs to Realised Webs: A Hierarchical Approach to Network Ecology

Emergent functions in the chemodiversity landscape