From video to behaviour: an LSTM-based approach for automated nest behaviour recognition in the wild

Liliana R. Silva
André C. Ferreira
Irene Martínez-Baquero
Arlette Fauteux
Claire Doutrelant
Rita Covas

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Studies of animal behaviour usually rely on direct observations or manual annotations of video recordings. However, such methods can be very time-consuming and error-prone, leading to sub-optimal sample sizes. Recent advances in deep-learning show great potential to overcome such limitations, however, most currently available behavioural recognition solutions remain focused on standardised settings.

Here, we present a deployment-focused framework to guide researchers in building behavioural recognition systems from video data, using Long Short-Term Memory (LSTM) networks to classify behavioural sequences across consecutive frames.

LSTMs allowed us to: 1) monitor nest activity by detecting the birds’ presence and simultaneously classifying the type of trajectory: i.e., nest-chamber entrance or exit; and 2) identify the behaviour performed: building, aggression or sanitation. Using our framework, we largely outperformed human annotators in error and speed. LSTM performance improved with challenging training instances, and remained robust even with modest sample sizes. LSTM also outperformed YOLO, highlighting the critical role of temporal sequence information in behavioural analysis.

We demonstrate that our approach is replicable across three bird species and applicable to deployment videos, highlighting its value as a generalizable and transferable tool for long-term studies in the wild.

Version published to 10.1101/2024.10.25.620052 on bioRxiv
Oct 25, 2024

DLC2Action: A Deep Learning-based Toolbox for Automated Behavior Segmentation

This article has 3 authors:
1. Elizaveta Kozlova
2. Andy Bonnetto
3. Alexander Mathis
This article has no evaluationsLatest version Sep 28, 2025
Data-driven Sampling Strategies for Fine-Tuning Bird Detection Models

This article has 6 authors:
1. Corentin Bernard
2. Ben McEwen
3. Benjamin Cretois
4. Hervé Glotin
5. Dan Stowell
6. Ricard Marxer
This article has no evaluationsLatest version Oct 4, 2025
Multimodal Spatio-Temporal Attention Networks with Multi-Head Residual Recurrent Encoding for Human Activity Tracking

This article has 3 authors:
1. Monirul Islam Mahmud
2. Md Shihab Reza
3. Hafeza Akter
This article has no evaluationsLatest version Oct 6, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DLC2Action: A Deep Learning-based Toolbox for Automated Behavior Segmentation

Data-driven Sampling Strategies for Fine-Tuning Bird Detection Models

Multimodal Spatio-Temporal Attention Networks with Multi-Head Residual Recurrent Encoding for Human Activity Tracking