DLC2Action: A Deep Learning-based Toolbox for Automated Behavior Segmentation

Elizaveta Kozlova
Andy Bonnetto
Alexander Mathis

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

While expert biologists can annotate complex behaviors from video data, the process remains tedious and time-consuming, creating a bottleneck for efficient behavioral analysis. Here, we present DLC2Action, an open-source Python toolbox that enables automatic behavior annotation from video or estimated 2D/3D pose tracking data. DLC2Action integrates multiple state-of-the-art deep learning architectures, optimized for action segmentation and supports self-supervised learning (SSL) to leverage unlabeled data, boosting performance with limited labeled datasets. Its robust implementation enables efficient hyperparameter optimization, customizable feature extraction, and data handling. We also standardized eight benchmarks and evaluated DLC2Action on five animal behavior datasets, which comprise common behavioral tests in neuroscience, and four human datasets. Overall these datasets span a wide range of contexts from standard laboratory studies to naturalistic cooking. DLC2Action reached strong performance across those benchmarks. To further showcase the tool's versatility, we applied it to Atari gameplay data and found that in certain games the players' eye movements consistently predict their button presses across different subjects. Furthermore, DLC2Action features an intuitive graphical user interface (GUI) for streamlined behavior annotation, active learning, and assessment of model predictions. Diverse pose, video, and annotation formats are supported. Lastly, DLC2Action is modular and thus designed for extensibility, allowing users to integrate new models, dataset features, and methods. The code and benchmarks are available at: https://github.com/amathislab/DLC2action

Version published to 10.1101/2025.09.27.678941 on bioRxiv
Sep 28, 2025

DREAMER-S: Deep leaRning-Enabled Attention-based Multiple-instance approaches with Explainable Representations for Spatial biology

This article has 9 authors:
1. M. Rifqi Rafsanjani
2. Alison Dooney
3. Rahul Suresh
4. Alice C. O’Farrell
5. Monika A. Jarzabek
6. Liam Shiels
7. Annette T. Byrne
8. Jochen H. M. Prehn
9. Aidan D. Meade
This article has no evaluationsLatest version Oct 3, 2025
CASTLE: a training-free foundation-model pipeline for cross-species behavioral classification

This article has 11 authors:
1. Yu-Shun Liu
2. Han-Yuan Yeh
3. Yu-Ting Hu
4. Bing-Shiuan Wu
5. Yi-Fang Chen
6. Jia-Bin Yang
7. Sureka Jasmin
8. Ching-Lung Hsu
9. Suewei Lin
10. Chun-Hao Chen
11. Yu-Wei Wu
This article has no evaluationsLatest version Aug 27, 2025
Swin-CATPN: A Context-Enhanced Temporal Pyramid Network with Swin Transformer for Action Recognition

This article has 4 authors:
1. Hanyou Huang
2. Changnan Jiang
3. Ziyuan Zhang
4. Heqing Ouyang
This article has no evaluationsLatest version Sep 2, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DREAMER-S: Deep leaRning-Enabled Attention-based Multiple-instance approaches with Explainable Representations for Spatial biology

CASTLE: a training-free foundation-model pipeline for cross-species behavioral classification

Swin-CATPN: A Context-Enhanced Temporal Pyramid Network with Swin Transformer for Action Recognition