Reconstruction of Postures Using Partial Body Information Through a Self-supervised Transformer in Mice

Chang Ko
Daesoo Kim

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The comprehensive interpretation of behavior from incomplete data represents a fundamental challenge in computational ethology. Here we present Masked Autoencoder for Transformer-based Estimation and Reconstruction (MATER), a self-supervised learning framework that extracts behaviorally relevant representations from unlabeled rodent pose data by reconstructing strategically masked body keypoints. This approach captures fundamental movement patterns without requiring extensive manual annotation, addressing common experimental challenges including occlusions during social interactions and tracking errors. We evaluate MATER across two-dimensional and three-dimensional rodent pose datasets, demonstrating its robustness under high levels of keypoint masking. The framework achieves high-fidelity reconstructions under these challenging conditions and produces latent representations that support accurate behavioral classification with minimal supervision. Our analyses further reveal that rodent movement exhibits intrinsic spatiotemporal structure, which can be computationally inferred without explicit labeling. Reconstruction performance is tightly linked to the temporal coherence of movement, highlighting the importance of temporal dynamics in behavioral representation. These findings reinforce the emerging view that animal behavior is hierarchically organized and governed by natural temporal dependencies. MATER offers a robust, scalable tool for neuroscientists seeking to analyze complex, naturalistic behaviors across diverse experimental contexts, ultimately advancing our understanding of behavioral architecture and its neural underpinnings.

Version published to 10.1101/2025.05.27.655252v1 on bioRxiv
May 28, 2025

refineDLC: an advanced post-processing pipeline for DeepLabCut outputs

This article has 3 authors:
1. Weronika Klecel
2. Hadley Rahael
3. Samantha A. Brooks
This article has no evaluationsLatest version Apr 15, 2025
A Unified Approach to Pose Estimation in Elephants and Other Quadrupeds using Noisy Labels

This article has 6 authors:
1. Obafemi Jinadu
2. Karen Panetta
3. Jamie Heller
4. Srijith Rajeev
5. Kali Pereira
6. Sos Agaian
This article has no evaluationsLatest version Apr 9, 2025
Comprehensive Neural Representations of Naturalistic Stimuli through Multimodal Deep Learning

This article has 5 authors:
1. Mingxue Fu
2. Guoqiu Chen
3. Yijie Zhang
4. Mingzhe Zhang
5. Yin Wang
This article has no evaluationsLatest version Apr 19, 2025

Listed in

Abstract

Article activity feed

Related articles

refineDLC: an advanced post-processing pipeline for DeepLabCut outputs

A Unified Approach to Pose Estimation in Elephants and Other Quadrupeds using Noisy Labels

Comprehensive Neural Representations of Naturalistic Stimuli through Multimodal Deep Learning