Advancing Wildlife Image Analysis: A Graph Attention Contrastive Learning Approach for Region-Specific Mammal Classification
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
1. Camera traps have become a cornerstone of wildlife ecological research, yet the manual analysis of the millions of images they generate requires substantial time and resources. Deep learning-based automation has emerged as a promising solution, existing global general-purpose models exhibit limitations in precisely recognizing local endemic species and adapting to unique local ecosystems.
2. This study developed a high-performance classification model optimized for native species. A large-scale “Korean Wildlife Dataset” was constructed from data collected across diverse domestic habitats, and a novel architecture was proposed to overcome limitations of conventional CNNs. The proposed Graph Attention Contrastive Learning (GACL) model is structured as a two-stage pipeline. Stage one employs YOLOv5 and MegaDetector to detect animals, humans, and vehicles, filtering valid images. Stage two performs fine-grained species classification. GACL captures structural relationships among object parts using a Graph Attention Transformer (GAT) and aligns semantic correspondence between images and textual descriptions via Parallel Contrastive Learning, enabling deeper understanding beyond simple visual features.
3. Evaluation on an independent test set demonstrated that the proposed model robust classification performance with an overall accuracy of 96.83% across four classes (Wildboar, Goral, Deers, and Other). Notably, in a comparative analysis against a global general-purpose model, our model showed distinct advantages in the precise recognition of endemic species. Furthermore, it exhibited a lower false positive rate in identifying animals in empty images, confirming its potential to enhance the efficiency of the data cleaning process.
4. Beyond technical accuracy, this study highlights that ’region-specific AI models’ that reflect local ecological characteristics can provide substantial practical value for wildlife monitoring and biodiversity conservation. Future work will require continuous efforts in data diversification and model lightweighting to further improve model robustness and practicality.