A Novel Approach for Fish Species Classification Combining Convolutional Neural Networks and Vision Transformers
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Fish species classification plays an essential role in aquaculture management, marine biodiversity conservation and fisheries monitoring. Traditional methods rely heavily on manual identification, which is time-consuming, prone to human error and inefficient on a large scale. This paper proposes a new approach entitled DeepLIFT-ViT, which combines the Visual Geometry Group 16 (VGG16) and Vision Transformer (ViT) architectures to improve the accuracy and efficiency of image-based fish species classification. Unlike existing methods that rely solely on CNN-based or transformer-based models, our approach introduces a novel hybrid architecture that integrates interpretability-based saliency features with transformer-based attention mechanisms. The process of our approach begins with the VGG16 model pre-trained on the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) 2012 dataset, to extract deep visual features. The DeepLIFT interpretability technique is then used to generate heat maps highlighting salient areas of the image contributing to model predictions. These maps are then divided into patches. The patches are concatenated with those extracted from the original images to form combined vectors, which are fed into a ViT model containing an MLP (Multi-Layer Perceptron) head for final classification. The model was trained and evaluated on public datasets containing various fish species from different aquatic environments. Experimental results show that DeepLIFT-ViT outperforms existing state-of-the-art models in terms of classification accuracy, noise robustness and computational efficiency. With classification accuracy of up to 99%, this approach enhances the capabilities of automatic fish species recognition systems, offering a scalable solution for fisheries management and aquatic research.