Multi-view Learning for Camouflaged Object Detection with PVTv2

Pu Yan
Kang Ruan
Lili Wang
Yang Zhao
Xu Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recently, with the continuous development in the field of camouflage object detection(COD), effectively separating objects highly similar to the background has become a focal point of research. Due to the high similarity between camouflage objects and backgrounds, traditional single visual branch often perform poorly in such scenarios. To address this issue, We propose a multi-view learning detection network based on the Pyramid Vision Transformer, named Multi-view Learning for Camouflaged Object Detection with PVTv2(MVLNet). By utilizing the information from RGB and noise views, our method can provide a more comprehensive description of the relationship between objects and backgrounds to improve the accuracy and robustness for COD. Inspired by human visual attention during observation, we design a Global Context Aggregation Module by using a U-shaped structure and progressively increasing dilation rates to simulate the human behavior of zooming in and out. Extensive experiments demonstrate that the proposed MVLNet outperforms 22 other representative models on three public datasets.

Version published to 10.21203/rs.3.rs-4520181/v1 on Research Square
Jun 17, 2024

SAC-YOLO: Efficient Multi-Scale Feature Fusion for Transmission Line Defect Detection

This article has 6 authors:
1. Haotian Yin
2. Fanghua Liu
3. Jiankang Yuan
4. Juntao Fan
5. Chaojie Xu
6. Ruibo Tan
This article has no evaluationsLatest version Apr 17, 2026
Multi-Scale Contextual Attention for Robust Crop and Pest Image Classification

This article has 4 authors:
1. Muhammad Majid
2. Hassan Tariq
3. Imran Mumtaz
4. Hanan Aljuaid
This article has no evaluationsLatest version Apr 28, 2026
An Interpretable 3D Bag-Of-Visual-Words Pipeline for Volumetric Microscopy Classification

This article has 4 authors:
1. Anna E. Pittman
2. Kirby R. Campbell
3. Christophe Laumonnerie
4. David J. Solecki
This article has no evaluationsLatest version Apr 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

SAC-YOLO: Efficient Multi-Scale Feature Fusion for Transmission Line Defect Detection

Multi-Scale Contextual Attention for Robust Crop and Pest Image Classification

An Interpretable 3D Bag-Of-Visual-Words Pipeline for Volumetric Microscopy Classification