CVNet: Lightweight Cross-View Vehicle ReID with Multi-Scale Localization

Wenji Yin
Baixuan Han
Yueping Peng
Hexiang Hao
Zecong Ye
Yu Shen
Yanjun Cai
Wenchao Kang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cross-view vehicle re-identification (ReID) between aerial and ground perspectives is challenging due to limited computational resources on edge devices and significant scale variations. We propose CVNet, a lightweight network with two key modules: the multi-scale localization (MSL) module and the deep–shallow filtrate collaboration (DFC) module. The MSL module employs multi-scale depthwise separable convolutions and a localization attention mechanism to extract multi-scale features and localize salient regions, addressing viewpoint variations. DFC employs a dual-branch design comprising deep and shallow branches, integrating a filtration module optimized via neural architecture search, a collaboration module, and lightweight convolutions. This design effectively captures both unique and shared cross-view features, ensuring efficient and robust feature representation. We also release a new CVPair v1.0 dataset, the first benchmark for cross-view ReID, containing 14,969 images of 894 vehicle identities, offering results of traditional and lightweight methods. CVNet achieves state-of-the-art performance on CVPair v1.0, VehicleID, and VeRi776, advancing cross-view vehicle ReID. Code, dataset, and models will be released publicly.

Version published to 10.20944/preprints202503.1718.v1
Mar 24, 2025

Refining Small Object Detection in Aerial Images with PF-DETR: A Progressive Fusion Approach

This article has 6 authors:
1. Jing Liu
2. Yanyan Cao
3. Chunyu Dong
4. Xin Zhang
5. Yong Liang
6. Pan Li
This article has no evaluationsLatest version Mar 17, 2025
Enhancing Cross-View Geo-Localization Through Global-Local Quadrant Interaction Network

This article has 4 authors:
1. Jin Xu
2. Junping Yin
3. Juan Zhang
4. Tianyan Gao
This article has no evaluationsLatest version Mar 17, 2025
Coarse-to-Fine Multi-View 3D Reconstruction with SLAM Optimization and Transformer-Based Matching

This article has 1 author:
1. Xiangqin Chen
This article has no evaluationsLatest version Apr 11, 2025

Listed in

Abstract

Article activity feed

Related articles

Refining Small Object Detection in Aerial Images with PF-DETR: A Progressive Fusion Approach

Enhancing Cross-View Geo-Localization Through Global-Local Quadrant Interaction Network

Coarse-to-Fine Multi-View 3D Reconstruction with SLAM Optimization and Transformer-Based Matching