CVNet: Lightweight Cross-View Vehicle ReID with Multi-Scale Localization
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Cross-view vehicle re-identification (ReID) between aerial and ground perspectives is challenging due to limited computational resources on edge devices and significant scale variations. We propose CVNet, a lightweight network with two key modules: the multi-scale localization (MSL) module and the deep–shallow filtrate collaboration (DFC) module. The MSL module employs multi-scale depthwise separable convolutions and a localization attention mechanism to extract multi-scale features and localize salient regions, addressing viewpoint variations. DFC employs a dual-branch design comprising deep and shallow branches, integrating a filtration module optimized via neural architecture search, a collaboration module, and lightweight convolutions. This design effectively captures both unique and shared cross-view features, ensuring efficient and robust feature representation. We also release a new CVPair v1.0 dataset, the first benchmark for cross-view ReID, containing 14,969 images of 894 vehicle identities, offering results of traditional and lightweight methods. CVNet achieves state-of-the-art performance on CVPair v1.0, VehicleID, and VeRi776, advancing cross-view vehicle ReID. Code, dataset, and models will be released publicly.