LVC2-DViT: Landview Creation for Landview Classification

Kai Wang
Siyi Chen
Weicong Pang
Ziru Chen
Cheng Li

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Remote sensing land-cover classification is impeded by limited annotated data and pronounced geometric distortion, hindering its value for environmental monitoring and land planning. We introduce LVC2‑DViT (Landview Creation for Landview Classification with Deformable Vision Transformer), an end‑to‑end framework evaluated on five Aerial Image Dataset (AID) scene types, including Beach, Bridge, Pond, Port and River. LVC2‑DViT fuses two modules: (i) a data creation pipeline that converts ChatGPT-4o-generated textual scene descriptions into class‑balanced, high-fidelity images via Stable Diffusion, and (ii) DViT, a deformation‑aware Vision Transformer dedicated to land‑use classification whose adaptive receptive fields more faithfully model irregular landform geometries. Without increasing model size, LVC2‑DViT improves Overall Accuracy by 2.13 percentage points and Cohen’s Kappa by 2.66 percentage points over a strong vanilla ViT baseline, and also surpasses FlashAttention variant. These results confirm the effectiveness of combining generative augmentation with deformable attention for robust land‑use mapping. The project is available at here.

Version published to 10.20944/preprints202507.1001.v1
Jul 11, 2025

LGD-DeepLabV3+: An Enhanced Framework for Remote Sensing Semantic Segmentation via Multi-Level Feature Fusion and Global Modeling

This article has 5 authors:
1. Xin Wang
2. Xu Liu
3. Adnan Mahmood
4. Yaxin Yang
5. Xipeng Li
This article has no evaluationsLatest version Jan 21, 2026
An effective framework for accurate semantic segmentation of high-resolution remote sensing images.

This article has 6 authors:
1. Wambugu Naftaly
2. Ruisheng Wang
3. Abubakar Sani-Mohammed
4. Bo Guo
5. Xinchang Zhang
6. Zhijun Wang
This article has no evaluationsLatest version Jan 20, 2026
Research on improved SegFormer with multi-module fusion for landslide remote sensing image recognition

This article has 8 authors:
1. Minghua Luo
2. Canming Yuan
3. Rui Ma
4. Bibo Dai
5. Jinxin Huang
6. Xin Pan
7. Xu Wu
8. Zhixin Zhang
This article has no evaluationsLatest version Feb 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

LGD-DeepLabV3+: An Enhanced Framework for Remote Sensing Semantic Segmentation via Multi-Level Feature Fusion and Global Modeling

An effective framework for accurate semantic segmentation of high-resolution remote sensing images.

Research on improved SegFormer with multi-module fusion for landslide remote sensing image recognition