Few-shot Remote Sensing Scene Image Classification Method Based On Cross-Scale Efficient Hybrid Encoder

Di Wu
ShengLong Gan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The field of few-shot remote sensing image classification often suffer from heavy data dependency and neglected inter / intra class relationships due to limited samples, leading to sub-optimal accuracy in existing methods. To address this critical issue, in this paper, we propose a novel framework based on a novel cross-scale efficient hybrid encoder and an adaptive complementary metric distance classifier, aiming to enhance the model's feature extraction capabilities and its ability to differentiate between inter-class and intra-class relationships. First, we propose a novel Cross-Scale Efficient Hybrid Encoder (EHE) transforms multi-scale features into image feature sequences through intra-scale interaction and cross-scale fusion, efficiently integrating fine-grained geometric details with high-level semantics into unified discriminative representations. .Secondly, a dynamic adaptive complementary metric distance classifier (ACMDC) is designed, which utilizing an improved adaptive cosine classifier for query matching and then updating the adaptive activation function, while the Euclidean distance and cosine similarity are used as complementary distance metrics, This adaptive fusion optimizes metric space by enforcing tighter intra-class clusters and separable inter-class boundaries.Experimental results on the NWPU-RESISC45, WHU-RS19, and UC Merced remote sensing datasets demonstrate that our method achieves accuracies of 72.45%, 83.69%, and 61.57% respectively under the 5-way 1-shot setting, outperforming the second-best model by 0.32%, 1.50%, and 1.39%. In the 5-way 5-shot configuration, our approach yields accuracy improvements of 2.57%, 0.86%, and 2.12% over the sub-optimal model. These findings confirm that the proposed method enables models to learn richer intra-class and inter-class relationships, significantly enhancing the discriminative capability of few-shot remote sensing scene classification models.

Version published to 10.21203/rs.3.rs-8528895/v1 on Research Square
Mar 9, 2026

Few-Shot Remote Sensing Scene Classification Based on Diffusion Augmentation and Multimodal Feature Fusion

This article has 3 authors:
1. Zhou Yang
2. Siming Han
3. Ming Wu
This article has no evaluationsLatest version Mar 5, 2026
Phase Congruency-Guided Cross-Scale Contextual Fusion Network for Salient Object Detection in Optical Remote Sensing Images

This article has 6 authors:
1. Junfang Jiang
2. Wanjin Wang
3. Xiaohui Lin
4. Pingping Miao
5. Lina Gao
6. Mingzhu Xu
This article has no evaluationsLatest version Mar 27, 2026
Vision Transformer-based Change Detection in optical and SAR Remote Sensing Images

This article has 1 author:
1. Emna Brahim
This article has no evaluationsLatest version Apr 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Few-Shot Remote Sensing Scene Classification Based on Diffusion Augmentation and Multimodal Feature Fusion

Phase Congruency-Guided Cross-Scale Contextual Fusion Network for Salient Object Detection in Optical Remote Sensing Images

Vision Transformer-based Change Detection in optical and SAR Remote Sensing Images