Lightweight Super-Resolution Reconstruction Architecture of Remote Sensing Images Using a Residual Hierarchical Transformer Network

Bo Huang
Jian Lin
Qingtang Chen
Yiqing Cao
Liaoni Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Remote Sensing image super-resolution technology aims to enhance spatial details, and it is of great significance for the high-quality interpretation of satellite imagery. Recently, Transformer-based models have shown competitive performance in single image super-resolution (SISR). However, current transformer-based SR approaches often employ window self-attention with fixed small window sizes, limiting the receptive filed to a single scale and preventing the network from gathering multi-scale information such as local textures and repetitive patterns, impeding the model’s ability to remote sensing images. Moreover, the quadratic computational complexity resulting from global self-attention, rendering it inefficient for addressing RSISR tasks that involve processing high-resolution images. To address these issues, we proposed a vision transformer architecture called residual hierarchical transformer network (RHTN). Specifically, we have developed a residual hierarchical transformer block (RHTB) as a building block in RHTN. In the RHTB, we introduce a novel spatial-channel self-attention mechanism characterized by linear complexity relative to window dimensions. This design optimally harvests both spatial structural information and channel-wise features from the hierarchical window framework while maintaining computational tractability. Then, we adopt the spatial-gate feed-forward network to further model additional non-linear spatial information. We conducted comprehensive experiments on multiple benchmark datasets, demonstrating the superior performance of our proposed RHTN in terms of quantitative metrics and visual quality when compared to state-of-the-art methods.

Version published to 10.21203/rs.3.rs-7445248/v1 on Research Square
Sep 17, 2025

Generative Adversarial Network-Based Super-Resolution Reconstruction of Remote Sensing Images

This article has 4 authors:
1. longbao wang
2. lisheng liu
3. qing yu
4. dandan yuan
This article has no evaluationsLatest version Nov 2, 2025
FASwinNet: Frequency-Aware Swin Transformer for Remote Sensing Image Super-Resolution via Enhanced High-Similarity-Pass Attention and Octave Residual Blocks

This article has 4 authors:
1. Zhongyang Wang
2. Shilong Liu
3. Keyan Cao
4. Xinlei Wang
This article has no evaluationsLatest version Oct 30, 2025
Enhancing Underwater Imagery: A Fourier-Sparse Attention Transformer Approach

This article has 6 authors:
1. Jing Ling
2. Zheng Zuo
3. Dan Xiang
4. Naiyao Liang
5. Zhimin Wang
6. Pan Gao
This article has no evaluationsLatest version Sep 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generative Adversarial Network-Based Super-Resolution Reconstruction of Remote Sensing Images

FASwinNet: Frequency-Aware Swin Transformer for Remote Sensing Image Super-Resolution via Enhanced High-Similarity-Pass Attention and Octave Residual Blocks

Enhancing Underwater Imagery: A Fourier-Sparse Attention Transformer Approach