Remote Sensing Multi-View Stereo using ConvLSTM Guided Iterative Depth Refinement

Xuebin Wei
Yunxin Ye
Feng Cai
Liyan Wu
Feng Shao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Multi-view 3D reconstruction from remote sensing imagery has emerged as a critical re-search direction in both computer vision and remote sensing. While deep learning-based methods have demonstrated remarkable success in close-range reconstruction, remote sensing scenarios continue to pose significant challenges. These include edge blurring due to varying viewpoints, noise artifacts in shadowed regions, and discontinuous depth estimates in areas with smoothly varying elevation, all of which hinder accurate matching and degrade reconstruction quality. To address these issues, we propose a novel end-to-end network, termed CGIDR-Net, which enhances remote sensing multi-view stereo (MVS) reconstruction through ConvLSTM guided iterative depth refinement. Specifically, we design a Deformable Channel Transformation Module (DCTM) to alleviate edge blurring across views by adaptively capturing spatial and channel-wise variations. Furthermore, we introduce a Differentiable 3D Masked Warping (MW) mechanism that leverages learnable masks to construct a more reliable cost volume, effectively sup-pressing occlusions and geometric distortions. Finally, we incorporate an Iterative Depth Refinement Module (IDRM) based on ConvLSTM, which progressively integrates spatial and contextual cues to refine depth predictions. Extensive experiments on several public datasets, including WHU, LuoJia-MVS, WHU-OMVS, and DTU, demonstrate that CGIDR-Net achieves superior performance in both accuracy and robustness compared to existing state-of-the-art methods.

Version published to 10.21203/rs.3.rs-7521676/v1 on Research Square
Sep 12, 2025

Generative Adversarial Network-Based Super-Resolution Reconstruction of Remote Sensing Images

This article has 4 authors:
1. longbao wang
2. lisheng liu
3. qing yu
4. dandan yuan
This article has no evaluationsLatest version Nov 2, 2025
Domain-Invariant Dehazing via Depth-Aware Transmission Estimation and Image Restoration

This article has 7 authors:
1. Feng Ling
2. Yan Zhang
3. Zhiguang Shi
4. Jinghua Zhang
5. Yu Zhang
6. Di Liu
7. Wanchun Xu
This article has no evaluationsLatest version Oct 1, 2025
Lightweight Super-Resolution Reconstruction Architecture of Remote Sensing Images Using a Residual Hierarchical Transformer Network

This article has 5 authors:
1. Bo Huang
2. Jian Lin
3. Qingtang Chen
4. Yiqing Cao
5. Liaoni Wu
This article has no evaluationsLatest version Sep 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Generative Adversarial Network-Based Super-Resolution Reconstruction of Remote Sensing Images

Domain-Invariant Dehazing via Depth-Aware Transmission Estimation and Image Restoration

Lightweight Super-Resolution Reconstruction Architecture of Remote Sensing Images Using a Residual Hierarchical Transformer Network