PRDE: Progressive Representation Reconstruction for Single Image Depth Estimation with Diffusion Priors and Detail-Amplification Decoding

Zhu Tianshui
Wang Xiaohong
Zhang Ningning
Cong Bingge
Chen Kaihang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Single-image depth estimation (SIDE) is challenging, particularly in complex scenes that involve fine-grained structures, occlusions, and non-uniform textures. Although diffusion-based methods effectively model global semantic structures, their denoising process often suppresses high-frequency components, consequently resulting in the loss of local details and degraded edge quality. This paper proposes a progressive single-image depth estimation framework, termed PRDE (Progressive Representation Reconstruction with Diffusion Priors and Detail Enhancement), which integrates diffusion-generated global features with a dedicated Detail Feature Refinement Module (DFRM). The DFRM leverages frequency-domain attention and a representation alignment and integration module to bolster structural integrity and recapture local details. Experimental results on two standard benchmarks, NYU Depth v2 and KITTI, demonstrate that the proposed model outperforms existing models across multiple metrics. Notably, on the NYU dataset, our model achieves a 3.96%, 8.07%, and 2.33% improvement in Log10, SqRel, and RMSELog, respectively. Furthermore, it attains the best δ3 accuracy among all compared methods.

Version published to 10.21203/rs.3.rs-7818869/v1 on Research Square
Oct 28, 2025

Dynamic Nonlinear Networks for Adaptive Low-Light Image Enhancement

This article has 3 authors:
1. Minglong Xue
2. Kaiwen Chen
3. Senming Zhong
This article has no evaluationsLatest version Nov 14, 2025
Efficient Document Image Dewarping via Hybrid Deep Learning and Cubic Polynomial Geometry Restoration

This article has 3 authors:
1. Valery Istomin
2. Oleg Pereziabov
3. Ilya Afanasyev
This article has no evaluationsLatest version Dec 3, 2025
DIPLI: Deep Image Prior Lucky Imaging for BlindAstronomical Image Restoration

This article has 4 authors:
1. Suraj Singh
2. Anastasiia Batsheva
3. Oleg Rogov
4. Ahmed Bouridane
This article has no evaluationsLatest version Nov 25, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Dynamic Nonlinear Networks for Adaptive Low-Light Image Enhancement

Efficient Document Image Dewarping via Hybrid Deep Learning and Cubic Polynomial Geometry Restoration

DIPLI: Deep Image Prior Lucky Imaging for BlindAstronomical Image Restoration