MSU-Mamba: Multi-Scale Defocus Blur Detection Using Cross-Scale Fusion and State Space Models
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Defocus blur detection (DBD) plays a pivotal role in computer vision, serving as a fundamental step to enhance the performance of various downstream applications, such as image refocusing, depth estimation, and saliency detection. Despite recent advancements, existing methods often struggle in complex scenes with homogeneous regions, subtle blur transitions, and cluttered backgrounds. In this paper, we develop a novel approach (MSU-Mamba) to combine a multi-scale feature extraction with state-space modeling for boosting defocus blur detection. To do so, we develop a Multi-Scale Fusion (MSF) Block to integrate long-range dependency features across multiple scales to enhance feature representation. Moreover, in our MSF block, we devise a cross-scale token scanning mechanism into the original Mamba to better distinguish blurred and sharp regions. Comprehensive experiments conducted on benchmark datasets show that our MSU-Mamba outperforms state-of-the-art methods in terms of F-measure and MAE. The results validate our approach as a promising solution to the challenges of defocus blur detection and its application to downstream tasks.