Deep Learning Integration of CNN-Transformer and UNet for Bi-Temporal SAR Flash Flood Detection

Abbas Mohammed Noori
Abdul Razzak .T. Ziboon
Amjed Naser Al-hameedawi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

A new deep learning approach for bi-temporal flash flood detection in synthetic aperture radar (SAR) is proposed in this research. It combines a U-Net convolutional network with a Transformer model using a compact Convolutional Tokenizer (CCT) to improve the efficiency of long-range dependency learning. The hybrid model, namely CCT-U-ViT, that naturally combines the spatial feature extraction of U-Net and the global context capability of Transformer. The model significantly reduces the number of basic blocks as it uses the CCT tokenizer instead of conventional Vision Transformer tokenization, which makes it the right fit for small flood detection datasets. This model improves flood boundary delineation by involving local spatial patterns and global contextual relations. However, the method is based on Sentinel-1 SAR images and focuses on Erbil, Iraq, that experienced an extreme flash flood in December 2021. The experimental comparison results show that the proposed CCT-U-ViT outperforms multiple baseline models, such as conventional CNNs, U-Net, and Vision Transformer, obtaining an impressive overall accuracy of 91.24%. In addition, Furthermore, the model gets better precision and recall with F1-score of 91.21% and mIoU of 83.83%. Qualitative results demonstrate that CCT-U-ViT can effectively preserve the flood boundaries with higher precision and less salt-and-pepper noise compared with the state-of-the-art approaches. This study underscores the significance of hybrid deep learning models in enhancing the precision of flood detection with SAR data, providing valuable insights for the advancement of real-time flood monitoring and crisis management systems.

Version published to 10.20944/preprints202506.1153.v1
Jun 17, 2025

An Improved HRNetV2-Based Algorithm for Semantic Segmentation of Corroded Regions in Urban Drainage Pipes

This article has 1 author:
1. Gao Liang
This article has no evaluationsLatest version Jun 26, 2025
EAF-Net: An Image Enhancement Method and Adaptive Fusion Network for Crack Detection

This article has 3 authors:
1. Zhong Qu
2. Zhao Gao
3. Xuehui Yin
This article has no evaluationsLatest version Jun 16, 2025
A Hybrid RNN-CNN Approach with TPI for High-Precision DEM Reconstruction

This article has 6 authors:
1. Ruizhe Cao
2. Chunjing Yao
3. Hongchao Ma
4. Bin Guo
5. Jie Wang
6. Junhao Xu
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

An Improved HRNetV2-Based Algorithm for Semantic Segmentation of Corroded Regions in Urban Drainage Pipes

EAF-Net: An Image Enhancement Method and Adaptive Fusion Network for Crack Detection

A Hybrid RNN-CNN Approach with TPI for High-Precision DEM Reconstruction