SEFormer:A new method for Medical Image Segmentation

Chen Ge
Haoze Pan
Yihua Song
Xia Zhang
Zuojian Zhou

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Medical image segmentation is a fundamental task in medical image analysis, playing a crucial role in disease diagnosis, treatment planning, and clinical decision-making. Accurate segmentation of anatomical structures, such as blood vessels, organs, and lesions, is essential for reliable medical interpretations. However, existing segmentation models often face challenges in effectively capturing both local and global features within medical images. Traditional convolutional neural networks (CNNs) excel at extracting fine-grained local features but struggle to model long-range dependencies due to their limited receptive fields. Conversely, transformer-based models can capture global contextual relationships but often fail to preserve detailed local structures, leading to suboptimal segmentation performance. The purpose of this study is to develop SEFormer, a novel hybrid network that integrates SENet, ResNet, and Transformer to enhance medical image segmentation. SEFormer aims to effectively capture both local and global features, addressing the limitations of traditional CNNs and transformer-based models.SENet is incorporated to recalibrate feature maps by adaptively emphasizing informative features, thereby compensating for the global representation limitations of CNNs. Meanwhile, the ResNet backbone ensures deep feature extraction while maintaining computational efficiency. Additionally, the Transformer module is introduced to capture long-range dependencies and enhance contextual awareness, complementing the local feature extraction capabilities of CNNs. This hybrid approach allows SEFormer to balance fine-grained spatial details with broader contextual information, leading to more precise segmentation results.Furthermore, to mitigate information loss during feature extraction and ensure a more complete and hierarchical representation of image information, we introduce a feature pyramid structure inspired by multi-scale image pyramid models. By progressively increasing the receptive field across different scales and employing skip connections, SEFormer effectively fuses multi-scale local and global features at each stage of the pyramid. This hierarchical fusion mechanism ensures a richer and more robust feature representation, which is particularly beneficial for segmenting complex medical structures such as blood vessels. We evaluate SEFormer on the CHASE_DB1 dataset, a widely used benchmark for retinal vessel segmentation. Experimental results demonstrate that SEFormer outperforms existing state-of-the-art segmentation methods, achieving a 3.25% improvement in segmentation accuracy. Additionally, we conduct ablation studies to verify the contributions of different components within SEFormer. The results show that incorporating SENet enhances feature recalibration and channel attention, while the Transformer module significantly improves global context awareness. The feature pyramid structure further contributes to performance gains by ensuring a multi-scale representation of vascular structures. Compared to conventional CNN-based methods, our approach achieves better segmentation accuracy with improved robustness to variations in vessel thickness, noise, and image contrast. Furthermore, SEFormer maintains computational efficiency, making it suitable for real-world clinical applications where both accuracy and processing speed are critical. SEFormer provides an efficient and accurate approach for medical image segmentation by leveraging the complementary strengths of CNNs, SENet, and Transformers while integrating a feature pyramid structure to maximize feature representation. Future research directions include extending SEFormer to other medical imaging modalities, such as CT and MRI scans, as well as exploring further optimization techniques to reduce computational costs while maintaining high segmentation performance. Additionally, integrating SEFormer with active learning frameworks could further improve segmentation performance by leveraging expert-annotated data more efficiently.

Version published to 10.21203/rs.3.rs-6535669/v1 on Research Square
Jun 9, 2025

Hybrid Vision Transformers for Accurate Recognition of Lung Lesions and Anatomical Structures in Bronchoscopic Imaging

This article has 3 authors:
1. Rolyph Erwan NTOUTOUME NGUEMA
2. Mohamad Forouzanfar
3. Ali Traore
This article has no evaluationsLatest version Jun 25, 2025
Region-Adaptive Attention and Edge-Guided Alignment for Medical Image Registration

This article has 3 authors:
1. Linghui Liu
2. Yue Zeng
3. Weiqiang Wang
This article has no evaluationsLatest version Jul 16, 2025
Explainable Deep Learning Framework for Brain Tumor Segmentation Using Vision Transformer and Conditional Random Fields

This article has 6 authors:
1. Homayoun Safarpour
2. Soroush Sadeghi
3. Payam Zarbakhsh
4. Mohammadreza Kamsari
5. Marjan Kia
6. Ramin Ranjbarzadeh
This article has no evaluationsLatest version Jun 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Hybrid Vision Transformers for Accurate Recognition of Lung Lesions and Anatomical Structures in Bronchoscopic Imaging

Region-Adaptive Attention and Edge-Guided Alignment for Medical Image Registration

Explainable Deep Learning Framework for Brain Tumor Segmentation Using Vision Transformer and Conditional Random Fields