A Systematic Review of Deep Neural Network Architectures Training Methods and Applications for Image Segmentation

Mojtaba Najafi
Vafa Maihami
Um Kulthum shahryari

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Image segmentation, a critical task in computer vision, has witnessed transformative advancements driven by deep neural networks (DNNs), enabling unprecedented precision across diverse applications, including medical imaging, industrial automation, and autonomous navigation. This systematic review elucidates the evolving architecture of segmentations with a focus on state-of-the-art developments from the year 2020 onwards. The classical architectures, such as U-Net, Fully Convolutional Networks, and DeepLab, are discussed in terms of their foundational methodologies, while the very recent Innovations-Vision Transformers, Swin Transformers, and SegFormer-are underlined for their long-range dependency modelling capability and efficient capture of hierarchical multi-scale features. Advanced training paradigms are explained, including transfer learning, data augmentation with Albumentations, and modern optimization algorithms such as AdamP, in order to demonstrate the degree to which these concepts further enhance model performance, robustness, and generalizability. Real-world implementations critically discuss medical applications, such as tumor segmentation and robotic surgery guidance, and industrial applications, such as defect detection and object tracking in dynamic environments. It addresses the rising complexity of deployment by focusing on key challenges: high computation demands, real-time inference constraints, and domain generalization, among others, discussing mitigation strategies such as model quantization, domain adaptation, and hybrid architectures. This review integrates the theoretical development with insights into practice through comparative analyses, performance benchmarking, and code implementation, thereby serving as a guide for designing and deploying advanced segmentation systems. The contribution bridges the gap between the innovation-application cycle in laying the foundation for future research towards the optimization of the segmentation model for accuracy, efficiency, and scalability.

Version published to 10.21203/rs.3.rs-7664425/v1 on Research Square
Nov 24, 2025

A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

This article has 3 authors:
1. Fahim Al Islam
2. Saif Hossain
3. Monir Hosen
This article has no evaluationsLatest version Feb 3, 2026
A Controlled Multi-dataset Evaluation of Custom CNNs, Pretrained Feature Extractors, and Transfer Learning

This article has 2 authors:
1. Tanzeem Maliat
2. Asfin Jannat Shamsi
This article has no evaluationsLatest version Jan 9, 2026
A Resource-Efficient Hybrid ViT–CNN Framework with CutMix Regularization for Cardiac MRI Image Classification

This article has 3 authors:
1. Amirreza Khayyat assadi
2. Babak Nouri-Moghaddam
3. Abbas Mirzaei
This article has no evaluationsLatest version Feb 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

A Controlled Multi-dataset Evaluation of Custom CNNs, Pretrained Feature Extractors, and Transfer Learning

A Resource-Efficient Hybrid ViT–CNN Framework with CutMix Regularization for Cardiac MRI Image Classification