Smooth Attention: Improving Image Semantic Segmentation

Boris Kriuk
Fedor Kriuk
Karthik Praveen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Attention mechanisms have become a fundamental component of deep learning, including the field of computer vision. The key idea behind attention in computer vision is to help the model focus on the relevant spatial regions of the input image, rather than treating all regions equally. The traditional approaches to attention mechanisms in computer vision often suffer from distribution inconsistencies in the attention maps, resulting in sharp transitions that negatively affect model’s focus and lead to poor generalization on complex shapes. The problem of spatial incoherence is particularly pronounced in the task of semantic segmentation, where accurate pixel-level predictions require a detailed understanding of the spatial relationships within the image. In this paper, we propose an attention mechanism called Smooth Attention designed for convolutional neural networks to address the problem of spatial inconsistency in attention maps through multidimensional spatial smoothing. We conduct a series of experiments to evaluate the effectiveness of the proposed mechanism and demonstrate its superior performance compared to traditional methods.

Version published to 10.20944/preprints202409.1283.v1
Sep 17, 2024

A Hybrid YOLOv5s-Faster R-CNN Architecture for Object Detection in Complex Road Scenes

This article has 3 authors:
1. Lenard Nkalubo Byenkya
2. Rose Nakibuule
3. Danison Taremwa
This article has no evaluationsLatest version Jan 21, 2026
Lite-FARNet: A Light-weight Feedback Attention Residual Network for Efficient Multi-Class Segmentation in Complex Urban Scenes

This article has 3 authors:
1. Jiaxi Yang
2. Jiaquan Shen
3. Shitong Wang
This article has no evaluationsLatest version Dec 23, 2025
An effective framework for accurate semantic segmentation of high-resolution remote sensing images.

This article has 6 authors:
1. Wambugu Naftaly
2. Ruisheng Wang
3. Abubakar Sani-Mohammed
4. Bo Guo
5. Xinchang Zhang
6. Zhijun Wang
This article has no evaluationsLatest version Jan 20, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Hybrid YOLOv5s-Faster R-CNN Architecture for Object Detection in Complex Road Scenes

Lite-FARNet: A Light-weight Feedback Attention Residual Network for Efficient Multi-Class Segmentation in Complex Urban Scenes

An effective framework for accurate semantic segmentation of high-resolution remote sensing images.