Attention-Weighted Hierarchical Decoding for Few-Shot Semantic Segmentation: A Case Study on Batik Cultural Heritage Patterns

Yuzhou Ma
Haolong Qian
Wei Li

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Few-shot semantic segmentation aims to learn accurate pixel-level classification from limited annotated samples, a critical capability for real-world applications where data acquisition is expensive or impractical. However, existing methods often struggle with fine-grained texture details and complex boundaries under data-scarce conditions, particularly when applied to domains with intricate visual patterns (such as batik patterns). To address this few-shot learning challenge, we constructed a few-shot batik pattern dataset and proposed a novel network architecture centered on attention weighting and hierarchical decoding. Our method leverages a pre-trained ResNet101 backbone for transfer learning to establish a strong feature foundation. It incorporates a dual-attention module that combines spatial and channel attention to dynamically highlight semantically rich regions and intricate texture boundaries specific to batik. For multi-scale context aggregation, a lightweight module utilizing parallel dilated convolutions is introduced to efficiently capture features from varying receptive fields. Finally, a hierarchical decoder progressively integrates these enhanced, multi-scale features with high-resolution shallow features to reconstruct precise segmentation maps. Comprehensive evaluations on a dedicated batik dataset show that our model achieves state-of-the-art performance, with a mean Intersection over Union (mIoU) of 79.22% and a pixel accuracy (PA) of 92.47%. It notably improves over the strong DeepLabV3+ baseline by 3.3% in mIoU and 0.95% in PA, demonstrating its effectiveness for the task of batik pattern segmentation under data-scarce conditions.

Version published to 10.3390/electronics15061242
Mar 17, 2026
Version published to 10.20944/preprints202602.1167.v1
Feb 14, 2026

DAFSDet: Dual-Attention Guided Few-Shot Object Detection in Remote Sensing Images

This article has 6 authors:
1. Guangshuai Gao
2. Zhilin Zhang
3. Wei Zhang
4. Yunqi Shang
5. Yan Dong
6. Jiangtao Xi
This article has no evaluationsLatest version Feb 12, 2026
MSA-MVSNet: A Cross-Scale Collaborative Attention-Based Multi-View Reconstruction Network for Orchard Tree 3D Reconstruction with Instance Segmentation for Fruit Counting

This article has 6 authors:
1. Hui Li
2. Jun Zhang
3. Jianhua Hong
4. Fuzhi Ke
5. Chu Zhang
6. Zhixin Li
This article has no evaluationsLatest version Mar 5, 2026
Dynamic4D: Enhancing Self-Supervised Learning for Robust and Fine-Grained 4D Point Cloud Video Understanding

This article has 2 authors:
1. Mingxuan Du
2. Yutian Zeng
This article has no evaluationsLatest version Mar 17, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DAFSDet: Dual-Attention Guided Few-Shot Object Detection in Remote Sensing Images

MSA-MVSNet: A Cross-Scale Collaborative Attention-Based Multi-View Reconstruction Network for Orchard Tree 3D Reconstruction with Instance Segmentation for Fruit Counting

Dynamic4D: Enhancing Self-Supervised Learning for Robust and Fine-Grained 4D Point Cloud Video Understanding