Scene Text Detection Using Attention with Depthwise Separable Convolutions for Mobile Applications

Ramalakshmi Subbukalai
Vani Vijayan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Text detection from images or videos contributes well in many applications since deep-learned features can effectively capture textual cues. However, many existing methods give average performance when they are applied to detect Arbitrary-shaped text present in the image. This limitation is mainly due to the constraints of their text representations, which include horizontal boxes, rotating rectangles, and quadrangles. This paper proposes a Deep-Learned Fusion Attention Network (DLFANet) for learning the prominent features of arbitrary shaped text by using a lightweight network known as shared network which is further fine-tuned by the proposed Feature Attention Module Enhancement (FAME). In addition, the Final Feature Module (FFM) with an Attention Detection Head (ADH) and Geometry Aware Pixel Network (GAPN) are used to detect the location of the text effectively. The performance analysis of the proposed work on standard datasets Total-Text, CTW 1500, and ICDAR 2015 gives better results when compared to other state-of-the-art algorithms.

Version published to 10.21203/rs.3.rs-7487631/v1 on Research Square
Sep 11, 2025

Scene Text Recognition via Alternating Hierarchical-Global Attention in Encoder-Only Transformers

This article has 3 authors:
1. Shashank B N
2. S. Nagesh Bhattu
3. Sri Phani Krishna K
This article has no evaluationsLatest version Oct 1, 2025
Attention based deep learning model for detecting copy move forgery

This article has 2 authors:
1. Priyadharsini Selvaraj
2. Annes Belmin S
This article has no evaluationsLatest version Oct 1, 2025
DCD-YOLO: An Improved YOLOv11n Algorithm \for Traffic Participant Detection

This article has 6 authors:
1. Xiaohui Lu
2. Dexin Wang
3. Ruixia Xiong
4. Xinzhan Lv
5. Yichong Chen
6. Shaosong Li
This article has no evaluationsLatest version Oct 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Scene Text Recognition via Alternating Hierarchical-Global Attention in Encoder-Only Transformers

Attention based deep learning model for detecting copy move forgery

DCD-YOLO: An Improved YOLOv11n Algorithm \for Traffic Participant Detection