Random Rolling Attention Augmentation for Efficient Agricultural Disease and Pest Detection
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Accurate detection of agricultural diseases and pests is essential for crop protection and food security worldwide. Real-world field applications face challenges including small objects, complex backgrounds, high class similarity, and limited computational resources. This work proposes a Random Rolling Transformer (RRT), which introduces random circular shifts along channel and sequence dimensions into multi-head self-attention to enrich feature interactions without increasing parameters or computation. Integrated into YOLOv12, the proposed RRT-YOLO is evaluated on the IP102 pest dataset and a tomato leaf disease dataset. Results show that RRT-YOLO improves mAP@50 by 2.5% and mAP@50–95 by 3.9% on IP102, and by 8.8% and 4.2% on the tomato disease dataset, while maintaining identical model size and complexity. This attention perturbation strategy offers an effective and efficient solution for lightweight agricultural vision detection and can be extended to other visual computing tasks. The code and detailed descriptions can be accessed via the following repository: \href{https://github.com/glorioustory/Random-Rolling-Attention-Augmentation.git}{https://github.com/glorioustory/Random-Rolling-Attention-Augmentation.git}.