Vision-Based Pick and Place Robots Using Faster R-CNN and EfficientNet for Real-Time Object Detection and Classification

Santhoshkumar Sivakumar
Jayanth Subramaniam A
Senthil Kumar K

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper describes a vision-based pick-and-place robotic system that uses Faster R-CNN for object detection and EfficientNetB0 for classification. The system employs an eye-in-hand 2D camera on a UR5 robotic arm to collect real-time RGB images, which are processed through the dual-model architecture to detect and classify objects from 36 categories. Training and validation were conducted using a publicly available fruit and vegetable dataset to simulate an industrial sorting application. The combined classification accuracy reaches 83%, with high F1-scores for most classes. This architecture provides visual recognition capabilities and real-time processing suitable for automated industrial settings.

Version published to 10.21203/rs.3.rs-8651114/v1 on Research Square
Feb 16, 2026

A Hybrid YOLOv5s-Faster R-CNN Architecture for Object Detection in Complex Road Scenes

This article has 3 authors:
1. Lenard Nkalubo Byenkya
2. Rose Nakibuule
3. Danison Taremwa
This article has no evaluationsLatest version Jan 21, 2026
Performance Evaluation of a Frugal Open-Source Humanoid Robotic Platform

This article has 9 authors:
1. Harish V. Mekali
2. Girish K
3. Ajaykumar D
4. Sushma S S
5. Sudarshan S. Harithas
6. Shivaraj K. M
7. Heethesh Vhavle
8. Karun Warrior
9. Shrenik Muralidhar
This article has no evaluationsLatest version Feb 18, 2026
A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

This article has 3 authors:
1. Fahim Al Islam
2. Saif Hossain
3. Monir Hosen
This article has no evaluationsLatest version Feb 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Hybrid YOLOv5s-Faster R-CNN Architecture for Object Detection in Complex Road Scenes

Performance Evaluation of a Frugal Open-Source Humanoid Robotic Platform

A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks