Visual Localisation Using Deep Learning and Graph Neural Networks: Approaches and Evaluation

Dinesh Kumar Koilada

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper addresses the visual localization problem, focusing on estimating camera position and orientation from images in a known scene. Traditional localization methods utilizing local feature matching face challenges in generalization to new scenarios. In contrast, this study explores state-of-the-art techniques, including deep learning models and graph neural networks, to enhance feature extraction and matching. We implemented five models: SIFT, a CNN-based baseline, Hierarchical Localisation, an ImageSimilarity-Autoencoder, and the SuperGlue feature matching model. Evaluated on a dataset from the Getty Center in Los Angeles for a Kaggle competition, the SuperGlue model significantly outperformed others, achieving a mean absolute error (MAE) of 6.37266. The findings suggest that leveraging advanced architectures and attention mechanisms can substantially improve visual localization performance, even under challenging conditions. This research highlights the potential of integrating deep learning and graph neural networks in practical localization tasks

Version published to 10.31224/5143
Aug 20, 2025

GradLIME: A CNN Local Interpretation Model Based on Feature Gradient Activation

This article has 8 authors:
1. Jinwei Zhao
2. Jiedong Liu
3. Zhenghao Shi
4. Yu Liu
5. Majid Habib Khan
6. Wei Wang
7. Minhui Zhu
8. Xinhong Hei
This article has no evaluationsLatest version Sep 25, 2025
DREAMER-S: Deep leaRning-Enabled Attention-based Multiple-instance approaches with Explainable Representations for Spatial Biology

This article has 9 authors:
1. Mohd. Rifqi Rafsanjani
2. Alison Dooney
3. Rahul Suresh
4. Alice C. O’Farrell
5. Monika A. Jarzabek
6. Liam Shiels
7. Annette T. Byrne
8. Jochen H. M. Prehn
9. Aidan D. Meade
This article has no evaluationsLatest version Oct 3, 2025
Ensemble Deep Learning for Real-Bogus Classification with Sky Survey Images

This article has 4 authors:
1. Pakpoom Prommool
2. Sirikan Chucherd
3. Natthakan Iam-On
4. Tossapon Boongoen
This article has no evaluationsLatest version Sep 2, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

GradLIME: A CNN Local Interpretation Model Based on Feature Gradient Activation

DREAMER-S: Deep leaRning-Enabled Attention-based Multiple-instance approaches with Explainable Representations for Spatial Biology

Ensemble Deep Learning for Real-Bogus Classification with Sky Survey Images