Viewpoint-Aware Pose Estimation Framework for Cooperative UAVs

Youngrun Kim
Heokjune You
Seunghyun Choi
Dongwon Jung

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Pose estimation from monocular vision is essential for UAV applications, yetexisting methods often struggle in real-world settings. Traditional techniquesbased on markers or hand-crafted features are computationally efficient but unre-liable in cluttered or unstructured environments. Learning-based approaches,while powerful, typically demand extensive target-specific annotation andretraining, limiting their generalizability. This paper proposes a robust, key-point training-free pose estimation framework that leverages pre-trained visualcorrespondence models—SuperPoint and LightGlue—to eliminate the need fortask-specific keypoint detection or pose regression. Target localization is per-formed using an off-the-shelf YOLO detector, followed by viewpoint-awaretemplate matching to discretize target appearance under varying views. Withinthe detected region, SuperPoint features are matched via LightGlue, and anovel Coverage Score evaluates the spatial distribution of correspondences toreject degenerate configurations prior to PnP–RANSAC pose recovery. AnUnscented Kalman Filter integrates asynchronous measurements for tempo-rally stable yet responsive estimates. The framework requires no target-specificretraining, enabling seamless deployment across diverse targets and environ-ments. Extensive evaluations—including indoor, outdoor, and visually degradedscenarios—demonstrate robust, consistent performance and significant reduc-tions in annotation effort and system integration complexity, outperformingstate-of-the-art learning-based methods in various conditions.

Version published to 10.21203/rs.3.rs-9075640/v1 on Research Square
Apr 1, 2026

BPC-SLAM: Part-Level Dynamic Suppression and Structure-Constrained RGB-D SLAM for Human-Centric Dynamic Environments

This article has 5 authors:
1. Wang Yang
2. Jiupeng Chen
3. Hongjun San
4. Fan Zhang
5. Wunyu Xu
This article has no evaluationsLatest version Apr 2, 2026
Enhancing Loop Closure Detection with Object Semantic Scan Context

This article has 5 authors:
1. Dhruv Kumarjiguda
2. Saurab Verma
3. Rajdeep Dutta
4. Syed Zeeshan Ahmed
5. Zhang Kun
This article has no evaluationsLatest version Mar 23, 2026
Enabling Safe UAV Navigation in Transparent and Specular Environments via Generative Depth Completion

This article has 7 authors:
1. Boyu Zhou
2. Pengcheng Zhu
3. Xulin Xiao
4. Hao Hu
5. Wei Pan
6. Huaxu Li
7. Qingkai Yang
This article has no evaluationsLatest version Apr 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

BPC-SLAM: Part-Level Dynamic Suppression and Structure-Constrained RGB-D SLAM for Human-Centric Dynamic Environments

Enhancing Loop Closure Detection with Object Semantic Scan Context

Enabling Safe UAV Navigation in Transparent and Specular Environments via Generative Depth Completion