Deep-DSO: Improving Mapping of Direct Sparse Odometry Using CNN-based Single Image Depth Estimation

Erick P. Herrera-Granda
Juan C. Torres-Cantero
Israel D. Herrera-Granda
José F. Lucio-Naranjo
Andrés Rosales
Javier Revelo-Fuelagán
Diego H. Peluffo-Ordóñez

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In recent years, SLAM, visual odometry, and structure from motion approaches have widely addressed the problems of 3D reconstruction and ego-motion estimation. Of the many input modalities that can be used to solve these ill-posed problems, the pure visual alternative using a single monocular RGB camera has attracted the attention of multiple researchers due to its low cost and widespread availability in handheld devices. One of the best proposals currently available is the Direct Sparse Odometry (DSO) system, which has demonstrated its ability to accurately recover trajectories and depth maps using monocular sequences as the only source of information. Given the impressive advances in single-image depth estimation using neural networks, this work proposes an extension of the DSO system, named DeepDSO. DeepDSO effectively integrates the state-of-the-art NeW CRFs CNN as a depth estimation module, providing depth prior information for each candidate point. This reduces the point search interval over the epipolar line. This integration improves the DSO algorithm's depth point initialization and allows each proposed point to converge faster to its true depth. Experimentation carried out in the TUM-Mono dataset demonstrated that adding the CNN depth estimation module to the DSO pipeline significantly reduced rotation, translation, scale, start-segment alignment, end-segment alignment, and RMSE errors. The DeepDSO code is publicly available at https://github.com/erickherreraresearch/Deep-DSO.

Version published to 10.20944/preprints202510.0642.v1
Oct 8, 2025

SRNet-Trans: A Singal-Image Guided Depth Completion Regression Network for Transparent Object

This article has 5 authors:
1. Tao Tao
2. Hong Zheng
3. Jinsheng Xiao
4. Wenfei Wu
5. Jianfeng Yang
This article has no evaluationsLatest version Sep 30, 2025
Visual Localisation Using Deep Learning and Graph Neural Networks: Approaches and Evaluation

This article has 1 author:
1. Dinesh Kumar Koilada
This article has no evaluationsLatest version Aug 20, 2025
Remote Sensing Multi-View Stereo using ConvLSTM Guided Iterative Depth Refinement

This article has 5 authors:
1. Xuebin Wei
2. Yunxin Ye
3. Feng Cai
4. Liyan Wu
5. Feng Shao
This article has no evaluationsLatest version Sep 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

SRNet-Trans: A Singal-Image Guided Depth Completion Regression Network for Transparent Object

Visual Localisation Using Deep Learning and Graph Neural Networks: Approaches and Evaluation

Remote Sensing Multi-View Stereo using ConvLSTM Guided Iterative Depth Refinement