Efficient Document Image Dewarping via Hybrid Deep Learning and Cubic Polynomial Geometry Restoration

Valery Istomin
Oleg Pereziabov
Ilya Afanasyev

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Camera-captured document images often suffer from geometric distortions caused by paper deformation, perspective distortion, and lens aberrations, significantly reducing OCR accuracy. This study develops an efficient automated method for document image dewarping that balances accuracy with computational efficiency.We propose a hybrid approach combining deep learning for document detection with classical computer vision for geometry restoration. YOLOv8 performs initial document segmentation and mask generation. Subsequently, classical CV techniques construct a topological 2D grid through cubic polynomial interpolation of document boundaries, followed by image remapping to correct nonlinear distortions. A new annotated dataset and open-source framework are provided to facilitate reproducibility and further research.Experimental evaluation against state-of-the-art methods (RectiNet, DocGeoNet, DocTr++) and mobile applications (DocScan, CamScanner, TapScanner) demonstrates superior performance. Our method achieves the lowest median Character Error Rate (CER=0.0235), Levenshtein Distance (LD=27.8), and highest Jaro--Winkler similarity (JW=0.902), approaching the quality of scanned originals. The approach requires significantly fewer computational resources and memory compared to pure deep learning solutions while delivering better OCR readability and geometry restoration quality.The proposed hybrid methodology effectively restores document geometry with computational efficiency superior to existing deep learning approaches, making it suitable for resource-constrained applications while maintaining high-quality document digitization.Project page: https://github.com/HorizonParadox/DRCCBI

Version published to 10.21203/rs.3.rs-8036823/v1 on Research Square
Dec 3, 2025

Spline-Guided Segmentation of Handwritten Physico-Mathematical Documents for Improved OCR Accuracy

This article has 1 author:
1. Vasyl Zalizko
This article has no evaluationsLatest version Jan 18, 2026
Geometry-Aware Super-Resolution Fusion Calibration for Binocular Structured Light 3D Reconstruction

This article has 6 authors:
1. Yijie Shen
2. HONGYAN CAO
3. Dayong Qiao
4. Mengya Han
5. Wangke Yu
6. Benquan Wang
This article has no evaluationsLatest version Jan 16, 2026
Seamlessly Natural: Image Stitching with Natural Appearance Preservation

This article has 4 authors:
1. Gaetane Lorna N. Tchana
2. Damaris Belle M. Fotso
3. Antonio Hendricks
4. Christophe Bobda
This article has no evaluationsLatest version Jan 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Spline-Guided Segmentation of Handwritten Physico-Mathematical Documents for Improved OCR Accuracy

Geometry-Aware Super-Resolution Fusion Calibration for Binocular Structured Light 3D Reconstruction

Seamlessly Natural: Image Stitching with Natural Appearance Preservation