Tablecert: YOLO and TATR Enhanced Models to Boost Table Detection and Recognition in Legacy Documents

Patrick Ferreira Barroso
Wilson de Souza Melo Junior
Rodrigo Pereira David
Luiz Fernando Rust da Costa Carmo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The digital transformation of legacy documents remains challenging, as these documents are often unstructured and contain complex table layouts (e.g., watermarks, spaced headers, closely spaced tables, nested structures, and double borders) that degrade the performance of conventional table detection and recognition systems. We propose a modular, plug-and-play adaptation framework for YOLO-based table detection and Table Transformer (TATR)-based structure recognition, combining parameter-efficient LoRA fine-tuning with lightweight architectural modules (e.g., frequency-domain filtering and structural refinements). We evaluate the framework on a dataset of calibration certificates using a controlled training and evaluation protocol with standard detection and structure metrics. The adapted models outperform their respective baselines, mitigating layout-related challenges, and achieve F1-scores of 0.9999 (YOLO) and 0.9640 (TATR), alongside reduced validation loss. The best YOLO adaptation improves robustness in table detection under challenging visual artifacts, whereas the TATR-V6 yields stronger structural recognition. Finally, we show that the proposed FreqFilter2D module is a promising drop-in component for other computer vision architectures.

Version published to 10.21203/rs.3.rs-8864429/v1 on Research Square
Feb 16, 2026

DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

This article has 14 authors:
1. Marcel Lamott
2. Saifullah Saifullah
3. Nauman Riaz
4. Yves-Noel Weweler
5. Tobias Alt-Veit
6. Ahmad Sarmad Ali
7. Muhammad Armaghan Shakir
8. Adrian Kalwa
9. Momina Moetesum
10. Andreas Dengel
11. Sheraz Ahmed
12. Faisal Shafait
13. Ulrich Schwanecke
14. Adrian Ulges
This article has no evaluationsLatest version Apr 14, 2026
Towards Robust Industrial Micro-Defect Detection: AContext-Aware and Feature-Refined Architecture forCamouflaged Anomalies

This article has 7 authors:
1. Xinda Yu
2. Kunxin Zheng
3. Chunan Yu
4. Qingbo Song
5. Hao Xiao
6. Ying Zang
7. Jie Liu
This article has no evaluationsLatest version Mar 11, 2026
TLDRT-DETR: Adaptive Upsampling and Dual-Activation Attention for Real-Time Transmission Line Defect Detection

This article has 3 authors:
1. Bing Su
2. Yi Lu
3. Yifeng Lin
This article has no evaluationsLatest version Mar 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

Towards Robust Industrial Micro-Defect Detection: AContext-Aware and Feature-Refined Architecture forCamouflaged Anomalies

TLDRT-DETR: Adaptive Upsampling and Dual-Activation Attention for Real-Time Transmission Line Defect Detection