ArtScale: Autoregressive Super-Resolution for Art Paintings via Multi-Scale Vision-Language Guidance

Ji Zhou
Qiong Tian
Jiaxing Hu
Zhuting Tan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Heritage digitization can require extreme super-resolution for inspecting brushwork, craquelure, and pigment aging beyond native capture limits. Most SR models are trained for fixed scale factors and degrade when extrapolated, while training directly for extreme scales is expensive. We present ArtScale, a scale-space autoregressive framework that reaches large magnifications by chaining intermediate steps while reusing a frozen SR backbone. To limit semantic drift at high magnification, ArtScale adds multi-scale vision--language guidance: a VLM generates art-aware prompts conditioned on the current and previous scale states. We fine-tune the prompt extractor with GRPO-based preference alignment to reduce repetitive or generic prompts. Experiments improve 4× restoration and show more stable behavior under recursive zooming.

Version published to 10.21203/rs.3.rs-8717845/v1 on Research Square
Feb 6, 2026

Adaptive Hierarchical Edge Detection: Enhancing Real-Time Artistic Stylization in Computer Graphics

This article has 1 author:
1. Lino Roshaan M.K.
This article has no evaluationsLatest version Mar 5, 2026
Adaptive Hierarchical Edge Detection: Enhancing Real-Time Artistic Stylization in Computer Graphics

This article has 1 author:
1. Lino Roshaan M.K.
This article has no evaluationsLatest version Mar 5, 2026
StyleMamba: Efficient Image Style Transfer with Bidirectional Selective Scan Vision Mamba

This article has 5 authors:
1. Jian Liu
2. Jun Yang
3. DiWei Wu
4. Hewen Liu
5. Jun Liu
This article has no evaluationsLatest version Feb 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Adaptive Hierarchical Edge Detection: Enhancing Real-Time Artistic Stylization in Computer Graphics

Adaptive Hierarchical Edge Detection: Enhancing Real-Time Artistic Stylization in Computer Graphics

StyleMamba: Efficient Image Style Transfer with Bidirectional Selective Scan Vision Mamba