Enhancing Quantum Diffusion Models for Complex Image Generation

Jeongbin Jo
Santanam Wishal
Shah Md Khalil Ul
Shan Zeng
Dikshant Dulal

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Quantum generative models offer a novel approach to exploring high-dimensional Hilbert spaces but face significant challenges in scalability and expressibility, particularly when applied to multi-modal distributions. In this study, we propose a \textbf{Hybrid Quantum-Classical U-Net} architecture enhanced by \textbf{Adaptive Non-Local Observables (ANO)} and an \textbf{Ancilla-based Global Feature Extractor}. By compressing classical data into a dense quantum latent space and utilizing trainable observables, our model extracts rich non-local features that complement classical processing. Furthermore, we integrate a Hadamard Test module to capture global structural information, fusing it with dense local features. We also investigate the role of Skip Connections in preserving semantic information during the reverse diffusion process. Experimental results on the full MNIST dataset (digits 0-9) demonstrate that the proposed architecture generates structurally coherent and recognizable images for all digit classes, overcoming the mode collapse observed in prior quantum diffusion models. While hardware constraints necessitate resolution downscaling, our findings suggest that hybrid architectures with adaptive measurements provide a feasible pathway for enhancing generative capabilities in the NISQ era.

Version published to 10.21203/rs.3.rs-8795888/v1 on Research Square
Feb 24, 2026

Quantum-Enhanced Hybrid-Model Compressionusing Knowledge Distillation

This article has 3 authors:
1. Luigi Barbato
2. Massimo Esposito
3. Francesco Gargiulo
This article has no evaluationsLatest version Mar 4, 2026
A Hybrid Quantum-Classical GAN with Classical Critic for Multi-Conditional Image Generation

This article has 2 authors:
1. Zhenyu Ta
2. Dongsheng Cai
This article has no evaluationsLatest version Mar 5, 2026
Inductive Graph Convolutional Quantum Process Tomography: A Scalable Geometric Deep Learning Approach

This article has 1 author:
1. BOUAKER IMED
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Quantum-Enhanced Hybrid-Model Compressionusing Knowledge Distillation

A Hybrid Quantum-Classical GAN with Classical Critic for Multi-Conditional Image Generation

Inductive Graph Convolutional Quantum Process Tomography: A Scalable Geometric Deep Learning Approach