Analog Diffusion Models

Jiaqi Chu
Heiner Kremer
Fabian Falck
Grace Brennan
Burcu Canakci
James Clegg
Daniel Cletheroe
Doug Kelly
Christos Gkantsidis
Michael Hansen
Paul Jeha
Kirill Kalinin
Jim Kleewein
Babak Rahmani
Saravan Rajmohan
Victor Rühle
Jannes Gladrow
Francesca Parmigiani
Hitesh Ballani

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As generative artificial intelligence (GenAI) drives computational demands to unprecedented scales, digital hardware is approaching fundamental limits. Analog and optical systems promise orders-of-magnitude efficiency gains, but translating these to application-level gains is challenging due to the mismatch between hardware primitives and algorithmic requirements. Here, we introduce Analog Diffusion Models (ADMs) which implement diffusion inference with an implicit integration scheme, formulating each diffusion step as a fixed-point problem amenable for acceleration by efficient analog hardware. At the same time, training remains identical to that of conventional diffusion models, allowing adoption of established scalable training approaches with no additional overhead. We validate ADMs on analog hardware using three-dimensional optics with 2,304 programmable weights. On hardware, we generate two-dimensional distributions and latent-space distributions for MNIST, FashionMNIST, and ExtendedMNIST, demonstrating the feasibility of executing multi-layer diffusion processes entirely on noisy, non-traditional hardware. The current prototype reaches fixed-point convergence in 10–15 µs per diffusion step, with projections to nanosecond-scale convergence with miniaturization. In simulation, across multiple datasets, backbone architectures, and model sizes ranging from 32 million to 13 billion parameters, ADMs match the sample quality of standard methods with up to 16× fewer diffusion steps. Most importantly, they could achieve efficiency gains of more than 100× at the application level without sacrificing generation quality, 100× from hardware acceleration, and an additional 1-2× from algorithmic improvement, highlighting the multiplicative benefit of hardware–algorithm co-design. Together, these results establish ADMs as a scalable and general, hardware-aligned framework for low-latency and energy-efficient generative modeling on analog computing platforms.

Version published to 10.21203/rs.3.rs-8919479/v1 on Research Square
Mar 18, 2026

DMDA: Density Matrix Decomposition for Training-Free Diffusion Acceleration

This article has 1 author:
1. Do-Geun Kim
This article has no evaluationsLatest version Apr 3, 2026
Learning Nonlinear Heterogeneity in Physical Kolmogorov-Arnold Networks

This article has 13 authors:
1. Jack Gartside
2. Fabiana Taglietti
3. Andrea Pulici
4. Maxwell Roxburgh
5. Gabriele Seguini
6. Ian Vidamour
7. Stephan Menzel
8. Edoardo Franco
9. Michele Laus
10. Eleni Vasilaki
11. Michele Perego
12. Tom Hayward
13. Marco Fanciulli
This article has no evaluationsLatest version Mar 13, 2026
Sampling-Efficient Unconditional Pure-Deblurring Diffusion Models via Noise-Augmented Generation

This article has 2 authors:
1. Byung-Woo Hong
2. Simon Korman
This article has no evaluationsLatest version Apr 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DMDA: Density Matrix Decomposition for Training-Free Diffusion Acceleration

Learning Nonlinear Heterogeneity in Physical Kolmogorov-Arnold Networks

Sampling-Efficient Unconditional Pure-Deblurring Diffusion Models via Noise-Augmented Generation