Overcoming the Semantic Bottleneck for Deterministic Structural Control in Text-to-Image Synthesis

Muhammad Bilal Khan
Muhammad Sabih ul Hassan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Latent diffusion models are limited by text encoders, restrict geometric control, and prevent deterministic structural steering.We propose Procedural Latent Prompt Injection as a Zero-shot steerability framework for generative modelling, modelled as a steerable Stochastic differential equation (SDE). This avoids the need for either auxiliary training or linguistics-based conditioning. A key component of the approach is the use of normalized tensor operators to embed geometric priors directly into the latent space. Empirical analysis of the diffusion trajectory indicates a critical plasticity window (timesteps 10–20) where the noise patterns embedded during diffusion are most amenable to structural steering. Compared with baselines, improvements of 19.6% in structural alignment (CLIP) and 12.3% in diversity control are achieved. These results represent an improvement over prior learned control approaches (prompt-to-prompt & controlnet) and provide a new deterministic paradigm for generative modelling control with applications in procedural synthesis of medical images, structural biology models, and physics simulations.

Version published to 10.21203/rs.3.rs-9033885/v1 on Research Square
Apr 5, 2026

MuseDrift: Navigating Protein Evolutionary Manifolds with Conditional Discrete Diffusion

This article has 2 authors:
1. Chaoyang Wang
2. Yiquan Wang
This article has no evaluationsLatest version May 12, 2026
Deep Computational Anatomy via Latent-Aligned Multiview Normalizing Flows

This article has 5 authors:
1. Nicholas J. Tustison
2. Brian B. Avants
3. Philip A. Cook
4. James C. Gee
5. James R. Stone
This article has no evaluationsLatest version May 10, 2026
NPMCL: A Theoretical Framework for Non-Parametric Continual Learning through Meta-Ability Cultivation

This article has 1 author:
1. Zhiqiang Gan
This article has no evaluationsLatest version Apr 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

MuseDrift: Navigating Protein Evolutionary Manifolds with Conditional Discrete Diffusion

Deep Computational Anatomy via Latent-Aligned Multiview Normalizing Flows

NPMCL: A Theoretical Framework for Non-Parametric Continual Learning through Meta-Ability Cultivation