From One Prompt to Ten Platforms: A Comparative Study Revealing the Limits of AI in Architecture
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Recent progress in generative artificial intelligence has brought about powerful tools for creating architectural visualizations, yet limited knowledge exists regarding how different AI platforms perform when confronted with complex architectural prompts. This study addresses that gap by testing a single, unified prompt across ten widely used platforms, each instructed to generate a 3D conceptual design inspired by the stacking of wooden sticks and reimagined as a modern administrative headquarters in Egypt’s New Administrative Capital. A baseline image was first created to provide a visual reference, followed by controlled testing of each platform under identical conditions, and the resulting outputs were evaluated based on architectural coherence, stylistic alignment with Jean Nouvel’s abstract geometric forms, material authenticity, spatial organization, and overall visual quality. The findings revealed considerable variation in image fidelity, interpretative depth, and contextual integration, with some platforms excelling in photorealistic rendering and spatial detailing, while others struggled to maintain consistency in massing, proportions, or material application. This comparative assessment highlights both the potential and the current limitations of AI-generated architectural imagery, offering insights into how architects and designers can strategically incorporate AI tools during the early stages of design exploration and visualization, while also paving the way for future applications in sustainable and safe architectural practices.