Generative AI Shanshui Animation Enhancement using Perlin Noise and Diffusion Models

Kanoksak Wattanachote
Chih-Yang Lin
Shao-En Hsu
Timothy K. Shih

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep learning models have achieved remarkable advancements in image generation but face persistent challenges in synthesizing traditional Shanshui (mountain-water) landscape paintings due to limited domain-specific training data and the complexity of aesthetic principles. This study integrated Perlin Noise, Stable Diffusion, ControlNet, and AnimateDiff to enhance Shanshui landscape generation and animation. Perlin Noise constructs naturalistic skeletal structures, which are further refined using ControlNet for precise structural control. Advanced prompt engineering with GPT-4 and Textual Inversion improved prompt descriptiveness and mitigated low-quality outputs. Furthermore, LoRA fine-tuning improved the adaptability of our Shanshui landscapes model. Integrating I2V Encoders and AnimateDiff enabled the seamless transformation of static landscape images into dynamic animations, preserving artistic authenticity while introducing motion consistency. The experimental results demonstrated significant improvements in realism, stylistic fidelity, and diversity, addressing key limitations in existing generative approaches. This framework not only advances the field of generative AI in digital art but also offers new opportunities for the creation of multimedia content and cultural preservation through the synthesis of computational Shanshui animation.

Version published to 10.21203/rs.3.rs-7200182/v1 on Research Square
Aug 28, 2025

Towards Human-Centered and Efficient Video Synthesis: A Survey of Multimodal Diffusion Models

This article has 2 authors:
1. Alaa Abdullah Albaghdadi
2. Ahmad R. Naghsh-Nilchi
This article has no evaluationsLatest version Oct 7, 2025
IncrementalDreamer: Scene-level 3D Generation with Incremental Optimization

This article has 4 authors:
1. Haiqi Zhu
2. Zihao Zhang
3. Qi Liu
4. Youdong Ding
This article has no evaluationsLatest version Sep 23, 2025
Enhancing Image Vectorization Fidelity and Editability through Adaptive Layered Techniques

This article has 6 authors:
1. Guixiang Nie
2. Qing Xie
3. Anshu Hu
4. Yanchun Ma
5. Jiachen Li
6. Jinyu Xu
This article has no evaluationsLatest version Oct 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Towards Human-Centered and Efficient Video Synthesis: A Survey of Multimodal Diffusion Models

IncrementalDreamer: Scene-level 3D Generation with Incremental Optimization

Enhancing Image Vectorization Fidelity and Editability through Adaptive Layered Techniques