StycoGAN for Feature Level Temporal Regularization in Perceptually Stable Sequential Image Synthesis

Mars Caroline Wibowo
Danny HF Manongga
Hendry Hendry
Budhi Kristianto

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Style-based generative adversarial networks achieve high spatial fidelity in image synthesis, yet their extension to sequential generation remains challenging due to temporal instability and style inconsistency across frames. Most existing approaches emphasize motion modeling or pixel-level temporal constraints, which often fail to preserve stylistic coherence. This paper proposes StycoGAN, a style-consistent spatial–temporal generative framework that enforces temporal regularization directly in the feature space of a style-based generator. The proposed model integrates a ConvLSTM-based temporal consistency module into an intermediate layer of the StyleGAN2-ADA backbone, enabling modeling of temporal dependencies while retaining high-quality style modulation. Additionally, a Styco-Consistency loss is introduced to suppress undesired stylistic drift across consecutive frames without imposing explicit motion constraints. Experiments on curated sequential image data demonstrate that StycoGAN improves temporal stability while maintaining competitive spatial realism. Quantitative evaluations show enhanced perceptual quality and temporal coherence compared to frame-independent and temporal baseline models, while qualitative results reveal reduced style flickering across frames. These findings indicate that feature-level temporal regularization offers an effective and flexible solution for perceptually stable sequential image synthesis.

Version published to 10.21203/rs.3.rs-8813305/v1 on Research Square
Mar 4, 2026

Overcoming the Semantic Bottleneck for Deterministic Structural Control in Text-to-Image Synthesis

This article has 2 authors:
1. Muhammad Bilal Khan
2. Muhammad Sabih ul Hassan
This article has no evaluationsLatest version Apr 5, 2026
Hybrid Diffusion Framework for Realistic Virtual Garment Try-On

This article has 6 authors:
1. Veerababu Reddy
2. Pravallika Bhosale
3. Devi Sahasra Vellalacheruvu
4. Himavarshini Kotha
5. Venkata Chandu Ranga
6. Isaac Sonu Yangaladasu
This article has no evaluationsLatest version Mar 30, 2026
Qwen-Edit+: Scaling Image Editing with VLM-Guided Consistency and Aesthetic Preference Distillation

This article has 2 authors:
1. Fan Tang
2. Siyuan Li
This article has no evaluationsLatest version Apr 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Overcoming the Semantic Bottleneck for Deterministic Structural Control in Text-to-Image Synthesis

Hybrid Diffusion Framework for Realistic Virtual Garment Try-On

Qwen-Edit+: Scaling Image Editing with VLM-Guided Consistency and Aesthetic Preference Distillation