ViVerse-A1: Ascending Generating Vietnamese Poetry with Semantic and Cultural Fidelity

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Despite their increasing sophistication in content generation, contemporary Large Language Models (LLMs) consistently fail to master the nuanced artistry of Vietnamese poetry, particularly its intricate semantic layers, figurative language, phonaesthetics, and deep-seated cultural resonance. This research confronts this challenge with the introduction of ViVerse-A1, a bespoke neural architecture optimized for Vietnamese poetic expression. ViVerse-A1's novelty lies in its hybrid training methodology, which synergizes targeted fine-tuning with advanced auxiliary frameworks, including a Masked Language Model and our proposed Masked-Token Reasoning Language Model. This design empowers the model to develop a granular understanding of the structural, prosodic, lexical, and diverse conventions of Vietnamese literature. Rigorous comparative analyses confirm that ViVerse-A1's performance surpasses that of leading state-of-the-art models, demonstrating a unique ability to generate poetry that is both technically proficient and expressively profound. The success of ViVerse-A1 signals a promising trajectory for AI-driven systems designed for deep semantic and cultural intelligence. To facilitate further research, the source code and all related materials are provided publicly at https://github.com/tph-kds/ViVerse_A1

Article activity feed