A Modular Prototype of Emotion-Aware Proactive Voice Agent with Live2D Embodiment

Jae Young Suh
Mingyu Jeon

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We present a voice-based conversational agent that advances beyond reactive dialogue by integrating speech-to-text transcription with Whisper, emotion recognition, simple policy mechanisms, and Live2D embodiment. The system delivers supportive guidance either as inline prompts or card-style recommendations, while empathetic dialogue and expressive avatar cues enhance both transparency and user engagement. A log-based evaluation across ten sessions showed consistent stability, with an average latency of 7.1 seconds. This prototype illustrates a practical foundation for developing emotion-aware, proactive companions aligned with the vision of human-centered dialogue systems.

Version published to 10.31224/5993
Dec 15, 2025

A Character-Based Korean Tourist Dialogue System with Proactive Recommendations and Live2D Embodiment

This article has 2 authors:
1. Jae Young Suh
2. Mingyu Jeon
This article has no evaluationsLatest version Dec 15, 2025
Exploring Korean AI Companion Possibilities with Live2D and Empathic Voice Interaction

This article has 2 authors:
1. Jae Young Suh
2. Mingyu Jeon
This article has no evaluationsLatest version Dec 15, 2025
Efficient and Responsible Transformer Based Conversational Agents for Emotionally Supportive Dialogue

This article has 8 authors:
1. DIVYA SALEELA
2. Akhil Mathew Philip
3. Reji R
4. Rincy Merlin Mathew
5. Teena Joseph
6. Sujith Kumar P S
7. Supriya L P
8. Chinchu M S
This article has no evaluationsLatest version Feb 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Character-Based Korean Tourist Dialogue System with Proactive Recommendations and Live2D Embodiment

Exploring Korean AI Companion Possibilities with Live2D and Empathic Voice Interaction

Efficient and Responsible Transformer Based Conversational Agents for Emotionally Supportive Dialogue