Exploring Korean AI Companion Possibilities with Live2D and Empathic Voice Interaction

Jae Young Suh
Mingyu Jeon

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study explores the growing trend of fostering emotional connections with AI through natural voice interactions. It presents a web-based Korean voice interaction system that integrates high-quality speech synthesis with a Live2D character. Research suggests that voice-based communication enhances emotional engagement more effectively than text-based methods, particularly in gaming and virtual reality. The system leverages OpenAI’s gpt-4o-audio-preview model to generate expressive speech while synchronized with visual elements. Designed to enhance the synergy between auditory and visual channels, it delivers a well-balanced multimodal experience. Quantitative and qualitative evaluations confirm enhanced user immersion and emotional satisfaction, supporting deeper human-AI interactions. This study provides a practical framework for advancing multimodal AI interactions, offering valuable insights for both research and real-world applications in immersive AI communication.

Version published to 10.31224/5984
Dec 15, 2025

A Comparative Study of MangaMind and XAR: AI Driven Talking Avatars for Interactive Education

This article has 3 authors:
1. Kajal Vivek Singh Chauhan
2. Aarya Plakkat
3. Divya T L
This article has no evaluationsLatest version Jan 20, 2026
A Modular Prototype of Emotion-Aware Proactive Voice Agent with Live2D Embodiment

This article has 2 authors:
1. Jae Young Suh
2. Mingyu Jeon
This article has no evaluationsLatest version Dec 15, 2025
Multimodal Reshaping L2 Learning: An Autoethnographic Pilot Study of AI-Voiced Quests in a World of Warcraft Private Server

This article has 1 author:
1. Jiaqiang Wang
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Comparative Study of MangaMind and XAR: AI Driven Talking Avatars for Interactive Education

A Modular Prototype of Emotion-Aware Proactive Voice Agent with Live2D Embodiment

Multimodal Reshaping L2 Learning: An Autoethnographic Pilot Study of AI-Voiced Quests in a World of Warcraft Private Server