Masked Prediction of Onomatopoeic Expressions in Depressive Tweets Using LLMs

Kouko Miyabe
Megumi Kodaka
Yusuke Fukazawa

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Onomatopoeia is a key linguistic feature for expressing emotions and psychological states. As large language models (LLMs) are increasingly applied in mental health contexts, it is important to examine whether they can interpret onomatopoeia appropriately. This study evaluates the capabilities of various LLMs through a masked prediction task using 319 depressive tweets in Japanese, where onomatopoeic expressions were masked and models were prompted to predict them. We compared the performance of three LLMs—GPT-4o mini, GPT-4 Turbo, and o1-preview—against a fine-tuned BERT model and human annotators, employing several prompt strategies including ReAct-style reasoning and dictionary-based in-context learning. Our results show that o1-preview achieved prediction accuracy comparable to human annotators. In contrast, the fine-tuned BERT model performed significantly worse, highlighting the limitations of conventional masked language models in handling emotionally nuanced expressions and adapting to prompt-based tasks. These findings suggest that LLMs, despite lacking physical embodiment, can effectively predict emotionally charged language such as onomatopoeia through large-scale pretraining and inference capabilities.

Version published to 10.21203/rs.3.rs-7898432/v1 on Research Square
Oct 21, 2025

Enhancing Multi-label Emotion Prediction through Rule-based Voting with LLM and BERT Variants

This article has 4 authors:
1. Minh Hieu Le
2. Cong Phuoc Phan
3. Thanh Tuan Nguyen
4. Thi Thanh Sang Nguyen
This article has no evaluationsLatest version Nov 6, 2025
Emotional Tone Detection in Hate Speech Using Machine Learning and NLP: Methods, Challenges, and Future Directions, a Systematic Review

This article has 3 authors:
1. Aymé Escobar Díaz
2. Ricardo Rivadeneira
3. Walter Fuertes
This article has no evaluationsLatest version Oct 31, 2025
Combining Natural Language Processing with Patient-Reported Outcome Measures scores to investigate the impact of pandemic regulations on anxiety in children

This article has 11 authors:
1. Michiel A. J. Luijten
2. Chris Gibbons
3. Conrad J. Harrison
4. Hedy A. Oers
5. Josjan Zijlmans
6. Jacintha M. Tieskens
7. Hekmat Alrouh
8. Emma M. Broek
9. Janna N. Boer
10. Lotte Haverman
11. Tinca J. C. Polderman
This article has no evaluationsLatest version Nov 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing Multi-label Emotion Prediction through Rule-based Voting with LLM and BERT Variants

Emotional Tone Detection in Hate Speech Using Machine Learning and NLP: Methods, Challenges, and Future Directions, a Systematic Review

Combining Natural Language Processing with Patient-Reported Outcome Measures scores to investigate the impact of pandemic regulations on anxiety in children