Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone

Noa Oded
Matan Rubin
Shir Genzer
Anat Perry

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Empathy plays a central role in human emotional relationships. Empathic accuracy, the ability to accurately infer another person’s emotional state, varies by informational modality and, in humans, is often intertwined with emotional and motivational processes. This study examines whether state-of-the-art Large Language Models (LLMs) - GPT-4, Claude, and Gemini - demonstrate empathic accuracy, and how their accuracy compares to that of humans when presented with only the semantic content (transcripts of recorded videos) of ecological, complex autobiographical emotional narratives. We compared the empathic accuracy of LLMs’ to that of human participants (N = 127, randomly sampled students, both in-lab and online) who either read the same transcripts or watched the original videos, which enabled them to use facial and bodily expressions, as well as paralinguistic cues, in addition to semantics. LLMs were able to infer emotional states from semantic content alone with a precision that is equal to or surpasses human performance. This was true both generally and when analyzing positive and negative emotions separately. Theoretically, these findings suggest that semantic information alone can support high empathic accuracy, though humans may not fully leverage this potential. Practical implications are discussed regarding the use of LLMs in introspective and emotional contexts, while raising critical concerns about privacy, ethical risks, and the potential reshaping of emotional understanding, intimacy, and human connection in an increasingly AI-mediated world.

Version published to 10.31234/osf.io/g6pnq_v1 on OSF Preprints
Oct 1, 2025

Decomposing Simulation and Evaluation in Affective Language Processing: Frowning Responses to Reading about Emotions of Morally Good and Bad Characters

This article has 4 authors:
1. Li Kloostra
2. Marijke Beulen
3. Jos van Berkum
4. Marijn Eefje Struiksma
This article has no evaluationsLatest version Nov 20, 2025
Exploring Human and AI Emotional Support Through Reframing of Negative Situations

This article has 5 authors:
1. Joanna Z. Li
2. Alina Herderich
3. Praveen Nair
4. Anat Perry
5. Amit Goldenberg
This article has no evaluationsLatest version Sep 30, 2025
The Phenomenology of Silent Expertise: From Inner Speech to Embodied Knowing

This article has 1 author:
1. Nobuchika Yamaki
This article has no evaluationsLatest version Nov 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Decomposing Simulation and Evaluation in Affective Language Processing: Frowning Responses to Reading about Emotions of Morally Good and Bad Characters

Exploring Human and AI Emotional Support Through Reframing of Negative Situations

The Phenomenology of Silent Expertise: From Inner Speech to Embodied Knowing