Do Large Language Models know who did what to whom?

Joseph M. Denning
Xiaohan (Hannah) Guo
Bryor Snefjella
Idan Asher Blank

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) are commonly criticized for not “understanding” language. However, many critiques target cognitive abilities that, in humans, are distinct from language processing. Here, we instead study a kind of understanding tightly linked to language: inferring “who did what to whom” (thematic roles) in a sentence. Does the common training objective of LLMs—word prediction—result in sentence representations that capture thematic roles? In two experiments, we characterized sentence representations in four LLMs. In contrast to human similarity judgments, in LLMs the overall representational similarity of sentence pairs reflected syntactic similarity but not whether their agent and patient assignments were identical vs. reversed. Furthermore, we found little evidence that thematic role information was available in any subset of hidden units. However, some attention heads robustly captured thematic roles, independently of syntax. Therefore, LLMs can extract thematic roles but, unlike humans, this information influences their representations only weakly.

Version published to 10.31234/osf.io/z56wj on OSF Preprints
Sep 26, 2024

Six fallacies in substituting large language models for human participants

This article has 1 author:
1. Zhicheng Lin
This article has no evaluationsLatest version Jun 23, 2025
Uncovering patterns of semantic predictability in sentence processing

This article has 4 authors:
1. Cassandra L. Jacobs
2. Ryan Hubbard
3. Kara D. Federmeier
4. Loïc Grobol
This article has no evaluationsLatest version May 9, 2025
Linguistic structure and language familiarity sharpen phoneme encoding in the brain

This article has 5 authors:
1. Filiz Tezcan
2. Sanne Ten Oever
3. Fan Bai
4. Noémie te Rietmolen
5. Andrea Martin
This article has no evaluationsLatest version Jun 16, 2025

Listed in

Abstract

Article activity feed

Related articles

Six fallacies in substituting large language models for human participants

Uncovering patterns of semantic predictability in sentence processing

Linguistic structure and language familiarity sharpen phoneme encoding in the brain