Gold student meets star model: Predicting the interpretational diversity of novel compounds in an exploratory-confirmatory approach

Fritz Guenther
Melanie J. Bell
Martin Schäfer

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Almost all linguistic expressions are ambiguous to some extent, and can be interpreted in various different ways. This is especially the case for novel expressions a speaker has never encountered before, in particular combined concepts expressed via compounds such as /gold student/ or /monkey ring/. Although previous studies have shown that word embeddings (meaning representations derived from text-based language models), can encode the interpretational diversity of such expressions, these previous studies have been limited to a small, rigid and high-level closed set of relational interpretations (e.g., `student MADE OF gold', `student ABOUT gold'). In contrast, the present study uses more ecologically-valid open-format interpretations provided by human participants, which are afterwards classified in a bottom-up manner in order to compute quantitative estimates of interpretational diversity. In an exploratory study on pre-existing data, we first investigate what measures derived from word embeddings capture interpretational diversity, with the vector norm of the embeddings emerging as the best predictor. In a subsequent high-powered confirmatory study, we then systematically select new items for maximal variation of this vector norm, and replicate the same pattern. This is the first study to show that text-based language models encode the unconstrained interpretational diversity of linguistic expressions, even within a single vector representation, and even for novel expressions that have never been observed in their training data.

Version published to 10.31234/osf.io/2ypfs_v1 on OSF Preprints
Jul 8, 2025

Emergent Numeric Bias in Large Language Models: An Empirical Study on the Anomalous Recurrence of the Number 27 Across Independent Sessions

This article has 1 author:
1. Som Subhro Nath
This article has no evaluationsLatest version Jul 16, 2025
AI is Misled by GenAI: Stylistic Bias in Automated Assessment of Creativity in Large Language Models

This article has 3 authors:
1. Marek Urban
2. Petra Kmoníčková
3. Kamila Urban
This article has no evaluationsLatest version Jul 28, 2025
Six fallacies in substituting large language models for human participants

This article has 1 author:
1. Zhicheng Lin
This article has no evaluationsLatest version Aug 21, 2025

Listed in

Abstract

Article activity feed

Related articles

Emergent Numeric Bias in Large Language Models: An Empirical Study on the Anomalous Recurrence of the Number 27 Across Independent Sessions

AI is Misled by GenAI: Stylistic Bias in Automated Assessment of Creativity in Large Language Models

Six fallacies in substituting large language models for human participants