The Road Not Taken: Unsaid Word Alternatives are Represented in the Brain

Daria Lioubashevski
Daniel Friedman
Adeen Flinker
Ariel Goldstein

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Human language allows multiple ways to express the same thought, implying that several lexical alternatives may exist in parallel before a single word is spoken or heard. We test the multiple alternatives-co-activation hypothesis by combining high-density ECoG during spontaneous dialogue with behavioral paradigms and ranked next-word predictions from large language models (LLMs). Behaviorally, words that LLMs rank as more likely continuations are recognized faster in a preregistered lexical decision task and produced with shorter inter-word intervals in free speech, indicating graded anticipatory activation of alternatives. Neurally, encoding models reveal that activity in classical language regions (IFG, STG) prior to word onset is predicted by embeddings of multiple top-ranked alternatives, not only by the word ultimately used; critically, mean embeddings that pool the top candidates outperform single-candidate embeddings, and the effect persists with arbitrary (non-semantic) embeddings that control for distributional similarity. Extending beyond a handful of options, encoding strength increases as embeddings are averaged across larger top-k sets, implying that a broad cohort of lexical candidates is simultaneously represented. Finally, models trained in comprehension generalize to production (and vice versa), preserving rank order and suggesting a shared neural code for candidate sets across modalities. Together, these findings provide direct evidence that the brain co-activates unsaid alternatives during natural language use and identify parallel candidate activation as a computational principle common to human comprehension, human production, and artificial language modeling.

Version published to 10.1101/2025.11.09.687421 on bioRxiv
Nov 10, 2025

Contextual Assembly of Lexical Functions in Large Language Models

This article has 3 authors:
1. Chris Kello
2. Polyphony Bruna
3. Kanly Thao
This article has no evaluationsLatest version Sep 23, 2025
Testing the Limits of Cumulative Semantic Interference in Word Production: Implications for Adaptive Models

This article has 2 authors:
1. Stefan Wöhner
2. Jörg D. Jescheniak
This article has no evaluationsLatest version Oct 20, 2025
Core vocabulary reveals differences between human word prediction and large language models

This article has 4 authors:
1. Andrew Wang
2. Simon De Deyne
3. Meredith McKague
4. Andrew Perfors
This article has no evaluationsLatest version Oct 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Contextual Assembly of Lexical Functions in Large Language Models

Testing the Limits of Cumulative Semantic Interference in Word Production: Implications for Adaptive Models

Core vocabulary reveals differences between human word prediction and large language models