Munich Sentence (MuSe) Database – Completion norms and audio recordings for 619 German sentences

Elisabeth Friederike Sterner
Maximilian Stadler
Franziska Knolle

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Prediction is a core feature of language, which is widely studied across research domains. The Munich Sentence (MuSe) database enhances reproducibility by providing sentence completion norms for 619 German sentences, including cloze probabilities and entropy estimates from up to 232 participants. Sentence completions were collected in two online studies in which participants completed sentence beginnings with a single-word response after either hearing (auditory sample, N = 133) or reading (visual sample, N = 98) the sentence beginning. All responses were manually preprocessed to correct typos and spelling mistakes and to label grammatical errors, proper nouns, and singular and plural variants of the same response. In addition to the sentence norms, we provide trial-level data with participant-level demographic information and subclinical autistic and schizotypal trait measures. Together with open access R-Scripts or our webtool, this allows for tailoring the cleaning and norming steps to integrate individual difference measures. For a subset of 479 sentence beginnings, the database also includes professional audio recordings of sentence beginnings which can be flexibly combined with 531 recordings of unique sentence-final words and implemented in auditory language paradigms. All material is freely accessible via the Open Science Framework (https://osf.io/ktnze/overview) and the MuSe webtool (https://munichsentencedatabase.franziskaknolle.com/).

Version published to 10.31234/osf.io/evr24_v1 on OSF Preprints
Oct 26, 2025

What did you said? Differential impacts of acoustic challenge on semantic, syntactic, and prediction-related ERPs during speech processing

This article has 5 authors:
1. Jack Silcox
2. Karen Bennett
3. David Strayer
4. Sarah Hargus Ferguson
5. Brennan Payne
This article has no evaluationsLatest version Jan 28, 2026
The Author Recognition Test for Rioplatense Spanish

This article has 3 authors:
1. Eleonora Achugar
2. Catalina Etcheverry
3. Álvaro Cabana
This article has no evaluationsLatest version Jan 29, 2026
Cross-linguistic zero-shot communication via ad-hoc pseudowords

This article has 3 authors:
1. Fritz Guenther
2. Aliona Petrenco
3. Daniele Gatti
This article has no evaluationsLatest version Jan 5, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

What did you said? Differential impacts of acoustic challenge on semantic, syntactic, and prediction-related ERPs during speech processing

The Author Recognition Test for Rioplatense Spanish

Cross-linguistic zero-shot communication via ad-hoc pseudowords