Munich Sentence (MuSe) Database – Completion norms and audio recordings for 619 German sentences

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Prediction is a core feature of language, which is widely studied across research domains. The Munich Sentence (MuSe) database enhances reproducibility by providing sentence completion norms for 619 German sentences, including cloze probabilities and entropy estimates from up to 232 participants. Sentence completions were collected in two online studies in which participants completed sentence beginnings with a single-word response after either hearing (auditory sample, N = 133) or reading (visual sample, N = 98) the sentence beginning. All responses were manually preprocessed to correct typos and spelling mistakes and to label grammatical errors, proper nouns, and singular and plural variants of the same response. In addition to the sentence norms, we provide trial-level data with participant-level demographic information and subclinical autistic and schizotypal trait measures. Together with open access R-Scripts or our webtool, this allows for tailoring the cleaning and norming steps to integrate individual difference measures. For a subset of 479 sentence beginnings, the database also includes professional audio recordings of sentence beginnings which can be flexibly combined with 531 recordings of unique sentence-final words and implemented in auditory language paradigms. All material is freely accessible via the Open Science Framework (https://osf.io/ktnze/overview) and the MuSe webtool (https://munichsentencedatabase.franziskaknolle.com/).

Article activity feed