OneStop: A 360-Participant English Eye Tracking Dataset with Different Reading Regimes

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

We present OneStop Eye Movements, a large-scale corpus of eye movements in reading, in which native speakers read newswire texts in English and answer reading comprehension questions. OneStop has 152 hours of passage reading eye movement recordings from 360 participants for 2.6 million word tokens, more data than all the existing public broad coverage native English eye tracking datasets combined. The eye movement data was collected for extensively piloted reading comprehension materials comprising 486 reading comprehension questions and auxiliary text annotations geared towards behavioral analyses of reading comprehension. OneStop includes multiple reading regimes: ordinary reading, information seeking, repeated reading of the same text, and reading simplified text. The combination of the unprecedented size, high-quality reading comprehension materials and multiple reading regimes, aims to enable new research avenues in the study of reading and human language processing. It further aims to facilitate the integration of eye tracking data in Natural Language Processing (NLP), Artificial Intelligence (AI), Human Computer Interaction (HCI) and educational applications.

Article activity feed