Creating and Validating a Corpus and Dataset of Government-Issued Travel Advisories From the Internet Archives
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Government-issued travel advisories are used by citizens to get information about destination countries for tourism and other purposes such as temporary work stays or permanent relocation plans. However, qualitative evidence suggests that travel advisories may be influenced by considerations beyond current security situations. Systematic and rigorous quantitative analyses of advisories are scarce because relevant data are not readily available and official government websites often provide practical obstacles. We present a pipeline to generate a time-series cross-sectional dataset of government-issued travel advisories for three issuing countries in their native languages based on the Internet Archive's Wayback Machine. We validate our approach with official government data sources that are prohibited to be scraped and used for research to illustrate that our approach provides (near-)complete coverage.