A Geospatial Approach to Measuring Economic Activity
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
We introduce a new methodology to detect and measure economic activity using geospatial data and apply it to steel production, a major industrial pollution source worldwide. Combining plant output data with geospatial data, such as ambient air pollutants, nighttime lights, and temperature, we train machine learning models to predict plant locations and output. We identify about 40% (70%) of plants missing from the training sample within a 1 km (5 km) radius and achieve R^2 above 0.8 for output prediction at a 1 km grid and at the plant level, as well as for both regional and time series validations. We apply the model trained on Chinese steel data to North Korea and use it to identify steel locations and output absent from official statistics but consistent with independent open-source and defector-based reports. Our approach can be adapted to other industries and regions, and used by policymakers and researchers to track and measure industrial activity in near real time.