Machine Learning for Urban Air Quality Prediction Using Google AlphaEarth Foundations Satellite Embeddings: A Case Study of Quito, Ecuador

Cesar Ivan Alvarez
Carlos Andrés Ulloa Vaca
Neptali Armando Echeverria Llumipanta

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Many Global-South cities lack dense monitoring and suffer persistent cloud cover, hampering fine-scale trend detection. This study evaluates the potential of annual multi-sensor satellite embeddings from the AlphaEarth Foundations model in Google Earth Engine to predict and map major air pollutants in Quito, Ecuador, between 2017 and 2024. The 64-dimensional embeddings integrate Sentinel-1 radar, Sentinel-2 optical imagery, Landsat surface reflectance, ERA5-Land climate variables, GRACE terrestrial water storage, and GEDI canopy structure into a compact representation of surface and climatic conditions. Annual median concentrations of NO2, SO2, PM2.5, CO, and O3 from the Red Metropolitana de Monitoreo Atmosférico de Quito (REEMAQ) were paired with collocated embeddings and modeled using five machine learning algorithms. Support Vector Regression achieved the highest accuracy for NO2 and SO2 (R2 = 0.71 for both), capturing fine-scale spatial patterns and multi-year changes, including COVID-19 lockdown-related reductions. PM2.5 and CO were predicted with moderate accuracy, while O3 remained challenging due to its short-term photochemical and meteorological drivers and the mismatch with annual aggregation. SHAP analysis revealed that a small subset of embedding bands dominated predictions for NO2 and SO2. The approach provides a scalable and transferable framework for high-resolution urban air quality mapping in data-scarce environments, supporting long-term monitoring, hotspot detection, and evidence-based policy interventions.

Version published to 10.3390/rs17203472
Oct 17, 2025
Version published to 10.20944/preprints202508.1202.v1
Aug 18, 2025

Machine Learning Driven Land Surface Temperature Prediction and Urban Heat Risk Assessment in The Gambia

This article has 5 authors:
1. Rodrigue Samb
2. Adyasha Jena
3. S. Manavvi
4. Uttam Kumar Roy
5. Basant Yadav
This article has no evaluationsLatest version Dec 18, 2025
Interpretable Multi-Horizon Machine Learning Framework for PM₂.₅ Forecasting in Tashkent: Toward Early-Warning Air Quality Management

This article has 2 authors:
1. Moulay Rachid Babaa
2. Otabek Atabaev
This article has no evaluationsLatest version Dec 18, 2025
Machine Learning-Based Modeling of Land Surface Temperature in Lagos, Nigeria: Integrating Canopy Structure, Built Environment, and Surface Reflectance Variables

This article has 4 authors:
1. Peter Edemewe Ugege
2. Ayodotun Olufemi Bobadoye
3. Peace Amarachi Ukoha
4. John Oseyomon Aigbokhan
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Machine Learning Driven Land Surface Temperature Prediction and Urban Heat Risk Assessment in The Gambia

Interpretable Multi-Horizon Machine Learning Framework for PM₂.₅ Forecasting in Tashkent: Toward Early-Warning Air Quality Management

Machine Learning-Based Modeling of Land Surface Temperature in Lagos, Nigeria: Integrating Canopy Structure, Built Environment, and Surface Reflectance Variables