DeepAQI: A Vision-Based EfficientNet Framework for Air Quality Index Prediction from Environmental Metadata

Yash Mishra
Kedarnath senapati

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This research presents a deep learning–based framework for predicting the Air Quality Index (AQI) using outdoor webcam images from U.S. National Parks. Traditional AQI measurement relies on ground‑based air sensors, which require continuous calibration and are often geographically sparse. Recent advances in computer vision have motivated the exploration of image-based AQI estimation, enabling scalable, low‑cost, and real‑time monitoring in remote regions. Leveraging the publicly available NPS_AQI_DB, which includes more than 120,000 webcam images paired with corresponding pollutant data (O₃, SO₂, RH, temperature), we investigate both image‑only and multi‑modal (image + tabular) models for AQI regression. Our primary architecture is EfficientNet‑B0, chosen for its strong performance–efficiency trade‑off. The model is trained on augmented 224×224 images using AdamW optimization, mixed‑precision training, and robust preprocessing to handle missing and corrupted data. To enhance interpretability, Grad‑CAM visualizations highlight regions influencing AQI prediction, often corresponding to sky visibility, haze thickness, and lighting conditions. Additionally, we evaluate a two‑tower fusion model combining CNN features with meteorological variables, demonstrating improved stability across pollution categories. Experimental results show that the best-performing model achieves MAE ≈ 11.2 and RMSE ≈ 15.3 on the test set, reflecting competitive performance given the inherent noise of visual air estimation. Comprehensive evaluation includes residual analysis, calibration curves, AQI‑bin MAE breakdown, per‑park performance, and temporal error trends. These findings confirm that vision-based AQI prediction is feasible and can supplement traditional monitoring networks, especially in visually accessible but sensor-limited environments. Future work will explore temporal modeling, domain adaptation, and deployment on low-power edge devices.

Version published to 10.21203/rs.3.rs-9062054/v1 on Research Square
Mar 12, 2026

FusionAttNet: Hierarchical Attention-Driven Sentinel- 1/Sentinel-2 Fusion for Semi-Arid Land Cover Classification in Far North Cameroon

This article has 5 authors:
1. Pountianus Berinyuy Wirba
2. Mvogo Joseph Ngono
3. Noumsi Auguste Vigny Woguia
4. Emile Tatinyuy Verdzekov
5. ELE Pierre
This article has no evaluationsLatest version Mar 10, 2026
An Efficient Joint Evolutionary Algorithm-based Neural Network Model for Air Quality Prediction

This article has 6 authors:
1. Peiyang Wei
2. Mingsheng Shang
3. Xi Chen
4. Yuyan Wang
5. Jianhong Gan
6. Shuai Li
This article has no evaluationsLatest version Mar 31, 2026
A Dual Dynamic Feature-based Deep Learning and Computer Vision–Based Model for Multi-Object Classification Using Geospatial Satellite Imagery

This article has 1 author:
1. Doaa Mohey Eldin
This article has no evaluationsLatest version Mar 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

FusionAttNet: Hierarchical Attention-Driven Sentinel- 1/Sentinel-2 Fusion for Semi-Arid Land Cover Classification in Far North Cameroon

An Efficient Joint Evolutionary Algorithm-based Neural Network Model for Air Quality Prediction

A Dual Dynamic Feature-based Deep Learning and Computer Vision–Based Model for Multi-Object Classification Using Geospatial Satellite Imagery