Transforming Urban Planning through Machine Learning: A Study on Planning Application Classification using Natural Language Processing

Yang Lin
William Thackway
Balamurugan Soundararaj
Serryn Eagleson
Hoon Han
Christopher Pettit

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Planning for sustainable urban growth is a pressing challenge facing many cities. Investigating proposed changes to the built environment can provide planners and policymakers information to understand future urban development trends and related infrastructure requirements. It is in this context we have developed a novel urban analytics approach that utilises planning applications (PAs) data and Natural Language Processing (NLP) techniques to forecast the housing supply pipeline in Australia. Firstly, we implement a data processing pipeline which scrapes, geocodes, and filters PA data from council websites and planning portals to provide the first nationally available daily dataset of PAs that are currently under consideration. Secondly, we classify the collected PAs into four distinct urban development categories, selected based on infrastructure planning provisioning requirements. Of the five model architectures tested, we found that the fine-tuned DeBERTA-v3 model achieves the best performance with an accuracy and F1-score of 0.944. This demonstrates the suitability of fine-tuned Pre-trained Language Models (PLMs) for planning text classification tasks. Finally, the model is applied to classify and map urban development trends in Australia’s two largest cities, Sydney and Melbourne, from 2021-2022 and 2023-2024. The mapping affirms a face-validation test of the classification model and demonstrates the utility of PA insights for planners. Holistically, the paper demonstrates the potential for NLP to enrich urban analytics through the integration of previously inaccessible planning text data into planning analysis and decisions.

Version published to 10.31219/osf.io/fs76e on OSF Preprints
Oct 25, 2024

From Hazards to Settlement Planning in Nepal: Using Machine learning and GIS to assess impact of critical Flood and Landslide Overlays

This article has 4 authors:
1. Narayan Thapa
2. Kabir Uddin
3. Rajesh Bahadur Thapa
4. Erica Udas
This article has no evaluationsLatest version Jan 20, 2026
Machine Learning Driven Land Surface Temperature Prediction and Urban Heat Risk Assessment in The Gambia

This article has 5 authors:
1. Rodrigue Samb
2. Adyasha Jena
3. S. Manavvi
4. Uttam Kumar Roy
5. Basant Yadav
This article has no evaluationsLatest version Dec 18, 2025
Planning for Low-Carbon Urban Futures in Greater Cairo Region

This article has 1 author:
1. Taher Osman
This article has no evaluationsLatest version Jan 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

From Hazards to Settlement Planning in Nepal: Using Machine learning and GIS to assess impact of critical Flood and Landslide Overlays

Machine Learning Driven Land Surface Temperature Prediction and Urban Heat Risk Assessment in The Gambia

Planning for Low-Carbon Urban Futures in Greater Cairo Region