Travel Time Prediction from Sparse Open Data

Geoff Boeing
Yuquan Zhou

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Travel time prediction is central to transport geography and planning's accessibility analyses, sustainable transportation infrastructure provision, and active transportation interventions. However, calculating accurate travel times, especially for driving, requires either extensive technical capacity and bespoke data, or resources like the Google Maps API that quickly become prohibitively expensive to analyze thousands or millions of trips necessary for metropolitan-scale analyses. Such obstacles particularly challenge less-resourced researchers, practitioners, and community advocates. This article argues that a middle-ground is needed to provide reasonably accurate travel time predictions without extensive data or computing requirements. It introduces a free, open-source minimally-congested driving time prediction model with minimal cost, data, and computational requirements. It trains and tests this model using the Los Angeles, California urban area as a case study by calculating naïve travel times from open data then developing a random forest model to predict travel times as a function of those naïve times plus open data on turns and traffic controls. Validation shows that this interpretable machine learning method offers a superior middle-ground technique that balances reasonable accuracy with minimal resource requirements.

Version published to 10.31235/osf.io/qepc6_v1 on OSF Preprints
Feb 15, 2026

A Transportation System Performance and Safety Analysis of Juba, South Sudan

This article has 2 authors:
1. William F. Lyons
2. Moses Tefe
This article has no evaluationsLatest version Apr 17, 2026
Rainfall–Road Synergy and Landslide Risk Mapping in the Nepal Himalayas: A GIS–MCDA Framework with Level-4 Citizen Science Validation

This article has 4 authors:
1. Narayan Thapa
2. Sushant Sharma
3. Reshma Shrestha
4. Mukesh Thapa
This article has no evaluationsLatest version Apr 17, 2026
Geospatial Machine Learning for Predicting Flash Flood Response at Ungauged Appalachian Watersheds: Terrain, Soil, and Land Cover Controls

This article has 1 author:
1. Sujan Bhattarai
This article has no evaluationsLatest version Apr 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Transportation System Performance and Safety Analysis of Juba, South Sudan

Rainfall–Road Synergy and Landslide Risk Mapping in the Nepal Himalayas: A GIS–MCDA Framework with Level-4 Citizen Science Validation

Geospatial Machine Learning for Predicting Flash Flood Response at Ungauged Appalachian Watersheds: Terrain, Soil, and Land Cover Controls