Spatio‐Temporal Forecasting of Divvy Bike‐Share Demand and Trip Duration Using Gradient‐Boosted Decision Trees

Fatemeh Noorizadehsalout

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study develops and validates a transparent, high-precision forecasting framework for predicting monthly trip volumes and average trip durations in Chicago’s Divvy bike-share system, thereby providing urban planners with reliable, data-driven insights without relying on complex deep-learning architectures.We assembled data for January–December 2024, including Divvy trip records, daily weather (temperature, precipitation, snowfall), and community-area covariates (median income, population, and transit-stop densities). We engineered nine predictors for each community area—including a weekend indicator and one- and seven-month lags of the response (trip count or duration)—to capture both demand inertia and seasonal fluctuations. A gradient-boosted decision-tree model (LightGBM) was trained in R, with hyperparameter tuning via grid search. Performance was evaluated using two complementary strategies: (1) a 10-fold ``leave-one-community-area-out'' spatial cross-validation to prevent spatial leakage and assess generalizability across distinct geographic contexts; and (2) a 10% stratified hold-out of community areas---sampled across low, medium, and high ridership (or duration) tiers---to balance bias--variance trade-offs and support early stopping.Hyperparameter tuning reduced cross-validation RMSE for trip counts from 3,314 to 2,341 rides/month (37.5 % of the mean). On the stratified hold-out, the final count model achieved an RMSE of 274 rides/month (8.4 % of hold-out mean). Applying the same pipeline to average trip duration yielded a hold-out RMSE of 0.36 minutes (2 % of mean duration). Feature-importance analysis revealed that the one-month lag explains \ 96% of predictive gain, with weather and spatial context each contributing \((<)\) 2%.A simple LightGBM framework—anchored by lagged demand and enriched with contextual covariates—delivers \((\leq 8%)\) error for trip counts and \((\leq 2%)\) for durations, offering a practical and interpretable forecasting tool for urban mobility planning without the need for deep-learning architectures.

Version published to 10.21203/rs.3.rs-6709649/v1 on Research Square
May 29, 2025

Material Flow Prediction Task Based On TCN-GRU Deep Fusion Model

This article has 3 authors:
1. Pingmei Fan
2. Keyan Liu
3. Ziang Qi
This article has no evaluationsLatest version May 28, 2025
Interpretable Slow-Moving Inventory Forecasting: A Hybrid Neural Network Approach with Interactive Visualization

This article has 1 author:
1. Ruolin Qi
This article has no evaluationsLatest version May 19, 2025
Zero-Shot Traffic Flow Prediction with Large Language Models: A Comparison with Deep Learning Approaches

This article has 3 authors:
1. Yue Li
2. Qunshan Zhao
3. Mingshu Wang
This article has no evaluationsLatest version May 8, 2025

Listed in

Abstract

Article activity feed

Related articles

Material Flow Prediction Task Based On TCN-GRU Deep Fusion Model

Interpretable Slow-Moving Inventory Forecasting: A Hybrid Neural Network Approach with Interactive Visualization

Zero-Shot Traffic Flow Prediction with Large Language Models: A Comparison with Deep Learning Approaches