Airbnb Pricing Prediction Using Machine Learning: A Case Study on Seattle Listings

Aishwaryaa Vasudevan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The growth of peer-to-peer accommodation platforms has transformed the tourism and hospitality industry by introducing decentralized, host-driven pricing systems. However, many Airbnb hosts rely on intuition or limited platform recommendations to set nightly rates, often resulting in inconsistent pricing strategies. This study develops and evaluates a machine-learning model for predicting Airbnb prices using publicly available data from Inside Airbnb for Seattle, Washington. The analysis integrates listing, review, and calendar data to identify key determinants of nightly rates. Following extensive data cleaning and feature engineering, three predictive models were tested: Linear Regression, Ridge Regression, and Random Forest Regression. The Random Forest model achieved the best performance, with an R² of 0.726 and a mean absolute error (MAE) of approximately $51 per night. Cross-validation and multi-seed testing confirmed model stability and reproducibility. Feature-importance analysis revealed that property capacity and amenity richness were the strongest predictors of price, while neighborhood tier and host activity contributed moderately. These findings reinforce hedonic pricing theory by demonstrating that tangible property characteristics explain most pricing variation in peer-to-peer rentals. The study contributes a reproducible and interpretable framework for short-term rental analytics, offering practical guidance for hosts, policymakers, and researchers seeking to understand data-driven pricing in the sharing economy.

Version published to 10.20944/preprints202510.0449.v1
Oct 6, 2025

Detecting financial misstatements in emerging markets: a machine learning approach

This article has 3 authors:
1. Hoa Thi Thanh Tieu
2. Thanh Hien Hoang
3. Hung Ngoc Tran
This article has no evaluationsLatest version Oct 1, 2025
Comparative Analysis of Machine Learning Models for House Price Prediction: From Linear Regression to Boosted Trees

This article has 2 authors:
1. Mahim Al Muntashir Billah
2. Tasrifa Sarker
This article has no evaluationsLatest version Oct 14, 2025
Quantifying the Housing Price Premium of Walkable Urban Nature in Shanghai: Evidence from Explainable Boosting Models

This article has 2 authors:
1. Shixu Zhang
2. Yunlong Song
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Detecting financial misstatements in emerging markets: a machine learning approach

Comparative Analysis of Machine Learning Models for House Price Prediction: From Linear Regression to Boosted Trees

Quantifying the Housing Price Premium of Walkable Urban Nature in Shanghai: Evidence from Explainable Boosting Models