Detecting Illicit Investment in Real Estate: A Machine‑Learning Approach to Rare‑Event AML Risk
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Illicit investment in residential real estate poses persistent challenges for anti–money‑laundering (AML) enforcement due to ownership opacity, fragmented data, and the scarcity of confirmed criminal cases. This study evaluates supervised machine‑learning and neural‑network models to determine their ability to detect properties linked to criminal activity and to identify the most influential predictors of illicit investment. Using a rare‑event dataset and cross‑validated modelling framework, the analysis shows that meaningful patterns can be detected despite extreme class imbalance. Across logistic regression, Random Forest, XGBoost, CART oversampling experiments, and an artificial neural network, consistent predictors—market_value, owner_legal_person, owner_owns_multiple, land_acres, and out_of_state_owner—emerge as central risk indicators. The findings support Rational Choice Theory by illustrating how offenders exploit structural vulnerabilities to maximize utility. Policy implications include property‑level risk scoring, early‑warning systems, and enhanced support for gatekeepers. Machine‑learning approaches show strong potential to strengthen real‑estate AML frameworks.