Low-complexity pedestrian intent prediction using contextual stacked ensemble learning
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Walking, as a form of active and sustainable mobility, plays a critical role in future smart transportation systems. Accurate prediction of pedestrian crossing intentions is essential for preventing collisions, particularly with the increasing deployment of autonomous vehicles.Existing approaches to near-miss prevention typically rely on computationally intensive computer vision and deep learning techniques. In contrast, this work proposes CSE, a lightweight contextual stacked ensemble-learning framework to efficiently predict pedestrian crossing intent. Pedestrians are first detected and their visual representation is compressed through skeletonization, and complementary pose, trajectory, and contextual cues are fused using a stacked ensemble model.Experimental results on multiple datasets demonstrate that the proposed approach achieves performance comparable to state-of-the-art pedestrian intent prediction methods while reducing computational complexity by at least $25$ times. This reduction translates directly into a $25\times$ decrease in inference time, enabling deployment on resource-constrained edge devices without compromising accuracy and while avoiding the latency associated with cloud-based processing.