Multi-modal Ensemble Approach for Decoding Player Intentions in Table Tennis

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

This study aims to predict human intentions during intense sports activities, specifically in table tennis. Using a publicly available Real World Table Tennis dataset containing simultaneous EEG and video recordings, we developed a series of participant-specific classifiers for nine players (7 males and 2 females; age range 18–30), based on pose features and EEG signals. The pose-based classifier used a stochastic gradient descent model with logistic loss, whereas the EEG-based classifier employed a modified convolutional neural network architecture (EEGNet). Both classifiers successfully predicted left-right attack intentions from the time windows preceding racket-ball impact, with optimal decoding occurring at −100 ms for pose features and −500 ms for EEG signals. EEG-based decoding achieved higher performance than pose-based decoding, and a multi-modal ensemble further improved prediction, reaching a mean macro F1 score of 0.563 (bootstrapped 95% CI: 0.523–0.603), corresponding to gains of +0.03 over pose-only and +0.02 over EEG-only classifiers. Because each classifier is trained independently, the ensemble can be feasibly extended to incorporate additional modalities in the future. These results suggest potential applications in neural prosthetic systems and neurofeedback tools for sports training.

Article activity feed