Sport Classification from Multi-Player Trajectories with Set-over-Time Aggregation

Sawet Somnugpong

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Sport classification is commonly studied from RGB video, but multi-player trajectories also provide a compact representation of movement, spacing, and interaction patterns. This paper investigates sport classification in a trajectory-only setting and introduces a set-over-time formulation for multi-player inputs. Each clip is represented as a sequence of frames, where each frame contains an unordered set of visible player trajectories, and the method is evaluated on a four-sport classification task built from MultiSports actor tubes. The proposed model first encodes the player set in each frame with permutation-invariant aggregation and then performs temporal aggregation for clip-level prediction. This design preserves frame-level multi-player structure before temporal summarization. Experiments compare the proposed approach with three compact trajectory baselines: mean pooling, mean-plus-standard-deviation pooling, and GRU-based aggregation. The proposed model achieves the best validation macro-F1 of 0.8793 and test macro-F1 of 0.8614, outperforming the strongest baseline by 0.0386 test macro-F1. Ablation results further show that entity weighting, temporal variability modeling, time attention, and the joint use of position and motion cues all contribute to performance. Error analysis indicates that trajectory-only recognition is effective, but confusion remains among team sports with partially similar collective motion patterns. Overall, the results show that set-over-time aggregation is an effective approach for sport classification from multi-player trajectories.

Version published to 10.21203/rs.3.rs-9214484/v1 on Research Square
Apr 14, 2026

Multi-modal Ensemble Approach for Decoding Player Intentions in Table Tennis

This article has 4 authors:
1. Trung Quang Pham
2. Shotaro Shiba Funai
3. Ryota Kanai
4. Junichi Chikazoe
This article has no evaluationsLatest version May 7, 2026
Smartphone Placement Recognition during Walking: Performance Determinants and Real-World Generalizability

This article has 9 authors:
1. P. Tasca
2. G. Trentadue
3. E. Buckley
4. S. Sun
5. M. Long
6. N. Ireson
7. F. Ciravegna
8. V. Lanfranchi
9. A. Cereatti
This article has no evaluationsLatest version May 14, 2026
Reducing Video Verification Burden: Machine Learning Classification of Head Acceleration Events in Youth Football

This article has 7 authors:
1. Giovanny A. Romero A.
2. Josh Cherian
3. N. Stewart Pritchard
4. Joel D. Stitzel
5. Lyndia C. Wu
6. Ryan S. McGinnis
7. Jillian E. Urban
This article has no evaluationsLatest version May 20, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Multi-modal Ensemble Approach for Decoding Player Intentions in Table Tennis

Smartphone Placement Recognition during Walking: Performance Determinants and Real-World Generalizability

Reducing Video Verification Burden: Machine Learning Classification of Head Acceleration Events in Youth Football