Diving Performance Analysis with 3D Motion Knowledge Hypergraphs

Jingbo Wang
Yifan Xie
Yitao Xie
Hongyu Xiao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Diving actions involve complex temporal dynamics, rapid pose transitions, and strict requirements for entry angles and splash control, making quality assessment a challenging task in computer vision. Existing methods still face limitations in motion structure modeling, depth perception, and multimodal fusion. This paper proposes a multimodal scoring framework that integrates 3D pose reconstruction with a hypergraph neural network to enhance the modeling and evaluation of diving actions. We are the first to introduce 3D pose reconstruction into diving score assessment, compensating for the depth perception limitations of 2D vision by constructing complete 3D motion trajectories. To mitigate keypoint errors caused by rapid movements or occlusions, we propose a hypergraph-based spatiotemporal pose fusion model. This model leverages three types of hyperedges—temporal, skeletal, and joint—to build high-order spatiotemporal representations, and incorporates an attention mechanism to adaptively adjust their weights. To capture visual cues such as entry angles and splash patterns, we further design a multimodal fusion module that combines skeletal features with appearance features, significantly enhancing the model’s ability to perceive fine details. To address the lack of structured and fine-grained annotations in existing datasets, we also construct the Individual-Diving dataset, which contains 1,023 diving video clips covering 27 action classes, 26 sub-actions, along with frame-wise 3D pose annotations and official scores. Experimental results on the FineDiving and Individual-Diving datasets show that our method consistently outperforms previous approaches such as USDL and CoRe, demonstrating competitive performance in diving action modeling and quality assessment.

Version published to 10.21203/rs.3.rs-6812566/v1 on Research Square
Sep 8, 2025

IncrementalDreamer: Scene-level 3D Generation with Incremental Optimization

This article has 4 authors:
1. Haiqi Zhu
2. Zihao Zhang
3. Qi Liu
4. Youdong Ding
This article has no evaluationsLatest version Sep 23, 2025
UREPTrack: Unified RGB-Event Visual Tracking via PoolFormer Backbone

This article has 1 author:
1. Min Lu
This article has no evaluationsLatest version Sep 24, 2025
3D Feature Distillation with Object-Centric Priors

This article has 4 authors:
1. Georgios Tziafas
2. Yucheng Xu
3. Zhibin Li
4. Hamidreza Kasaei
This article has no evaluationsLatest version Oct 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

IncrementalDreamer: Scene-level 3D Generation with Incremental Optimization

UREPTrack: Unified RGB-Event Visual Tracking via PoolFormer Backbone

3D Feature Distillation with Object-Centric Priors