Highly Accurate 3D Estimation for Occluded Human Pose

Bojun Xie
Xuyang Hao
Junfen Chen
Wushan Feng

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The 3D human pose estimation is crucial in computer vision. It mainly aims at estimating the spatial coordinates of important joints of the human body from monocular images. However, occlusion still is a challenge problem, which hinders their practical applications. To solve this, a novel method based on attention mechanism and distillation learning framework is proposed for unsupervised 3D human pose estimation. Due to the fact that the occluded joints often have multiple motion solutions, a pose filling network based on attention mechanism is proposed to accurately predict the coordinates of the occluded joints. To better establish the relationship between the 3D pose and occluded 2D poses, a distillation learning framework is employed. More precisely, the teacher network uses the complete 2D pose as input, and thus it can produce a more accurate 3D pose. The student network uses occluded 2D pose as the training set and the teacher network output as the target to establish a more robust dependency. Moreover, a simple yet effective data augmentation method is incorporated , which improve the performance by increasing the data diversity. Experimental results of the proposed method achieves PA-MPJPE values 2.3 and 5.2 lower than that of the benchmark method on the Human3.6M and MPI-INF- 3DHP datasets without occlusion, respectively. Moreover, the PA-MPJPE of our method is reduced to 30.3 when 5 out of 17 joints are occluded, comparing to 72.7 of a competitor. The extensive experiments demonstrated the efficiency of proposed method.

Version published to 10.21203/rs.3.rs-5260195/v1 on Research Square
Oct 16, 2024

Vision-Based Human Pose Estimation for Intelligent Sports Training and Teaching Assistance

This article has 1 author:
1. Chen Lu
This article has no evaluationsLatest version Jan 22, 2026
SPARK: Sparse-Perception Action Recognition with Keyframes for Quadruped Robots

This article has 2 authors:
1. Sehun Park
2. Andrew Jaeyong Choi
This article has no evaluationsLatest version Dec 10, 2025
Experimental Verification of a Monocular CNN-Based Pose Estimation Algorithm for the SROC Mission

This article has 5 authors:
1. Antonio D'Ortona
2. Lucrezia Lovaglio
3. Marco Visca
4. Fabrizio Stesina
5. Sabrina Corpino
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Vision-Based Human Pose Estimation for Intelligent Sports Training and Teaching Assistance

SPARK: Sparse-Perception Action Recognition with Keyframes for Quadruped Robots

Experimental Verification of a Monocular CNN-Based Pose Estimation Algorithm for the SROC Mission