Emergent Vision Technology: 3D Human Pose Estimation for Single-Pixel Imaging (SPI)

Carlos Osorio Quero
Jose Martinez-Carranza

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Applying 3D human pose and body shape details from a single monocular image presents a significant challenge in computer vision. Traditional methods that rely on RGB images often face constraints due to varying lighting conditions and occlusions. However, advancements in imaging technologies have introduced new techniques, such as single-pixel imaging (SPI), which can overcome these limitations. SPI is particularly effective in capturing 3D human pose in the Near-Infrared (NIR) spectrum. This wavelength can penetrate clothing and is less affected by lighting variations than visible light, providing a reliable means to accurately capture body shape and pose data, even in challenging environments. In this work, we explore using an SPI camera operating in the NIR range, with Time-of-Flight (TOF) technology at wavelengths of 850-1550 nm, to detect humans in night-time environments. Our proposed system employs SPI for depth estimation and feature extraction in humans. These features generate point clouds integrated into a 3D body model (SMPLX) via 3D body shape regression. This process utilizes deep learning techniques based on self-supervised 3D human mesh methodologies. We constructed a laboratory scenario simulating night-time conditions to evaluate the efficacy of NIR-SPI 3D image reconstruction. This setup allowed us to test the feasibility of using NIR-SPI as a vision sensor in outdoor environments. By assessing the results obtained from this setup, we aim to demonstrate the potential of NIR-SPI as an effective tool for detecting humans in night-time scenarios and accurately capturing their 3D body pose and shape, with future applications in environmental rescue.

Version published to 10.21203/rs.3.rs-4837829/v1 on Research Square
Aug 28, 2024

Geometry-Aware Super-Resolution Fusion Calibration for Binocular Structured Light 3D Reconstruction

This article has 6 authors:
1. Yijie Shen
2. HONGYAN CAO
3. Dayong Qiao
4. Mengya Han
5. Wangke Yu
6. Benquan Wang
This article has no evaluationsLatest version Jan 16, 2026
Vision-Based Human Pose Estimation for Intelligent Sports Training and Teaching Assistance

This article has 1 author:
1. Chen Lu
This article has no evaluationsLatest version Jan 22, 2026
Dual-Projection Fusion for Accurate Upright Panorama Generation in Robotic Vision

This article has 6 authors:
1. Yuhao Shan
2. Qianyi Yuan
3. Jingguo Liu
4. Shigang Li
5. Jianfeng Li
6. Tong Chen
This article has no evaluationsLatest version Dec 31, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Geometry-Aware Super-Resolution Fusion Calibration for Binocular Structured Light 3D Reconstruction

Vision-Based Human Pose Estimation for Intelligent Sports Training and Teaching Assistance

Dual-Projection Fusion for Accurate Upright Panorama Generation in Robotic Vision