Skip to content

MayurDeshmukh10/E-3DPSM

Repository files navigation

E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation

Mayur Deshmukh1,2   Hiroyasu Akada1   Helge Rhodin1,3   Christian Theobalt1   Vladislav Golyanik1

1 Max Planck Institute for Informatics, SIC   2 Saarland University   3 Bielefeld University

CVPR 2026

Paper | Video | Project Page

E-3DPSM teaser

Rethinking event-based egocentric 3D human pose estimation. E-3DPSM models motion as a continuous event-driven state evolution, fusing delta and direct 3D pose updates to achieve real-time and temporally stable 3D reconstruction.

Abstract

Event cameras offer multiple advantages in monocular egocentric 3D human pose estimation from head-mounted devices, such as millisecond temporal resolution, high dynamic range, and negligible motion blur. Existing methods effectively leverage these properties, but suffer from low 3D estimation accuracy that remains insufficient for applications such as immersive VR/AR. This is due to designs not being fully tailored to event streams and their asynchronous, continuous nature, leading to high sensitivity to self-occlusions and temporal jitter. E-3DPSM rethinks this setting with an event-driven continuous pose state machine for event-based egocentric 3D human pose estimation. It aligns continuous human motion with fine-grained event dynamics, evolves latent states, and predicts continuous 3D joint updates that are fused with direct 3D pose predictions, producing stable and drift-free reconstructions. E-3DPSM runs in real time at 80 Hz on a single workstation and sets a new state of the art on two benchmarks, improving MPJPE by up to 19% and temporal stability by up to 2.7x.

Method

E-3DPSM method overview

Hardware Setup

Head-mounted hardware setup

The head-mounted device uses a single fisheye egocentric event camera for input, an NVIDIA Jetson Orin Nano for onboard processing, and a portable power source for standalone operation.

Quantitative Results

Quantitative results

E-3DPSM improves both 3D pose accuracy and temporal stability compared with prior event-based egocentric pose estimation methods.

Repository Structure

  • run.py: Lightning CLI entrypoint for training, validation, and testing
  • configs/: experiment configurations for pretraining, finetuning, and evaluation
  • EventEgoPoseEstimation/: core model, dataset, loss, and training code
  • scripts/: Slurm shell scripts
  • images/: README and project assets

Setup

Create the conda environment:

conda env create -f environment.yml
conda activate e3dpsm

Install ocam support if it is not already available in your environment:

pip install git+https://github.com/Chris10M/ocam_python.git

Training and Evaluation

This repository uses pytorch_lightning via LightningCLI, so experiments are launched through run.py with a config file.

Example training:

python run.py fit --config configs/pretrain_deform_attention_kf_lnes.yaml

Example finetuning:

python run.py fit --config configs/finetune_deform_attention_kf_lnes.yaml

Example evaluation:

python run.py test --config configs/evaluate_deform_attention_kf_lnes.yaml

Please update dataset paths, checkpoint paths, and runtime settings inside the selected config before launching an experiment.

Citation

If you find this repository useful, please cite:

@inproceedings{deshmukh2026e3dpsm,
  title = {E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation},
  author = {Deshmukh, Mayur and Akada, Hiroyasu and Rhodin, Helge and Theobalt, Christian and Golyanik, Vladislav},
  booktitle = {Computer Vision and Pattern Recognition (CVPR)},
  year = {2026}
}

Acknowledgments

This codebase borrows dataloaders from EventEgo3D++.

License

This project is released under the license provided in LICENSE.

About

The official implementation of our CVPR 2026 paper, "E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors