Mayur Deshmukh1,2 Hiroyasu Akada1 Helge Rhodin1,3 Christian Theobalt1 Vladislav Golyanik1
1 Max Planck Institute for Informatics, SIC 2 Saarland University 3 Bielefeld University
CVPR 2026
Paper | Video | Project Page
Rethinking event-based egocentric 3D human pose estimation. E-3DPSM models motion as a continuous event-driven state evolution, fusing delta and direct 3D pose updates to achieve real-time and temporally stable 3D reconstruction.
Event cameras offer multiple advantages in monocular egocentric 3D human pose estimation from head-mounted devices, such as millisecond temporal resolution, high dynamic range, and negligible motion blur. Existing methods effectively leverage these properties, but suffer from low 3D estimation accuracy that remains insufficient for applications such as immersive VR/AR. This is due to designs not being fully tailored to event streams and their asynchronous, continuous nature, leading to high sensitivity to self-occlusions and temporal jitter. E-3DPSM rethinks this setting with an event-driven continuous pose state machine for event-based egocentric 3D human pose estimation. It aligns continuous human motion with fine-grained event dynamics, evolves latent states, and predicts continuous 3D joint updates that are fused with direct 3D pose predictions, producing stable and drift-free reconstructions. E-3DPSM runs in real time at 80 Hz on a single workstation and sets a new state of the art on two benchmarks, improving MPJPE by up to 19% and temporal stability by up to 2.7x.
The head-mounted device uses a single fisheye egocentric event camera for input, an NVIDIA Jetson Orin Nano for onboard processing, and a portable power source for standalone operation.
E-3DPSM improves both 3D pose accuracy and temporal stability compared with prior event-based egocentric pose estimation methods.
run.py: Lightning CLI entrypoint for training, validation, and testingconfigs/: experiment configurations for pretraining, finetuning, and evaluationEventEgoPoseEstimation/: core model, dataset, loss, and training codescripts/: Slurm shell scriptsimages/: README and project assets
Create the conda environment:
conda env create -f environment.yml
conda activate e3dpsmInstall ocam support if it is not already available in your environment:
pip install git+https://github.com/Chris10M/ocam_python.gitThis repository uses pytorch_lightning via LightningCLI, so experiments are launched through run.py with a config file.
Example training:
python run.py fit --config configs/pretrain_deform_attention_kf_lnes.yamlExample finetuning:
python run.py fit --config configs/finetune_deform_attention_kf_lnes.yamlExample evaluation:
python run.py test --config configs/evaluate_deform_attention_kf_lnes.yamlPlease update dataset paths, checkpoint paths, and runtime settings inside the selected config before launching an experiment.
If you find this repository useful, please cite:
@inproceedings{deshmukh2026e3dpsm,
title = {E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation},
author = {Deshmukh, Mayur and Akada, Hiroyasu and Rhodin, Helge and Theobalt, Christian and Golyanik, Vladislav},
booktitle = {Computer Vision and Pattern Recognition (CVPR)},
year = {2026}
}This codebase borrows dataloaders from EventEgo3D++.
This project is released under the license provided in LICENSE.



