Abstract
We investigate the reconstruction of 3D human-object interactions from images, encompassing 3D human shape and pose estimation as well as object shape and pose estimation. To address this task, we introduce an autoregressive transformer-based variational autoencoder capable of learning a robust shape prior from extensive 3D shape datasets. Additionally, we leverage the reconstructed 3D human body as supplementary features for object shape and pose estimation. In contrast, prior methods only predict object pose and rely on shape templates for shape prediction. Experimental findings on the BEHAVE dataset underscore the effectiveness of our proposed approach, achieving a 40.7cm Chamfer distance and demonstrating the advantages of learning a shape prior.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 2133-2139 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798350307443 |
| DOIs | |
| State | Published - 2023 |
| Event | 19th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 - Paris, France Duration: 02 10 2023 → 06 10 2023 |
Publication series
| Name | Proceedings - 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 |
|---|
Conference
| Conference | 19th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 02/10/23 → 06/10/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- 3D reconstruction
- human object interaction
Fingerprint
Dive into the research topics of 'Reconstruction of 3D Interaction Models from Images using Shape Prior'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver