cs.CV

GRAFT: Geometric Refinement and Fitting Transformer for Human Scene Reconstruction

arXiv:2604.19624v1 Announce Type: new
Abstract: Reconstructing physically plausible 3D human-scene interactions (HSI) from a single image currently presents a trade-off: optimization based methods offer accurate contact but are slow (~20s), while feed…