HAT-4D: Multi-Object 4D Reconstruction Framework
Researchers propose HAT-4D, a novel agentic framework using VLMs to reconstruct 3D geometry, temporal dynamics, and physical interactions of multiple objects from a single monocular video, addressing occlusions and complex dynamics in multi-object interactions.
One continuously updated timeline instead of dozens of separate articles. New developments are appended as the story evolves.
- AnnouncementJun 26, 2026, 04:05 PM 84%
HAT-4D introduced as first agentic framework for reconstructing 3D geometry and interactions of multiple objects from single monocular video
Researchers propose HAT-4D, a novel agentic framework using VLMs to reconstruct 3D geometry, temporal dynamics, and physical interactions of multiple objects from a single monocular video, addressing occlusions and complex dynamics in multi-object interactions.
Read the full story โ