Visual scene understanding is crucial for many practical applications in the real world. A lot of work has focused on 3D scene analysis. Considering the scene also in time, i.e. 4D visual understanding, is key for holistic understanding of the surrounding world. 3D scene analysis, 3D reconstruction, SLAM and others focus on the scene primarily without dynamic objects. At the same time video analysis has provided very advanced methods for understanding visual information in time.
Recent research e.g. in perception for autonomous driving or embodied vision, have also started to consider both aspects of scene analysis. Gleaning insights from scene understanding, video analysis, and 3D modeling and sensing in the realms of both dynamic and static scenes, we hope to shed more light on the topic of 4D vision for scene understanding.