Search papers, labs, and topics across Lattice.
This paper introduces U4D, a novel framework for 4D LiDAR scene synthesis that prioritizes spatial uncertainty in the generation process. By deriving per-point uncertainty maps using Shannon Entropy from a pretrained segmentor, U4D synthesizes complex scenes in a "hard-to-easy" manner, enhancing the fidelity of high-entropy areas while ensuring temporal coherence through a Mixture of Spatio-Temporal block. Experimental results on nuScenes and SemanticKITTI show that U4D achieves state-of-the-art performance in scene fidelity and temporal consistency, significantly improving upon existing methods.
U4D reveals that leveraging spatial uncertainty can drastically enhance the quality of LiDAR scene synthesis, achieving unprecedented fidelity and coherence.
Constructing faithful 4D worlds from LiDAR-acquired sequences is crucial for embodied AI, yet current generative frameworks apply uniform modeling capacity across all spatial regions. This ignores that perceptual difficulty varies dramatically within a single scan: distant surfaces, occluded boundaries, and small-scale objects carry far higher uncertainty than well-observed structures. We present U4D, a new framework that explicitly leverages spatial uncertainty to guide LiDAR scene generation in a "hard-to-easy" schedule. U4D derives per-point uncertainty maps via Shannon Entropy from a pretrained segmentor, then applies an unconditional diffusion stage to synthesize high-entropy areas with precise geometry, followed by a conditional completion stage that fills in the remaining regions using these structures as priors. A MoST (Mixture of Spatio-Temporal) block further maintains cross-frame coherence by dynamically balancing spatial detail and temporal continuity. Extensive experiments on nuScenes and SemanticKITTI demonstrate state-of-the-art scene fidelity, temporal consistency, and downstream performance.