Search papers, labs, and topics across Lattice.
School of Software Engineering, Beijing Jiaotong University, Beijing, 100044, China
1
0
0
1
DMFormer is introduced, an innovative multi-modal BEV perception framework that employs Transformer architecture and a diffusion denoising model to tackle key challenges, including sensor noise, efficient fusion of multi-modal data, and modeling dynamic scenes.