Search papers, labs, and topics across Lattice.
State Key Laboratory of Multimodal Artificial Intelligence Systems, University of Chinese Academy of Sciences
2
0
4
Ditch the fragmented architectures: OneDrive unifies autonomous driving tasks within a single VLM decoder, achieving state-of-the-art performance while slashing latency.
By explicitly modeling motion with a retina-inspired mechanism, MI-DETR achieves a remarkable +26.35 mAP@50 improvement over the best multi-frame baseline for infrared small target detection.