Search papers, labs, and topics across Lattice.
1
0
3
Ditch the complex modules: a minimalist, end-to-end vision-language-action model for UAV navigation achieves 3x better generalization than leading baselines by directly mapping visual inputs and language to continuous control.