Search papers, labs, and topics across Lattice.
China University of Mining Technology 鈭桬qual contribution
1
0
3
0
Current vision-language models struggle with process understanding in robotic manipulation, but targeted post-training can yield significant improvements.