Search papers, labs, and topics across Lattice.
Nankai University
1
0
2
TARPO outperforms traditional reasoning methods by seamlessly integrating discrete and continuous approaches, revolutionizing policy exploration in LLMs.