Search papers, labs, and topics across Lattice.
Peking University
2
0
6
Token-oriented inference optimizations can cut production costs and boost efficiency, transforming large model services from merely callable to fully operable.
Reward LLMs for verifiable reasoning steps, not just correct answers, to get more reliable multi-step logic.