Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
1
0
2
FlowTracer reveals that optimizing token-level rewards based on attention-induced information flow can dramatically enhance reasoning performance in LLMs.