Search papers, labs, and topics across Lattice.
KAIST
1
0
3
Stop wasting compute on irrelevant actions: targeted hindsight self-distillation focuses LLM agent training on the critical failure points, boosting performance and slashing training time.