Search papers, labs, and topics across Lattice.
3
0
7
0
Forget outcome-based filtering: TopoCurate uses interaction topology to surface informative tool-use trajectories and tasks, boosting SFT and RL performance by up to 6.9%.
Current memory systems like RAG and long-context LLMs stumble in AMemGym's interactive long-horizon conversations, revealing critical performance gaps in maintaining consistent user state.
LLM reasoning gets a serious upgrade with MASPO, a new RLVR method that smartly balances gradient use, probability mass, and signal reliability for faster, more robust learning.