Search papers, labs, and topics across Lattice.
2
0
5
Bridging the perception-reasoning gap in visual planning, MGSD boosts model performance by over 19% while relying solely on visual inference during deployment.
Even frontier models with high reasoning budgets fail to effectively navigate densely interlinked knowledge bases and complex policies in realistic fintech customer support scenarios, achieving only ~25.5% pass rate.