Search papers, labs, and topics across Lattice.
4
0
9
6
RFT's Achilles heel? This benchmark reveals how fragile reinforcement fine-tuning is, and introduces an automated system to catch and fix training failures before they tank your LLM.
LLM agents can now autonomously generate complex skills with multi-file dependencies, rivaling human-authored skills, thanks to a co-evolutionary verification process that doesn't need ground truth labels.
Even state-of-the-art LLMs struggle to adapt to mid-task changes in long-horizon web navigation, highlighting a critical gap in their ability to handle realistic user interactions.
The first comprehensive survey of Visual Document Retrieval reveals how MLLMs are reshaping the field, highlighting the shift towards RAG and agentic systems for complex document understanding.