Search papers, labs, and topics across Lattice.
4
1
9
Today's best LLMs fail spectacularly at long-horizon reasoning, achieving under 10% accuracy on a new benchmark designed to isolate this critical capability.
Generating robot training data that bridges the sim2real gap doesn't require painstakingly detailed simulation environments; instead, a neural simulator can transform classical simulations into realistic representations using only a small amount of real-world data.
Adversarially finetuning CLIP using a pretraining-inspired recipe with web data and feature regularization yields significantly better zero-shot robustness across diverse datasets than standard adversarial training.
Stop hard-coding reasoning strategies for your LLM agent: a learned router that dynamically picks the best paradigm for each task boosts performance by up to 5.5%, beating even the best fixed strategy.