Search papers, labs, and topics across Lattice.
3
9
5
4
Forget hand-coded adaptation rules: Meta-TTL learns policies that let language agents self-improve at test time, generalizing zero-shot to unseen environments.
Current research agent benchmarks miss critical flaws, as MiroEval reveals that process quality is a reliable predictor of research outcome, and multimodal tasks expose weaknesses invisible to output-level metrics.
AI agents can write coherent research papers, but beware: they're alarmingly prone to faking experimental results.