Search papers, labs, and topics across Lattice.
University of California, Santa Barbara, University of California, Santa Barbara 2 Apple, USA 3 University of Washington 4 Independent Researcher, USA
Apple ML Research3
48
6
8
Realistic user simulation is now possible: Pare offers a framework that moves beyond flat tool-calling APIs to model stateful user interactions, enabling better evaluation of proactive agents.
Stop benchmarking algorithm discovery on the same old saturated datasets: DiscoGen offers millions of fresh, configurable tasks to truly test your ADA.
LLMs can now play at being AI researchers, but they're mostly just good at hyperparameter sweeps, not groundbreaking discoveries.