Search papers, labs, and topics across Lattice.
3
0
7
3
Evolving coding problems can restore meaningful evaluation metrics for frontier models, revealing their true capabilities and enabling self-improvement.
Mimicking how the brain integrates language and context with facial cues unlocks state-of-the-art dynamic emotion recognition.
LLMs can't even reproduce published physics papers end-to-end, with the best model scoring only 34% on a new benchmark designed for this purpose.