Search papers, labs, and topics across Lattice.
2
0
3
17
LLMs possess domain-specific "gut feelings" about their factual knowledge accuracy that external observers can't see, but this advantage vanishes for math reasoning.
General-purpose agents can match the performance of specialized agents across diverse environments without any environment-specific tuning, challenging the need for task-specific engineering.