Search papers, labs, and topics across Lattice.
3
0
5
15
LLMs can generate syntactically correct tests, but their ability to *reason* about code faults is surprisingly poor, hindering autonomous debugging.
SkillOrchestra slashes the learning costs of AI agent orchestration by up to 700x while improving performance by explicitly modeling agent skills and costs, offering a more scalable and interpretable alternative to RL-based methods.
Reference-guided LLM evaluators can boost alignment in non-verifiable domains, enabling self-improvement to rival reward model training.