Search papers, labs, and topics across Lattice.
3
0
7
5
LLM-as-a-judge can be made far more reliable by explicitly modeling the aggregation weights of sub-features in a tree structure, achieving near-human agreement on complex writing tasks.
LLMs, orchestrated as a team of specialized agents, can autonomously discover and verify zero-day vulnerabilities in real-world software with significantly higher success rates than existing automated exploit generation tools.
GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.