Search papers, labs, and topics across Lattice.
Beijing Big Data Centre, Beijing, China
1
4
3
2
LLMs struggle with basic accuracy, reliability, and security in government affairs tasks, with some models failing on minor input variations and exhibiting task avoidance.