Search papers, labs, and topics across Lattice.
2
0
6
Grounding audio language models with acoustic feature representations unlocks more accurate and explainable deepfake detection, even with smaller models.
Multimodal agents still struggle with game development, solving only ~50% of tasks in a new benchmark, GameDevBench, highlighting the need for better multimodal reasoning in complex software environments.