Search papers, labs, and topics across Lattice.
Cisco Research
5
0
9
15
Integrating semantic, acoustic, and engagement signals in music recommendations can boost performance by nearly 95%, challenging the status quo of opaque token-based systems.
Training on D3-Gym, a new dataset of real-world scientific tasks with verifiable environments, closes the gap between open-source and proprietary models on ScienceAgentBench by 7.8 points.
LLMs are revolutionizing conversational AI research, and this survey offers a structured guide to navigating the rapidly evolving landscape of LLM-powered user simulation.
A unified benchmark reveals the fragmented landscape of RAG security, highlighting vulnerabilities to knowledge-extraction attacks and paving the way for robust defense strategies.
LLM judges exhibit a surprising "blindness" to human-written summaries, increasingly preferring machine-generated content as the similarity to human references decreases, challenging their reliability in summarization tasks.