Search papers, labs, and topics across Lattice.
Cisco Research
2
0
5
Training on D3-Gym, a new dataset of real-world scientific environments, boosts Qwen3-32B performance on ScienceAgentBench by 7.8 points, rivaling proprietary models.
Open-source LLM agents can get a 27% performance boost in tool use by strategically injecting context tailored to address common failure modes.