Search papers, labs, and topics across Lattice.
This paper introduces Evolving Idea Graphs (EIG), a graph-based framework for multi-agent scientific ideation where nodes represent scientific claims and edges represent relations between them. A learned controller guides the ideation process by selecting graph edits for agents to execute and deciding when to commit the graph as a final proposal. Experiments on AI Idea Bench 2025 and LiveIdeaBench demonstrate that EIG outperforms existing systems in terms of novelty, feasibility, and clarity, as evaluated by both automatic metrics and expert ratings.
Coordinating LLM agents with evolving knowledge graphs, rather than just text, unlocks superior scientific ideation, beating state-of-the-art systems on multiple benchmarks.
LLM-empowered multi-agent systems offer new potential to accelerate scientific discovery by generating novel research ideas. However, existing methods typically coordinate agents through temporary texts, such as drafts or chat logs; it is difficult to pinpoint the weaknesses in the generated ideas and how the agents refine them. To this end, we introduce \textbf{Evolving Idea Graphs} (EIG), a graph-based multi-agent scientific ideation framework that can generate high-performance research ideas across various benchmark-native metrics, such as novelty, feasibility, and clarity. Instead of coordinating solely through texts, EIG represents a partially formed proposal as an evolving idea graph, where nodes capture scientific claims and edges encode relations (e.g., support and conflict), enabling unresolved weaknesses to remain identifiable throughout the idea evolving process. Specifically, a learned two-head controller operates over the evolving graph to guide the ideation: one head selects graph edits for agents to execute, while the other decides when the graph is ready for commit as final proposal synthesis. On AI Idea Bench 2025 and LiveIdeaBench, EIG outperforms all compared systems on both automatic benchmark scores and blind expert ratings. Ablations further show that explicit graph state provides the main performance gains, and learned edit-and-commit control adds consistent improvements.