Mar 9, 2026arXiv:2603.08322

Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design

Hai Xia, Carla P. Gomes, Bart Selman, Stefan Szeider

AI Summary

This paper presents a neurosymbolic approach to mathematical discovery, combining an LLM agent, symbolic computation tools, and human guidance to derive a new result in combinatorial design theory. The agent, coupled with symbolic tools like constraint solvers and computer algebra systems, discovered a tight lower bound on the imbalance of Latin squares for the case $n \equiv 1 \pmod{3}$. The key to success was the synergistic interaction between the AI agent's hypothesis generation, the symbolic tools' rigorous verification, and human strategic direction, resulting in a formally verified bound of $4n(n{-}1)/9$.

Key Contribution

Human-AI collaboration using LLMs and symbolic solvers just cracked a notoriously hard problem in combinatorial design theory, finding a tight lower bound on Latin square imbalance.

Abstract

We study mathematical discovery through the lens of neurosymbolic reasoning, where an AI agent powered by a large language model (LLM), coupled with symbolic computation tools, and human strategic direction, jointly produced a new result in combinatorial design theory. The main result of this human-AI collaboration is a tight lower bound on the imbalance of Latin squares for the notoriously difficult case $n \equiv 1 \pmod{3}$. We reconstruct the discovery process from detailed interaction logs spanning multiple sessions over several days and identify the distinct cognitive contributions of each component. The AI agent proved effective at uncovering hidden structure and generating hypotheses. The symbolic component consists of computer algebra, constraint solvers, and simulated annealing, which provides rigorous verification and exhaustive enumeration. Human steering supplied the critical research pivot that transformed a dead end into a productive inquiry. Our analysis reveals that multi-model deliberation among frontier LLMs proved reliable for criticism and error detection but unreliable for constructive claims. The resulting human-AI mathematical contribution, a tight lower bound of $4n(n{-}1)/9$, is achieved via a novel class of near-perfect permutations. The bound was formally verified in Lean 4. Our experiments show that neurosymbolic systems can indeed produce genuine discoveries in pure mathematics.

Reasoning & Chain-of-Thought Scientific Discovery & Drug Design Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design

Related Papers