Alireza Amiri Bavandpour

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Santiago Gonzalez (1)S. González (1)Peter Ye (1)Edward Zhang (1)

Papers (1)

Feb 24, 2026

Feb 24, 2026·also CMU ML, Cardiff, UBC, UESTC

QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs

LLM judges inflate math proof scores by up to 0.36 points, revealing a significant alignment gap with human experts and a reasoning breakdown in discrete domains.

Santiago Gonzalez, S. González, Alireza Amiri Bavandpour +71

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Search

Alireza Amiri Bavandpour

Research focus

Frequent co-authors

Papers (1)