Daniel Dahlmeier

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (3)Natural Language Processing (3)Tool Use & Agents (1)Multimodal Models (1)

Frequent co-authors

Yifan Mai (3)Nicholas Sadjoli (1)Tim Siefken (1)Atin Ghosh (1)

Papers (3)

Apr 30, 2026

Stanford HAIApr 30, 2026

Optimization before Evaluation: Evaluation with Unoptimized Prompts Can be Misleading

Model rankings on standard benchmarks can flip entirely when you optimize prompts for each LLM, so your "best" model might actually be the worst.

Nicholas Sadjoli, Tim Siefken, Atin Ghosh +2

Eval Frameworks & Benchmarks Natural Language Processing

Mar 16, 2026

Stanford HAIMar 16, 2026

Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis

Stop evaluating agents in a vacuum: TED reveals how user expertise impacts agent performance and pinpoints actionable error remedies, boosting performance by 8-10%.

Penny Chong, Harshavardhan Abichandani, Jiyuan Shen +3

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 3, 2026

Stanford HAIMar 3, 2026

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Forget OCR? Powerful MLLMs can extract information from business documents just as well from images alone, challenging the necessity of traditional OCR pipelines.

Jiyuan Shen, Peiyue Yuan, A. Ghosh +2

Eval Frameworks & Benchmarks Multimodal Models Natural Language Processing

Search

Daniel Dahlmeier

Research focus

Frequent co-authors

Papers (3)