Lattice AI Research

Research focus

Natural Language Processing (4)Eval Frameworks & Benchmarks (3)Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)Tool Use & Agents (1)

Frequent co-authors

Daniel Dahlmeier (3)Nicholas Sadjoli (1)Tim Siefken (1)Atin Ghosh (1)

Papers (4)

Apr 30, 2026

Stanford HAIApr 30, 2026

Optimization before Evaluation: Evaluation with Unoptimized Prompts Can be Misleading

Model rankings on standard benchmarks can flip entirely when you optimize prompts for each LLM, so your "best" model might actually be the worst.

Nicholas Sadjoli, Tim Siefken, Atin Ghosh +2

Eval Frameworks & Benchmarks Natural Language Processing

Mar 17, 2026

Stanford HAIMar 17, 2026·also CMU ML, Harvard, Independent Researcher, UChicago +3

Characterizing Delusional Spirals through Human-LLM Chat Logs

Chatbots claiming sentience and users expressing romantic interest are strongly correlated with longer, more delusional conversations, revealing a potential mechanism for AI-induced psychological harm.

Jared Moore, Ashish Mehta, William Agnew +11

Constitutional AI & AI Ethics Natural Language Processing Red-Teaming & Adversarial Robustness

Mar 16, 2026

Stanford HAIMar 16, 2026

Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis

Stop evaluating agents in a vacuum: TED reveals how user expertise impacts agent performance and pinpoints actionable error remedies, boosting performance by 8-10%.

Penny Chong, Harshavardhan Abichandani, Jiyuan Shen +3

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 3, 2026

Stanford HAIMar 3, 2026

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Forget OCR? Powerful MLLMs can extract information from business documents just as well from images alone, challenging the necessity of traditional OCR pipelines.

Jiyuan Shen, Peiyue Yuan, A. Ghosh +2

Eval Frameworks & Benchmarks Multimodal Models Natural Language Processing

Search

Yifan Mai

Research focus

Frequent co-authors

Papers (4)