Mann Talati

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Han Wang (1)Yifan Sun (1)Brian Ko (1)Jiawen Gong (1)

Papers (1)

Mar 30, 2026

Han Wang +12Mar 30, 2026·also RUC, State Key Laboratory for Novel Software

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

LLMs' chain-of-thought explanations often fail to reflect the true drivers of their decisions, and this benchmark reveals that closed-source models are particularly opaque, with monitorability dropping by up to 30% under stress.

Han Wang, Yifan Sun, Brian Ko +10

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Search

Mann Talati

Research focus

Frequent co-authors

Papers (1)