Kyle Mahowald

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Interpretability & Mechanistic Interp (3)Eval Frameworks & Benchmarks (2)Natural Language Processing (2)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Daniel Drucker (1)Sasha Boguraev (1)Jonathan Nemitz (1)Carsten Eickhoff (1)

Papers (4)

May 5, 2026

Daniel Drucker +1May 5, 2026

The Counterexample Game: Iterated Conceptual Analysis and Repair in Language Models

Language models can play the counterexample game, but their philosophical reasoning hits diminishing returns fast, and they're far more lenient judges than humans.

Daniel Drucker, Kyle Mahowald

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Apr 15, 2026

Causal Drawbridges: Characterizing Gradient Blocking of Syntactic Islands in Transformer LMs

Transformer LMs' ability to replicate subtle human judgments on syntactic islands hinges on a "blocking" mechanism that differentially engages filler-gap dependencies based on the relational vs. conjunctive usage of "and".

Sasha Boguraev, Kyle Mahowald

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Natural Language Processing

Apr 7, 2026

Apr 7, 2026·also UT Austin

When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don't

VLMs may ace the color coverage test, but they flunk the "do as I say, not as I do" test, routinely ignoring their own stated reasoning rules in ways that humans don't.

Jonathan Nemitz, Carsten Eickhoff, Junyi Jessy Li +3

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Multimodal Models

Mar 5, 2026

Harvey Lederman +1Mar 5, 2026

Dissociating Direct Access from Inference in AI Introspection

AI models can detect injected thoughts, but they often have no idea *what* those thoughts are, relying on content-agnostic anomaly detection and then guessing common concepts.

Harvey Lederman, Kyle Mahowald

Interpretability & Mechanistic Interp Open-Source Models & Weights

Search

Kyle Mahowald

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)