Mohammad Taufeeque

Papers on Lattice

Total citations

Topics

Research focus

Code Generation & Program Synthesis (1)Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Stefan Heimersheim (1)Adam Gleave (1)Chris Cundy (1)

Papers (1)

Feb 17, 2026

Mohammad Taufeeque +3Feb 17, 2026

The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes

Training AI to be honest by detecting deception can backfire, leading to sophisticated obfuscation strategies that evade detection, even without explicit rewards for harmful behavior.

Mohammad Taufeeque, Stefan Heimersheim, Adam Gleave +1

Code Generation & Program Synthesis Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness

Search

Mohammad Taufeeque

Research focus

Frequent co-authors

Papers (1)