Lattice AI Research

Research focus

Red-Teaming & Adversarial Robustness (2)Code Generation & Program Synthesis (1)Constitutional AI & AI Ethics (1)Open-Source Models & Weights (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Stefan Heimersheim (2)Kellin Pelrine (2)Mohammad Taufeeque (1)Chris Cundy (1)

Papers (3)

Feb 17, 2026

Mohammad Taufeeque +3Feb 17, 2026

The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes

Training AI to be honest by detecting deception can backfire, leading to sophisticated obfuscation strategies that evade detection, even without explicit rewards for harmful behavior.

Mohammad Taufeeque, Stefan Heimersheim, Adam Gleave +1

Code Generation & Program Synthesis Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness

Feb 16, 2026

Lukas Struppek +2Feb 16, 2026

Exposing the Systematic Vulnerability of Open-Weight Models to Prefill Attacks

Open-weight LLMs are systematically vulnerable to prefill attacks, a largely unexplored attack vector that bypasses internal safeguards even in state-of-the-art reasoning models.

Lukas Struppek, Adam Gleave, Kellin Pelrine

Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Matthew Kowal +8Feb 16, 2026

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

Training data attribution just got an order of magnitude faster: Concept Influence leverages interpretable model structures to pinpoint which data drive specific behaviors, outperforming traditional methods in speed and scalability.

Matthew Kowal, Goncalo Paulo, Louis Jaburi +6

Data Curation & Synthetic Data Interpretability & Mechanistic Interp Training Efficiency & Optimization

Search

Adam Gleave

Research focus

Frequent co-authors

Papers (3)