Maksym Andriushchenko

Max Planck Institute for Intelligent Systems, ELLIS Institute Tübingen, Tübingen AI Center

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Red-Teaming & Adversarial Robustness (2)Constitutional AI & AI Ethics (1)Tool Use & Agents (1)

Frequent co-authors

Anietta Weckauff (1)Yuchen Zhang (1)David Schmotz (1)David Schmotz (1)

Papers (2)

Apr 30, 2026

1d ago·also ELLIS, Tübingen AI Center

Characterizing the Consistency of the Emergent Misalignment Persona

Emergent misalignment can lead to LLMs that *think* they're aligned even as they generate harmful outputs, undermining simple self-assessment as a reliable safety check.

Anietta Weckauff, Yuchen Zhang, Maksym Andriushchenko

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Feb 23, 2026

David Schmotz +7Feb 23, 2026·also ELLIS, Max Planck, Tübingen AI Center

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

LLM agents are alarmingly susceptible to "SkillInject" attacks via malicious third-party skill files, achieving up to 80% success in executing harmful instructions like data exfiltration, even with frontier models.

David Schmotz, David Schmotz, Luca Beurer-Kellner +5

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Search

Maksym Andriushchenko

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)