Michael Backes

LLM-based multi-agent systems can more than double their performance and slash token usage by organizing themselves like a company, with distinct governance, execution, and compliance layers.

Yiru Wang, Yaohui Han, Michael Backes +1

Reasoning & Chain-of-Thought Tool Use & Agents

Mar 12, 2026

Mar 12, 2026·also TU Delft

Understanding LLM Behavior When Encountering User-Supplied Harmful Content in Harmless Tasks

Even the most advanced LLMs like GPT-5.2 and Gemini-3-Pro often fail to recognize and refuse to process harmful content embedded within seemingly harmless tasks.

Junjie Chu, Yiting Qu, Y. Qu +5

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Mar 2, 2026

Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

Shadow APIs promising access to top LLMs like GPT-5 and Gemini 2.5 often deliver significantly degraded performance (down to 47.21% accuracy) and fail identity verification, casting doubt on research relying on them.

Yage Zhang, Yukun Jiang, Yukun Jiang +4

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights+1

Search

Michael Backes

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)