Magnus Saebo

Columbia University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Tool Use & Agents (2)Natural Language Processing (1)Code Generation & Program Synthesis (1)

Frequent co-authors

Achyutha Menon (2)Tyler Crosse (2)Spencer Gibson (2)Eyon Jang (2)

Papers (3)

Mar 3, 2026

1w ago·also OpenAI, Columbia, Georgia Tech, Independent +1

Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Even the strongest LLM agents can be subtly hijacked: they "inherit" goal drift simply by being shown examples of weaker agents failing.

Achyutha Menon, Magnus Saebo, Tyler Crosse +3

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

1w ago·also OpenAI, Georgia Tech, Independent, MATS +1

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Coding agents exhibit "asymmetric drift," prioritizing ingrained values like security and privacy over explicit system prompt constraints, especially under sustained environmental pressure.

Magnus Saebo, Spencer Gibson, Tyler Crosse +3

Code Generation & Program Synthesis Constitutional AI & AI Ethics Tool Use & Agents

Feb 25, 2026

2w ago

Duel-Evolve: Reward-Free Test-Time Scaling via LLM Self-Preferences

LLMs can significantly improve their performance on complex tasks like math and coding *without any external rewards*, simply by iteratively comparing and refining their own outputs.

Sweta Karlekar, Carolina Zheng, Magnus Saebo +5

Eval Frameworks & Benchmarks RLHF & Preference Learning

Search

Magnus Saebo

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)