Seth Karten

Papers on Lattice

Total citations

Topics

h-index

Research focus

Tool Use & Agents (4)RLHF & Preference Learning (2)Eval Frameworks & Benchmarks (2)Code Generation & Program Synthesis (2)Multimodal Models (1)

Frequent co-authors

Chi Jin (2)Chengshuai Shi (1)Wenzhe Li (1)Xin Liang (1)

Papers (4)

May 1, 2026

Chengshuai Shi +12May 1, 2026·also Princeton

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

Forget short-horizon RL: Odysseus proves VLMs can master 100+ turn decision-making in complex games, outperforming state-of-the-art models by 3x.

Chengshuai Shi, Wenzhe Li, Xin Liang +10

Multimodal Models RLHF & Preference Learning Tool Use & Agents

Mar 16, 2026

Seth Karten +33Mar 16, 2026·also Gwangju Institute of Science and Technology

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Pokemon, not just a childhood game, emerges as a surprisingly effective benchmark for AI, revealing critical gaps in LLMs and RL agents that existing benchmarks miss.

Seth Karten, Jake Grigsby, Tersoo Upaa +31

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Mar 12, 2026

Mar 12, 2026·also Independent Researcher

Automatic Generation of High-Performance RL Environments

Automating RL environment engineering slashes costs and unlocks massive speedups (up to 22,320x!) using a recipe of prompt engineering, verification, and agent-assisted repair.

Seth Karten, Rahul Dev Appapogu, Chi Jin

Code Generation & Program Synthesis RLHF & Preference Learning Tool Use & Agents+1

Feb 11, 2026

CMU MLFeb 11, 2026·also Princeton

GameDevBench: Evaluating Agentic Capabilities Through Game Development

Multimodal agents still struggle with game development, solving only ~50% of tasks in a new benchmark, GameDevBench, highlighting the need for better multimodal reasoning in complex software environments.

Wayne Chi, Wayne Chi, Yixiong Fang +15

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Search

Seth Karten

Research focus

Frequent co-authors

Papers (4)