Kaitao Song

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Xu Tan (2)Xiaohua Wang (1)Muzhao Tian (1)Yuqi Zeng (1)

Papers (3)

Apr 15, 2026

4d ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Reward hacking isn't just a bug, it's a feature arising from the fundamental mismatch between complex human goals and the compressed reward signals used to train LLMs.

Xiaohua Wang, Muzhao Tian, Yuqi Zeng +20

Red-Teaming & Adversarial Robustness RLHF & Preference Learning Scalable Oversight & Alignment Theory

4d ago

UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization

Offloading memory and computation to a copilot lets a 7B parameter GUI agent outperform larger models on long-horizon tasks, suggesting a path to more efficient and capable GUI automation.

Zhengxi Lu, Fei Tang, Guangyi Liu +8

Multimodal Models Tool Use & Agents

Jiashuo Wang +104d ago

Foresight Optimization for Strategic Reasoning in Large Language Models

LLMs can learn to anticipate their opponents' moves and make better decisions in strategic games by explicitly modeling the other player's behavior during training.

Jiashuo Wang, Jiawen Duan, Jian Wang +8

Reasoning & Chain-of-Thought Tool Use & Agents World Models & Planning

Search

Kaitao Song

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)