Nathan Heath

Syntony Research

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Avijit Ghosh (1)Anka Reuel (1)Jenny Chim (1)Wm. Matthew Kennedy (1)

Papers (2)

Jun 8, 2026

Stanford HAI1d ago·also ETH, Mila, MIT CSAIL, AISI +27

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

Systematic gaps in AI evaluation reporting are exposed, revealing inconsistencies that hinder reliable comparisons across thousands of models and benchmarks.

Avijit Ghosh, Anka Reuel, Jenny Chim +44

Eval Frameworks & Benchmarks

Mar 31, 2026

Syntony ResearchMar 31, 2026

Extending MONA in Camera Dropbox: Reproduction, Learned Approval, and Design Implications for Reward-Hacking Mitigation

Learned approval in MONA can eliminate reward hacking, but at the cost of significantly under-optimizing for the intended task, revealing a critical trade-off in safe RL.

Nathan Heath

Red-Teaming & Adversarial Robustness RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Nathan Heath

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)