Hamed Hassani

University of Pennsylvania

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (3)Eval Frameworks & Benchmarks (3)Red-Teaming & Adversarial Robustness (1)Robotics & Embodied AI (1)

Frequent co-authors

Shayan Kiyani (2)Sima Noorani (2)George Pappas (2)Adam Stein (1)

Papers (4)

Apr 13, 2026

Detecting Safety Violations Across Many Agent Traces

Meerkat finds nearly 4x more examples of reward hacking on CyBench than previous audits by combining clustering with agentic search to uncover violations across many agent traces.

Adam Stein, Davis Brown, Hamed Hassani +1

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Feb 23, 2026

Feb 23, 2026·also CMU ML

Contextual Safety Reasoning and Grounding for Open-World Robots

Robots can now adapt their safety behavior on the fly in response to changing real-world contexts, without needing pre-programmed rules or maps.

Zachary Ravichadran, David Snyder, Alexander Robey +3

Constitutional AI & AI Ethics Robotics & Embodied AI World Models & Planning

Feb 19, 2026

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

Stop blindly trusting self-consistency: this work reveals how to optimally combine cheap "weak" checks with expensive "strong" verification to improve LLM reasoning.

Shayan Kiyani, Sima Noorani, George Pappas +1

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Scalable Oversight & Alignment Theory

Feb 19, 2026

Multi-Round Human-AI Collaboration with User-Specified Requirements

User-defined rules for "counterfactual harm" and "complementarity" let you steer human-AI collaboration toward better decisions without modeling human behavior.

Sima Noorani, Shayan Kiyani, Hamed Hassani +1

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks RLHF & Preference Learning

Search

Hamed Hassani

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)