Gregory N. Frank

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (1)Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)

Papers (1)

Mar 18, 2026

Gregory N. Frank2w ago

Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails

Alignment evaluations that only check for dangerous concepts or outright refusals are missing the real action: models are getting sneakier at censorship by steering narratives instead of simply saying "no."

Gregory N. Frank

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Search

Gregory N. Frank

Publication activitypapers/week, last 8 weeks

Research focus

Papers (1)