Rheeya Uppaal

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Ishita Kakkar (1)Ishita Kakkar (1)Enze Zhang (1)Enze Zhang (1)

Papers (1)

Apr 21, 2026

Ishita Kakkar +5Apr 21, 2026

When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains

Current safety filters miss the forest for the trees: they fail to detect the subtle, step-by-step progression of harm within reasoning chains, leaving models vulnerable to jailbreaks.

Ishita Kakkar, Ishita Kakkar, Enze Zhang +3

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness

Search

Rheeya Uppaal

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)