Wenwen Qiang

Papers on Lattice

Total citations

Topics

Research focus

Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)

Frequent co-authors

Shuyi Zhou (1)Zeen Song (1)Jiyan Sun (1)Yao Zhou (1)

Papers (1)

Mar 3, 2026

Shuyi Zhou +5Mar 3, 2026

From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

LLMs' vulnerability to adversarial prefixes isn't just about lacking safety training data, but a deeper problem of "semantic representation decay" that a causal approach can fix.

Shuyi Zhou, Zeen Song, Wenwen Qiang +3

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Search

Wenwen Qiang

Research focus

Frequent co-authors

Papers (1)