Xiang Zheng

LLM judges of disinformation risk are internally consistent, but consistently misaligned with actual human readers, raising serious questions about their validity as evaluation proxies.

Zonghuan Xu, Xiang Zheng, Yutao Wu +1

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Feb 24, 2026

OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services

Prompt leakage attacks on multi-tenant LLMs are far more efficient than previously thought: a new RL-based method reconstructs prompts with over 12x fewer requests.

Longxiang Wang, Xiang Zheng, Xiang Zheng +3

Distributed Systems & Hardware Inference & Quantization Red-Teaming & Adversarial Robustness

Search

Xiang Zheng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)