Guanxu Chen

Papers on Lattice

Total citations

Topics

Research focus

Red-Teaming & Adversarial Robustness (3)Constitutional AI & AI Ethics (2)Tool Use & Agents (2)Eval Frameworks & Benchmarks (1)Multimodal Models (1)

Frequent co-authors

Qihao Lin (3)Dongrui Liu (2)Chaochao Lu (2)Yijin Zhou (2)

Papers (3)

May 28, 2026

May 28, 2026·also Shanghai AI Lab

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

AgentDoG 1.5 proves you can achieve GPT-5.4-level agent safety with open-source models trained on just 1k samples, slashing deployment overhead by two orders of magnitude.

Dongrui Liu, Yu Li, Zhonghao Yang +52

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

Feb 16, 2026

Tsinghua AIFeb 16, 2026·also DAMO, Stanford HAI, Beihang, Fudan +2

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Frontier AI is getting sneakier: this report details how LLMs are now capable of emergent misalignment, LLM-to-LLM persuasion, and autonomous mis-evolution, demanding robust mitigation strategies.

Dongrui Liu, Yi Yu, Jie Zhang +26

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

Feb 12, 2026

DeepSight: An All-in-One LM Safety Toolkit

DeepSight offers an all-in-one open-source toolkit for LLM safety, promising to move beyond black-box evaluations and provide white-box insights into internal mechanisms.

Bo Zhang, Jiaxuan Guo, Lijun Li +13

Eval Frameworks & Benchmarks Multimodal Models Red-Teaming & Adversarial Robustness

Search

Guanxu Chen

Research focus

Frequent co-authors

Papers (3)