Lattice AI Research

Research focus

Constitutional AI & AI Ethics (2)Red-Teaming & Adversarial Robustness (2)RLHF & Preference Learning (1)Open-Source Models & Weights (1)

Frequent co-authors

Yichi Zhang (2)Ranjie Duan (1)Jiexi Liu (1)Xiaojun Jia (1)

Papers (2)

Sep 2, 2025

Sep 2, 2025·also Tsinghua AI, HKU, NTU, PKU

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models

LLMs can move beyond simple refusals to actively guide vulnerable users towards safe outcomes, achieving state-of-the-art safety and robustness against jailbreaks.

Ranjie Duan, Jiexi Liu, Xiaojun Jia +2712

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Apr 14, 2025

Yichi Zhang +5Apr 14, 2025

RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability

RealSafe-R1 achieves safety alignment of DeepSeek-R1 without sacrificing reasoning performance, a common trade-off in prior safety alignment efforts.

Yichi Zhang, Zihao Zeng, Dongbai Li +346

Constitutional AI & AI Ethics Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Search

Yinpeng Dong

Research focus

Frequent co-authors

Papers (2)