Yan Teng

Papers on Lattice

Total citations

Topics

h-index

Research focus

Constitutional AI & AI Ethics (1)Interpretability & Mechanistic Interp (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)

Frequent co-authors

Lingyu Li (1)Yingchun Wang (1)Xin Wang (1)Yunhao Chen (1)

Papers (2)

Mar 16, 2026

Lingyu Li +2Mar 16, 2026·also AI Laboratory

Mechanistic Origin of Moral Indifference in Language Models

LLMs exhibit a surprising degree of moral indifference, compressing distinct moral concepts into uniform probability distributions, a problem that persists across model scales, architectures, and alignment techniques.

Lingyu Li, Yan Teng, Yingchun Wang

Constitutional AI & AI Ethics Interpretability & Mechanistic Interp Red-Teaming & Adversarial Robustness+1

Jan 4, 2026

Jan 4, 2026·also ZJU

OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs

Even state-of-the-art multimodal LLMs like GPT-5.2 and Claude 4.5 can be jailbroken nearly half the time using OpenRT's diverse suite of attacks, revealing a critical lack of generalization across attack paradigms.

Xin Wang, Yunhao Chen, Juncheng Li +8

Search

Yan Teng

Research focus

Frequent co-authors

Papers (2)