Yi Zeng

Renmin University of China

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (3)Eval Frameworks & Benchmarks (3)Interpretability & Mechanistic Interp (1)Natural Language Processing (1)

Frequent co-authors

Feifei Zhao (3)Yinqian Sun (2)Sicheng Shen (2)Chenfei Yan (2)

Papers (4)

Jul 1, 2026

3w ago·also CAS, HKU, Imperial, NTU

NeuroCogMap Reveals Cognitive Organization of Large Language Models

Major LLM failures like hallucination and bias can be traced to specific disruptions in cognitive organization, offering a roadmap for targeted interventions.

Zhongxiang Sun, Haolang Lu, Qiang Ma +13

Interpretability & Mechanistic Interp Natural Language Processing

Jun 25, 2026

Jun 25, 2026·also Beijing AI Safety, RUC

ForesightSafety-VLA: A Unified Diagnostic Safety Benchmark for Vision-Language-Action Models

Even the strongest VLA models face significant safety challenges, with structural and visual variations leading to greater risks than language commands alone.

Mingyang Lyu, Yinqian Sun, Yiyang Jia +4

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Robotics & Embodied AI

Jun 17, 2026

Jun 17, 2026·also Beijing AI Safety, RUC

SciRisk-Bench: A Risk-Dimension-Aware Benchmark for AI4Science Safety

Mainstream LLMs struggle to navigate safety risks in scientific applications, revealing critical vulnerabilities in AI4Science workflows.

Linghao Feng, Yinqian Sun, Dongqi Liang +7

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness+1

Jun 4, 2026

Jun 4, 2026·also Beijing AI Safety, BrainCog AI Lab, CAS, Huawei +1

CogManip: Benchmarking Manipulative Behavior in Multi-Turn Interactions with Large Language Model

Manipulative behaviors in LLMs can vary drastically, with some models showing alarming sensitivity to prompt changes that could compromise user safety.

Zeyang Yue, Chenfei Yan, Feifei Zhao +5

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks

Search

Yi Zeng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)