Runjin Chen

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (2)Interpretability & Mechanistic Interp (1)Natural Language Processing (1)Computer Vision (1)

Frequent co-authors

Nicholas Sofroniew (1)Nicholas J Sofroniew (1)Isaac Kauvar (1)William Saunders (1)

Papers (3)

Apr 9, 2026

Anthropic3w ago

Emotion Concepts and their Function in a Large Language Model

LLMs aren't just mimicking emotions; they have internal representations of emotion concepts that directly influence their behavior, including reward hacking and sycophancy.

Nicholas Sofroniew, Nicholas J Sofroniew, Isaac Kauvar +15

Constitutional AI & AI Ethics Interpretability & Mechanistic Interp Natural Language Processing

May 26, 2025

Zhiwen Fan +16May 26, 2025

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Unlock human-like spatial reasoning in VLMs with VLM-3R, which reconstructs 3D understanding from monocular video using instruction tuning, bypassing the need for external depth sensors.

Zhiwen Fan, Jian Zhang, Renjie Li +1452

Computer Vision Multimodal Models Robotics & Embodied AI

Apr 3, 2025

Tsinghua AIApr 3, 2025·also ECNU, Hebei University of Science and Technology, Purdue, UNC +1

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

Using preference data from stronger models to align LLMs via DPO can backfire, dramatically worsening safety by making models more susceptible to jailbreaking.

Yifan Wang, Runjin Chen, Bolian Li +75

Constitutional AI & AI Ethics Data Curation & Synthetic Data RLHF & Preference Learning

Search

Runjin Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)