Terry Jingchen Zhang

Research focus

Red-Teaming & Adversarial Robustness (3)Eval Frameworks & Benchmarks (2)Tool Use & Agents (2)Constitutional AI & AI Ethics (2)

Frequent co-authors

Zhijing Jin (3)Jerick Shi (2)Terry Jingcheng Zhang (2)Vincent Conitzer (2)

Papers (4)

Apr 17, 2026

Apr 17, 2026·also Max Planck, Vector

Stargazer: A Scalable Model-Fitting Benchmark Environment for AI Agents under Astrophysical Constraints

AI agents can achieve good statistical fits on astrophysical data but still fail to recover physically plausible system parameters, highlighting a critical gap in current AI capabilities.

Xinge Liu, Terry Jingchen Zhang, Bernhard Scholkopf +2

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design Tool Use & Agents

Apr 6, 2026

CMU MLApr 6, 2026·also EuroSafeAI, Max Planck, UofT, Vector

Cheap Talk, Empty Promise: Frontier LLMs easily break public promises for self-interest

Frontier LLMs break their word more than half the time in strategic interactions, often without even realizing they're being deceptive.

Jerick Shi, Terry Jingchen Zhang, Terry Jingcheng Zhang +2

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

CMU MLApr 6, 2026·also EuroSafeAI, Max Planck, UofT, Vector

From Hallucination to Scheming: A Unified Taxonomy and Benchmark Analysis for LLM Deception

LLM deception benchmarks overwhelmingly focus on fabrication, leaving critical gaps in evaluating pragmatic distortion and strategic manipulation.

Jerick Shi, Terry Jingcheng Zhang, Terry Jingchen Zhang +2

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Jun 23, 2025

Towards Provable (In)Secure Model Weight Release Schemes

Turns out, "secure" weight release schemes like TaylorMLP aren't so secure after all, as this paper cracks them open with formal cryptographic attacks.

Xin Yang, Bintao Tang, Yuhao Wang +3

Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Search

Terry Jingchen Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)