Chen Bo Calvin Zhang

Research focus

Eval Frameworks & Benchmarks (2)Scientific Discovery & Drug Design (2)Reasoning & Chain-of-Thought (1)Constitutional AI & AI Ethics (1)

Frequent co-authors

Udari Madhushani Sehwag (1)Elaine Lau (1)Haniyeh Ehsani Oskouie (1)Shayan Shabihi (1)

Papers (2)

Apr 12, 2026

Apr 12, 2026·also McGill, Princeton, UMD

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

LLMs can't reliably predict scientific experiment outcomes, and more worryingly, they have no idea when they're wrong, unlike human experts whose accuracy skyrockets when they feel confident.

Udari Madhushani Sehwag, Elaine Lau, Haniyeh Ehsani Oskouie +12

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Scientific Discovery & Drug Design

Feb 26, 2026

Feb 26, 2026·also B trainable parameters, Mondo Robotics

LLM Novice Uplift on Dual-Use, In Silico Biology Tasks

LLMs can boost novice performance on complex biosecurity tasks to surpass even expert-level benchmarks, but users struggle to fully leverage the models' capabilities.

Chen Bo Calvin Zhang, Christina Q. Knight, Christina Q Knight +31

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Scientific Discovery & Drug Design

Search

Chen Bo Calvin Zhang

Research focus

Frequent co-authors

Papers (2)