Chaoyi Wu

Shanghai Jiao Tong University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Tool Use & Agents (1)

Frequent co-authors

Cheng Liang (1)Pengcheng Qiu (1)Ya Zhang (1)Yanfeng Wang (1)

Papers (1)

Jun 3, 2026

1w ago·also Artificial Intelligence Laboratory, Shanghai AI Lab

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

Static benchmarks fail to predict LLM performance in dynamic clinical settings, with top models only achieving 60.4% of expert criteria in real-world simulations.

Cheng Liang, Pengcheng Qiu, Ya Zhang +3

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Search

Chaoyi Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)