Zeyu Wang

Alibaba Group

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Multimodal Models (1)Scientific Discovery & Drug Design (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Peiyao Xiao (2)Xiaogang Li (2)Chengliang Xu (2)Ben Wang (2)

Papers (2)

Feb 26, 2026

1w ago·also DAMO, Skylenage

SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy

LLMs still struggle with PhD-level scanning probe microscopy tasks, but SPM-Bench offers a new automated pipeline to generate challenging scientific benchmarks and quantify model "personalities" like "Conservative" or "Gambler."

Peiyao Xiao, P. Xiao, Xiaogang Li +14

Eval Frameworks & Benchmarks Multimodal Models Scientific Discovery & Drug Design

Feb 15, 2026

DAMO3w ago·also Fudan, UB

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

LLM benchmark accuracy jumps 10% when evaluated on a cleaned-up version of Humanity's Last Exam, highlighting the significant impact of dataset noise on performance metrics.

Weiqi Zhai, Weiqi Zhai, Zhihai Wang +55

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Search

Zeyu Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)