Kai Tian

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Scientific Discovery & Drug Design (1)Tool Use & Agents (1)

Frequent co-authors

Che Jiang (2)Junlin Yang (2)Zhenzhao Yuan (2)Jincheng Zhong (2)

Papers (2)

Jun 23, 2026

Yuru Wang +165d ago·also Department of Radiology & Biomedical

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

AI coding agents excel at translating scientific tasks into familiar formats but struggle to achieve true scientific discovery, with only 17.8% surpassing state-of-the-art benchmarks.

Yuru Wang, Lejun Cheng, Yuxin Zuo +14

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design

Jun 22, 2026

Jincheng Zhong +76d ago·also Department of Radiology & Biomedical, Horizon Research

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Enterprise agents struggle to achieve high performance in real-world tasks, with the best benchmark score only reaching 0.663, highlighting significant evaluation gaps.

Jincheng Zhong, Weizhi Wang, Che Jiang +5

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Kai Tian

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)