Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (2)Multimodal Models (1)Scientific Discovery & Drug Design (1)Architecture Design (Transformers, SSMs, MoE) (1)Open-Source Models & Weights (1)

Frequent co-authors

Xiaogang Li (2)Chengliang Xu (2)Zichao Chen (2)P. Xiao (1)

Papers (3)

Feb 26, 2026

Feb 26, 2026·also DAMO, Skylenage

SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy

LLMs still struggle with PhD-level scanning probe microscopy tasks, but SPM-Bench offers a new automated pipeline to generate challenging scientific benchmarks and quantify model "personalities" like "Conservative" or "Gambler."

Peiyao Xiao, P. Xiao, Xiaogang Li +12

Eval Frameworks & Benchmarks Multimodal Models Scientific Discovery & Drug Design

Feb 16, 2026

Amazon ScienceFeb 16, 2026·also UB

DeepMTL2R: A Library for Deep Multi-task Learning to Rank

Stop hand-rolling your multi-task learning to rank models: DeepMTL2R provides a ready-to-use framework with 21 SOTA algorithms and Pareto-optimal optimization.

Chaosheng Dong, Peiyao Xiao, Yijia Wang +1

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Recommendation & Information Retrieval

Feb 15, 2026

DAMOFeb 15, 2026·also MIT CSAIL, Tsinghua AI, Fudan, Huawei +1

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

LLM benchmark accuracy jumps 10% when evaluated on a cleaned-up version of Humanity's Last Exam, highlighting the significant impact of dataset noise on performance metrics.

Weiqi Zhai, Weiqi Zhai, Zhihai Wang +44

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Search

Peiyao Xiao

Research focus

Frequent co-authors

Papers (3)