Haotian Ye

University of California, San Diego

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (2)Eval Frameworks & Benchmarks (2)Distributed Systems & Hardware (1)

Frequent co-authors

Stefano Ermon (2)Zhongkai Yu (1)Haotian Ye (1)Chenyang Zhou (1)

Papers (5)

Apr 28, 2026

3d ago·also NVIDIA, Columbia, Samsung Semiconductor, Yonsei

AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving

Forget GPU-centric designs: AMMA slashes attention latency by 15x and energy consumption by 7x with a memory-centric architecture for long-context LLMs.

Zhongkai Yu, Haotian Ye, Haotian Ye +12

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Apr 21, 2026

Beijing Language and Culture University1w ago·also DAMO, ELLIS, HIT, IBM Research +5

CulturALL: Benchmarking Multilingual and Multicultural Competence of LLMs on Grounded Tasks

LLMs still struggle to reason in context when cultural and linguistic nuances are involved, achieving only 44% accuracy on a new grounded benchmark spanning 14 languages.

Wenjiang Luo, Haotian Ye, Md Mehrab Hossain +16

Eval Frameworks & Benchmarks Natural Language Processing

1w ago·also Stanford HAI, HKUST

Evaluation-driven Scaling for Scientific Discovery

LLMs can leapfrog state-of-the-art scientific algorithms and human-designed solutions, but only if you scale the evaluation loop, not just the model.

Haotian Ye, Haowei Lin, Jingyi Tang +18

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design Tool Use & Agents

Mar 2, 2026

Stanford HAIMar 2, 2026·also UCSD

Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration

Ditch the training overhead and still get up to 4.79x faster diffusion sampling with Spectrum, a training-free feature forecasting method that actually maintains image quality.

Jiaqi Han, Juntong Shi, Puheng Li +3

Architecture Design (Transformers, SSMs, MoE)Computer Vision Inference & Quantization

Jan 29, 2026

Stanford HAIJan 29, 2026·also UCSD

ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design

LLMs still have a long way to go in AI-aided chip design, with even the best models achieving surprisingly low scores on the new ChipBench benchmark for Verilog generation and reference model creation.

Zhongkai Yu, Chenyang Zhou, Yichen Lin +6

Search

Haotian Ye

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)