Letian Ruan

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Open-Source Models & Weights (1)Tool Use & Agents (1)Distributed Systems & Hardware (1)

Frequent co-authors

MiniMax (1)Aili Chen (1)Aonian Li (1)Baichuan Zhou (1)

Papers (2)

May 26, 2026

MiniMax +1972w ago·also Columbia, Eastern Institute of Technology, HIT, Institute of Artificial Intelligence (TeleAI) +11

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

MiniMax-M2 proves that massive parameter counts don't always translate to better agentic performance; strategic activation of a smaller subset can unlock frontier-level intelligence.

MiniMax, Aili Chen, Aonian Li +195

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Tool Use & Agents

Apr 8, 2026

Apr 8, 2026·also NUS, ByteDance, Hamburg, HKUST +2

InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models

Serving LoRA adapters at scale doesn't have to crush your latency SLOs: InfiniLoRA disaggregates LoRA execution to achieve 3x higher throughput and dramatically improved tail latency.

Hongyu Chen, Letian Ruan, Zilin Xu +5

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Letian Ruan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)