Pengyu Zhao

MiniMax

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Scaling Laws & Emergent Abilities (2)Tool Use & Agents (2)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Zhengmao Zhu (3)Jiayuan Song (3)Jingyang Li (2)Binyang Jiang (2)

Papers (5)

Jun 11, 2026

1w ago·also NVIDIA, HIT, HUST, PKU +1

MiniMax Sparse Attention

MSA slashes per-token attention compute by over 28x while maintaining competitive performance, revolutionizing how LLMs can handle ultra-long contexts.

Xunhao Lai, Weiqi Xu, Yufeng Yang +9

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities

1w ago·also Tsinghua AI, Ant Group, CUHK, Fudan +1

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

MaxProof's innovative test-time scaling enables an AI to outperform human champions in mathematical proof competitions.

Jiacheng Chen, Xinyu Zhang, Shunkai Zhang +24

Reasoning & Chain-of-Thought Scaling Laws & Emergent Abilities

May 26, 2026

MiniMax +1763w ago·also Ant Group, Columbia, CUHK, Fudan +18

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

MiniMax-M2 proves that massive parameter counts don't always translate to better agentic performance; strategic activation of a smaller subset can unlock frontier-level intelligence.

MiniMax, Aili Chen, Aonian Li +174

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Tool Use & Agents

Mar 10, 2026

Mar 10, 2026·also MiniMax

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Forget data quantity, diversity is the secret sauce: scaling the variety of tool-use patterns in training data boosts LLM generalization by +22 points on OOD benchmarks, even with 4x less data.

Aili Chen, Chi Zhang, Junteng Liu +11

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Mar 1, 2026

Mar 1, 2026·also Honda RI, OUC, WHU

Transferring Policy of Offline Reinforcement Learning From Hybrid Dataset to Real World via Progressive Neural Network

Learning from a mix of real and simulated data can be effectively transferred to real-world robot tasks using progressive neural networks, enabling safer and more efficient online adaptation.

Pengyu Zhao, Pengyu Zhao, Zheng Fang +5

RLHF & Preference Learning Robotics & Embodied AI

Search

Pengyu Zhao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)