Yuan Zhang

Communication University of China

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (2)Tool Use & Agents (1)Computer Vision (1)

Frequent co-authors

Aakshita Chandiramani (1)Aaron Blakeman (1)Abdullahi Olaoye (1)Abhibha Gupta (1)

Papers (3)

Apr 14, 2026

AI26d ago·also NVIDIA, Communication University of China, Waterloo

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +534

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

University of Science and Technology6d ago·also Microsoft Research, Communication University of China

CoD-Lite: Real-Time Diffusion-Based Generative Image Compression

Real-time, lightweight image compression is now possible with diffusion models, thanks to a novel architecture that swaps transformers for convolutions and prioritizes compression-focused pre-training.

Zhaoyang Jia, Naifu Xue, Zihan Zheng +6

Architecture Design (Transformers, SSMs, MoE)Computer Vision Inference & Quantization

Apr 13, 2026

1w ago·also Communication University of China, Meituan

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

LLMs that ace math and physics still struggle with general reasoning, achieving only 63% accuracy on a new K-12 level benchmark.

Junlin Liu, Shengnan An, Shuang Zhou +9

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Search

Yuan Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)