Zechun Liu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Inference & Quantization (2)Architecture Design (Transformers, SSMs, MoE) (2)Computer Vision (1)Multimodal Models (1)Tool Use & Agents (1)

Frequent co-authors

Vikas Chandra (5)Raghuraman Krishnamoorthi (4)Wei Wen (3)Changsheng Zhao (3)

Papers (5)

Apr 9, 2026

Junjie Fei +15Apr 9, 2026

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Forget brute-force context windows: a small vision-language model can compress hour-long videos below theoretical limits by intelligently prioritizing relevant content.

Junjie Fei, Jun Chen, Zechun Liu +13

Computer Vision Inference & Quantization Multimodal Models

Apr 7, 2026

Mingchen Zhuge +16Apr 7, 2026·also L1)

Neural Computers

Forget agents and world models – the future of computing could be learned directly from I/O traces, turning the model itself into the computer.

Mingchen Zhuge, Changsheng Zhao, Haozhe Liu +14

Architecture Design (Transformers, SSMs, MoE)Tool Use & Agents World Models & Planning

Mar 19, 2026

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Scale up offline policy training for diffusion LLMs without breaking the bank: dTRPO slashes trajectory computation costs while boosting performance up to 9.6% on STEM tasks.

Wenxuan Zhang, Lemeng Wu, Changsheng Zhao +11

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

Mar 16, 2026

Meta AIMar 16, 2026·also Mila

MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale

Forget exotic attention mechanisms – MobileLLM-Flash achieves up to 1.8x faster LLM prefill on mobile CPUs by smartly pruning and adapting existing architectures for on-device use.

Igor Fedorov, Andrey Gromov, B. Beckerman +12

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Sep 29, 2025

Meta AISep 29, 2025

MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes

Forget scaling laws: this work shows you can get SOTA reasoning from sub-billion parameter models with *less* data, if you're smart about curation and resampling.

Changsheng Zhao, Ernie Chang, Zechun Liu +8

Open-Source Models & Weights Reasoning & Chain-of-Thought Scaling Laws & Emergent Abilities

Search

Zechun Liu

Research focus

Frequent co-authors

Papers (5)