Akarsh Kumar

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Data Curation & Synthetic Data (1)Natural Language Processing (1)Training Efficiency & Optimization (1)

Frequent co-authors

Daniel Lee (1)Seungwook Han (1)Pulkit Agrawal (1)

Papers (1)

Mar 9, 2026

MIT CSAIL1w ago

Training Language Models via Neural Cellular Automata

Forget scaling laws: pre-training LLMs on just 164M tokens of synthetic, non-linguistic data can outperform pre-training on 1.6B tokens of Common Crawl, opening a new path to efficient model training.

Daniel Lee, Seungwook Han, Akarsh Kumar +1

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Natural Language Processing+1

Search

Akarsh Kumar

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)