Daniel Lee

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Data Curation & Synthetic Data (1)Natural Language Processing (1)Training Efficiency & Optimization (1)

Frequent co-authors

Seungwook Han (1)Akarsh Kumar (1)Pulkit Agrawal (1)

Papers (1)

Mar 9, 2026

MIT CSAIL1w ago

Training Language Models via Neural Cellular Automata

Forget more data: pre-training on just 164M tokens of synthetic data from Neural Cellular Automata can outperform pre-training on 1.6B tokens of natural language for downstream LLM tasks.

Daniel Lee, Seungwook Han, Akarsh Kumar +1

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Natural Language Processing+1

Search

Daniel Lee

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)