Craig W. Schmidt

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (2)Inference & Quantization (1)Training Efficiency & Optimization (1)

Frequent co-authors

Michael Krumdick (1)Adam Wiemerslage (1)Seth Ebner (1)Varshini Reddy (1)

Papers (2)

May 21, 2026

MIT CSAIL3w ago·also Ben-Gurion University, Kensho Technologies

Tokenization with Split Trees

Subword tokenization just got a whole lot more efficient: ToaST slashes token counts by 11% and boosts language model performance by up to 7.6% compared to standard methods.

Craig W. Schmidt, Michael Krumdick, Adam Wiemerslage +4

Inference & Quantization Natural Language Processing

ETH3w ago

Tokenisation via Convex Relaxations

Escape the greedy trap: Convex optimization yields tokenizers that compress better and come with optimality guarantees.

Jan Tempus, Philip Whittington, Craig W. Schmidt +2

Natural Language Processing Training Efficiency & Optimization

Search

Craig W. Schmidt

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)