March 25 – April 1, 2026

Scaling Laws & Emergent Abilities - Weekly Roundup

8 papers published across 1 lab.

100% acceleration

Selected Labs publishing this week

Tsinghua AI1

Top Papers

Mar 31, 2026

Max Hennick +11d ago

From Density Matrices to Phase Transitions in Deep Learning: Spectral Early Warnings and Interpretability

Quantum chemistry's density matrix approach reveals interpretable early warning signals of phase transitions in deep learning, from grokking to emergent misalignment.

Max Hennick, Guillaume Corlouer

Interpretability & Mechanistic Interp Scaling Laws & Emergent Abilities Training Efficiency & Optimization

1d ago

Spontaneous Functional Differentiation in Large Language Models: A Brain-Like Intelligence Economy

LLMs spontaneously organize into brain-like functional units where the whole is greater than the sum of its parts, and destroying these synergistic cores cripples reasoning.

Junjie Zhang, Zhen Shen, Xisong Dong

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Scaling Laws & Emergent Abilities

Steven Y. Feng +21d ago

Baby Scale: Investigating Models Trained on Individual Children's Language Input

Training language models on individual children's language reveals that distributional and interactional linguistic features, not just dataset size, are key to efficient learning, mirroring factors that drive child language acquisition.

Steven Y. Feng, Alvin W. M. Tan, Michael C. Frank

Eval Frameworks & Benchmarks Natural Language Processing Scaling Laws & Emergent Abilities

Wensu Li +61d ago

Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?

Forget full automation – the sweet spot for AI deployment is often partial automation, where humans and AI collaborate to minimize costs.

Wensu Li, Atin Aboutorabi, Harry Lyu +4

Scaling Laws & Emergent Abilities Tool Use & Agents

Mar 30, 2026

Tsinghua AI2d ago

Rethinking Language Model Scaling under Transferable Hypersphere Optimization

Forget painstaking hyperparameter tuning: this hypersphere parameterization lets you transfer a single learning rate across model sizes, depths, and even MoE architectures, slashing compute costs by 1.58x.

Liliang Ren, Yelong Shen, Weizhu Chen

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Training Efficiency & Optimization

All Papers (8)

Mar 31, 2026

Max Hennick +11d ago

From Density Matrices to Phase Transitions in Deep Learning: Spectral Early Warnings and Interpretability

Quantum chemistry's density matrix approach reveals interpretable early warning signals of phase transitions in deep learning, from grokking to emergent misalignment.

Max Hennick, Guillaume Corlouer

Interpretability & Mechanistic Interp Scaling Laws & Emergent Abilities Training Efficiency & Optimization

1d ago

Spontaneous Functional Differentiation in Large Language Models: A Brain-Like Intelligence Economy

LLMs spontaneously organize into brain-like functional units where the whole is greater than the sum of its parts, and destroying these synergistic cores cripples reasoning.

Junjie Zhang, Zhen Shen, Xisong Dong

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Scaling Laws & Emergent Abilities

Steven Y. Feng +21d ago

Baby Scale: Investigating Models Trained on Individual Children's Language Input

Steven Y. Feng, Alvin W. M. Tan, Michael C. Frank

Eval Frameworks & Benchmarks Natural Language Processing Scaling Laws & Emergent Abilities

Wensu Li +61d ago

Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?

Forget full automation – the sweet spot for AI deployment is often partial automation, where humans and AI collaborate to minimize costs.

Wensu Li, Atin Aboutorabi, Harry Lyu +4

Scaling Laws & Emergent Abilities Tool Use & Agents

Mar 30, 2026

Tsinghua AI2d ago

Rethinking Language Model Scaling under Transferable Hypersphere Optimization

Liliang Ren, Yelong Shen, Weizhu Chen

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Training Efficiency & Optimization

Chien-Ping Lu2d ago

The Unreasonable Effectiveness of Scaling Laws in AI

Scaling laws work so well because they capture the essence of computation, not the specifics of implementation, leading to a persistent efficiency arms race.

Chien-Ping Lu

Scaling Laws & Emergent Abilities Training Efficiency & Optimization

Rohan Pandey +22d ago

Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners

Scientific reasoning gains from prompt engineering are often mirages, driven by model-specific hacks that don't generalize.

Rohan Pandey, Eric Ye, Michael Li

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought Scaling Laws & Emergent Abilities

Mar 29, 2026

Dario Paape3d ago

What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps

LLMs exhibit polarity illusions without rational inference, suggesting that "good enough" processing and partial grammaticalization may suffice to explain these phenomena in both machines and humans.

Dario Paape

Natural Language Processing Open-Source Models & Weights Scaling Laws & Emergent Abilities

Search

Scaling Laws & Emergent Abilities - Weekly Roundup

Selected Labs publishing this week

Top Papers

All Papers (8)