May 11 – May 18, 2026

Training Efficiency & Optimization - Weekly Roundup

4 papers published across 2 labs.

1725% acceleration

Selected Labs publishing this week

Mila1 DAMO1

Top Papers

May 18, 2026

Mila1w ago·also CIFAR, McGill, ServiceNow

Forecasting Downstream Performance of LLMs With Proxy Metrics

Forget expensive downstream evaluations: token-level statistics from expert-written solutions can reliably forecast LLM performance with 10,000x less compute.

Arkil Patel, Siva Reddy, Marius Mosbach +1

Eval Frameworks & Benchmarks Scaling Laws & Emergent Abilities Training Efficiency & Optimization

1w ago·also DeepAuto.ai

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Stop wasting compute on irrelevant actions: targeted hindsight self-distillation focuses LLM agent training on the critical failure points, boosting performance and slashing training time.

Woongyeng Yeo, Yumin Choi, Taekyung Ki +1

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

May 16, 2026

DAMO1w ago·also NJU

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Full-attention LLMs are intrinsically sparse and can be transformed into highly efficient sparse models with minimal training, sidestepping the need for expensive sparse pre-training.

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

May 13, 2026

Beomjin Ahn +31w ago

LoREnc: Low-Rank Encryption for Securing Foundation Models and LoRA Adapters

Stop IP thieves cold: LoREnc lets you lock down your foundation models and LoRA adapters without retraining, crushing model recovery attacks while keeping performance intact for authorized users.

Beomjin Ahn, Jungmin Kwon, Chanyong Jung +1

Inference & Quantization Open-Source Models & Weights Training Efficiency & Optimization

Search

Training Efficiency & Optimization - Weekly Roundup

Selected Labs publishing this week

Top Papers