Search papers, labs, and topics across Lattice.
4 papers from Microsoft Research on Training Efficiency & Optimization
Autonomous driving gets a boost: CRAFT cleverly combines the best of both worlds – dense counterfactual supervision and grounded closed-loop feedback – to significantly improve driving policies.
Optimizing for runtime in multimodal training can be energy-inefficient, as data movement and overlap on Grace Hopper chips dominate energy consumption, not raw compute.
TurboQuant's "novel" quantization method is actually a special case of a prior technique (EDEN) with a crucial parameter stuck at a suboptimal value, leading to demonstrably worse performance.
By explicitly detecting and escaping "Forbidden Zones" during training, AMD unlocks significant gains in sample fidelity and training robustness for few-step generative models like SDXL.