Olatunji Ruwase

Snowflake

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (2)Training Efficiency & Optimization (2)Multimodal Models (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Mahmoud Ahmed (1)Sameh Abdulah (1)Sam Ade Jacobs (1)Mathis Bode (1)

Papers (2)

May 3, 2026

May 3, 2026·also Microsoft Research, Forschungszentrum Jülich GmbH, Snowflake

Cross-Layer Energy Analysis of Multimodal Training on Grace Hopper Superchips

Optimizing for runtime in multimodal training can be energy-inefficient, as data movement and overlap on Grace Hopper chips dominate energy consumption, not raw compute.

Mahmoud Ahmed, Sameh Abdulah, Olatunji Ruwase +4

Distributed Systems & Hardware Multimodal Models Training Efficiency & Optimization

Apr 29, 2026

Ahan Gupta +4Apr 29, 2026·also Snowflake

AutoSP: Unlocking Long-Context LLM Training Via Compiler-Based Sequence Parallelism

Training LLMs on ultra-long contexts just got a whole lot easier: AutoSP automates sequence parallelism and activation checkpointing, boosting context length by up to 2.7x with negligible throughput cost.

Ahan Gupta, Zhihao Wang, Neel Dani +2

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

Olatunji Ruwase

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)