H. Babak

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Training Efficiency & Optimization (1)

Frequent co-authors

Melanie Schaller (1)

Papers (1)

Apr 28, 2026

H. Babak +13w ago

CUDA Kernel Optimization and Counter-Free Performance Analysis for Depthwise Convolution in Cloud Environments

Unlock significant speedups in depthwise convolutions (up to 3.26x) with optimized CUDA kernels, even in restricted cloud environments lacking hardware performance counters.

H. Babak, Melanie Schaller

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

H. Babak

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)