Ahson Saiyed

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Interpretability & Mechanistic Interp (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Sabrina Sadiekh (1)Chirag Agarwal (1)

Papers (1)

Apr 20, 2026

Ahson Saiyed +2Apr 20, 2026

Towards Understanding the Robustness of Sparse Autoencoders

Integrating Sparse Autoencoders into transformer models can slash jailbreak success rates by up to 5x, reshaping our understanding of model robustness against adversarial attacks.

Ahson Saiyed, Sabrina Sadiekh, Chirag Agarwal

Interpretability & Mechanistic Interp Red-Teaming & Adversarial Robustness

Search

Ahson Saiyed

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)