Elena Tutubalina

AIRI, ISP RAS Research Center for Trusted AI

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Data Curation & Synthetic Data (1)Eval Frameworks & Benchmarks (1)Natural Language Processing (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Borisiuk Anna (1)Borisiuk Anna (1)Andrey Savchenko (1)A. Savchenko (1)

Papers (2)

Feb 23, 2026

AIRI3w ago·also ISP RAS Research Center for Trusted AI, Sber AI Lab, Skoltech

Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning

Unlearning is much easier on supervised fine-tuned models than on pretrained ones, with direct unlearning on pretrained models often leading to catastrophic forgetting.

Borisiuk Anna, Borisiuk Anna, Andrey Savchenko +4

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Feb 15, 2026

Feb 15, 2026·also AIRI, ISP RAS Research Center for Trusted AI

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

Sparse autoencoders, hyped as a key interpretability tool, may not be learning much more than random feature sets, casting doubt on their ability to decompose model internals.

Anton Korznikov, Andrey V. Galichin, Alexey Dontsov +3

Interpretability & Mechanistic Interp

Search

Elena Tutubalina

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)