Aleksandr Panov

Research focus

Tool Use & Agents (2)RLHF & Preference Learning (1)World Models & Planning (1)Architecture Design (Transformers, SSMs, MoE) (1)Robotics & Embodied AI (1)

Frequent co-authors

Alexey Skrynnik (2)Zoya Volovikova (1)Nikita Sorokin (1)Dmitriy Lukashevskiy (1)

Papers (2)

Apr 22, 2026

MIRAIApr 22, 2026·also CogAI Lab

Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning

Forget meticulously annotating subtasks – SuperIgor lets language models self-learn to generate and refine instruction-following plans through RL feedback.

Zoya Volovikova, Nikita Sorokin, Dmitriy Lukashevskiy +2

RLHF & Preference Learning Tool Use & Agents World Models & Planning

Apr 7, 2026

Maria Nesterova +8Apr 7, 2026·also CogAI Lab, MIRAI

MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning

A single transformer can master StarCraft, Football, and POGEMA, suggesting we can unify MARL under one foundation model.

Maria Nesterova, Mikhail Kolosov, Anton Andreychuk +6

Architecture Design (Transformers, SSMs, MoE)Robotics & Embodied AI Tool Use & Agents

Search

Aleksandr Panov

Research focus

Frequent co-authors

Papers (2)