Tinkoff ResearchMar 3, 2026arXiv:2603.02765

Next Embedding Prediction Makes World Models Stronger

George Bredis, Nikita Balagansky, Daniil Gavrilov, Ruslan Rakhimov

AI Summary

NE-Dreamer, a novel MBRL agent, predicts next-step encoder embeddings using a temporal transformer to optimize temporal predictive alignment in the latent space, eliminating the need for a decoder. This decoder-free approach allows the agent to learn coherent state representations without reconstruction losses or auxiliary supervision. Experiments show NE-Dreamer matches or exceeds DreamerV3 on DeepMind Control Suite and achieves significant performance gains on challenging DMLab tasks involving memory and spatial reasoning.

Key Contribution

Ditch the decoder: Directly predicting future latent embeddings with a temporal transformer yields state-of-the-art MBRL performance, especially in complex, partially observable environments.

Abstract

Capturing temporal dependencies is critical for model-based reinforcement learning (MBRL) in partially observable, high-dimensional domains. We introduce NE-Dreamer, a decoder-free MBRL agent that leverages a temporal transformer to predict next-step encoder embeddings from latent state sequences, directly optimizing temporal predictive alignment in representation space. This approach enables NE-Dreamer to learn coherent, predictive state representations without reconstruction losses or auxiliary supervision. On the DeepMind Control Suite, NE-Dreamer matches or exceeds the performance of DreamerV3 and leading decoder-free agents. On a challenging subset of DMLab tasks involving memory and spatial reasoning, NE-Dreamer achieves substantial gains. These results establish next-embedding prediction with temporal transformers as an effective, scalable framework for MBRL in complex, partially observable environments.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization World Models & Planning

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Next Embedding Prediction Makes World Models Stronger

Related Papers