Search papers, labs, and topics across Lattice.
NE-Dreamer, a novel MBRL agent, predicts next-step encoder embeddings using a temporal transformer to optimize temporal predictive alignment in the latent space, eliminating the need for a decoder. This decoder-free approach allows the agent to learn coherent state representations without reconstruction losses or auxiliary supervision. Experiments show NE-Dreamer matches or exceeds DreamerV3 on DeepMind Control Suite and achieves significant performance gains on challenging DMLab tasks involving memory and spatial reasoning.
Ditch the decoder: Directly predicting future latent embeddings with a temporal transformer yields state-of-the-art MBRL performance, especially in complex, partially observable environments.
Capturing temporal dependencies is critical for model-based reinforcement learning (MBRL) in partially observable, high-dimensional domains. We introduce NE-Dreamer, a decoder-free MBRL agent that leverages a temporal transformer to predict next-step encoder embeddings from latent state sequences, directly optimizing temporal predictive alignment in representation space. This approach enables NE-Dreamer to learn coherent, predictive state representations without reconstruction losses or auxiliary supervision. On the DeepMind Control Suite, NE-Dreamer matches or exceeds the performance of DreamerV3 and leading decoder-free agents. On a challenging subset of DMLab tasks involving memory and spatial reasoning, NE-Dreamer achieves substantial gains. These results establish next-embedding prediction with temporal transformers as an effective, scalable framework for MBRL in complex, partially observable environments.