Google ResearchUMassMar 2, 2026arXiv:2603.02008

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards

Faisal Mohamed, Catherine Ji, Benjamin Eysenbach, Glen Berseth

AI Summary

This paper introduces a novel intrinsic exploration method that leverages temporal contrastive learning to guide agents towards states with unpredictable future outcomes. By learning temporal representations that capture information relevant to downstream tasks, the agent actively explores states that enhance its understanding of the environment. The authors demonstrate that this approach enables the learning of complex exploratory behaviors in locomotion, manipulation, and embodied AI tasks, achieving performance comparable to reward-engineered methods.

Key Contribution

Forget hand-engineered reward functions: this method learns complex exploratory behaviors by simply predicting which states lead to unpredictable futures.

Abstract

Effective exploration in reinforcement learning requires not only tracking where an agent has been, but also understanding how the agent perceives and represents the world. To learn powerful representations, an agent should actively explore states that contribute to its knowledge of the environment. Temporal representations can capture the information necessary to solve a wide range of potential tasks while avoiding the computational cost associated with full state reconstruction. In this paper, we propose an exploration method that leverages temporal contrastive representations to guide exploration, prioritizing states with unpredictable future outcomes. We demonstrate that such representations can enable the learning of complex exploratory x in locomotion, manipulation, and embodied-AI tasks, revealing capabilities and behaviors that traditionally require extrinsic rewards. Unlike approaches that rely on explicit distance learning or episodic memory mechanisms (e.g., quasimetric-based methods), our method builds directly on temporal similarities, yielding a simpler yet effective strategy for exploration.

Robotics & Embodied AI World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards

Related Papers