Preferred NetworksApr 2, 2026arXiv:2604.01577

Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling

Shota Takashiro, Masanori Koyama, Takeru Miyato, Yusuke Iwasawa, Yutaka Matsuo, Kohei Hayashi

AI Summary

This paper introduces a fast-slow recurrent architecture for sequential modeling that interleaves rapid latent state updates with slower, self-organizing observation updates. This approach promotes the formation of stable, clustered internal representations, enabling the model to maintain coherence over extended sequences. The proposed method outperforms LSTMs, state space models, and Transformers on out-of-distribution generalization tasks in reinforcement learning and algorithmic domains.

Key Contribution

Forget Transformers; this new recurrent architecture learns more stable representations and generalizes better out-of-distribution by interleaving fast latent updates with slower, self-organizing observation processing.

Abstract

We extend the recent latent recurrent modeling to sequential input streams. By interleaving fast, recurrent latent updates with self-organizational ability between slow observation updates, our method facilitates the learning of stable internal structures that evolve alongside the input. This mechanism allows the model to maintain coherent and clustered representations over long horizons, improving out-of-distribution generalization in reinforcement learning and algorithmic tasks compared to sequential baselines such as LSTM, state space models, and Transformer variants.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Speech & Audio

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Thinking While Listening: Fast-Slow Recurrence for Long-Horizon Sequential Modeling

Related Papers