Mar 5, 2026arXiv:2603.04761

Adaptive Policy Switching of Two-Wheeled Differential Robots for Traversing over Diverse Terrains

Haru Izawa, Haruki Izawa, Takeshi Takai, Shingo Kitano, Mikita Miyaguchi, Hiroaki Kawashima

AI Summary

This paper explores adaptive policy switching for two-wheeled robots navigating diverse terrains, specifically within simulated lunar lava tubes. They fine-tuned a pre-trained policy using PPO and then analyzed the robot's 3D orientation data to differentiate between flat and rough terrains. By using the standard deviation of the robot's pitch and Gaussian Mixture Models (GMM), they achieved over 98% accuracy in terrain classification with a 70-step window, demonstrating the feasibility of using short-term orientation data for reliable terrain estimation.

Key Contribution

Robots can reliably distinguish between flat and rough terrain in lava tubes with just 70 steps of orientation data, paving the way for adaptive navigation in unstructured environments.

Abstract

Exploring lunar lava tubes requires robots to traverse without human intervention. Because pre-trained policies cannot fully cover all possible terrain conditions, our goal is to enable adaptive policy switching, where the robot selects an appropriate terrain-specialized model based on its current terrain features. This study investigates whether terrain types can be estimated effectively using posture-related observations collected during navigation. We fine-tuned a pre-trained policy using Proximal Policy Optimization (PPO), and then collected the robot's 3D orientation data as it moved across flat and rough terrain in a simulated lava-tube environment. Our analysis revealed that the standard deviation of the robot's pitch data shows a clear difference between these two terrain types. Using Gaussian mixture models (GMM), we evaluated terrain classification across various window sizes. An accuracy of more than 98% was achieved when using a 70-step window. The result suggests that short-term orientation data are sufficient for reliable terrain estimation, providing a foundation for adaptive policy switching.

Robotics & Embodied AI World Models & Planning

Citation Metrics

Citations0

Influential citations0

References8

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Adaptive Policy Switching of Two-Wheeled Differential Robots for Traversing over Diverse Terrains

Related Papers