FastStair: Humanoid Robots Learn to Sprint Up Stairs with Planner-Guided Reinforcement Learning
Key Takeaways
- ▸FastStair combines model-based planning with model-free reinforcement learning to achieve both dynamic agility and strict stability in stair climbing—a previously difficult balance in robotics
- ▸The framework uses parallel foothold planning during RL training to bias exploration toward feasible contacts and pretrain a safety-focused base policy, reducing unsafe behaviors
- ▸Speed-specialized expert policies coordinated via Low-Rank Adaptation enable smooth performance across the full speed range, mitigating conservatism and distribution mismatch issues
Summary
Researchers have developed FastStair, a novel framework that enables humanoid robots to climb stairs at unprecedented speeds by combining model-free reinforcement learning with model-based foothold planning. The approach addresses a fundamental challenge in robotics: balancing the dynamic agility needed for fast movement with the stability constraints required for safe stair traversal. Traditional RL methods can generate dynamic motion but lack safety guarantees, while conventional planners are overly conservative. FastStair reconciles these competing demands through a multi-stage learning process that integrates a parallel foothold planner into the RL training loop, allowing the robot to explore dynamically feasible movements while maintaining stability.
The framework further refines performance through speed-specialized expert policies that are coordinated using Low-Rank Adaptation (LoRA), enabling smooth operation across all commanded speeds. When deployed on the Oli humanoid robot, FastStair achieved stable stair ascent at speeds up to 1.65 m/s and successfully traversed a 33-step spiral staircase with 17 cm rise per step in just 12 seconds. The research team demonstrated the real-world effectiveness of their approach by winning the Champion solution at the Canton Tower Robot Run Up Competition, showcasing robust performance on long staircases at high speeds.
- The Oli humanoid robot achieved record stair-climbing performance, ascending at up to 1.65 m/s and completing a 33-step spiral staircase in 12 seconds, winning the Canton Tower Robot Run Up Competition
Editorial Opinion
FastStair represents a meaningful step forward in humanoid robotics by demonstrating how hybrid learning approaches can overcome fundamental tradeoffs between safety and performance. The integration of model-based constraints into model-free learning via LoRA-based expert coordination is an elegant solution that other roboticists will likely adopt. While the work is impressive on stairs specifically, the underlying framework—balancing conservative planning with dynamic exploration—could have broad implications for other high-stakes robotic locomotion tasks where both speed and safety are critical.



