Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior

Gabriel B. Margolis     Pulkit Agrawal
Improbable AI Lab
Computer Science and Artificial Intelligence Laboratory (CSAIL)
Massachusetts Institute of Technology

Conference on Robot Learning 2022 (Oral)


Paper GitHub

Video



Abstract


Learned locomotion policies can rapidly adapt to diverse environments similar to those experienced during training but lack a mechanism for fast tuning when they fail in an out-of-distribution test environment. This necessitates a slow and iterative cycle of reward and environment redesign to achieve good performance on a new task. As an alternative, we propose learning a single policy that encodes a structured family of locomotion strategies that solve training tasks in different ways, resulting in Multiplicity of Behavior (MoB). Different strategies generalize differently and can be chosen in real-time for new tasks or environments, bypassing the need for time-consuming retraining. We release a fast, robust open-source MoB locomotion controller, Walk These Ways, that can execute diverse gaits with variable footswing, posture, and speed, unlocking diverse downstream tasks: crouching, hopping, high-speed running, stair traversal, bracing against shoves, rhythmic dance, and more.


Paper


Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior
Gabriel B. Margolis, Pulkit Agrawal
Conference on Robot Learning (CoRL), 2022
paper / project page / code / bibtex


Tuning for Diverse Environments


All behaviors are achieved by a single neural network policy with its gait parameters modulated by a human operator or procedurally generated by a script.



Tuning for Diverse Tasks


All behaviors are achieved by a single neural network policy with its gait parameters modulated by a human operator or procedurally generated by a script.



Demonstrating Gait Parameters


Showcasing different locomotion styles available in Walk These Ways.





Rapid Locomotion via Reinforcement Learning
Gabriel B. Margolis, Ge Yang, Kartik Paigwar, Tao Chen, Pulkit Agrawal
Robotics: Science and Systems (RSS), 2022
paper / project page / code / bibtex
 
Learning to Jump from Pixels
Gabriel B. Margolis, Tao Chen, Kartik Paigwar, Xiang Fu, Donghyun Kim, Sangbae Kim, Pulkit Agrawal
Conference on Robot Learning (CoRL), 2021
paper / project page / bibtex


Website made using this template.