Optimizing Bipedal Maneuvers of Single Rigid-Body Models for Reinforcement Learning

Previous works on Cassie:

combined reference trajectories and DRL

→ based off only a single manipulated reference and thus were limited in application [18]
using SLIP models to create libraries of reference trajectories to guide DRL [19]

In this paper:

proposes a method to generate reduced-order model reference trajectories, Single Rigid-Body Models (SRBM), for general classes of highly dynamic maneuvers for bipedal robots

→ SRBM have been applied to quadruped robots to great success (Mini cheetah [11])

→ also bipeds (Atlas [15] ~ [17])
SRBM’s simplicity allows for fast iteration and refinement of behaviors
robustness of learning-based controllers allows for highly dynamic motions to be transferred to hardware
by using TO, creates libraries of optimal trajectories offline that utilize the SRBM to plan a variety of highly dynamic behaviours
These trajectories are then incorporated as part of a policies reward function and trained using PPO to develop controllers

Contents:

SRBM Formulation

스크린샷 2022-08-02 오전 10.22.06.png

where

SRBM are actuated by ground reaction forces.

Optimization method: Direct collocation in COALESCE

Solver: IPOPT