Previous works on Cassie:
combined reference trajectories and DRL
→ based off only a single manipulated reference and thus were limited in application [18]
using SLIP models to create libraries of reference trajectories to guide DRL [19]
In this paper:
proposes a method to generate reduced-order model reference trajectories, Single Rigid-Body Models (SRBM), for general classes of highly dynamic maneuvers for bipedal robots
→ SRBM have been applied to quadruped robots to great success (Mini cheetah [11])
→ also bipeds (Atlas [15] ~ [17])
SRBM’s simplicity allows for fast iteration and refinement of behaviors
robustness of learning-based controllers allows for highly dynamic motions to be transferred to hardware
by using TO, creates libraries of optimal trajectories offline that utilize the SRBM to plan a variety of highly dynamic behaviours
These trajectories are then incorporated as part of a policies reward function and trained using PPO to develop controllers
Contents:

where
SRBM are actuated by ground reaction forces.
Optimization method: Direct collocation in COALESCE
Solver: IPOPT