Approaches for learning legged locomotion and their challenges:
Using reference motions to guide learning
Using reference-free reward functions
In this paper:
The gait reward should be
Contributions of this paper:
a framework for designing reward functions for gaits that are characterized by periodic swing and stance phases.
Swing phase: foot forces are penalized while velocities are allowed
→ learn to lift the foot
demonstrate this framework for sim-to-real RL of all common bipedal gaits, including walking, running, galloping, skipping, and hopping, without using a motion capture dataset or reference trajectories
Contents: