ppo drama

how to use ppo for gpu-parallelised robot manipulation

PPO drama

there are lots of tricks beyond what was introduced.

resources:

here I keep a list of them that I have found relevant for robotic manipulation continuous control

PPO update

Scaling

Optimisers

Learning rate schedulers

Networks