Learning to walk in minutes using massively parallel deep reinforcement learning. Full text (accepted version) (PDF, 36.