Anchor policies could be a total for complex control systems like flight. Love seeing applications of reinforcement learning! https://www.reddit.com/user/9138NOMS