Actor Critic, Policy Gradient, DQN, VFA, SARSA, Q-learning, Model-based and Model-free Monte Carlo, Dynamic Programming — Reinforcement learning has gained tremendous popularity in the last decade with a series of successful real-world applications in robotics, games and many other fields. In this article, I will provide a high-level structural overview of classic reinforcement learning algorithms. …