Deep Rienforcement Learning: A course on the subject


Reinforcemen learning is large and accelerating area of research. The recent advances in combining RL method with Deep learning have given way to solutions to challenging problems Like playing Atari and Robotic Manipulation. These advances have been wonderful but as many practitioners might have relized getting these methods to work together is less than trivial. There is a great book on RL by Richard Sutton that covers the general method in RL. There is also another book on Deep learning but in my opinon there my never be enoug recources on how to combine these two methods. In these articles I hope to put together a colection of material that will guide the reader along a path understanding and implimenting a method that could solve Go or learn to walk.

A large goal of these articles is to try and explain the concepts in relatively simple terms. When broken down into small parts RL can be very understandable. However, There can be much confusion in the area for a number of reasons, many definitions have multiple names that are all the same thing, the discourse assumes some Machine Learning (ML) prior knowledge, the explination is too breif, etc... I learned these concepts coming from a computer animation background, they were all very new to me. I however like to visualize and animation things to help me understand what is happening. I hope these additional aids assist in the learning process.

Try and explain things with integrals and finite sums...

List of topics

  1. Intro to RL (first few chapters of Suttons book
  2. Markov Decision Process
  3. Future Discounted Reward (and trajectories)
  4. Policies and Value functions
  5. Optimal Policies and Value functions
  6. Policy Improvement (policy iteration and value iteration)