Skip to content

Python implementation of common RL algorithms using OpenAI gym environments

Notifications You must be signed in to change notification settings

jayeshk7/RL-Algorithms

Repository files navigation

Reinforcement Learning

All the algorithms I implemented (using Python3 and NumPy) while reading Introduction to Reinforcement Learning by Sutton and Barto.
There's a separate ReadMe for each topic

High Level structure of the repo :

  1. Bandits
    • Epsilon-greedy
    • Optimistic initial value
    • Softmax exploration
  2. Dynamic Programming methods
    • Policy iteration
    • Value iteration
  3. Model free methods

Resources

CS234 and David silver often use different notations, it would be better to follow just one of them in the beginning (I prefer David Silver's lectures)

  1. Introduction to Reinforcement Learning by Sutton and Barto
  2. David Silver's lectures
  3. Stanford CS234

Check this out for more resources!

About

Python implementation of common RL algorithms using OpenAI gym environments

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages