Just a bunch of exercises created during my thesis work working on Reinforcement Learning.
-
Updated
Dec 8, 2022 - Python
Just a bunch of exercises created during my thesis work working on Reinforcement Learning.
CS234 Courswork
Optimising the blackjack game
TD, a model of second/higher order conditioning
Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.
DiceUp is a collection of backgammon playing AI's.
A minimal Rust library for solving finite deterministic Markov decision processes
🧨 Interactive temporal difference algorithm simulator in which agent has to find the optimal path to reach certain destination.
Implementation of td policy evaluation and q-learning on a grid world.
Implementation of several algorithms in RL based on Prof. sutton's book
Monte Carlo and Temporal Difference implementation from Chapter 5 and Chapter 6 of Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton.
Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.
Multi-Armed Bandit Simulation, MDP GridWorld Example, Random Walk Problem by TD and MC
Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)
Temporal difference learning for ultimate tic-tac-toe.
a collection of python notebooks using RL agents to play Atari games in OpenAI gym environments
My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python
Examples and tutorials that implement various algorithms in Deep Reinforcement Learning.
NCTU(NYCU) Deep Learning and Practice Spring 2021
Add a description, image, and links to the temporal-difference topic page so that developers can more easily learn about it.
To associate your repository with the temporal-difference topic, visit your repo's landing page and select "manage topics."