temporal-difference

Monte Carlo and Temporal Difference implementation from Chapter 5 and Chapter 6 of Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton.

reinforcement-learning monte-carlo temporal-difference

Updated Sep 8, 2019
Python

VEXLife / Accelerated-TD

Star

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

reinforcement-learning reinforcement-learning-algorithms td atd random-walk temporal-differencing-learning temporal-difference temporal-difference-algorithms temporal-difference-learning accelerated-td

Updated Jan 25, 2024
Python

devspaceship / madepro

Star

A minimal Rust library for solving finite deterministic Markov decision processes

rust reinforcement-learning q-learning mdp sarsa markov-decision-processes temporal-difference

Updated Jan 13, 2024
Rust

mweglowski / pathfinding_simulator

Star

🧨 Interactive temporal difference algorithm simulator in which agent has to find the optimal path to reach certain destination.

javascript css html reinforcement-learning reactjs q-learning tailwindcss temporal-difference

Updated May 22, 2024
JavaScript

WinDerek / reinforce-py

Star

Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)

visualization python reinforcement-learning full-stack artificial-intelligence dynamic-programming temporal-difference

Updated Jan 6, 2023
Jupyter Notebook

rdadrl / DiceUp

Star

DiceUp is a collection of backgammon playing AI's.

java ai td minimax monte-carlo-tree-search backgammon temporal-difference

Updated Jan 23, 2020
Java

victor-iyi / deep-RL

Star

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

machine-learning reinforcement-learning ai deep-reinforcement-learning artificial-intelligence policy-gradient a3c actor-critic deep-rl proximal-policy-optimization ppo policy-network temporal-difference

Updated Feb 19, 2019
Python

dksifoua / Reinforcement-Learning

Star

reinforcement-learning monte-carlo q-learning policy-gradient sarsa dynamic-programming reinforce markov-decision-processes actor-critic asynchronous-advantage-actor-critic proximal-policy-optimization advantage-actor-critic dyna-q temporal-difference

Updated May 3, 2024
Jupyter Notebook

keeeal / temporal-ut3

Star

Temporal difference learning for ultimate tic-tac-toe.

reinforcement-learning deep-learning neural-network pytorch artificial-intelligence temporal ultimate-tic-tac-toe self-play temporal-difference

Updated Sep 7, 2019
Python

alizindari / Reinforcement-Learning

Star

Implementation of several algorithms in RL based on Prof. sutton's book

reinforcement-learning deep-reinforcement-learning policy-iteration value-iteration bellman-equation k-armed-bandit temporal-difference policy-improvement montecarlo-methods

Updated Aug 20, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to the temporal-difference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the temporal-difference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

temporal-difference

Here are 27 public repositories matching this topic...

i2a-k / Reinforcement-Learning

PieroMacaluso / reinforcement-learning-stuff

ahlusar1989 / CS234ReinforcementLearning

willyfh / grid-world-reinforcement-learning-2

dhikshitha29 / Playing-the-game-of-twenty-one-and-pontoon

ken-power / DRLND_DeepReinforcementLearning_Examples

TanushGoel / Atari-Games-RL

steven112163 / Deep-Learning-and-Practice

qihongl / demo-td

antonio-f / TD-methods-SARSA

KaleabTessera / Monte-Carlo-and-Temporal-Difference

VEXLife / Accelerated-TD

devspaceship / madepro

mweglowski / pathfinding_simulator

WinDerek / reinforce-py

rdadrl / DiceUp

victor-iyi / deep-RL

dksifoua / Reinforcement-Learning

keeeal / temporal-ut3

alizindari / Reinforcement-Learning

Improve this page

Add this topic to your repo