temporal-difference

Monte Carlo and Temporal Difference implementation from Chapter 5 and Chapter 6 of Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton.

reinforcement-learning monte-carlo temporal-difference

Updated Sep 8, 2019
Python

victor-iyi / deep-RL

Star

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

machine-learning reinforcement-learning ai deep-reinforcement-learning artificial-intelligence policy-gradient a3c actor-critic deep-rl proximal-policy-optimization ppo policy-network temporal-difference

Updated Feb 19, 2019
Python

dksifoua / Reinforcement-Learning

Star

reinforcement-learning monte-carlo q-learning policy-gradient sarsa dynamic-programming reinforce markov-decision-processes actor-critic asynchronous-advantage-actor-critic proximal-policy-optimization advantage-actor-critic dyna-q temporal-difference

Updated May 3, 2024
Jupyter Notebook

i2a-k / Reinforcement-Learning

Star

Multi-Armed Bandit Simulation, MDP GridWorld Example, Random Walk Problem by TD and MC

reinforcement-learning monte-carlo rl gridworld markov-decision-processes multi-armed-bandit random-walk n-armed-bandit-problem temporal-difference incremental-monte-carlo

Updated Sep 14, 2020
Jupyter Notebook

WinDerek / reinforce-py

Star

Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)

visualization python reinforcement-learning full-stack artificial-intelligence dynamic-programming temporal-difference

Updated Jan 6, 2023
Jupyter Notebook

keeeal / temporal-ut3

Star

Temporal difference learning for ultimate tic-tac-toe.

reinforcement-learning deep-learning neural-network pytorch artificial-intelligence temporal ultimate-tic-tac-toe self-play temporal-difference

Updated Sep 7, 2019
Python

TanushGoel / Atari-Games-RL

Star

a collection of python notebooks using RL agents to play Atari games in OpenAI gym environments

reinforcement-learning monte-carlo q-learning policy-gradient atari-games actor-critic-methods temporal-difference state-value-function policy-based-method

Updated Jun 4, 2020
Jupyter Notebook

VEXLife / Accelerated-TD

Star

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

reinforcement-learning reinforcement-learning-algorithms td atd random-walk temporal-differencing-learning temporal-difference temporal-difference-algorithms temporal-difference-learning accelerated-td

Updated Jan 25, 2024
Python

ken-power / DRLND_DeepReinforcementLearning_Examples

Star

Examples and tutorials that implement various algorithms in Deep Reinforcement Learning.

monte-carlo deep-reinforcement-learning openai-gym pytorch dqn dynamic-programming monte-carlo-tree-search cross-entropy temporal-difference

Updated Jan 23, 2022
Jupyter Notebook

steven112163 / Deep-Learning-and-Practice

Star

NCTU(NYCU) Deep Learning and Practice Spring 2021

reinforcement-learning deep-learning cpp python3 pytorch dqn resnet ddpg deep-convolutional-networks cvae conditional-gan eegnet temporal-difference conditional-normalizing-flows

Updated Jun 21, 2022
Python

Improve this page

Add a description, image, and links to the temporal-difference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the temporal-difference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

temporal-difference

Here are 27 public repositories matching this topic...

PieroMacaluso / reinforcement-learning-stuff

ahlusar1989 / CS234ReinforcementLearning

dhikshitha29 / Playing-the-game-of-twenty-one-and-pontoon

qihongl / demo-td

antonio-f / TD-methods-SARSA

rdadrl / DiceUp

devspaceship / madepro

mweglowski / pathfinding_simulator

willyfh / grid-world-reinforcement-learning-2

alizindari / Reinforcement-Learning

KaleabTessera / Monte-Carlo-and-Temporal-Difference

victor-iyi / deep-RL

dksifoua / Reinforcement-Learning

i2a-k / Reinforcement-Learning

WinDerek / reinforce-py

keeeal / temporal-ut3

TanushGoel / Atari-Games-RL

VEXLife / Accelerated-TD

ken-power / DRLND_DeepReinforcementLearning_Examples

steven112163 / Deep-Learning-and-Practice

Improve this page

Add this topic to your repo