Skip to content

A gallery for reinforcement learning, including frameworks, tutorials, papers, implementations, applications, etc.

License

Notifications You must be signed in to change notification settings

IamWangYunKai/RL-Gallery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 

Repository files navigation

RL-Gallery


RL Framework

RL Tutorials

RL Platform

RL Implementation

RL Papers

RL Applications

RL Technique

Technique Benefit Mentioned Key Algorithm
Target network Stabilize the training process DQN, 2015
Memory buffer Breaking data relevance DQN, 2015
KL-constrained update Optimize update step size TRPO, 2015
Advantage function Stabilize learning A3C, 2015
Importance sampling Data efficient PER,2016
Entropy-regularized Better exploration Soft Q-Learning, 2018
Boltzmann policy Richer mathematical meaning Soft Q-Learning, 2018
Target policy smoothing Avert Q-function incorrect sharp peak TD3, 2018
Clipped double-Q learning Fend off overestimation in the Q-function TD3, 2018
Reparameterize the policy Lower variance estimate SAC, 2018

PS: "Mentioned Key Algorithm" may not be the first algorithm that uses this technique, but makes a detailed explanation

About

A gallery for reinforcement learning, including frameworks, tutorials, papers, implementations, applications, etc.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published