Skip to content

sritee/Deterministic-Policy-Gradient-Methods

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Deterministic Policy Gradient

This is a C++ implementation of a Deterministic Policy Gradient algorithm proposed by Silver et al [1]. We use tile coding proposed by Richard Sutton for the critic's linear function approximator. Note that this algorithm is different from Deep Deterministic Policy Gradient, as we use linear function approximation, and hence there are convergence guarantees. We test our algorithm on the Continuous Action Mountain Car domain, implemented similar to the OpenAI gym environment.

For a detailed discussion, please visit my blog post [2].

References

About

C++ Implementation of Deterministic Policy Gradient Algorithms (ICML 2014, Silver Et al.) using Tile Coding

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published