Skip to content
/ NES-HT Public

Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Notifications You must be signed in to change notification settings

cangcn/NES-HT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

This project implements an evolution algorithm that integrates the hard-thresholding (HT) operator into the well-known natural evolution strategies (NES) algorithm. For the paper, please check this arixv link.

Details

  • Parallelism. We employ the simple joblib package, finding it much faster than using ray on a single node cluster. (Our single node machine has 512 cores.)
  • Policy Network. We apply the NESHT algorithm to a single linear layer agent, as it's well established that the capacity of a single linear layer suffices for Mujoco tasks.
  • Gaussian Noise. Our comparisons are conducted on noisy Mujoco environments, where the state is a combination of Gaussian noise and environment-provided observations.
  • For any questions about the implementation, please don't hesitate to contact me.

About

Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published