Name		Name	Last commit message	Last commit date
parent directory ..
model		model
module		module
utils		utils
PMDB_env.yml		PMDB_env.yml
README.md		README.md
config.py		config.py
main.py		main.py

README.md

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Code to reproduce the experiments in Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief.

Installation

Install MuJoCo 2.0.0 to ~/.mujoco/mujoco200.
Create a conda environment and install requirements.

cd PMDB
conda env create -f PMDB_env.yml
conda activate PMDB_env

Usage

For example, use the following command to run Hopper-medium-v2 benchmark in D4RL.

python main.py --task=hopper-medium-v2

Detailed configuration can be found in config.py.

Logging

By default, TensorBoard logs are generated in the log/ directory.

Citing PMDB

@inproceedings{guo2022pmdb,
  title={Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief},
  author={Kaiyang Guo and Yunfeng Shao and Yanhui Geng},
  booktitle{NeurIPS},
  year={2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PMDB

PMDB

model

model

module

module

utils

utils

PMDB_env.yml

PMDB_env.yml

README.md

README.md

config.py

config.py

main.py

main.py

README.md

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Installation

Usage

Logging

Citing PMDB

Files

PMDB

Directory actions

More options

Directory actions

More options

Latest commit

History

PMDB

Folders and files

parent directory

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Installation

Usage

Logging

Citing PMDB